SlideShare uma empresa Scribd logo
1 de 114
Baixar para ler offline
Architecting for Data Science
johann@ifwe.co@jssmith github.com/ifwe
Johann Schleier-Smith
CTO, if(we)
O’Reilly Software Architecture Conference

Boston, March 19, 2015
Data Science
value from data
Alternative Denitions
extraction of knowledge from data
making discoveries in the world of big data
statistics + machine learning + scalable
computation + visualization + computer science +
business acumen + skilled communication
Related and Alternative Language
business intelligence
statistics
data mining
forecasting
business reporting
predictive modeling
analyticsknowledge extraction
value
Types of Value
understanding
revenue
product improvements
projections
new inspirations
predictions
customer satisfaction
Today’s Examples
• >10 million candidates to draw from
• >1000 updates/sec
• Must be responsive to current activity
• Users expect instant query results
Recommendation engine

for dating product
• Real-time is challenging
• Human behavior is
complicated, especially in
social context
• Previous interactions are
perhaps our best hope for
predicting future interactions
• Human connections
• User engagement ecosystem
• Subscription and other revenues
Value
♥
Kaggle competition

with Best Buy data
https://www.kaggle.com/c/acm-sf-chapter-hackathon-small
Kaggle competition

with Best Buy data
“outgoing and social (heavy messaging ---
especially distant recipients and opposite gender,
many outgoing comments, many friend requests to
distant people), doesn’t play Pets much”
“receives many messages, active user,
views many proles, doesn't use meet me,
sends many messages to distant people”
Heavy user overall, (pets, meet me, messaging)!
“heavy user overall, (pets, meet me, messaging)”
value
data
• Vote history
• Social interaction history
• Profile information
Dating product data
{
“timestamp” : “2011-10-31 09:48:46”,
“query” : “Assassin’s Creed”,
“skuSelected” : “2670133”
}
product views
{
“sku” : “1032361”,
“regularPrice” : “19.99”,
“name” : “Need for Speed: Hot Pursuit”,
“description” : “Fasten your seatbelt and

get ready to drive like your life depends

on it...”
...
}
product updates
Formats for Data
log les
web services
relational databases
unstructured documents
spreadsheets
xml les
Types of Data
technical data
government data
usage records
sensor data
academic data
reference data
yet uncollected data
Vasant Dhar. 2013. Data science and prediction. Commun. ACM 56, 12 (December 2013), 64-73.

And International Telecommunication Union (ITU) and United Nations Population Division via www.internetlivestats.com/internet-users/
Trends
data quantity
machine learning maturing
data variety
data velocity
Machine Learning
classication
decision trees
supervised methods
unsupervised methodsclustering
what matters most is mapping of data
Machine Learning Techniques
• Classification - (Logistic Regression, Decision
Trees, Random Forests)
• Prediction - (Generalized Linear Models, Support
Vector Regression, …)
• Clustering - (K-Means, Hierarchical, Latent
Dirichlet Allocation, …)
• Collaborative filtering, …
Features often matter more than choice of algorithm
data
tools of
the
trade
• Created in 1993
• Implementation of S language but also
inherits from Scheme
• Object oriented code is possible but
not encouraged
• Vast high-quality package ecosystem
• Data is vectors and data frames
Demo
• Statistics
• Visualization
• Machine learning
• REPL, scripts, interactive IDE
• In-memory data sets
http://scikit-learn.org/stable/auto_examples/linear_model/plot_iris_logistic.html
• More of a general purpose language than R
• Arrays and matrices as basic data structures
• Supports data frames through Pandas
• Sophisticated machine learning libraries
• Generally limited to in-memory data sets
• Leverages commodity hardware to
store large data sets at low cost
• Vibrant and diverse ecosystem
• Popular but not always best solution
• Probably best viewed as marketing
terminology, as opposed to technology
https://hadoopecosystemtable.github.io/
Category
Number of
projects
Distributed Filesystem 7
Distributed Programming 18
NoSQL Database 4
Document Data Model 3
Stream Data Model 1
Key-Value Data Model 4
Graph Data Model 3
NewSQL 9
SQL-On-Hadoop 11
Data Ingestion 11
Service Programming 7
Scheduling 3
Machine Learning 6
Benchmarking 5
Security 3
System Deployment 12
Applications 5
Development Frameworks 2
Categorize Pending 16
130 freely licensed open source
projects listed in the Hadoop
Ecosystem Table
Hadoop for Data Scientists
• Pulling data from repository (SQL, Hive)
• MapReduce programming (Java, Scala,
Pig, Python)
• Spark in-memory framework is gaining
adoption rapidly
tools rarely used in data science
version control
automated testing
automated deployment
shared code
agile methodology
code review
software architecture
the cycle

of data science
Data ProcessingData Collection
Models, Algorithms
Data-driven Product
Features
Data Analysis &
Understanding
Reports &
Visualizations
Product Improvements
data science

at
• Profitable startup actively pursuing big
opportunities in social apps
• Millions of users on existing products
• Thousands of social contacts per second
what it should look like
1. Gain understanding of the product usage
2. See opportunity to make the product better
3. Create training data
4. Train predictive models
5. Put models in production
6. See improvements
what it often looks like
1. Gain understanding of the product usage
2. See opportunity to make the product better
3. Pull records from relational database to create
interesting features (usually aggregates)
4. Train predictive models
5. Go implement models for production
6. See improvements
1. Gain understanding of the product usage
2. See opportunity to make the product better
3. Pull records from relational database to create
interesting features (usually aggregates)
4. Train predictive models
5. Go implement models for production
6. See improvements
3-6

months
1. Gain understanding of the product usage
2. See opportunity to make the product better
3. Pull records from relational database to create
interesting features (usually aggregates)
4. Train predictive models
5. Go implement models for production
6. See improvements Cool!

Was it worth it?
implementation

pain points
• Data scientist hands model description to
software engineer
• May need to translate features from SQL to Java
• Aggregate features require batch processing
• May need to adjust features and model to achieve
real-time updates
• Fast scoring requires high-performance in-
memory data structures
new thinking
new architecture
one right way to data
event history
one right way to data
everything is an event
Bob registers
Alice registers
Alice updates prole
Bob opens app
Bob sees Alice in recommendations
Bob swipes yes on Alice
Alice receives push notication
Alice sees Bob swiped yes
Alice swipes yes
Alice sends message to Bob
architecture
comparison
Database'
Applica-on'
Web'API'
Ranking' Solr'Search'
Indexing'
Service'
Change'logs'
Occasional'
index'rebuilds'
Change'logs'
Produc'on)
Development)
Exploratory'
Analysis'
Training'&'
Backtes-ng'
Database'
Applica-on'
Web'API'
Ranking' Solr'Search'
Indexing'
Service'
Change'logs'
Occasional'
index'rebuilds'
Change'logs'
Produc'on)
Development)
Exploratory'
Analysis'
Training'&'
Backtes-ng'
Database'
Applica-on'
Web'API'
Ranking' Solr'Search'
Indexing'
Service'
Change'logs'
Occasional'
index'rebuilds'
Change'logs'
Produc'on)
Development)
Exploratory'
Analysis'
Training'&'
Backtes-ng'
Database'
Applica-on'
Web'API'
Ranking' Solr'Search'
Indexing'
Service'
Change'logs'
Occasional'
index'rebuilds'
Change'logs'
Produc'on)
Development)
Exploratory'
Analysis'
Training'&'
Backtes-ng'
Database'
Applica-on'
Web'API'
Ranking' Solr'Search'
Indexing'
Service'
Change'logs'
Occasional'
index'rebuilds'
Change'logs'
Produc'on)
Development)
Exploratory'
Analysis'
Training'&'
Backtes-ng'
Database'
Applica-on'
Web'API'
Ranking'
Event'History'Repository'
Solr'Search'
Real=-me'
State'Updates'
Change'logs'Real=-me'events'
Occasional'
index'rebuilds'
Exploratory'
Analysis'&'
Visualiza-on'
Produc'on)
Development)
Training'&'
Backtes-ng'
State'Updates'
Monitoring'
Database'
Applica-on'
Web'API'
Ranking'
Event'History'Repository'
Solr'Search'
Real=-me'
State'Updates'
Change'logs'Real=-me'events'
Occasional'
index'rebuilds'
Exploratory'
Analysis'&'
Visualiza-on'
Produc'on)
Development)
Training'&'
Backtes-ng'
State'Updates'
Monitoring'
Database'
Applica-on'
Web'API'
Ranking'
Event'History'Repository'
Solr'Search'
Real=-me'
State'Updates'
Change'logs'Real=-me'events'
Occasional'
index'rebuilds'
Exploratory'
Analysis'&'
Visualiza-on'
Produc'on)
Development)
Training'&'
Backtes-ng'
State'Updates'
Monitoring'
Database'
Applica-on'
Web'API'
Ranking'
Event'History'Repository'
Solr'Search'
Real=-me'
State'Updates'
Change'logs'Real=-me'events'
Occasional'
index'rebuilds'
Exploratory'
Analysis'&'
Visualiza-on'
Produc'on)
Development)
Training'&'
Backtes-ng'
State'Updates'
Monitoring'
Database'
Applica-on'
Web'API'
Ranking'
Event'History'Repository'
Solr'Search'
Real=-me'
State'Updates'
Change'logs'Real=-me'events'
Occasional'
index'rebuilds'
Exploratory'
Analysis'&'
Visualiza-on'
Produc'on)
Development)
Training'&'
Backtes-ng'
State'Updates'
Monitoring'
Event History API
trait EventHistory {
def publishEvent(e: Event)
"
def getEvents(
startTime: Date,
endTime: Date,
eventFilter: EventFilter,
eventHandler: EventHandler
)
}
Event History API
trait EventHistory {
def publishEvent(e: Event)
"
def getEvents(
startTime: Date,
endTime: Date,
eventFilter: EventFilter,
eventHandler: EventHandler
)
}
Event History API
trait EventHistory {
def publishEvent(e: Event)
"
def getEvents(
startTime: Date,
endTime: Date,
eventFilter: EventFilter,
eventHandler: EventHandler
)
}
+∞ for

real-time

streaming
training data
comparison
Database'
Applica-on'
Web'API'
Ranking' Solr'Search'
Indexing'
Service'
Change'logs'
Occasional'
index'rebuilds'
Change'logs'
Produc'on)
Development)
Exploratory'
Analysis'
Training'&'
Backtes-ng'
Events'State'Snapshots''
Training'
Features'
Training'
Outcomes'
Time'
Database'
Applica-on'
Web'API'
Ranking'
Event'History'Repository'
Solr'Search'
Real=-me'
State'Updates'
Change'logs'Real=-me'events'
Occasional'
index'rebuilds'
Exploratory'
Analysis'&'
Visualiza-on'
Produc'on)
Development)
Training'&'
Backtes-ng'
State'Updates'
Monitoring'
Updates(
Training(
features(
Training(
outcome(
Time(
Online(State(
Events(
Training(Data(
1. Gain understanding of the product usage
2. See opportunity to make the product better
3. Create training data
4. Train predictive models
5. Put models in production
6. See improvements
Fast cycles!!
Live Demo
https://www.kaggle.com/c/acm-sf-chapter-hackathon-small
https://github.com/ifwe/antelope
• Open source implementation derived from if(we)’s
proprietary platform
• Not ready scale or production, but useful for
demonstration purposes
• Seeking collaborators
product update events
{
“timestamp” : “2012-05-03 6:43:15”,
“eventType” : “ProductUpdate”,
“eventProperties” : {
“sku” : “1032361”,
“regularPrice” : “19.99”,
“name” : “Need for Speed: Hot Pursuit”,
“description” : “Fasten your seatbelt and

get ready to drive like your life depends

on it...”
...
}
}
product view events
{
“timestamp” : “2011-10-31 09:48:46”,
“eventType” : “ProductView”,
“eventProperties” : {
“query” : “Modern warfare”,
“skuSelected” : “2670133”
}
}
demo
Try it yourself, code and instructions at:

https://github.com/ifweco/antelope/blob/master/doc/demo.md
new
tools :)
Architecture for

Data Science
data warehousing
Database'
Applica-on'
Web'API'
Ranking' Solr'Search'
Indexing'
Service'
Change'logs'
Occasional'
index'rebuilds'
Change'logs'
Produc'on)
Development)
Exploratory'
Analysis'
Training'&'
Backtes-ng'
Data$
Warehouse$
Extract$
Load$
Transform$
Opera7onal$
Data$Store$
Staging$
log transform use
extract transform load
Data Architecture
dimensional modeling
relational modeling
Warehouse Design
normalization
slowly changing

dimensions
denormalization
star schema
slowly changing dimensions
type 1: overwrite the old data
type 2: multiple rows with versioning
type 3: extra columns for older versions
slowly changing dimensions
event history
Database'
Applica-on'
Web'API'
Ranking'
Event'History'Repository'
Solr'Search'
Real=-me'
State'Updates'
Change'logs'Real=-me'events'
Occasional'
index'rebuilds'
Exploratory'
Analysis'&'
Visualiza-on'
Produc'on)
Development)
Training'&'
Backtes-ng'
State'Updates'
Monitoring'
Event&
Indexing&
Applica1on&
Web&API&
Ranking&
Event&History&Repository&
Solr&Search&
Real>1me&
State&Updates&
Change&logs&Real>1me&events&
Exploratory&
Analysis&&&
Visualiza1on&
Produc'on)
Development)
Training&&&
Backtes1ng&
State&Updates&
Monitoring&
trait EventHistory {
def publishEvent(e: Event)
"
def getEvents(
startTime: Date,
endTime: Date,
eventFilter: EventFilter,
eventHandler: EventHandler
)
}
event history design
Data Architecture
ok to denormalize
log a lot
think about the types
• Make sure that events are simple facts
• Files are ok for event history, don’t really need a database
• Use an object hierarchy to model events in code
• Use online features that are efficient to update incrementally
• Write efficient implementations before than scaling out
• Functional style makes it easier
• Encourage reactive processing
Data Quality
• Matters more than transformations, more than algorithms
• Data that doesn’t make sense often indicates an application bug
• Do assertions, e.g., make sure things aren’t happening out of order
• All data in form of events – no exceptions!
• Same feature code in production and development
• Emphasis on creative feature engineering
• Quick cycles between ideas and production
github.com/ifwe/antelope
@jssmith
Try the Antelope Demo:

https://github.com/ifwe/antelope/blob/master/doc/demo.md

Mais conteĂşdo relacionado

Mais procurados

Ml infra at an early stage
Ml infra at an early stageMl infra at an early stage
Ml infra at an early stageNick Handel
 
Guiding through a typical Machine Learning Pipeline
Guiding through a typical Machine Learning PipelineGuiding through a typical Machine Learning Pipeline
Guiding through a typical Machine Learning PipelineMichael Gerke
 
Production machine learning_infrastructure
Production machine learning_infrastructureProduction machine learning_infrastructure
Production machine learning_infrastructurejoshwills
 
Spark Summit EU 2017 - Preventing revenue leakage and monitoring distributed ...
Spark Summit EU 2017 - Preventing revenue leakage and monitoring distributed ...Spark Summit EU 2017 - Preventing revenue leakage and monitoring distributed ...
Spark Summit EU 2017 - Preventing revenue leakage and monitoring distributed ...Flavio Clesio
 
MLOps - The Assembly Line of ML
MLOps - The Assembly Line of MLMLOps - The Assembly Line of ML
MLOps - The Assembly Line of MLJordan Birdsell
 
Modern Machine Learning Infrastructure and Practices
Modern Machine Learning Infrastructure and PracticesModern Machine Learning Infrastructure and Practices
Modern Machine Learning Infrastructure and PracticesWill Gardella
 
MLOps Using MLflow
MLOps Using MLflowMLOps Using MLflow
MLOps Using MLflowDatabricks
 
Rsqrd AI: How to Design a Reliable and Reproducible Pipeline
Rsqrd AI: How to Design a Reliable and Reproducible PipelineRsqrd AI: How to Design a Reliable and Reproducible Pipeline
Rsqrd AI: How to Design a Reliable and Reproducible PipelineSanjana Chowdhury
 
The Machine Learning Workflow with Azure
The Machine Learning Workflow with AzureThe Machine Learning Workflow with Azure
The Machine Learning Workflow with AzureIvo Andreev
 
MLOps and Data Quality: Deploying Reliable ML Models in Production
MLOps and Data Quality: Deploying Reliable ML Models in ProductionMLOps and Data Quality: Deploying Reliable ML Models in Production
MLOps and Data Quality: Deploying Reliable ML Models in ProductionProvectus
 
Intelligent Document Processing in Healthcare. Choosing the Right Solutions.
Intelligent Document Processing in Healthcare. Choosing the Right Solutions.Intelligent Document Processing in Healthcare. Choosing the Right Solutions.
Intelligent Document Processing in Healthcare. Choosing the Right Solutions.Provectus
 
DevOps for DataScience
DevOps for DataScienceDevOps for DataScience
DevOps for DataScienceStepan Pushkarev
 
MLOps Bridging the gap between Data Scientists and Ops.
MLOps Bridging the gap between Data Scientists and Ops.MLOps Bridging the gap between Data Scientists and Ops.
MLOps Bridging the gap between Data Scientists and Ops.Knoldus Inc.
 
Why APM Is Not the Same As ML Monitoring
Why APM Is Not the Same As ML MonitoringWhy APM Is Not the Same As ML Monitoring
Why APM Is Not the Same As ML MonitoringDatabricks
 
FrugalML: Using ML APIs More Accurately and Cheaply
FrugalML: Using ML APIs More Accurately and CheaplyFrugalML: Using ML APIs More Accurately and Cheaply
FrugalML: Using ML APIs More Accurately and CheaplyDatabricks
 
Productionising Machine Learning Models
Productionising Machine Learning ModelsProductionising Machine Learning Models
Productionising Machine Learning ModelsTash Bickley
 
Data Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data ScienceData Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data SciencePouria Amirian
 
Machine Learning in Production with Dato Predictive Services
Machine Learning in Production with Dato Predictive ServicesMachine Learning in Production with Dato Predictive Services
Machine Learning in Production with Dato Predictive ServicesTuri, Inc.
 
Importance of ML Reproducibility & Applications with MLfLow
Importance of ML Reproducibility & Applications with MLfLowImportance of ML Reproducibility & Applications with MLfLow
Importance of ML Reproducibility & Applications with MLfLowDatabricks
 

Mais procurados (20)

Ml infra at an early stage
Ml infra at an early stageMl infra at an early stage
Ml infra at an early stage
 
Guiding through a typical Machine Learning Pipeline
Guiding through a typical Machine Learning PipelineGuiding through a typical Machine Learning Pipeline
Guiding through a typical Machine Learning Pipeline
 
Production machine learning_infrastructure
Production machine learning_infrastructureProduction machine learning_infrastructure
Production machine learning_infrastructure
 
Spark Summit EU 2017 - Preventing revenue leakage and monitoring distributed ...
Spark Summit EU 2017 - Preventing revenue leakage and monitoring distributed ...Spark Summit EU 2017 - Preventing revenue leakage and monitoring distributed ...
Spark Summit EU 2017 - Preventing revenue leakage and monitoring distributed ...
 
MLOps - The Assembly Line of ML
MLOps - The Assembly Line of MLMLOps - The Assembly Line of ML
MLOps - The Assembly Line of ML
 
Modern Machine Learning Infrastructure and Practices
Modern Machine Learning Infrastructure and PracticesModern Machine Learning Infrastructure and Practices
Modern Machine Learning Infrastructure and Practices
 
MLOps Using MLflow
MLOps Using MLflowMLOps Using MLflow
MLOps Using MLflow
 
Rsqrd AI: How to Design a Reliable and Reproducible Pipeline
Rsqrd AI: How to Design a Reliable and Reproducible PipelineRsqrd AI: How to Design a Reliable and Reproducible Pipeline
Rsqrd AI: How to Design a Reliable and Reproducible Pipeline
 
The Machine Learning Workflow with Azure
The Machine Learning Workflow with AzureThe Machine Learning Workflow with Azure
The Machine Learning Workflow with Azure
 
MLOps and Data Quality: Deploying Reliable ML Models in Production
MLOps and Data Quality: Deploying Reliable ML Models in ProductionMLOps and Data Quality: Deploying Reliable ML Models in Production
MLOps and Data Quality: Deploying Reliable ML Models in Production
 
Intelligent Document Processing in Healthcare. Choosing the Right Solutions.
Intelligent Document Processing in Healthcare. Choosing the Right Solutions.Intelligent Document Processing in Healthcare. Choosing the Right Solutions.
Intelligent Document Processing in Healthcare. Choosing the Right Solutions.
 
DevOps for DataScience
DevOps for DataScienceDevOps for DataScience
DevOps for DataScience
 
MLOps Bridging the gap between Data Scientists and Ops.
MLOps Bridging the gap between Data Scientists and Ops.MLOps Bridging the gap between Data Scientists and Ops.
MLOps Bridging the gap between Data Scientists and Ops.
 
Why APM Is Not the Same As ML Monitoring
Why APM Is Not the Same As ML MonitoringWhy APM Is Not the Same As ML Monitoring
Why APM Is Not the Same As ML Monitoring
 
FrugalML: Using ML APIs More Accurately and Cheaply
FrugalML: Using ML APIs More Accurately and CheaplyFrugalML: Using ML APIs More Accurately and Cheaply
FrugalML: Using ML APIs More Accurately and Cheaply
 
What is MLOps
What is MLOpsWhat is MLOps
What is MLOps
 
Productionising Machine Learning Models
Productionising Machine Learning ModelsProductionising Machine Learning Models
Productionising Machine Learning Models
 
Data Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data ScienceData Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data Science
 
Machine Learning in Production with Dato Predictive Services
Machine Learning in Production with Dato Predictive ServicesMachine Learning in Production with Dato Predictive Services
Machine Learning in Production with Dato Predictive Services
 
Importance of ML Reproducibility & Applications with MLfLow
Importance of ML Reproducibility & Applications with MLfLowImportance of ML Reproducibility & Applications with MLfLow
Importance of ML Reproducibility & Applications with MLfLow
 

Semelhante a Architecting for Data Science

ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...DATAVERSITY
 
Which institute is best for data science?
Which institute is best for data science?Which institute is best for data science?
Which institute is best for data science?DIGITALSAI1
 
Best Selenium certification course
Best Selenium certification courseBest Selenium certification course
Best Selenium certification courseKumarNaik21
 
Data science training in hyd ppt (1)
Data science training in hyd ppt (1)Data science training in hyd ppt (1)
Data science training in hyd ppt (1)SayyedYusufali
 
Data science training institute in hyderabad
Data science training institute in hyderabadData science training institute in hyderabad
Data science training institute in hyderabadVamsiNihal
 
Data science training in Hyderabad
Data science  training in HyderabadData science  training in Hyderabad
Data science training in Hyderabadsaitejavella
 
Data science training Hyderabad
Data science training HyderabadData science training Hyderabad
Data science training HyderabadNithinsunil1
 
Data science online training in hyderabad
Data science online training in hyderabadData science online training in hyderabad
Data science online training in hyderabadVamsiNihal
 
Data science training in hyd ppt (1)
Data science training in hyd ppt (1)Data science training in hyd ppt (1)
Data science training in hyd ppt (1)SayyedYusufali
 
data science training and placement
data science training and placementdata science training and placement
data science training and placementSaiprasadVella
 
online data science training
online data science trainingonline data science training
online data science trainingDIGITALSAI1
 
Data science online training in hyderabad
Data science online training in hyderabadData science online training in hyderabad
Data science online training in hyderabadVamsiNihal
 
data science online training in hyderabad
data science online training in hyderabaddata science online training in hyderabad
data science online training in hyderabadVamsiNihal
 
Best data science training in Hyderabad
Best data science training in HyderabadBest data science training in Hyderabad
Best data science training in HyderabadKumarNaik21
 
Data science training Hyderabad
Data science training HyderabadData science training Hyderabad
Data science training HyderabadNithinsunil1
 
Data Science Training and Placement
Data Science Training and PlacementData Science Training and Placement
Data Science Training and PlacementAkhilGGM
 
Data science training in hyd ppt converted (1)
Data science training in hyd ppt converted (1)Data science training in hyd ppt converted (1)
Data science training in hyd ppt converted (1)SayyedYusufali
 
Data science training in hyd pdf converted (1)
Data science training in hyd pdf converted (1)Data science training in hyd pdf converted (1)
Data science training in hyd pdf converted (1)SayyedYusufali
 
Data science training in hydpdf converted (1)
Data science training in hydpdf  converted (1)Data science training in hydpdf  converted (1)
Data science training in hydpdf converted (1)SayyedYusufali
 

Semelhante a Architecting for Data Science (20)

ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
 
Which institute is best for data science?
Which institute is best for data science?Which institute is best for data science?
Which institute is best for data science?
 
Best Selenium certification course
Best Selenium certification courseBest Selenium certification course
Best Selenium certification course
 
Data science training in hyd ppt (1)
Data science training in hyd ppt (1)Data science training in hyd ppt (1)
Data science training in hyd ppt (1)
 
Data science training institute in hyderabad
Data science training institute in hyderabadData science training institute in hyderabad
Data science training institute in hyderabad
 
Data science training in Hyderabad
Data science  training in HyderabadData science  training in Hyderabad
Data science training in Hyderabad
 
Data science training Hyderabad
Data science training HyderabadData science training Hyderabad
Data science training Hyderabad
 
Data science online training in hyderabad
Data science online training in hyderabadData science online training in hyderabad
Data science online training in hyderabad
 
Data science training in hyd ppt (1)
Data science training in hyd ppt (1)Data science training in hyd ppt (1)
Data science training in hyd ppt (1)
 
data science training and placement
data science training and placementdata science training and placement
data science training and placement
 
online data science training
online data science trainingonline data science training
online data science training
 
Data science online training in hyderabad
Data science online training in hyderabadData science online training in hyderabad
Data science online training in hyderabad
 
data science online training in hyderabad
data science online training in hyderabaddata science online training in hyderabad
data science online training in hyderabad
 
Best data science training in Hyderabad
Best data science training in HyderabadBest data science training in Hyderabad
Best data science training in Hyderabad
 
Data science training Hyderabad
Data science training HyderabadData science training Hyderabad
Data science training Hyderabad
 
Data Science Training and Placement
Data Science Training and PlacementData Science Training and Placement
Data Science Training and Placement
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
Data science training in hyd ppt converted (1)
Data science training in hyd ppt converted (1)Data science training in hyd ppt converted (1)
Data science training in hyd ppt converted (1)
 
Data science training in hyd pdf converted (1)
Data science training in hyd pdf converted (1)Data science training in hyd pdf converted (1)
Data science training in hyd pdf converted (1)
 
Data science training in hydpdf converted (1)
Data science training in hydpdf  converted (1)Data science training in hydpdf  converted (1)
Data science training in hydpdf converted (1)
 

Último

Cloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEECloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEEVICTOR MAESTRE RAMIREZ
 
SpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at RuntimeSpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at Runtimeandrehoraa
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...stazi3110
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxTier1 app
 
MYjobs Presentation Django-based project
MYjobs Presentation Django-based projectMYjobs Presentation Django-based project
MYjobs Presentation Django-based projectAnoyGreter
 
Buds n Tech IT Solutions: Top-Notch Web Services in Noida
Buds n Tech IT Solutions: Top-Notch Web Services in NoidaBuds n Tech IT Solutions: Top-Notch Web Services in Noida
Buds n Tech IT Solutions: Top-Notch Web Services in Noidabntitsolutionsrishis
 
Introduction Computer Science - Software Design.pdf
Introduction Computer Science - Software Design.pdfIntroduction Computer Science - Software Design.pdf
Introduction Computer Science - Software Design.pdfFerryKemperman
 
CRM Contender Series: HubSpot vs. Salesforce
CRM Contender Series: HubSpot vs. SalesforceCRM Contender Series: HubSpot vs. Salesforce
CRM Contender Series: HubSpot vs. SalesforceBrainSell Technologies
 
Unveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New FeaturesUnveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New FeaturesŁukasz Chruściel
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityNeo4j
 
Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Hr365.us smith
 
英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作qr0udbr0
 
Folding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesFolding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesPhilip Schwarz
 
Xen Safety Embedded OSS Summit April 2024 v4.pdf
Xen Safety Embedded OSS Summit April 2024 v4.pdfXen Safety Embedded OSS Summit April 2024 v4.pdf
Xen Safety Embedded OSS Summit April 2024 v4.pdfStefano Stabellini
 
2.pdf Ejercicios de programaciĂłn competitiva
2.pdf Ejercicios de programaciĂłn competitiva2.pdf Ejercicios de programaciĂłn competitiva
2.pdf Ejercicios de programaciĂłn competitivaDiego IvĂĄn Oliveros Acosta
 
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Cizo Technology Services
 
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...confluent
 
How to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationHow to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationBradBedford3
 
Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Velvetech LLC
 

Último (20)

Cloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEECloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEE
 
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort ServiceHot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
 
SpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at RuntimeSpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at Runtime
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
 
MYjobs Presentation Django-based project
MYjobs Presentation Django-based projectMYjobs Presentation Django-based project
MYjobs Presentation Django-based project
 
Buds n Tech IT Solutions: Top-Notch Web Services in Noida
Buds n Tech IT Solutions: Top-Notch Web Services in NoidaBuds n Tech IT Solutions: Top-Notch Web Services in Noida
Buds n Tech IT Solutions: Top-Notch Web Services in Noida
 
Introduction Computer Science - Software Design.pdf
Introduction Computer Science - Software Design.pdfIntroduction Computer Science - Software Design.pdf
Introduction Computer Science - Software Design.pdf
 
CRM Contender Series: HubSpot vs. Salesforce
CRM Contender Series: HubSpot vs. SalesforceCRM Contender Series: HubSpot vs. Salesforce
CRM Contender Series: HubSpot vs. Salesforce
 
Unveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New FeaturesUnveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New Features
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered Sustainability
 
Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)
 
英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作
 
Folding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesFolding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a series
 
Xen Safety Embedded OSS Summit April 2024 v4.pdf
Xen Safety Embedded OSS Summit April 2024 v4.pdfXen Safety Embedded OSS Summit April 2024 v4.pdf
Xen Safety Embedded OSS Summit April 2024 v4.pdf
 
2.pdf Ejercicios de programaciĂłn competitiva
2.pdf Ejercicios de programaciĂłn competitiva2.pdf Ejercicios de programaciĂłn competitiva
2.pdf Ejercicios de programaciĂłn competitiva
 
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
 
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
 
How to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationHow to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion Application
 
Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...
 

Architecting for Data Science