SlideShare uma empresa Scribd logo
1 de 31
Baixar para ler offline
Valencian Summer School in Machine Learning
4rd edition
September 13-14, 2018
BigML, Inc 2
Machine Learning Automation
Mostly Unhelpful
Charles Parker
VP Algorithms, BigML, Inc
BigML, Inc #VSSML18 3
ML Automation is Awesome!
http://www.clparker.org/ml_benchmark/
BigML, Inc #VSSML18 4
Talk Overview
• Introduction to BigML
• Recent Ideas in Automation
• Automation vs. Abstraction
• Some Nice Abstractions
• BigML’s Abstractions
BigML, Inc #VSSML18 5
Talk Overview
• Introduction to BigML
• Recent Ideas in Automation
• Automation vs. Abstraction
• Some Nice Abstractions
• BigML’s Abstractions
BigML, Inc #VSSML18 6
Goals of BigML
• Accessible to Non-
programmers
• Limit the distance from
“I have data” to a model
in production
• It sounds like more
automation is what we
need!
BigML, Inc #VSSML18 7
Talk Overview
• Introduction to BigML
• Recent Ideas in Automation
• Automation vs. Abstraction
• Some Nice Abstractions
• BigML’s Abstractions
BigML, Inc #VSSML18 8
What Is ML Automation?
ML AlgorithmData Model
BigML, Inc #VSSML18 9
What Is ML Automation?
ML AlgorithmData ModelPreprocessing
Parameterization
BigML, Inc #VSSML18 10
What Is ML Automation?
ML AlgorithmData ModelPreprocessing
Parameterization
Solution: Just Re-Blacken The Box!
BigML, Inc #VSSML18 11
NIPS Workshops
• People want their learning automated . . .
• Towards an AI for Data Science
• Bayesian Parameter Optimization
• But they also want to interact with it?
• Deep Learning Visualizations
• The Future of Interactive Machine Learning
• Interpretable Machine Learning for Complex Systems
BigML, Inc #VSSML18 12
BayesOpt
• Bayesian Parameter Optimization: Tuning the
parameters for anything
• One experiment at a time; BPO gives you “next” experiment by
modeling previous experiments and preferring novelty
• Classic case is neural networks
• Some important caveats learned at Twitter
• Engineers wanted importance metrics
• Difficult to elicit objective functions; often they were only
determinable after interaction with consumers
• System has the tendency to be myopic or “cheat”
https://bayesopt.github.io/
BigML, Inc #VSSML18 13
Interactive Machine Learning
• Program to Tutor Children In Math
• Has a series of tutorial modules it can show to children
• Optimizes number and order using RL
• But:
• Optimality determined using comprehensive test scores
• What happens when no module can improve students’ scores
in a given subject?
http://www.filmnips.com/
BigML, Inc #VSSML18 14
Trigonometry?
Probably wasn’t important anyway
BigML, Inc #VSSML18 15
Fraud Prediction
• Model to detect fraudulent loan applications
• “Non-fraud” instances are passed to human underwriters for
verification
• Human underwriters provide the training data for the model
• Which applications will be marked as “passing”?
• Legitimate ones, of course
• But also . . .
BigML, Inc #VSSML18 16
Thieves!
Thanks, machine learning!
BigML, Inc #VSSML18 17
Themes
• People love the fact that computers can automate
the optimization process
• People don’t want to do it (as there is often drudgery involved)
• People are worse at it
• People are suspicious (and should be) of the
results of that optimization
• Humans are bad at specifying objective functions
• This isn’t just a nuisance, it’s a security issue
• Humans don’t mind being (and should be) in the loop on their
own terms
BigML, Inc #VSSML18 18
Talk Overview
• Introduction to BigML
• Recent Ideas in Automation
• Automation vs. Abstraction
• Some Nice Abstractions
• BigML’s Abstractions
BigML, Inc #VSSML18 19
Abstractions
• A “translational layer” used simplify interaction with
complex processes
• Establishes a mode of interaction
• Hides the remaining detail
• Programming languages are a canonical example
• The syntax is the mode of interaction
• The layer below is machine code
• Abstractions make technology usable
BigML, Inc #VSSML18 20
Evaluating Abstractions
• Abstractions can be strong or weak:
• Is the interaction free of much of the detail of the layer below?
• Is the interaction natural and intuitive, perhaps in a way that the
layer below is not?
• Abstractions can be lossy or lossless
• Is the user able to get a satisfactory result with the mode of
interaction that the abstraction provides?
• When they can’t, how graceful is the failure (example: Wavelet-
based image compression)?
• It’s easy to do just one or the other!
BigML, Inc #VSSML18 21
Some Abstractions
• Some good abstractions (rich interaction mostly
free of technical detail)
• Driving controls
• WYSIWIG Editors
• What about Bayesian Parameter Optimization?
• Hides: Hyper-parameter optimization / Model selection
• Interaction?
• As an abstraction, BPO is lossy and weak (but I
love it anyway)
BigML, Inc #VSSML18 22
Talk Overview
• Introduction to BigML
• Recent Ideas in Automation
• Automation vs. Abstraction
• Some Nice Abstractions
• You Can Help!
BigML, Inc #VSSML18 23
People Will Meet You Halfway
Case 1: Programming Language Primitives
(and the Sapir-Whorf Hypothesis)
Case 2: Google
(actually not very good)
Case 3: Decision Trees
(actually better than you think)
BigML, Inc #VSSML18 24
Models Are Abstractions!
• A learned model is an abstraction
• You want to answer a question using the data
• The model hides data details and provides modes of interaction
• How good of an abstraction is it?
• Hiding details: Pretty excellent
• Interaction? (a “predict” method?)
BigML, Inc #VSSML18 25
Going Somewhere?
“We’ve developed technology that
automates travel between two places!”
Here you go!
BigML, Inc #VSSML18 26
Talk Overview
• Introduction to BigML
• Recent Ideas in Automation
• Automation vs. Abstraction
• Some Nice Abstractions
• BigML’s Abstractions
BigML, Inc 27VSSML18
Go Broader with Abstraction
• Underlying Architecture
• Data storage / manipulation
• Speed / scaling
• “Technology Debt”
• How much software do you need before you even start (data
manipulation, scaling, visualization)?
• How many different software systems will you have to
integrate?
• Even worse: Having to hire technical people
• These things are “easier”, and also probably less
lossy (very few people will miss them)
BigML, Inc 28VSSML18
Technology Debt
• There are lots of programs out there for data
preprocessing / feature engineering
• Download, pay for, learn, set up software
• Often has difficulties with data ingestion
• We want to be able to do all of this stuff server-
side without constraining which operations we
can do
• Solution: A DSL (Flatline)
BigML, Inc 29VSSML18
What Can Flatline Do?
• A whole lot
• Binary, arithmetic, statistical functions
• Regular expressions
• Date/Time parsing
• Discretization / Binarization
• Temporary variable binding
• Window transformations
• It compiles down to Java, so adding new features
is pretty trivial
https://github.com/bigmlcom/flatline/blob/master/user-manual.md
BigML, Inc 30VSSML18
Is This Automation?
• You’re making your users learn a language (true)!
• What didn’t they have to do?
• Install and/or Learn sed or perl or python
• Prepare their data for ingestion into those tools (and debug that
process)
• Worry at all about scaling anything
• Take the results and feed them into BigML
• Iterate?
• Perhaps not automation, but a near-lossless and
fairly useful abstraction
VSSML18. Advanced WhizzML Workflows

Mais conteúdo relacionado

Semelhante a VSSML18. Advanced WhizzML Workflows

The Past, Present, and Future of Machine Learning APIs
The Past, Present, and Future of Machine Learning APIsThe Past, Present, and Future of Machine Learning APIs
The Past, Present, and Future of Machine Learning APIsBigML, Inc
 
Scaling Face Recognition with Big Data
Scaling Face Recognition with Big DataScaling Face Recognition with Big Data
Scaling Face Recognition with Big DataBogdan Bocse
 
HR Analytics: Using Machine Learning to Predict Employee Turnover - Matt Danc...
HR Analytics: Using Machine Learning to Predict Employee Turnover - Matt Danc...HR Analytics: Using Machine Learning to Predict Employee Turnover - Matt Danc...
HR Analytics: Using Machine Learning to Predict Employee Turnover - Matt Danc...Sri Ambati
 
MLSD18. Ensembles, Logistic Regression, Deepnets
MLSD18. Ensembles, Logistic Regression, DeepnetsMLSD18. Ensembles, Logistic Regression, Deepnets
MLSD18. Ensembles, Logistic Regression, DeepnetsBigML, Inc
 
MLSEV. Machine Learning: Technical Perspective
MLSEV. Machine Learning: Technical PerspectiveMLSEV. Machine Learning: Technical Perspective
MLSEV. Machine Learning: Technical PerspectiveBigML, Inc
 
DutchMLSchool. Introduction to Machine Learning with the BigML Platform
DutchMLSchool. Introduction to Machine Learning with the BigML PlatformDutchMLSchool. Introduction to Machine Learning with the BigML Platform
DutchMLSchool. Introduction to Machine Learning with the BigML PlatformBigML, Inc
 
ODSC18, London, How to build high performing weighted XGBoost ML Model for Re...
ODSC18, London, How to build high performing weighted XGBoost ML Model for Re...ODSC18, London, How to build high performing weighted XGBoost ML Model for Re...
ODSC18, London, How to build high performing weighted XGBoost ML Model for Re...Alok Singh
 
MLSEV. Machine Learning: Business Perspective
MLSEV. Machine Learning: Business PerspectiveMLSEV. Machine Learning: Business Perspective
MLSEV. Machine Learning: Business PerspectiveBigML, Inc
 
Why many data science projects fail
Why many data science projects fail Why many data science projects fail
Why many data science projects fail Omnia Safaan
 
Big Data Spain 2018: How to build Weighted XGBoost ML model for Imbalance dat...
Big Data Spain 2018: How to build Weighted XGBoost ML model for Imbalance dat...Big Data Spain 2018: How to build Weighted XGBoost ML model for Imbalance dat...
Big Data Spain 2018: How to build Weighted XGBoost ML model for Imbalance dat...Alok Singh
 
MLSD18. Real World Use Case II
MLSD18. Real World Use Case IIMLSD18. Real World Use Case II
MLSD18. Real World Use Case IIBigML, Inc
 
MLSD18. Supervised Workshop
MLSD18. Supervised WorkshopMLSD18. Supervised Workshop
MLSD18. Supervised WorkshopBigML, Inc
 
Big Data LDN 2018: HOW AUTOMATION CAN ACCELERATE THE DELIVERY OF MACHINE LEAR...
Big Data LDN 2018: HOW AUTOMATION CAN ACCELERATE THE DELIVERY OF MACHINE LEAR...Big Data LDN 2018: HOW AUTOMATION CAN ACCELERATE THE DELIVERY OF MACHINE LEAR...
Big Data LDN 2018: HOW AUTOMATION CAN ACCELERATE THE DELIVERY OF MACHINE LEAR...Matt Stubbs
 
MLSD18. OptiML and Fusions
MLSD18. OptiML and FusionsMLSD18. OptiML and Fusions
MLSD18. OptiML and FusionsBigML, Inc
 
Defcon 21-pinto-defending-networks-machine-learning by pseudor00t
Defcon 21-pinto-defending-networks-machine-learning by pseudor00tDefcon 21-pinto-defending-networks-machine-learning by pseudor00t
Defcon 21-pinto-defending-networks-machine-learning by pseudor00tpseudor00t overflow
 
Making operations visible - devopsdays tokyo 2013
Making operations visible  - devopsdays tokyo 2013Making operations visible  - devopsdays tokyo 2013
Making operations visible - devopsdays tokyo 2013Nick Galbreath
 
Making operations visible - Nick Gallbreath
Making operations visible - Nick GallbreathMaking operations visible - Nick Gallbreath
Making operations visible - Nick GallbreathDevopsdays
 
MLSD18. End-to-End Machine Learning
MLSD18. End-to-End Machine LearningMLSD18. End-to-End Machine Learning
MLSD18. End-to-End Machine LearningBigML, Inc
 

Semelhante a VSSML18. Advanced WhizzML Workflows (20)

The Past, Present, and Future of Machine Learning APIs
The Past, Present, and Future of Machine Learning APIsThe Past, Present, and Future of Machine Learning APIs
The Past, Present, and Future of Machine Learning APIs
 
Scaling Face Recognition with Big Data
Scaling Face Recognition with Big DataScaling Face Recognition with Big Data
Scaling Face Recognition with Big Data
 
Scott Stouffer - Advanced Search Summit Napa 2021
Scott Stouffer - Advanced Search Summit Napa 2021Scott Stouffer - Advanced Search Summit Napa 2021
Scott Stouffer - Advanced Search Summit Napa 2021
 
HR Analytics: Using Machine Learning to Predict Employee Turnover - Matt Danc...
HR Analytics: Using Machine Learning to Predict Employee Turnover - Matt Danc...HR Analytics: Using Machine Learning to Predict Employee Turnover - Matt Danc...
HR Analytics: Using Machine Learning to Predict Employee Turnover - Matt Danc...
 
Conclusion Connect state of IoT 2019 Review io t solutions world congress 2019
Conclusion Connect state of IoT 2019 Review io t solutions world congress 2019Conclusion Connect state of IoT 2019 Review io t solutions world congress 2019
Conclusion Connect state of IoT 2019 Review io t solutions world congress 2019
 
MLSD18. Ensembles, Logistic Regression, Deepnets
MLSD18. Ensembles, Logistic Regression, DeepnetsMLSD18. Ensembles, Logistic Regression, Deepnets
MLSD18. Ensembles, Logistic Regression, Deepnets
 
MLSEV. Machine Learning: Technical Perspective
MLSEV. Machine Learning: Technical PerspectiveMLSEV. Machine Learning: Technical Perspective
MLSEV. Machine Learning: Technical Perspective
 
DutchMLSchool. Introduction to Machine Learning with the BigML Platform
DutchMLSchool. Introduction to Machine Learning with the BigML PlatformDutchMLSchool. Introduction to Machine Learning with the BigML Platform
DutchMLSchool. Introduction to Machine Learning with the BigML Platform
 
ODSC18, London, How to build high performing weighted XGBoost ML Model for Re...
ODSC18, London, How to build high performing weighted XGBoost ML Model for Re...ODSC18, London, How to build high performing weighted XGBoost ML Model for Re...
ODSC18, London, How to build high performing weighted XGBoost ML Model for Re...
 
MLSEV. Machine Learning: Business Perspective
MLSEV. Machine Learning: Business PerspectiveMLSEV. Machine Learning: Business Perspective
MLSEV. Machine Learning: Business Perspective
 
Why many data science projects fail
Why many data science projects fail Why many data science projects fail
Why many data science projects fail
 
Big Data Spain 2018: How to build Weighted XGBoost ML model for Imbalance dat...
Big Data Spain 2018: How to build Weighted XGBoost ML model for Imbalance dat...Big Data Spain 2018: How to build Weighted XGBoost ML model for Imbalance dat...
Big Data Spain 2018: How to build Weighted XGBoost ML model for Imbalance dat...
 
MLSD18. Real World Use Case II
MLSD18. Real World Use Case IIMLSD18. Real World Use Case II
MLSD18. Real World Use Case II
 
MLSD18. Supervised Workshop
MLSD18. Supervised WorkshopMLSD18. Supervised Workshop
MLSD18. Supervised Workshop
 
Big Data LDN 2018: HOW AUTOMATION CAN ACCELERATE THE DELIVERY OF MACHINE LEAR...
Big Data LDN 2018: HOW AUTOMATION CAN ACCELERATE THE DELIVERY OF MACHINE LEAR...Big Data LDN 2018: HOW AUTOMATION CAN ACCELERATE THE DELIVERY OF MACHINE LEAR...
Big Data LDN 2018: HOW AUTOMATION CAN ACCELERATE THE DELIVERY OF MACHINE LEAR...
 
MLSD18. OptiML and Fusions
MLSD18. OptiML and FusionsMLSD18. OptiML and Fusions
MLSD18. OptiML and Fusions
 
Defcon 21-pinto-defending-networks-machine-learning by pseudor00t
Defcon 21-pinto-defending-networks-machine-learning by pseudor00tDefcon 21-pinto-defending-networks-machine-learning by pseudor00t
Defcon 21-pinto-defending-networks-machine-learning by pseudor00t
 
Making operations visible - devopsdays tokyo 2013
Making operations visible  - devopsdays tokyo 2013Making operations visible  - devopsdays tokyo 2013
Making operations visible - devopsdays tokyo 2013
 
Making operations visible - Nick Gallbreath
Making operations visible - Nick GallbreathMaking operations visible - Nick Gallbreath
Making operations visible - Nick Gallbreath
 
MLSD18. End-to-End Machine Learning
MLSD18. End-to-End Machine LearningMLSD18. End-to-End Machine Learning
MLSD18. End-to-End Machine Learning
 

Mais de BigML, Inc

Digital Transformation and Process Optimization in Manufacturing
Digital Transformation and Process Optimization in ManufacturingDigital Transformation and Process Optimization in Manufacturing
Digital Transformation and Process Optimization in ManufacturingBigML, Inc
 
DutchMLSchool 2022 - Automation
DutchMLSchool 2022 - AutomationDutchMLSchool 2022 - Automation
DutchMLSchool 2022 - AutomationBigML, Inc
 
DutchMLSchool 2022 - ML for AML Compliance
DutchMLSchool 2022 - ML for AML ComplianceDutchMLSchool 2022 - ML for AML Compliance
DutchMLSchool 2022 - ML for AML ComplianceBigML, Inc
 
DutchMLSchool 2022 - Multi Perspective Anomalies
DutchMLSchool 2022 - Multi Perspective AnomaliesDutchMLSchool 2022 - Multi Perspective Anomalies
DutchMLSchool 2022 - Multi Perspective AnomaliesBigML, Inc
 
DutchMLSchool 2022 - My First Anomaly Detector
DutchMLSchool 2022 - My First Anomaly Detector DutchMLSchool 2022 - My First Anomaly Detector
DutchMLSchool 2022 - My First Anomaly Detector BigML, Inc
 
DutchMLSchool 2022 - Anomaly Detection
DutchMLSchool 2022 - Anomaly DetectionDutchMLSchool 2022 - Anomaly Detection
DutchMLSchool 2022 - Anomaly DetectionBigML, Inc
 
DutchMLSchool 2022 - History and Developments in ML
DutchMLSchool 2022 - History and Developments in MLDutchMLSchool 2022 - History and Developments in ML
DutchMLSchool 2022 - History and Developments in MLBigML, Inc
 
DutchMLSchool 2022 - End-to-End ML
DutchMLSchool 2022 - End-to-End MLDutchMLSchool 2022 - End-to-End ML
DutchMLSchool 2022 - End-to-End MLBigML, Inc
 
DutchMLSchool 2022 - A Data-Driven Company
DutchMLSchool 2022 - A Data-Driven CompanyDutchMLSchool 2022 - A Data-Driven Company
DutchMLSchool 2022 - A Data-Driven CompanyBigML, Inc
 
DutchMLSchool 2022 - ML in the Legal Sector
DutchMLSchool 2022 - ML in the Legal SectorDutchMLSchool 2022 - ML in the Legal Sector
DutchMLSchool 2022 - ML in the Legal SectorBigML, Inc
 
DutchMLSchool 2022 - Smart Safe Stadiums
DutchMLSchool 2022 - Smart Safe StadiumsDutchMLSchool 2022 - Smart Safe Stadiums
DutchMLSchool 2022 - Smart Safe StadiumsBigML, Inc
 
DutchMLSchool 2022 - Process Optimization in Manufacturing Plants
DutchMLSchool 2022 - Process Optimization in Manufacturing PlantsDutchMLSchool 2022 - Process Optimization in Manufacturing Plants
DutchMLSchool 2022 - Process Optimization in Manufacturing PlantsBigML, Inc
 
DutchMLSchool 2022 - Anomaly Detection at Scale
DutchMLSchool 2022 - Anomaly Detection at ScaleDutchMLSchool 2022 - Anomaly Detection at Scale
DutchMLSchool 2022 - Anomaly Detection at ScaleBigML, Inc
 
DutchMLSchool 2022 - Citizen Development in AI
DutchMLSchool 2022 - Citizen Development in AIDutchMLSchool 2022 - Citizen Development in AI
DutchMLSchool 2022 - Citizen Development in AIBigML, Inc
 
Democratizing Object Detection
Democratizing Object DetectionDemocratizing Object Detection
Democratizing Object DetectionBigML, Inc
 
Machine Learning in Retail: Know Your Customers' Customer. See Your Future
Machine Learning in Retail: Know Your Customers' Customer. See Your FutureMachine Learning in Retail: Know Your Customers' Customer. See Your Future
Machine Learning in Retail: Know Your Customers' Customer. See Your FutureBigML, Inc
 
Machine Learning in Retail: ML in the Retail Sector
Machine Learning in Retail: ML in the Retail SectorMachine Learning in Retail: ML in the Retail Sector
Machine Learning in Retail: ML in the Retail SectorBigML, Inc
 
ML in GRC: Machine Learning in Legal Automation, How to Trust a Lawyerbot
ML in GRC: Machine Learning in Legal Automation, How to Trust a LawyerbotML in GRC: Machine Learning in Legal Automation, How to Trust a Lawyerbot
ML in GRC: Machine Learning in Legal Automation, How to Trust a LawyerbotBigML, Inc
 
ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...
ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...
ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...BigML, Inc
 
ML in GRC: Cybersecurity versus Governance, Risk Management, and Compliance
ML in GRC: Cybersecurity versus Governance, Risk Management, and ComplianceML in GRC: Cybersecurity versus Governance, Risk Management, and Compliance
ML in GRC: Cybersecurity versus Governance, Risk Management, and ComplianceBigML, Inc
 

Mais de BigML, Inc (20)

Digital Transformation and Process Optimization in Manufacturing
Digital Transformation and Process Optimization in ManufacturingDigital Transformation and Process Optimization in Manufacturing
Digital Transformation and Process Optimization in Manufacturing
 
DutchMLSchool 2022 - Automation
DutchMLSchool 2022 - AutomationDutchMLSchool 2022 - Automation
DutchMLSchool 2022 - Automation
 
DutchMLSchool 2022 - ML for AML Compliance
DutchMLSchool 2022 - ML for AML ComplianceDutchMLSchool 2022 - ML for AML Compliance
DutchMLSchool 2022 - ML for AML Compliance
 
DutchMLSchool 2022 - Multi Perspective Anomalies
DutchMLSchool 2022 - Multi Perspective AnomaliesDutchMLSchool 2022 - Multi Perspective Anomalies
DutchMLSchool 2022 - Multi Perspective Anomalies
 
DutchMLSchool 2022 - My First Anomaly Detector
DutchMLSchool 2022 - My First Anomaly Detector DutchMLSchool 2022 - My First Anomaly Detector
DutchMLSchool 2022 - My First Anomaly Detector
 
DutchMLSchool 2022 - Anomaly Detection
DutchMLSchool 2022 - Anomaly DetectionDutchMLSchool 2022 - Anomaly Detection
DutchMLSchool 2022 - Anomaly Detection
 
DutchMLSchool 2022 - History and Developments in ML
DutchMLSchool 2022 - History and Developments in MLDutchMLSchool 2022 - History and Developments in ML
DutchMLSchool 2022 - History and Developments in ML
 
DutchMLSchool 2022 - End-to-End ML
DutchMLSchool 2022 - End-to-End MLDutchMLSchool 2022 - End-to-End ML
DutchMLSchool 2022 - End-to-End ML
 
DutchMLSchool 2022 - A Data-Driven Company
DutchMLSchool 2022 - A Data-Driven CompanyDutchMLSchool 2022 - A Data-Driven Company
DutchMLSchool 2022 - A Data-Driven Company
 
DutchMLSchool 2022 - ML in the Legal Sector
DutchMLSchool 2022 - ML in the Legal SectorDutchMLSchool 2022 - ML in the Legal Sector
DutchMLSchool 2022 - ML in the Legal Sector
 
DutchMLSchool 2022 - Smart Safe Stadiums
DutchMLSchool 2022 - Smart Safe StadiumsDutchMLSchool 2022 - Smart Safe Stadiums
DutchMLSchool 2022 - Smart Safe Stadiums
 
DutchMLSchool 2022 - Process Optimization in Manufacturing Plants
DutchMLSchool 2022 - Process Optimization in Manufacturing PlantsDutchMLSchool 2022 - Process Optimization in Manufacturing Plants
DutchMLSchool 2022 - Process Optimization in Manufacturing Plants
 
DutchMLSchool 2022 - Anomaly Detection at Scale
DutchMLSchool 2022 - Anomaly Detection at ScaleDutchMLSchool 2022 - Anomaly Detection at Scale
DutchMLSchool 2022 - Anomaly Detection at Scale
 
DutchMLSchool 2022 - Citizen Development in AI
DutchMLSchool 2022 - Citizen Development in AIDutchMLSchool 2022 - Citizen Development in AI
DutchMLSchool 2022 - Citizen Development in AI
 
Democratizing Object Detection
Democratizing Object DetectionDemocratizing Object Detection
Democratizing Object Detection
 
Machine Learning in Retail: Know Your Customers' Customer. See Your Future
Machine Learning in Retail: Know Your Customers' Customer. See Your FutureMachine Learning in Retail: Know Your Customers' Customer. See Your Future
Machine Learning in Retail: Know Your Customers' Customer. See Your Future
 
Machine Learning in Retail: ML in the Retail Sector
Machine Learning in Retail: ML in the Retail SectorMachine Learning in Retail: ML in the Retail Sector
Machine Learning in Retail: ML in the Retail Sector
 
ML in GRC: Machine Learning in Legal Automation, How to Trust a Lawyerbot
ML in GRC: Machine Learning in Legal Automation, How to Trust a LawyerbotML in GRC: Machine Learning in Legal Automation, How to Trust a Lawyerbot
ML in GRC: Machine Learning in Legal Automation, How to Trust a Lawyerbot
 
ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...
ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...
ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...
 
ML in GRC: Cybersecurity versus Governance, Risk Management, and Compliance
ML in GRC: Cybersecurity versus Governance, Risk Management, and ComplianceML in GRC: Cybersecurity versus Governance, Risk Management, and Compliance
ML in GRC: Cybersecurity versus Governance, Risk Management, and Compliance
 

Último

Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxMohammedJunaid861692
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...amitlee9823
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...amitlee9823
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...shivangimorya083
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023ymrp368
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...amitlee9823
 

Último (20)

Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 

VSSML18. Advanced WhizzML Workflows

  • 1. Valencian Summer School in Machine Learning 4rd edition September 13-14, 2018
  • 2. BigML, Inc 2 Machine Learning Automation Mostly Unhelpful Charles Parker VP Algorithms, BigML, Inc
  • 3. BigML, Inc #VSSML18 3 ML Automation is Awesome! http://www.clparker.org/ml_benchmark/
  • 4. BigML, Inc #VSSML18 4 Talk Overview • Introduction to BigML • Recent Ideas in Automation • Automation vs. Abstraction • Some Nice Abstractions • BigML’s Abstractions
  • 5. BigML, Inc #VSSML18 5 Talk Overview • Introduction to BigML • Recent Ideas in Automation • Automation vs. Abstraction • Some Nice Abstractions • BigML’s Abstractions
  • 6. BigML, Inc #VSSML18 6 Goals of BigML • Accessible to Non- programmers • Limit the distance from “I have data” to a model in production • It sounds like more automation is what we need!
  • 7. BigML, Inc #VSSML18 7 Talk Overview • Introduction to BigML • Recent Ideas in Automation • Automation vs. Abstraction • Some Nice Abstractions • BigML’s Abstractions
  • 8. BigML, Inc #VSSML18 8 What Is ML Automation? ML AlgorithmData Model
  • 9. BigML, Inc #VSSML18 9 What Is ML Automation? ML AlgorithmData ModelPreprocessing Parameterization
  • 10. BigML, Inc #VSSML18 10 What Is ML Automation? ML AlgorithmData ModelPreprocessing Parameterization Solution: Just Re-Blacken The Box!
  • 11. BigML, Inc #VSSML18 11 NIPS Workshops • People want their learning automated . . . • Towards an AI for Data Science • Bayesian Parameter Optimization • But they also want to interact with it? • Deep Learning Visualizations • The Future of Interactive Machine Learning • Interpretable Machine Learning for Complex Systems
  • 12. BigML, Inc #VSSML18 12 BayesOpt • Bayesian Parameter Optimization: Tuning the parameters for anything • One experiment at a time; BPO gives you “next” experiment by modeling previous experiments and preferring novelty • Classic case is neural networks • Some important caveats learned at Twitter • Engineers wanted importance metrics • Difficult to elicit objective functions; often they were only determinable after interaction with consumers • System has the tendency to be myopic or “cheat” https://bayesopt.github.io/
  • 13. BigML, Inc #VSSML18 13 Interactive Machine Learning • Program to Tutor Children In Math • Has a series of tutorial modules it can show to children • Optimizes number and order using RL • But: • Optimality determined using comprehensive test scores • What happens when no module can improve students’ scores in a given subject? http://www.filmnips.com/
  • 14. BigML, Inc #VSSML18 14 Trigonometry? Probably wasn’t important anyway
  • 15. BigML, Inc #VSSML18 15 Fraud Prediction • Model to detect fraudulent loan applications • “Non-fraud” instances are passed to human underwriters for verification • Human underwriters provide the training data for the model • Which applications will be marked as “passing”? • Legitimate ones, of course • But also . . .
  • 16. BigML, Inc #VSSML18 16 Thieves! Thanks, machine learning!
  • 17. BigML, Inc #VSSML18 17 Themes • People love the fact that computers can automate the optimization process • People don’t want to do it (as there is often drudgery involved) • People are worse at it • People are suspicious (and should be) of the results of that optimization • Humans are bad at specifying objective functions • This isn’t just a nuisance, it’s a security issue • Humans don’t mind being (and should be) in the loop on their own terms
  • 18. BigML, Inc #VSSML18 18 Talk Overview • Introduction to BigML • Recent Ideas in Automation • Automation vs. Abstraction • Some Nice Abstractions • BigML’s Abstractions
  • 19. BigML, Inc #VSSML18 19 Abstractions • A “translational layer” used simplify interaction with complex processes • Establishes a mode of interaction • Hides the remaining detail • Programming languages are a canonical example • The syntax is the mode of interaction • The layer below is machine code • Abstractions make technology usable
  • 20. BigML, Inc #VSSML18 20 Evaluating Abstractions • Abstractions can be strong or weak: • Is the interaction free of much of the detail of the layer below? • Is the interaction natural and intuitive, perhaps in a way that the layer below is not? • Abstractions can be lossy or lossless • Is the user able to get a satisfactory result with the mode of interaction that the abstraction provides? • When they can’t, how graceful is the failure (example: Wavelet- based image compression)? • It’s easy to do just one or the other!
  • 21. BigML, Inc #VSSML18 21 Some Abstractions • Some good abstractions (rich interaction mostly free of technical detail) • Driving controls • WYSIWIG Editors • What about Bayesian Parameter Optimization? • Hides: Hyper-parameter optimization / Model selection • Interaction? • As an abstraction, BPO is lossy and weak (but I love it anyway)
  • 22. BigML, Inc #VSSML18 22 Talk Overview • Introduction to BigML • Recent Ideas in Automation • Automation vs. Abstraction • Some Nice Abstractions • You Can Help!
  • 23. BigML, Inc #VSSML18 23 People Will Meet You Halfway Case 1: Programming Language Primitives (and the Sapir-Whorf Hypothesis) Case 2: Google (actually not very good) Case 3: Decision Trees (actually better than you think)
  • 24. BigML, Inc #VSSML18 24 Models Are Abstractions! • A learned model is an abstraction • You want to answer a question using the data • The model hides data details and provides modes of interaction • How good of an abstraction is it? • Hiding details: Pretty excellent • Interaction? (a “predict” method?)
  • 25. BigML, Inc #VSSML18 25 Going Somewhere? “We’ve developed technology that automates travel between two places!” Here you go!
  • 26. BigML, Inc #VSSML18 26 Talk Overview • Introduction to BigML • Recent Ideas in Automation • Automation vs. Abstraction • Some Nice Abstractions • BigML’s Abstractions
  • 27. BigML, Inc 27VSSML18 Go Broader with Abstraction • Underlying Architecture • Data storage / manipulation • Speed / scaling • “Technology Debt” • How much software do you need before you even start (data manipulation, scaling, visualization)? • How many different software systems will you have to integrate? • Even worse: Having to hire technical people • These things are “easier”, and also probably less lossy (very few people will miss them)
  • 28. BigML, Inc 28VSSML18 Technology Debt • There are lots of programs out there for data preprocessing / feature engineering • Download, pay for, learn, set up software • Often has difficulties with data ingestion • We want to be able to do all of this stuff server- side without constraining which operations we can do • Solution: A DSL (Flatline)
  • 29. BigML, Inc 29VSSML18 What Can Flatline Do? • A whole lot • Binary, arithmetic, statistical functions • Regular expressions • Date/Time parsing • Discretization / Binarization • Temporary variable binding • Window transformations • It compiles down to Java, so adding new features is pretty trivial https://github.com/bigmlcom/flatline/blob/master/user-manual.md
  • 30. BigML, Inc 30VSSML18 Is This Automation? • You’re making your users learn a language (true)! • What didn’t they have to do? • Install and/or Learn sed or perl or python • Prepare their data for ingestion into those tools (and debug that process) • Worry at all about scaling anything • Take the results and feed them into BigML • Iterate? • Perhaps not automation, but a near-lossless and fairly useful abstraction