SlideShare uma empresa Scribd logo
1 de 42
Baixar para ler offline
BigML Spring 2016 Release
BigML, Inc 2
Spring 2016 Release
POUL PETERSEN (CIO)
Enter questions into chat box – we’ll
answer some via chat; others at the end of
the session
https://bigml.com/releases
ATAKAN CETINSOY, (VP Predictive Applications)
Resources
Moderator
Speaker
Contact info@bigml.com
Twitter @bigmlcom
Questions
@whizzml
BigML, Inc 3
BigML, Inc 4
Promise of ML
time
Want
•Reduce churn
•Increase conversion
•Improve diagnosis
•Reduce fraud
•Etc.
Automated InsightsData
Have
BigML, Inc 5
ML Hurdles
time
•Which algorithms?
•How to scale it?
•How to handle real data?
•How to tune it?
•How to automate it?
BigML, Inc 6
Current Resources
SOURCE DATASET CORRELATION
STATISTICAL
TEST
MODEL ENSEMBLE
LOGISTIC
REGRESSION EVALUATION
ANOMALY
DETECTOR
ASSOCIATION
DISCOVERY
PREDICTION
BATCH
PREDICTIONSCRIPT LIBRARY EXECUTION
Data
Exploration
Supervised
Learning
Unsupervised
Learning
Automation
CLUSTER
Scoring
BigML, Inc 7
BigML Vision
time
Automation
Paving the Path to Automatic Machine Learning
REST	
  API
Programmable	
  
Infrastructure
A
Sauron	
  
• Automatic	
  deployment	
  and	
  
auto-­‐scaling
Data	
  Generation	
  and	
  
Filtering
C
Flatline	
  
• DSL	
  for	
  transformation	
  and	
  
new	
  field	
  generation
B
Wintermute	
  
• Distributed	
  Machine	
  Learning	
  
Framework	
  
2011 Spring 2016
Automatic	
  Model	
  
Selection
E
SMACdown	
  	
  
• Automatic	
  parameter	
  
optimization
Workflow	
  
Automation
D
WhizzML	
  
• DSL	
  for	
  programmable	
  
workflows	
  
BigML, Inc 8
Workflow Map
Decision	
  Trees	
  
Bagging	
  
Decision	
  Forest	
  
LogisGc	
  Regression	
  MODEL
DATASET
CLUSTER ANOMALY
ASSOCIATION
SOURCE
K-­‐Means	
  
G-­‐Means	
  
IsolaGon	
  Forest	
  
Magnum	
  Opus	
  
StaGsGcal	
  Tests	
  
CorrelaGons	
  
STATSDATASET
Flatline	
  
Flatline	
  Editor	
  
PREDICTION
Batch	
  PredicGon	
  
Batch	
  Anomaly	
  
Batch	
  Centroid	
  
EvaluaGon	
  
BigML, Inc 9
Original Workflow
SOURCE DATASET MODEL PREDICTION
BigML, Inc 10
Regular Workflows
MODEL
FILTERSOLD HOMES
BATCH
PREDICTION
NEW FEATURES
DATASET DEALS
DATASET
FILTERFORSALE HOMES NEW FEATURES
BigML, Inc 11
Model Selection
ENSEMBLE LOGISTIC
REGRESSION
EVALUATION
SOURCE DATASET
TRAINING
TEST
MODEL
EVALUATIONEVALUATION
CHOOSE
BigML, Inc 12
Model Tuning
ENSEMBLE
N=20
EVALUATION
SOURCE DATASET
TRAINING
TEST
EVALUATIONEVALUATION
ENSEMBLE
N=10
ENSEMBLE
N=1000
CHOOSE
BigML, Inc 13
SMACdown
•How many models?
•How many nodes?
•Missing splits or not?
•Number of random candidates?
•Balance the objective?
SMACdown can tell you!
BigML, Inc 14
Best-First Features
{F1}
CHOOSE BEST
S = {Fa}
{F2} {F3} {F4} Fn
S+{F1} S+{F2} S+{F3} S+{F4} S+{Fn-1}
CHOOSE BEST
S = {Fa, Fb}
S+{F1} S+{F2} S+{F3} S+{F4} S+{Fn-1}
CHOOSE BEST
S = {Fa, Fb, Fc}
BigML, Inc 15
Stacked Generalization
ENSEMBLE
LOGISTIC
REGRESSION
SOURCE DATASET
MODEL
BATCH
PREDICTION
BATCH
PREDICTION
BATCH
PREDICTION
EXTENDED
DATASET
EXTENDED
DATASET
EXTENDED
DATASET
LOGISTIC
REGRESSION
BigML, Inc 16
Better Algorithms
•Stacked Generalization
•Boosting
•Adaboost
• Logitboost
•Martingale Boosting
•Gradient Boosting
BigML, Inc 17
Why Workflows
•Machine Learning is iterative by nature.
•ML tools still require many repetitive (and manual)
tasks.
•Instead of helping to focus on the output many
tools force analysts, developers, and scientists to
focus on infrastructure, parallelism, etc.
•Not everybody can implement complex workflows
or meta-algorithms but many people can reuse
them.
BigML, Inc 18
WhizzML Features
•A Domain-Specific Language (DSL) for
automating Machine Learning workflows.
•Complete programming language.
•Machine Learning “operations” are first-class
citizens.
•Scale is provided for free.
•API First! - Everything is composable.
BigML, Inc 19
WhizzML API Resources
SCRIPT LIBRARY EXECUTION
BigML, Inc 20
export BIGML_USERNAME=myuser
export BIGML_API_KEY=6ef37b3d791061d345ef51281dae821ac7943ed7
export BIGML_AUTH="username=$BIGML_USERNAME;api_key=$BIGML_API_KEY"
export SCRIPT="https://bigml.io/script?$BIGML_AUTH"
export LIBRARY="https://bigml.io/library?$BIGML_AUTH"
export EXECUTION="https://bigml.io/execution?$BIGML_AUTH"
Via API
BigML, Inc 21
Via API
http $LIBRARY 
source_code="(define (addition a b) (+ a b))" | jq ".resource"
"library/573a97f5b95b3941f6000004"
http $SCRIPT 
imports:='["library/573a97f5b95b3941f6000004"]' 
source_code="(addition x 2)" 
inputs:='[{"name": "x", "type": "number"}]' | jq ".resource"
"script/573a9862b95b3941ff000015"
http $EXECUTION 
script=script/573a9862b95b3941ff000015 
inputs:='[["x", 5]]' | jq ".resource"
"execution/573a987ab95b3941f000000d"
http http://bigml.io/execution/573a987ab95b3941f000000d?$BIGML_AUTH 
| jq ".execution.result"
7
BigML, Inc 22
Via Bindings
https://gist.github.com/whizzmler/8a849c282a770ac79a1441df5c5ccf62
BigML, Inc 23
Gallery Scripts
UPDATE ME!!!
BigML, Inc 24
Importing from GitHub
BigML, Inc 25
WhizzML in GitHubNEW
https://github.com/whizzml/examples
BigML, Inc 26
WhizzML UI Resources
BigML, Inc 27
Script Editor
BigML, Inc 28
WhizzML REPLNEW
https://bigml.com/whizzml
BigML, Inc 29
Reify
•"Reifies" a resource into a WhizzML script.
•Rapid prototyping meets automation.
•Coming soon…
BigML, Inc 30
Secret Link Scripts
https://bigml.com/shared/script/oazVtg8t2V2JHFf6PLmenUJbNU
https://bigml.com/dashboard/script/573d53a628eb3e026f000012
BigML, Inc 31
A Gallery of Scripts
https://bigml.com/gallery/scripts
BigML, Inc 32
Share or Sell
•Each script is reviewed internally by BigML Team
members.
BigML, Inc 33
Personalizing 1-Click Menus
BigML, Inc 34
Demo
BigML, Inc 35
API Documentation
• https://bigml.com/developers/libraries
• https://bigml.com/developers/scripts
• https://bigml.com/developers/executions
NEW
BigML, Inc 36
WhizzML PageNEW
https://bigml.com/whizzml
BigML, Inc 37
Documentation
Getting Started with
WhizzML
The BigML Team
Version 1.0
MACHINE LEARNING MADE BEAUTIFULLY SIMPLE
Copyright © 2016, BigML, Inc.
WhizzML Reference
Manual
The BigML Team
Version draft
MACHINE LEARNING MADE BEAUTIFULLY SIMPLE
Copyright © 2016, BigML, Inc.
WhizzML Tutorials
The BigML Team
Version draft
MACHINE LEARNING MADE BEAUTIFULLY SIMPLE
Copyright © 2016, BigML, Inc.
NEW
https://bigml.com/whizzml#documentation
BigML, Inc 38
TrainingNEW
BigML, Inc 39
https://bigml.com/events
FREE TrainingNEW
BigML, Inc 40
Spring 2016 Release
https://bigml.com/releases
NEW
BigML, Inc 41
Conclusion
•Automation is critical to fulfilling the promise of ML
•WhizzML can create workflows that:
•Automate repetitive tasks.
•Automate model tuning and feature
selection.
•Combine ML models into more powerful
algorithms.
•Create shareable and re-usable executions.
Questions?
twitter: @whizzml
mail: info@bigml.com

Mais conteúdo relacionado

Mais de BigML, Inc

DutchMLSchool 2022 - History and Developments in ML
DutchMLSchool 2022 - History and Developments in MLDutchMLSchool 2022 - History and Developments in ML
DutchMLSchool 2022 - History and Developments in MLBigML, Inc
 
DutchMLSchool 2022 - End-to-End ML
DutchMLSchool 2022 - End-to-End MLDutchMLSchool 2022 - End-to-End ML
DutchMLSchool 2022 - End-to-End MLBigML, Inc
 
DutchMLSchool 2022 - A Data-Driven Company
DutchMLSchool 2022 - A Data-Driven CompanyDutchMLSchool 2022 - A Data-Driven Company
DutchMLSchool 2022 - A Data-Driven CompanyBigML, Inc
 
DutchMLSchool 2022 - ML in the Legal Sector
DutchMLSchool 2022 - ML in the Legal SectorDutchMLSchool 2022 - ML in the Legal Sector
DutchMLSchool 2022 - ML in the Legal SectorBigML, Inc
 
DutchMLSchool 2022 - Smart Safe Stadiums
DutchMLSchool 2022 - Smart Safe StadiumsDutchMLSchool 2022 - Smart Safe Stadiums
DutchMLSchool 2022 - Smart Safe StadiumsBigML, Inc
 
DutchMLSchool 2022 - Process Optimization in Manufacturing Plants
DutchMLSchool 2022 - Process Optimization in Manufacturing PlantsDutchMLSchool 2022 - Process Optimization in Manufacturing Plants
DutchMLSchool 2022 - Process Optimization in Manufacturing PlantsBigML, Inc
 
DutchMLSchool 2022 - Anomaly Detection at Scale
DutchMLSchool 2022 - Anomaly Detection at ScaleDutchMLSchool 2022 - Anomaly Detection at Scale
DutchMLSchool 2022 - Anomaly Detection at ScaleBigML, Inc
 
DutchMLSchool 2022 - Citizen Development in AI
DutchMLSchool 2022 - Citizen Development in AIDutchMLSchool 2022 - Citizen Development in AI
DutchMLSchool 2022 - Citizen Development in AIBigML, Inc
 
Democratizing Object Detection
Democratizing Object DetectionDemocratizing Object Detection
Democratizing Object DetectionBigML, Inc
 
BigML Release: Image Processing
BigML Release: Image ProcessingBigML Release: Image Processing
BigML Release: Image ProcessingBigML, Inc
 
Machine Learning in Retail: Know Your Customers' Customer. See Your Future
Machine Learning in Retail: Know Your Customers' Customer. See Your FutureMachine Learning in Retail: Know Your Customers' Customer. See Your Future
Machine Learning in Retail: Know Your Customers' Customer. See Your FutureBigML, Inc
 
Machine Learning in Retail: ML in the Retail Sector
Machine Learning in Retail: ML in the Retail SectorMachine Learning in Retail: ML in the Retail Sector
Machine Learning in Retail: ML in the Retail SectorBigML, Inc
 
ML in GRC: Machine Learning in Legal Automation, How to Trust a Lawyerbot
ML in GRC: Machine Learning in Legal Automation, How to Trust a LawyerbotML in GRC: Machine Learning in Legal Automation, How to Trust a Lawyerbot
ML in GRC: Machine Learning in Legal Automation, How to Trust a LawyerbotBigML, Inc
 
ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...
ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...
ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...BigML, Inc
 
ML in GRC: Cybersecurity versus Governance, Risk Management, and Compliance
ML in GRC: Cybersecurity versus Governance, Risk Management, and ComplianceML in GRC: Cybersecurity versus Governance, Risk Management, and Compliance
ML in GRC: Cybersecurity versus Governance, Risk Management, and ComplianceBigML, Inc
 
Intelligent Mobility: Machine Learning in the Mobility Industry
Intelligent Mobility: Machine Learning in the Mobility IndustryIntelligent Mobility: Machine Learning in the Mobility Industry
Intelligent Mobility: Machine Learning in the Mobility IndustryBigML, Inc
 
Intelligent Mobility: Embedded Machine Learning, Damage Detection in Rail
Intelligent Mobility: Embedded Machine Learning, Damage Detection in RailIntelligent Mobility: Embedded Machine Learning, Damage Detection in Rail
Intelligent Mobility: Embedded Machine Learning, Damage Detection in RailBigML, Inc
 
Intelligent Mobility: Business Value of IoT and ML in Logistics
Intelligent Mobility: Business Value of IoT and ML in LogisticsIntelligent Mobility: Business Value of IoT and ML in Logistics
Intelligent Mobility: Business Value of IoT and ML in LogisticsBigML, Inc
 
Intelligent Mobility: The Added Value of Predictions for Transport Delivery
Intelligent Mobility: The Added Value of Predictions for Transport DeliveryIntelligent Mobility: The Added Value of Predictions for Transport Delivery
Intelligent Mobility: The Added Value of Predictions for Transport DeliveryBigML, Inc
 
Intelligent Mobility: From Last Mile to Long Distance Route Optimization
Intelligent Mobility: From Last Mile to Long Distance Route OptimizationIntelligent Mobility: From Last Mile to Long Distance Route Optimization
Intelligent Mobility: From Last Mile to Long Distance Route OptimizationBigML, Inc
 

Mais de BigML, Inc (20)

DutchMLSchool 2022 - History and Developments in ML
DutchMLSchool 2022 - History and Developments in MLDutchMLSchool 2022 - History and Developments in ML
DutchMLSchool 2022 - History and Developments in ML
 
DutchMLSchool 2022 - End-to-End ML
DutchMLSchool 2022 - End-to-End MLDutchMLSchool 2022 - End-to-End ML
DutchMLSchool 2022 - End-to-End ML
 
DutchMLSchool 2022 - A Data-Driven Company
DutchMLSchool 2022 - A Data-Driven CompanyDutchMLSchool 2022 - A Data-Driven Company
DutchMLSchool 2022 - A Data-Driven Company
 
DutchMLSchool 2022 - ML in the Legal Sector
DutchMLSchool 2022 - ML in the Legal SectorDutchMLSchool 2022 - ML in the Legal Sector
DutchMLSchool 2022 - ML in the Legal Sector
 
DutchMLSchool 2022 - Smart Safe Stadiums
DutchMLSchool 2022 - Smart Safe StadiumsDutchMLSchool 2022 - Smart Safe Stadiums
DutchMLSchool 2022 - Smart Safe Stadiums
 
DutchMLSchool 2022 - Process Optimization in Manufacturing Plants
DutchMLSchool 2022 - Process Optimization in Manufacturing PlantsDutchMLSchool 2022 - Process Optimization in Manufacturing Plants
DutchMLSchool 2022 - Process Optimization in Manufacturing Plants
 
DutchMLSchool 2022 - Anomaly Detection at Scale
DutchMLSchool 2022 - Anomaly Detection at ScaleDutchMLSchool 2022 - Anomaly Detection at Scale
DutchMLSchool 2022 - Anomaly Detection at Scale
 
DutchMLSchool 2022 - Citizen Development in AI
DutchMLSchool 2022 - Citizen Development in AIDutchMLSchool 2022 - Citizen Development in AI
DutchMLSchool 2022 - Citizen Development in AI
 
Democratizing Object Detection
Democratizing Object DetectionDemocratizing Object Detection
Democratizing Object Detection
 
BigML Release: Image Processing
BigML Release: Image ProcessingBigML Release: Image Processing
BigML Release: Image Processing
 
Machine Learning in Retail: Know Your Customers' Customer. See Your Future
Machine Learning in Retail: Know Your Customers' Customer. See Your FutureMachine Learning in Retail: Know Your Customers' Customer. See Your Future
Machine Learning in Retail: Know Your Customers' Customer. See Your Future
 
Machine Learning in Retail: ML in the Retail Sector
Machine Learning in Retail: ML in the Retail SectorMachine Learning in Retail: ML in the Retail Sector
Machine Learning in Retail: ML in the Retail Sector
 
ML in GRC: Machine Learning in Legal Automation, How to Trust a Lawyerbot
ML in GRC: Machine Learning in Legal Automation, How to Trust a LawyerbotML in GRC: Machine Learning in Legal Automation, How to Trust a Lawyerbot
ML in GRC: Machine Learning in Legal Automation, How to Trust a Lawyerbot
 
ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...
ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...
ML in GRC: Supporting Human Decision Making for Regulatory Adherence with Mac...
 
ML in GRC: Cybersecurity versus Governance, Risk Management, and Compliance
ML in GRC: Cybersecurity versus Governance, Risk Management, and ComplianceML in GRC: Cybersecurity versus Governance, Risk Management, and Compliance
ML in GRC: Cybersecurity versus Governance, Risk Management, and Compliance
 
Intelligent Mobility: Machine Learning in the Mobility Industry
Intelligent Mobility: Machine Learning in the Mobility IndustryIntelligent Mobility: Machine Learning in the Mobility Industry
Intelligent Mobility: Machine Learning in the Mobility Industry
 
Intelligent Mobility: Embedded Machine Learning, Damage Detection in Rail
Intelligent Mobility: Embedded Machine Learning, Damage Detection in RailIntelligent Mobility: Embedded Machine Learning, Damage Detection in Rail
Intelligent Mobility: Embedded Machine Learning, Damage Detection in Rail
 
Intelligent Mobility: Business Value of IoT and ML in Logistics
Intelligent Mobility: Business Value of IoT and ML in LogisticsIntelligent Mobility: Business Value of IoT and ML in Logistics
Intelligent Mobility: Business Value of IoT and ML in Logistics
 
Intelligent Mobility: The Added Value of Predictions for Transport Delivery
Intelligent Mobility: The Added Value of Predictions for Transport DeliveryIntelligent Mobility: The Added Value of Predictions for Transport Delivery
Intelligent Mobility: The Added Value of Predictions for Transport Delivery
 
Intelligent Mobility: From Last Mile to Long Distance Route Optimization
Intelligent Mobility: From Last Mile to Long Distance Route OptimizationIntelligent Mobility: From Last Mile to Long Distance Route Optimization
Intelligent Mobility: From Last Mile to Long Distance Route Optimization
 

Último

Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryJeremy Anderson
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfBoston Institute of Analytics
 
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...GQ Research
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectBoston Institute of Analytics
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 217djon017
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhijennyeacort
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsVICTOR MAESTRE RAMIREZ
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...limedy534
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样vhwb25kk
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
Machine learning classification ppt.ppt
Machine learning classification  ppt.pptMachine learning classification  ppt.ppt
Machine learning classification ppt.pptamreenkhanum0307
 
While-For-loop in python used in college
While-For-loop in python used in collegeWhile-For-loop in python used in college
While-For-loop in python used in collegessuser7a7cd61
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxMike Bennett
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna
 

Último (20)

Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data Story
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
 
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis Project
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business Professionals
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
Machine learning classification ppt.ppt
Machine learning classification  ppt.pptMachine learning classification  ppt.ppt
Machine learning classification ppt.ppt
 
While-For-loop in python used in college
While-For-loop in python used in collegeWhile-For-loop in python used in college
While-For-loop in python used in college
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptx
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
 

BigML Spring 2016 Release

  • 2. BigML, Inc 2 Spring 2016 Release POUL PETERSEN (CIO) Enter questions into chat box – we’ll answer some via chat; others at the end of the session https://bigml.com/releases ATAKAN CETINSOY, (VP Predictive Applications) Resources Moderator Speaker Contact info@bigml.com Twitter @bigmlcom Questions @whizzml
  • 4. BigML, Inc 4 Promise of ML time Want •Reduce churn •Increase conversion •Improve diagnosis •Reduce fraud •Etc. Automated InsightsData Have
  • 5. BigML, Inc 5 ML Hurdles time •Which algorithms? •How to scale it? •How to handle real data? •How to tune it? •How to automate it?
  • 6. BigML, Inc 6 Current Resources SOURCE DATASET CORRELATION STATISTICAL TEST MODEL ENSEMBLE LOGISTIC REGRESSION EVALUATION ANOMALY DETECTOR ASSOCIATION DISCOVERY PREDICTION BATCH PREDICTIONSCRIPT LIBRARY EXECUTION Data Exploration Supervised Learning Unsupervised Learning Automation CLUSTER Scoring
  • 7. BigML, Inc 7 BigML Vision time Automation Paving the Path to Automatic Machine Learning REST  API Programmable   Infrastructure A Sauron   • Automatic  deployment  and   auto-­‐scaling Data  Generation  and   Filtering C Flatline   • DSL  for  transformation  and   new  field  generation B Wintermute   • Distributed  Machine  Learning   Framework   2011 Spring 2016 Automatic  Model   Selection E SMACdown     • Automatic  parameter   optimization Workflow   Automation D WhizzML   • DSL  for  programmable   workflows  
  • 8. BigML, Inc 8 Workflow Map Decision  Trees   Bagging   Decision  Forest   LogisGc  Regression  MODEL DATASET CLUSTER ANOMALY ASSOCIATION SOURCE K-­‐Means   G-­‐Means   IsolaGon  Forest   Magnum  Opus   StaGsGcal  Tests   CorrelaGons   STATSDATASET Flatline   Flatline  Editor   PREDICTION Batch  PredicGon   Batch  Anomaly   Batch  Centroid   EvaluaGon  
  • 9. BigML, Inc 9 Original Workflow SOURCE DATASET MODEL PREDICTION
  • 10. BigML, Inc 10 Regular Workflows MODEL FILTERSOLD HOMES BATCH PREDICTION NEW FEATURES DATASET DEALS DATASET FILTERFORSALE HOMES NEW FEATURES
  • 11. BigML, Inc 11 Model Selection ENSEMBLE LOGISTIC REGRESSION EVALUATION SOURCE DATASET TRAINING TEST MODEL EVALUATIONEVALUATION CHOOSE
  • 12. BigML, Inc 12 Model Tuning ENSEMBLE N=20 EVALUATION SOURCE DATASET TRAINING TEST EVALUATIONEVALUATION ENSEMBLE N=10 ENSEMBLE N=1000 CHOOSE
  • 13. BigML, Inc 13 SMACdown •How many models? •How many nodes? •Missing splits or not? •Number of random candidates? •Balance the objective? SMACdown can tell you!
  • 14. BigML, Inc 14 Best-First Features {F1} CHOOSE BEST S = {Fa} {F2} {F3} {F4} Fn S+{F1} S+{F2} S+{F3} S+{F4} S+{Fn-1} CHOOSE BEST S = {Fa, Fb} S+{F1} S+{F2} S+{F3} S+{F4} S+{Fn-1} CHOOSE BEST S = {Fa, Fb, Fc}
  • 15. BigML, Inc 15 Stacked Generalization ENSEMBLE LOGISTIC REGRESSION SOURCE DATASET MODEL BATCH PREDICTION BATCH PREDICTION BATCH PREDICTION EXTENDED DATASET EXTENDED DATASET EXTENDED DATASET LOGISTIC REGRESSION
  • 16. BigML, Inc 16 Better Algorithms •Stacked Generalization •Boosting •Adaboost • Logitboost •Martingale Boosting •Gradient Boosting
  • 17. BigML, Inc 17 Why Workflows •Machine Learning is iterative by nature. •ML tools still require many repetitive (and manual) tasks. •Instead of helping to focus on the output many tools force analysts, developers, and scientists to focus on infrastructure, parallelism, etc. •Not everybody can implement complex workflows or meta-algorithms but many people can reuse them.
  • 18. BigML, Inc 18 WhizzML Features •A Domain-Specific Language (DSL) for automating Machine Learning workflows. •Complete programming language. •Machine Learning “operations” are first-class citizens. •Scale is provided for free. •API First! - Everything is composable.
  • 19. BigML, Inc 19 WhizzML API Resources SCRIPT LIBRARY EXECUTION
  • 20. BigML, Inc 20 export BIGML_USERNAME=myuser export BIGML_API_KEY=6ef37b3d791061d345ef51281dae821ac7943ed7 export BIGML_AUTH="username=$BIGML_USERNAME;api_key=$BIGML_API_KEY" export SCRIPT="https://bigml.io/script?$BIGML_AUTH" export LIBRARY="https://bigml.io/library?$BIGML_AUTH" export EXECUTION="https://bigml.io/execution?$BIGML_AUTH" Via API
  • 21. BigML, Inc 21 Via API http $LIBRARY source_code="(define (addition a b) (+ a b))" | jq ".resource" "library/573a97f5b95b3941f6000004" http $SCRIPT imports:='["library/573a97f5b95b3941f6000004"]' source_code="(addition x 2)" inputs:='[{"name": "x", "type": "number"}]' | jq ".resource" "script/573a9862b95b3941ff000015" http $EXECUTION script=script/573a9862b95b3941ff000015 inputs:='[["x", 5]]' | jq ".resource" "execution/573a987ab95b3941f000000d" http http://bigml.io/execution/573a987ab95b3941f000000d?$BIGML_AUTH | jq ".execution.result" 7
  • 22. BigML, Inc 22 Via Bindings https://gist.github.com/whizzmler/8a849c282a770ac79a1441df5c5ccf62
  • 23. BigML, Inc 23 Gallery Scripts UPDATE ME!!!
  • 24. BigML, Inc 24 Importing from GitHub
  • 25. BigML, Inc 25 WhizzML in GitHubNEW https://github.com/whizzml/examples
  • 26. BigML, Inc 26 WhizzML UI Resources
  • 28. BigML, Inc 28 WhizzML REPLNEW https://bigml.com/whizzml
  • 29. BigML, Inc 29 Reify •"Reifies" a resource into a WhizzML script. •Rapid prototyping meets automation. •Coming soon…
  • 30. BigML, Inc 30 Secret Link Scripts https://bigml.com/shared/script/oazVtg8t2V2JHFf6PLmenUJbNU https://bigml.com/dashboard/script/573d53a628eb3e026f000012
  • 31. BigML, Inc 31 A Gallery of Scripts https://bigml.com/gallery/scripts
  • 32. BigML, Inc 32 Share or Sell •Each script is reviewed internally by BigML Team members.
  • 35. BigML, Inc 35 API Documentation • https://bigml.com/developers/libraries • https://bigml.com/developers/scripts • https://bigml.com/developers/executions NEW
  • 36. BigML, Inc 36 WhizzML PageNEW https://bigml.com/whizzml
  • 37. BigML, Inc 37 Documentation Getting Started with WhizzML The BigML Team Version 1.0 MACHINE LEARNING MADE BEAUTIFULLY SIMPLE Copyright © 2016, BigML, Inc. WhizzML Reference Manual The BigML Team Version draft MACHINE LEARNING MADE BEAUTIFULLY SIMPLE Copyright © 2016, BigML, Inc. WhizzML Tutorials The BigML Team Version draft MACHINE LEARNING MADE BEAUTIFULLY SIMPLE Copyright © 2016, BigML, Inc. NEW https://bigml.com/whizzml#documentation
  • 40. BigML, Inc 40 Spring 2016 Release https://bigml.com/releases NEW
  • 41. BigML, Inc 41 Conclusion •Automation is critical to fulfilling the promise of ML •WhizzML can create workflows that: •Automate repetitive tasks. •Automate model tuning and feature selection. •Combine ML models into more powerful algorithms. •Create shareable and re-usable executions.