Enviar pesquisa
Carregar
Apache Spark Machine Learning
•
4 gostaram
•
1,960 visualizações
Carol McDonald
Seguir
Introduction to Machine Learning with Apache Spark
Leia menos
Leia mais
Software
Denunciar
Compartilhar
Denunciar
Compartilhar
1 de 41
Baixar agora
Baixar para ler offline
Recomendados
Introduction to Machine Learning with Spark
Introduction to Machine Learning with Spark
datamantra
Large-Scale Machine Learning with Apache Spark
Large-Scale Machine Learning with Apache Spark
DB Tsai
Large Scale Machine Learning with Apache Spark
Large Scale Machine Learning with Apache Spark
Cloudera, Inc.
Large Scale Machine learning with Spark
Large Scale Machine learning with Spark
Md. Mahedi Kaysar
2014-10-20 Large-Scale Machine Learning with Apache Spark at Internet of Thin...
2014-10-20 Large-Scale Machine Learning with Apache Spark at Internet of Thin...
DB Tsai
MLlib and Machine Learning on Spark
MLlib and Machine Learning on Spark
Petr Zapletal
Recent Developments in Spark MLlib and Beyond
Recent Developments in Spark MLlib and Beyond
DataWorks Summit
Sparse Data Support in MLlib
Sparse Data Support in MLlib
Xiangrui Meng
Recomendados
Introduction to Machine Learning with Spark
Introduction to Machine Learning with Spark
datamantra
Large-Scale Machine Learning with Apache Spark
Large-Scale Machine Learning with Apache Spark
DB Tsai
Large Scale Machine Learning with Apache Spark
Large Scale Machine Learning with Apache Spark
Cloudera, Inc.
Large Scale Machine learning with Spark
Large Scale Machine learning with Spark
Md. Mahedi Kaysar
2014-10-20 Large-Scale Machine Learning with Apache Spark at Internet of Thin...
2014-10-20 Large-Scale Machine Learning with Apache Spark at Internet of Thin...
DB Tsai
MLlib and Machine Learning on Spark
MLlib and Machine Learning on Spark
Petr Zapletal
Recent Developments in Spark MLlib and Beyond
Recent Developments in Spark MLlib and Beyond
DataWorks Summit
Sparse Data Support in MLlib
Sparse Data Support in MLlib
Xiangrui Meng
2015-06-15 Large-Scale Elastic-Net Regularized Generalized Linear Models at S...
2015-06-15 Large-Scale Elastic-Net Regularized Generalized Linear Models at S...
DB Tsai
Introduction to Spark
Introduction to Spark
Carol McDonald
Netflix's Recommendation ML Pipeline Using Apache Spark: Spark Summit East ta...
Netflix's Recommendation ML Pipeline Using Apache Spark: Spark Summit East ta...
Spark Summit
Lessons Learned while Implementing a Sparse Logistic Regression Algorithm in ...
Lessons Learned while Implementing a Sparse Logistic Regression Algorithm in ...
Spark Summit
Lazy Join Optimizations Without Upfront Statistics with Matteo Interlandi
Lazy Join Optimizations Without Upfront Statistics with Matteo Interlandi
Databricks
Ray: A Cluster Computing Engine for Reinforcement Learning Applications with ...
Ray: A Cluster Computing Engine for Reinforcement Learning Applications with ...
Databricks
Introduction to Mahout
Introduction to Mahout
Ted Dunning
Machine Learning using Apache Spark MLlib
Machine Learning using Apache Spark MLlib
IMC Institute
Machine learning with Apache Spark MLlib | Big Data Hadoop Spark Tutorial | C...
Machine learning with Apache Spark MLlib | Big Data Hadoop Spark Tutorial | C...
CloudxLab
A full Machine learning pipeline in Scikit-learn vs in scala-Spark: pros and ...
A full Machine learning pipeline in Scikit-learn vs in scala-Spark: pros and ...
Jose Quesada (hiring)
Large-Scale Lasso and Elastic-Net Regularized Generalized Linear Models (DB T...
Large-Scale Lasso and Elastic-Net Regularized Generalized Linear Models (DB T...
Spark Summit
Machine Learning With Spark
Machine Learning With Spark
Shivaji Dutta
Misha Bilenko, Principal Researcher, Microsoft at MLconf SEA - 5/01/15
Misha Bilenko, Principal Researcher, Microsoft at MLconf SEA - 5/01/15
MLconf
Spark 101
Spark 101
Mohit Garg
Spark Summit EU talk by Reza Karimi
Spark Summit EU talk by Reza Karimi
Spark Summit
Spark ML par Xebia (Spark Meetup du 11/06/2015)
Spark ML par Xebia (Spark Meetup du 11/06/2015)
Modern Data Stack France
A More Scaleable Way of Making Recommendations with MLlib-(Xiangrui Meng, Dat...
A More Scaleable Way of Making Recommendations with MLlib-(Xiangrui Meng, Dat...
Spark Summit
Continuous Evaluation of Deployed Models in Production Many high-tech industr...
Continuous Evaluation of Deployed Models in Production Many high-tech industr...
Databricks
Time-evolving Graph Processing on Commodity Clusters: Spark Summit East talk ...
Time-evolving Graph Processing on Commodity Clusters: Spark Summit East talk ...
Spark Summit
A Graph-Based Method For Cross-Entity Threat Detection
A Graph-Based Method For Cross-Entity Threat Detection
Jen Aman
Crab: A Python Framework for Building Recommender Systems
Crab: A Python Framework for Building Recommender Systems
Marcel Caraciolo
MLlib: Spark's Machine Learning Library
MLlib: Spark's Machine Learning Library
jeykottalam
Mais conteúdo relacionado
Mais procurados
2015-06-15 Large-Scale Elastic-Net Regularized Generalized Linear Models at S...
2015-06-15 Large-Scale Elastic-Net Regularized Generalized Linear Models at S...
DB Tsai
Introduction to Spark
Introduction to Spark
Carol McDonald
Netflix's Recommendation ML Pipeline Using Apache Spark: Spark Summit East ta...
Netflix's Recommendation ML Pipeline Using Apache Spark: Spark Summit East ta...
Spark Summit
Lessons Learned while Implementing a Sparse Logistic Regression Algorithm in ...
Lessons Learned while Implementing a Sparse Logistic Regression Algorithm in ...
Spark Summit
Lazy Join Optimizations Without Upfront Statistics with Matteo Interlandi
Lazy Join Optimizations Without Upfront Statistics with Matteo Interlandi
Databricks
Ray: A Cluster Computing Engine for Reinforcement Learning Applications with ...
Ray: A Cluster Computing Engine for Reinforcement Learning Applications with ...
Databricks
Introduction to Mahout
Introduction to Mahout
Ted Dunning
Machine Learning using Apache Spark MLlib
Machine Learning using Apache Spark MLlib
IMC Institute
Machine learning with Apache Spark MLlib | Big Data Hadoop Spark Tutorial | C...
Machine learning with Apache Spark MLlib | Big Data Hadoop Spark Tutorial | C...
CloudxLab
A full Machine learning pipeline in Scikit-learn vs in scala-Spark: pros and ...
A full Machine learning pipeline in Scikit-learn vs in scala-Spark: pros and ...
Jose Quesada (hiring)
Large-Scale Lasso and Elastic-Net Regularized Generalized Linear Models (DB T...
Large-Scale Lasso and Elastic-Net Regularized Generalized Linear Models (DB T...
Spark Summit
Machine Learning With Spark
Machine Learning With Spark
Shivaji Dutta
Misha Bilenko, Principal Researcher, Microsoft at MLconf SEA - 5/01/15
Misha Bilenko, Principal Researcher, Microsoft at MLconf SEA - 5/01/15
MLconf
Spark 101
Spark 101
Mohit Garg
Spark Summit EU talk by Reza Karimi
Spark Summit EU talk by Reza Karimi
Spark Summit
Spark ML par Xebia (Spark Meetup du 11/06/2015)
Spark ML par Xebia (Spark Meetup du 11/06/2015)
Modern Data Stack France
A More Scaleable Way of Making Recommendations with MLlib-(Xiangrui Meng, Dat...
A More Scaleable Way of Making Recommendations with MLlib-(Xiangrui Meng, Dat...
Spark Summit
Continuous Evaluation of Deployed Models in Production Many high-tech industr...
Continuous Evaluation of Deployed Models in Production Many high-tech industr...
Databricks
Time-evolving Graph Processing on Commodity Clusters: Spark Summit East talk ...
Time-evolving Graph Processing on Commodity Clusters: Spark Summit East talk ...
Spark Summit
A Graph-Based Method For Cross-Entity Threat Detection
A Graph-Based Method For Cross-Entity Threat Detection
Jen Aman
Mais procurados
(20)
2015-06-15 Large-Scale Elastic-Net Regularized Generalized Linear Models at S...
2015-06-15 Large-Scale Elastic-Net Regularized Generalized Linear Models at S...
Introduction to Spark
Introduction to Spark
Netflix's Recommendation ML Pipeline Using Apache Spark: Spark Summit East ta...
Netflix's Recommendation ML Pipeline Using Apache Spark: Spark Summit East ta...
Lessons Learned while Implementing a Sparse Logistic Regression Algorithm in ...
Lessons Learned while Implementing a Sparse Logistic Regression Algorithm in ...
Lazy Join Optimizations Without Upfront Statistics with Matteo Interlandi
Lazy Join Optimizations Without Upfront Statistics with Matteo Interlandi
Ray: A Cluster Computing Engine for Reinforcement Learning Applications with ...
Ray: A Cluster Computing Engine for Reinforcement Learning Applications with ...
Introduction to Mahout
Introduction to Mahout
Machine Learning using Apache Spark MLlib
Machine Learning using Apache Spark MLlib
Machine learning with Apache Spark MLlib | Big Data Hadoop Spark Tutorial | C...
Machine learning with Apache Spark MLlib | Big Data Hadoop Spark Tutorial | C...
A full Machine learning pipeline in Scikit-learn vs in scala-Spark: pros and ...
A full Machine learning pipeline in Scikit-learn vs in scala-Spark: pros and ...
Large-Scale Lasso and Elastic-Net Regularized Generalized Linear Models (DB T...
Large-Scale Lasso and Elastic-Net Regularized Generalized Linear Models (DB T...
Machine Learning With Spark
Machine Learning With Spark
Misha Bilenko, Principal Researcher, Microsoft at MLconf SEA - 5/01/15
Misha Bilenko, Principal Researcher, Microsoft at MLconf SEA - 5/01/15
Spark 101
Spark 101
Spark Summit EU talk by Reza Karimi
Spark Summit EU talk by Reza Karimi
Spark ML par Xebia (Spark Meetup du 11/06/2015)
Spark ML par Xebia (Spark Meetup du 11/06/2015)
A More Scaleable Way of Making Recommendations with MLlib-(Xiangrui Meng, Dat...
A More Scaleable Way of Making Recommendations with MLlib-(Xiangrui Meng, Dat...
Continuous Evaluation of Deployed Models in Production Many high-tech industr...
Continuous Evaluation of Deployed Models in Production Many high-tech industr...
Time-evolving Graph Processing on Commodity Clusters: Spark Summit East talk ...
Time-evolving Graph Processing on Commodity Clusters: Spark Summit East talk ...
A Graph-Based Method For Cross-Entity Threat Detection
A Graph-Based Method For Cross-Entity Threat Detection
Destaque
Crab: A Python Framework for Building Recommender Systems
Crab: A Python Framework for Building Recommender Systems
Marcel Caraciolo
MLlib: Spark's Machine Learning Library
MLlib: Spark's Machine Learning Library
jeykottalam
Large-scale Parallel Collaborative Filtering and Clustering using MapReduce f...
Large-scale Parallel Collaborative Filtering and Clustering using MapReduce f...
Varad Meru
Collaborative Filtering using KNN
Collaborative Filtering using KNN
Şeyda Hatipoğlu
Recommender Systems with Apache Spark's ALS Function
Recommender Systems with Apache Spark's ALS Function
Will Johnson
Collaborative Filtering and Recommender Systems By Navisro Analytics
Collaborative Filtering and Recommender Systems By Navisro Analytics
Navisro Analytics
Destaque
(6)
Crab: A Python Framework for Building Recommender Systems
Crab: A Python Framework for Building Recommender Systems
MLlib: Spark's Machine Learning Library
MLlib: Spark's Machine Learning Library
Large-scale Parallel Collaborative Filtering and Clustering using MapReduce f...
Large-scale Parallel Collaborative Filtering and Clustering using MapReduce f...
Collaborative Filtering using KNN
Collaborative Filtering using KNN
Recommender Systems with Apache Spark's ALS Function
Recommender Systems with Apache Spark's ALS Function
Collaborative Filtering and Recommender Systems By Navisro Analytics
Collaborative Filtering and Recommender Systems By Navisro Analytics
Semelhante a Apache Spark Machine Learning
Parallel and Iterative Processing for Machine Learning Recommendations with S...
Parallel and Iterative Processing for Machine Learning Recommendations with S...
MapR Technologies
Hadoop France meetup Feb2016 : recommendations with spark
Hadoop France meetup Feb2016 : recommendations with spark
Modern Data Stack France
Free Code Friday - Machine Learning with Apache Spark
Free Code Friday - Machine Learning with Apache Spark
MapR Technologies
Apache Spark Machine Learning Decision Trees
Apache Spark Machine Learning Decision Trees
Carol McDonald
Intro to Apache Spark by Marco Vasquez
Intro to Apache Spark by Marco Vasquez
MapR Technologies
Introduction to Collaborative Filtering with Apache Mahout
Introduction to Collaborative Filtering with Apache Mahout
sscdotopen
Building machine learning service in your business — Eric Chen (Uber) @PAPIs ...
Building machine learning service in your business — Eric Chen (Uber) @PAPIs ...
PAPIs.io
Training Large-scale Ad Ranking Models in Spark
Training Large-scale Ad Ranking Models in Spark
Patrick Pletscher
Azure Machine Learning Dotnet Campus 2015
Azure Machine Learning Dotnet Campus 2015
antimo musone
Object Oriented Programming in Matlab
Object Oriented Programming in Matlab
AlbanLevy
MACHINE LEARNING FOR OPTIMIZING SEARCH RESULTS WITH DRUPAL & APACHE SOLR
MACHINE LEARNING FOR OPTIMIZING SEARCH RESULTS WITH DRUPAL & APACHE SOLR
DrupalCamp Kyiv
Data Science in the Elastic Stack
Data Science in the Elastic Stack
Rochelle Sonnenberg
ML-Ops how to bring your data science to production
ML-Ops how to bring your data science to production
Herman Wu
Silicon valleycodecamp2013
Silicon valleycodecamp2013
Sanjeev Mishra
Automate ml workflow_transmogrif_ai-_chetan_khatri_berlin-scala
Automate ml workflow_transmogrif_ai-_chetan_khatri_berlin-scala
Chetan Khatri
Designing and Building a Graph Database Application – Architectural Choices, ...
Designing and Building a Graph Database Application – Architectural Choices, ...
Neo4j
Data science with R - Clustering and Classification
Data science with R - Clustering and Classification
Brigitte Mueller
(Py)testing the Limits of Machine Learning
(Py)testing the Limits of Machine Learning
Rebecca Bilbro
DataScience-101
DataScience-101
Karthikeyan VK
Cloudera Data Science Challenge
Cloudera Data Science Challenge
Mark Nichols, P.E.
Semelhante a Apache Spark Machine Learning
(20)
Parallel and Iterative Processing for Machine Learning Recommendations with S...
Parallel and Iterative Processing for Machine Learning Recommendations with S...
Hadoop France meetup Feb2016 : recommendations with spark
Hadoop France meetup Feb2016 : recommendations with spark
Free Code Friday - Machine Learning with Apache Spark
Free Code Friday - Machine Learning with Apache Spark
Apache Spark Machine Learning Decision Trees
Apache Spark Machine Learning Decision Trees
Intro to Apache Spark by Marco Vasquez
Intro to Apache Spark by Marco Vasquez
Introduction to Collaborative Filtering with Apache Mahout
Introduction to Collaborative Filtering with Apache Mahout
Building machine learning service in your business — Eric Chen (Uber) @PAPIs ...
Building machine learning service in your business — Eric Chen (Uber) @PAPIs ...
Training Large-scale Ad Ranking Models in Spark
Training Large-scale Ad Ranking Models in Spark
Azure Machine Learning Dotnet Campus 2015
Azure Machine Learning Dotnet Campus 2015
Object Oriented Programming in Matlab
Object Oriented Programming in Matlab
MACHINE LEARNING FOR OPTIMIZING SEARCH RESULTS WITH DRUPAL & APACHE SOLR
MACHINE LEARNING FOR OPTIMIZING SEARCH RESULTS WITH DRUPAL & APACHE SOLR
Data Science in the Elastic Stack
Data Science in the Elastic Stack
ML-Ops how to bring your data science to production
ML-Ops how to bring your data science to production
Silicon valleycodecamp2013
Silicon valleycodecamp2013
Automate ml workflow_transmogrif_ai-_chetan_khatri_berlin-scala
Automate ml workflow_transmogrif_ai-_chetan_khatri_berlin-scala
Designing and Building a Graph Database Application – Architectural Choices, ...
Designing and Building a Graph Database Application – Architectural Choices, ...
Data science with R - Clustering and Classification
Data science with R - Clustering and Classification
(Py)testing the Limits of Machine Learning
(Py)testing the Limits of Machine Learning
DataScience-101
DataScience-101
Cloudera Data Science Challenge
Cloudera Data Science Challenge
Mais de Carol McDonald
Introduction to machine learning with GPUs
Introduction to machine learning with GPUs
Carol McDonald
Streaming healthcare Data pipeline using Apache APIs: Kafka and Spark with Ma...
Streaming healthcare Data pipeline using Apache APIs: Kafka and Spark with Ma...
Carol McDonald
Analyzing Flight Delays with Apache Spark, DataFrames, GraphFrames, and MapR-DB
Analyzing Flight Delays with Apache Spark, DataFrames, GraphFrames, and MapR-DB
Carol McDonald
Analysis of Popular Uber Locations using Apache APIs: Spark Machine Learning...
Analysis of Popular Uber Locations using Apache APIs: Spark Machine Learning...
Carol McDonald
Predicting Flight Delays with Spark Machine Learning
Predicting Flight Delays with Spark Machine Learning
Carol McDonald
Structured Streaming Data Pipeline Using Kafka, Spark, and MapR-DB
Structured Streaming Data Pipeline Using Kafka, Spark, and MapR-DB
Carol McDonald
Streaming Machine learning Distributed Pipeline for Real-Time Uber Data Using...
Streaming Machine learning Distributed Pipeline for Real-Time Uber Data Using...
Carol McDonald
Applying Machine Learning to IOT: End to End Distributed Pipeline for Real-Ti...
Applying Machine Learning to IOT: End to End Distributed Pipeline for Real-Ti...
Carol McDonald
Applying Machine Learning to IOT: End to End Distributed Pipeline for Real- T...
Applying Machine Learning to IOT: End to End Distributed Pipeline for Real- T...
Carol McDonald
How Big Data is Reducing Costs and Improving Outcomes in Health Care
How Big Data is Reducing Costs and Improving Outcomes in Health Care
Carol McDonald
Demystifying AI, Machine Learning and Deep Learning
Demystifying AI, Machine Learning and Deep Learning
Carol McDonald
Spark graphx
Spark graphx
Carol McDonald
Applying Machine learning to IOT: End to End Distributed Distributed Pipeline...
Applying Machine learning to IOT: End to End Distributed Distributed Pipeline...
Carol McDonald
Streaming patterns revolutionary architectures
Streaming patterns revolutionary architectures
Carol McDonald
Spark machine learning predicting customer churn
Spark machine learning predicting customer churn
Carol McDonald
Fast Cars, Big Data How Streaming can help Formula 1
Fast Cars, Big Data How Streaming can help Formula 1
Carol McDonald
Applying Machine Learning to Live Patient Data
Applying Machine Learning to Live Patient Data
Carol McDonald
Streaming Patterns Revolutionary Architectures with the Kafka API
Streaming Patterns Revolutionary Architectures with the Kafka API
Carol McDonald
Advanced Threat Detection on Streaming Data
Advanced Threat Detection on Streaming Data
Carol McDonald
Fast, Scalable, Streaming Applications with Spark Streaming, the Kafka API an...
Fast, Scalable, Streaming Applications with Spark Streaming, the Kafka API an...
Carol McDonald
Mais de Carol McDonald
(20)
Introduction to machine learning with GPUs
Introduction to machine learning with GPUs
Streaming healthcare Data pipeline using Apache APIs: Kafka and Spark with Ma...
Streaming healthcare Data pipeline using Apache APIs: Kafka and Spark with Ma...
Analyzing Flight Delays with Apache Spark, DataFrames, GraphFrames, and MapR-DB
Analyzing Flight Delays with Apache Spark, DataFrames, GraphFrames, and MapR-DB
Analysis of Popular Uber Locations using Apache APIs: Spark Machine Learning...
Analysis of Popular Uber Locations using Apache APIs: Spark Machine Learning...
Predicting Flight Delays with Spark Machine Learning
Predicting Flight Delays with Spark Machine Learning
Structured Streaming Data Pipeline Using Kafka, Spark, and MapR-DB
Structured Streaming Data Pipeline Using Kafka, Spark, and MapR-DB
Streaming Machine learning Distributed Pipeline for Real-Time Uber Data Using...
Streaming Machine learning Distributed Pipeline for Real-Time Uber Data Using...
Applying Machine Learning to IOT: End to End Distributed Pipeline for Real-Ti...
Applying Machine Learning to IOT: End to End Distributed Pipeline for Real-Ti...
Applying Machine Learning to IOT: End to End Distributed Pipeline for Real- T...
Applying Machine Learning to IOT: End to End Distributed Pipeline for Real- T...
How Big Data is Reducing Costs and Improving Outcomes in Health Care
How Big Data is Reducing Costs and Improving Outcomes in Health Care
Demystifying AI, Machine Learning and Deep Learning
Demystifying AI, Machine Learning and Deep Learning
Spark graphx
Spark graphx
Applying Machine learning to IOT: End to End Distributed Distributed Pipeline...
Applying Machine learning to IOT: End to End Distributed Distributed Pipeline...
Streaming patterns revolutionary architectures
Streaming patterns revolutionary architectures
Spark machine learning predicting customer churn
Spark machine learning predicting customer churn
Fast Cars, Big Data How Streaming can help Formula 1
Fast Cars, Big Data How Streaming can help Formula 1
Applying Machine Learning to Live Patient Data
Applying Machine Learning to Live Patient Data
Streaming Patterns Revolutionary Architectures with the Kafka API
Streaming Patterns Revolutionary Architectures with the Kafka API
Advanced Threat Detection on Streaming Data
Advanced Threat Detection on Streaming Data
Fast, Scalable, Streaming Applications with Spark Streaming, the Kafka API an...
Fast, Scalable, Streaming Applications with Spark Streaming, the Kafka API an...
Último
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial Goals
Jhone kinadey
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf
Wave PLM
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
MyIntelliSource, Inc.
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Steffen Staab
Software Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
Arshad QA
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
9953056974 Low Rate Call Girls In Saket, Delhi NCR
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
ThousandEyes
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
kalichargn70th171
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.com
Fatema Valibhai
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.js
Andolasoft Inc
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
Health
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Models
aagamshah0812
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptx
bodapatigopi8531
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
harshavardhanraghave
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
MyIntelliSource, Inc.
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
panagenda
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
Delhi Call girls
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
shikhaohhpro
CALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female service
CALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female service
anilsa9823
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure
Último
(20)
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial Goals
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Software Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.com
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.js
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Models
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptx
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
CALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female service
CALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female service
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Apache Spark Machine Learning
1.
® © 2014 MapR
Technologies 1 ® © 2014 MapR Technologies Machine Learning with Spark Carol McDonald
2.
® © 2014 MapR
Technologies 2 Agenda • Classification • Clustering • Collaborative Filtering with Spark • Model training • Alternating Least Squares • The code
3.
® © 2014 MapR
Technologies 3 Three Categories of Techniques for Machine Learning classification Collaborative filtering (recommendation) clustering Groups similar items identifies category for item Recommend items
4.
® © 2014 MapR
Technologies 4 What is classification Form of ML that: • identifies which category an item belongs to • Uses supervised learning algorithms • Data is labeled Examples: • Spam Detection • spam/non-spam • Credit Card Fraud Detection • fraud/non-fraud • Sentiment analysis
5.
® © 2014 MapR
Technologies 5 Building and deploying a classifier model
6.
® © 2014 MapR
Technologies 6 If it walks/swims/quacks like a duck … Attributes, Features: • If it walks • If it swims • If it quacks “When I see a bird that walks like a duck and swims like a duck and quacks like a duck, I call that bird a duck.” classify something based on “if” conditions. Answer, Label: • Duck • Not duck
7.
® © 2014 MapR
Technologies 7 … then it must be a duck ducks not ducks walks quacks swims Label: • Duck • Not duck Features: • walks • swims • quacks
8.
® © 2014 MapR
Technologies 8 Building and deploying a classifier model Reference Learning Spark Oreilly Book
9.
® © 2014 MapR
Technologies 9 Vectorizing Data • identify interesting features (those that contribute to the model) • assign features to dimensions Example: vectorize an apple Features: [size, color, weight] Example: vectorize a text document (Term Frequency Inverse Term Frequency) Dictionary: [a, advance, after, …, you, yourself, youth, zigzag] [3.2, 16777184.0, 45.8][223,1,1,0,…,12,10,6,1]
10.
® © 2014 MapR
Technologies 10 Build Term Frequency Feature vectors // examples of spam val spam = sc.textFile("spam.txt") // examples of not spam val normal = sc.textFile("normal.txt”) // Create a HashingTF map email text to vectors of features val tf = new HashingTF(numFeatures = 10000) // Each email each word is mapped to one feature. val spamFeatures = spam .map(email => tf.transform(email.split(" "))) val normalFeatures = normal .map(email => tf.transform(email.split(" "))) Reference Learning Spark Oreilly Book
11.
® © 2014 MapR
Technologies 11 Building and deploying a classifier model
12.
® © 2014 MapR
Technologies 12 Build Model val trainingData = positiveExamples.union(negativeExamples) trainingData.cache() // Cache for iterative algorithm. // Run Logistic Regression using the SGD algorithm. val model = new LogisticRegressionWithSGD() .run(trainingData) Reference Learning Spark Oreilly Book
13.
® © 2014 MapR
Technologies 13 Building and deploying a classifier model Reference Learning Spark Oreilly Book
14.
® © 2014 MapR
Technologies 14 Model Evaluation // Test on a positive example (spam) Vector posTest = tf.transform(Arrays.asList( "O M G GET cheap stuff by sending money to...".split(" "))); // negative test not spam Vector negTest = tf.transform(Arrays.asList( "Hi Dad, I started studying Spark the other ...".split(" "))); System.out.println("Prediction for positive: " + model.predict(posTest)); System.out.println("Prediction for negative: " + model.predict(negTest));
15.
® © 2014 MapR
Technologies 15 Three Categories of Techniques for Machine Learning classification Collaborative filtering (recommendation) clustering
16.
® © 2014 MapR
Technologies 16 Clustering • Clustering is the unsupervised learning task that involves grouping objects into clusters of high similarity – Search results grouping – grouping of customers by similar habits – Anomaly detection • data traffic – Text categorization
17.
® © 2014 MapR
Technologies 17 What is Clustering? Clustering = (unsupervised) task of grouping similar objects MLlib K-means algorithm for clustering 1. randomly initialize centers of clusters 2. Assign all points to the closest cluster center 3. Change cluster centers to be in the middle of its points 4. Repeat until convergence
18.
® © 2014 MapR
Technologies 18 What is Clustering? Clustering = (unsupervised) task of grouping similar objects
19.
® © 2014 MapR
Technologies 19 Examples of ML Algorithms machine learning supervised unsupervised • Classification • Naïve Bayes • SVM • Random Decision Forests • Regression • Linear • logistic • Clustering • K-means • Dimensionality reduction • Principal Component Analysis • SVD
20.
® © 2014 MapR
Technologies 20 ML Algorithms http://scikit-learn.org/stable/tutorial/machine_learning_map/index.html
21.
® © 2014 MapR
Technologies 21 Three Categories of Techniques for Machine Learning classification Collaborative filtering (recommendation) clustering
22.
® © 2014 MapR
Technologies 22 Collaborative Filtering with Spark • Recommend Items – (filtering) • Based on User preferences data – (collaborative)
23.
® © 2014 MapR
Technologies 23 Train a Model to Make Predictions New Data Model Predictions Training Data ModelAlgorithm Ted and Carol like Movie B and C Bob likes Movie B, What might he like ? Bob likes Movie B, Predict C
24.
® © 2014 MapR
Technologies 24 Alternating Least Squares • approximates sparse user item rating matrix – as product of two dense matrices, User and Item factor matrices – tries to learn the hidden features of each user and item – algorithm alternatively fixes one factor matrix and solves for the other ?
25.
® © 2014 MapR
Technologies 25 ML Cross Validation Process Data Model Training/ Building Test Model Predictions Test Set Train Test loop Training Set
26.
® © 2014 MapR
Technologies 26 Ratings Data
27.
® © 2014 MapR
Technologies 27 Parse Input // parse input UserID::MovieID::Rating def parseRating(str: String): Rating= { val fields = str.split("::") Rating(fields(0).toInt, fields(1).toInt, fields(2).toDouble) } // create an RDD of Ratings objects val ratingsRDD = ratingText.map(parseRating).cache()
28.
® © 2014 MapR
Technologies 28 Build Model Data Build Model Test Set Training Set split ratings RDD into training data RDD (80%) and test data RDD (20%) build a user product matrix model
29.
® © 2014 MapR
Technologies 29 Create Model // Randomly split ratings RDD into training data RDD (80%) and test data RDD (20%) val splits = ratingsRDD.randomSplit(Array(0.8, 0.2), 0L) val trainingRatingsRDD = splits(0).cache() val testRatingsRDD = splits(1).cache() // build a ALS user product matrix model with rank=20, iterations=10 val model = (new ALS().setRank(20).setIterations(10) .run(trainingRatingsRDD))
30.
® © 2014 MapR
Technologies 30 Get predictions // get predicted ratings to compare to test ratings val testUserProductRDD = testRatingsRDD.map { case Rating(user, product, rating) => (user, product) } // call model.predict with test Userid, MovieId input data val predictionsForTestRDD = model.predict(testUserProductRDD) User, Movie Test Data Model Predicted Ratings
31.
® © 2014 MapR
Technologies 31 Compare predictions to Tests Join predicted ratings to test ratings in order to compare ((user, product),test rating) ((user, product), predicted rating) ((user, product),(test rating, predicted rating)) Key, Value Key, Value Key, Value
32.
® © 2014 MapR
Technologies 32 Test Model // prepare predictions for comparison val predictionsKeyedByUserProductRDD = predictionsForTestRDD.map{ case Rating(user, product, rating) => ((user, product), rating) } // prepare test for comparison val testKeyedByUserProductRDD = testRatingsRDD.map{ case Rating(user, product, rating) => ((user, product), rating) } //Join the test with predictions val testAndPredictionsJoinedRDD = testKeyedByUserProductRDD .join(predictionsKeyedByUserProductRDD)
33.
® © 2014 MapR
Technologies 33 Compare predictions to Tests Find False positives: Where test rating <= 1 and predicted rating >= 4 ((user, product),(test rating, predicted rating)) Key, Value
34.
® © 2014 MapR
Technologies 34 Test Model val falsePositives =(testAndPredictionsJoinedRDD.filter{ case ((user, product), (ratingT, ratingP)) => (ratingT <= 1 && ratingP >=4) }) falsePositives.take(2) Array[((Int, Int), (Double, Double))] = ((3842,2858),(1.0,4.106488210964762)), ((6031,3194),(1.0,4.790778049100913))
35.
® © 2014 MapR
Technologies 35 Test Model Mean Absolute Error //Evaluate the model using Mean Absolute Error (MAE) between test and predictions val meanAbsoluteError = testAndPredictionsJoinedRDD.map { case ((user, product), (testRating, predRating)) => val err = (testRating - predRating) Math.abs(err) }.mean() meanAbsoluteError: Double = 0.7244940545944053
36.
® © 2014 MapR
Technologies 36 Get Predictions for new user val newRatingsRDD=sc.parallelize(Array(Rating(0,260,4),Rating(0,1,3)) // union val unionRatingsRDD = ratingsRDD.union(newRatingsRDD) // build a ALS user product matrix model val model = (new ALS().setRank(20).setIterations(10) .run(unionRatingsRDD)) // get 5 recs for userid 0 val topRecsForUser = model.recommendProducts(0, 5)
37.
® © 2014 MapR
Technologies 37 Soon to Come • Spark On Demand Training – https://www.mapr.com/services/mapr-academy/ • Blogs and Tutorials: – Movie Recommendations with Collaborative Filtering – Spark Streaming
38.
® © 2014 MapR
Technologies 38 Machine Learning Blog • https://www.mapr.com/blog/parallel-and-iterative-processing- machine-learning-recommendations-spark
39.
® © 2014 MapR
Technologies 39 Spark on MapR • Certified Spark Distribution • Fully supported and packaged by MapR in partnership with Databricks • YARN integration – Spark can then allocate resources from cluster when needed
40.
® © 2014 MapR
Technologies 40 References • Spark Online course: learn.mapr.com • Spark web site: http://spark.apache.org/ • https://databricks.com/ • Spark on MapR: – http://www.mapr.com/products/apache-spark • Spark SQL and DataFrame Guide • Apache Spark vs. MapReduce – Whiteboard Walkthrough • Learning Spark - O'Reilly Book • Apache Spark
41.
® © 2014 MapR
Technologies 41 Q&A @mapr maprtech Engage with us! MapR maprtech mapr-technologies
Baixar agora