SlideShare a Scribd company logo
1 of 68
Download to read offline
Building a
Machine Learning
Recommendation Engine
in SQL
@garyorenstein @memsql
MemSQL 1
Today’s Talk
1. State of Data 2018 according to Gartner
2. Rise of Machine Learning
3. Live Demo - A SQL Recommendation Engine
MemSQL 2
SECTION 1
The State of DataAccording to Gartner 2018
MemSQL 3
Hype Cycle for Data
Management
26 July 2017
Donald Feinberg
Adam M. Ronthal
G00313950
MemSQL 4
MemSQL 5
Multimodel has the potential
to support both relational and
nonrelational use cases
while reducing the number of
disparate DBMS products
in an organization.
MemSQL 6
the idea of a
Hadoop distribution
will become obsolete
before it reaches
the Plateau of Productivity
MemSQL 7
Penetration continues to increase and organizations
should be evaluating these resources for
— cost-efficiency
— infrastructure simplification and
— new use cases, such as Hybrid Transactional/
Analytical Processing (HTAP)
MemSQL 8
Build Your Digital Business
Platform Around Data and
Analytics
31 January 2018
Andrew White
W. Roy Schulte
Roxane Edjlali
Joao Tapadinhas
Svetlana Sicular
G00350435
MemSQL 9
Select Challenges
Data and analytics investments that are tied to
measurable business outcomes are more likely to
produce reportable benefits.
MemSQL 10
Magic Quadrant for Data
Management Solutions for
Analytics
13 February 2018
Adam M. Ronthal
Roxane Edjlali
Rick Greenwald
G00326691
MemSQL 11
We define four primary use cases for DMSAs that reflect
this diversity of data and use cases:
— Traditional data warehouse
— Real-time data warehouse
— Context-independent data warehouse
— Logical data warehouse
MemSQL 12
MemSQL 13
MemSQL 14
Real-Time Data Warehouse
This use case adds a real-time component to analytics
use cases, with the aim of reducing latency — the time
lag between when data is generated and when it can be
analyzed.
MemSQL 15
MemSQL 16
Other Vendors to Consider for
Operational DBMSs
23 November 2017
Donald Feinberg
Merv Adrian
Nick Heudecker
G00327284
MemSQL 17
Other Vendors to Consider for Operational DBMSs
Actian
Aerospike
Alibaba Cloud
Altibase
ArangoDB
Cloudera
Clustrix
Couchbase
FairCom
Fujitsu
General Data Technology
Hortonworks
MariaDB
MemSQL
MongoDB
Neo4j
NuoDB
Percona
Redis Labs
SequoiaDB
TmaxSoft
VoltDB
MemSQL 18
Other Vendors to Consider for Operational DBMSs
also listed as Challenger or Leader
in the Magic Quadrant
for Data Management Solutions for Analytics
MemSQL
MemSQL 19
MemSQL 20
Over the next five years,
the OPDBMS and DMSA
markets converge to a
single DBMS market.
MemSQL 21
Look to your operational DBMS
vendor for both transactional
and analytical workloads.
MemSQL 22
SECTION 2
Rise of Machine Learning
MemSQL 23
MemSQL 24
MemSQL 25
MemSQL 26
MemSQL 27
MemSQL 28
MemSQL 29
2018 Outlook Survey
MemSQL and O’Reilly
1600+ respondents
memsql.com/MLsurvey
MemSQL 30
MemSQL 31
MemSQL 32
Machine Learning and
Databases
MemSQL 33
MemSQL 34
MemSQL 35
MemSQL 36
MemSQL 37
MemSQL 38
MemSQL 39
MemSQL 40
MemSQL 41
MemSQL 42
MemSQL 43
MemSQL 44
MemSQL 45
MemSQL 46
MemSQL 47
SECTION 3
DEMO with Yelp Dataset
MemSQL 48
MemSQL 49
MemSQL 50
MemSQL 51
MemSQL 52
Can you build a machine
learning recommendation
engine in SQL?
Yes
MemSQL 53
Can you build a machine learning
recommendation engine in SQL?
Yes
Should you?
For training? Maybe, maybe not.
For Operational Scoring?
Absolutely!
MemSQL 54
MemSQL 55
MemSQL 56
Secret Weapons to Machine Learning in SQL
— Extensibility
— Stored Procedures
— User Defined Functions
— User Defined Aggregates
— DOT_PRODUCT
— Compare two vectors
MemSQL 57
MemSQL 58
MemSQL 59
Sequel Pro Mac app for MySQL databases
MemSQL 60
MemSQL in one slide
— Distributed SQL database
— Massively parallel, lock-free, fast
— Full ACID features
— In-memory and on-disk
— JSON, key-value, geospatial, full-text search
— Robust security
— Built for transactions and analytics
MemSQL 61
MemSQL 62
MemSQL 63
Why do ML in SQL?
— Train in any number of systems
— Score in the database for applications from real-time
drilling to fraud detection to personalization
— Complete certain functions within the database to
radically simplify operational infrastructure
MemSQL 64
“It is a fine line between
a well executed SQL query on
live data and ML/AI”
MemSQL 65
MemSQL 66
Thank you!
Please visit our booth
www.memsql.com
@garyorenstein
@memsql
MemSQL 67
Abstract: Building a Machine Learning Recommendation Engine in SQL
Modern businesses constantly seek deeper customer relationships and more
compelling experiences.
To accomplish this, companies are looking to machine learning and artificial
intelligence solutions; however, that often involves a host of new systems and
approaches.
With a modern database architecture, it is possible to build compelling machine
learning solutions with SQL, deliver real-time engagements, and rapidly move to
operational applications.
See live, how a modern database can accomplish these feats within a single
integrated solution.
MemSQL 68

More Related Content

What's hot

Bringing olap fully online analyze changing datasets in mem sql and spark wi...
Bringing olap fully online  analyze changing datasets in mem sql and spark wi...Bringing olap fully online  analyze changing datasets in mem sql and spark wi...
Bringing olap fully online analyze changing datasets in mem sql and spark wi...
SingleStore
 
The Future of Data Science and Machine Learning at Scale: A Look at MLflow, D...
The Future of Data Science and Machine Learning at Scale: A Look at MLflow, D...The Future of Data Science and Machine Learning at Scale: A Look at MLflow, D...
The Future of Data Science and Machine Learning at Scale: A Look at MLflow, D...
Databricks
 

What's hot (20)

See who is using MemSQL
See who is using MemSQLSee who is using MemSQL
See who is using MemSQL
 
Google App Engine
Google App EngineGoogle App Engine
Google App Engine
 
Introducing MemSQL 4
Introducing MemSQL 4Introducing MemSQL 4
Introducing MemSQL 4
 
Real-Time Geospatial Intelligence at Scale
Real-Time Geospatial Intelligence at Scale Real-Time Geospatial Intelligence at Scale
Real-Time Geospatial Intelligence at Scale
 
Bringing olap fully online analyze changing datasets in mem sql and spark wi...
Bringing olap fully online  analyze changing datasets in mem sql and spark wi...Bringing olap fully online  analyze changing datasets in mem sql and spark wi...
Bringing olap fully online analyze changing datasets in mem sql and spark wi...
 
In-Memory Database Performance on AWS M4 Instances
In-Memory Database Performance on AWS M4 InstancesIn-Memory Database Performance on AWS M4 Instances
In-Memory Database Performance on AWS M4 Instances
 
Internet of Things and Multi-model Data Infrastructure
Internet of Things and Multi-model Data InfrastructureInternet of Things and Multi-model Data Infrastructure
Internet of Things and Multi-model Data Infrastructure
 
Denodo DataFest 2017: Integrating Big Data and Streaming Data with Enterprise...
Denodo DataFest 2017: Integrating Big Data and Streaming Data with Enterprise...Denodo DataFest 2017: Integrating Big Data and Streaming Data with Enterprise...
Denodo DataFest 2017: Integrating Big Data and Streaming Data with Enterprise...
 
Democratizing Data
Democratizing DataDemocratizing Data
Democratizing Data
 
Getting It Right Exactly Once: Principles for Streaming Architectures
Getting It Right Exactly Once: Principles for Streaming ArchitecturesGetting It Right Exactly Once: Principles for Streaming Architectures
Getting It Right Exactly Once: Principles for Streaming Architectures
 
Add Historical Analysis of Operational Data with Easy Configurations in Fivet...
Add Historical Analysis of Operational Data with Easy Configurations in Fivet...Add Historical Analysis of Operational Data with Easy Configurations in Fivet...
Add Historical Analysis of Operational Data with Easy Configurations in Fivet...
 
MemSQL
MemSQLMemSQL
MemSQL
 
Presto: Fast SQL on Everything
Presto: Fast SQL on EverythingPresto: Fast SQL on Everything
Presto: Fast SQL on Everything
 
Columbia Migrates from Legacy Data Warehouse to an Open Data Platform with De...
Columbia Migrates from Legacy Data Warehouse to an Open Data Platform with De...Columbia Migrates from Legacy Data Warehouse to an Open Data Platform with De...
Columbia Migrates from Legacy Data Warehouse to an Open Data Platform with De...
 
The Future of Data Science and Machine Learning at Scale: A Look at MLflow, D...
The Future of Data Science and Machine Learning at Scale: A Look at MLflow, D...The Future of Data Science and Machine Learning at Scale: A Look at MLflow, D...
The Future of Data Science and Machine Learning at Scale: A Look at MLflow, D...
 
Making Data Timelier and More Reliable with Lakehouse Technology
Making Data Timelier and More Reliable with Lakehouse TechnologyMaking Data Timelier and More Reliable with Lakehouse Technology
Making Data Timelier and More Reliable with Lakehouse Technology
 
Ebooks - Accelerating Time to Value of Big Data of Apache Spark | Qubole
Ebooks - Accelerating Time to Value of Big Data of Apache Spark | QuboleEbooks - Accelerating Time to Value of Big Data of Apache Spark | Qubole
Ebooks - Accelerating Time to Value of Big Data of Apache Spark | Qubole
 
Personalization Journey: From Single Node to Cloud Streaming
Personalization Journey: From Single Node to Cloud StreamingPersonalization Journey: From Single Node to Cloud Streaming
Personalization Journey: From Single Node to Cloud Streaming
 
Presto: Fast SQL-on-Anything (including Delta Lake, Snowflake, Elasticsearch ...
Presto: Fast SQL-on-Anything (including Delta Lake, Snowflake, Elasticsearch ...Presto: Fast SQL-on-Anything (including Delta Lake, Snowflake, Elasticsearch ...
Presto: Fast SQL-on-Anything (including Delta Lake, Snowflake, Elasticsearch ...
 
IBM Cloud Day January 2021 - A well architected data lake
IBM Cloud Day January 2021 - A well architected data lakeIBM Cloud Day January 2021 - A well architected data lake
IBM Cloud Day January 2021 - A well architected data lake
 

Similar to Building a Machine Learning Recommendation Engine in SQL

Making Sense of NoSQL and Big Data Amidst High Expectations
Making Sense of NoSQL and Big Data Amidst High ExpectationsMaking Sense of NoSQL and Big Data Amidst High Expectations
Making Sense of NoSQL and Big Data Amidst High Expectations
Rackspace
 
Guide to NoSQL with MySQL
Guide to NoSQL with MySQLGuide to NoSQL with MySQL
Guide to NoSQL with MySQL
Samuel Rohaut
 
bigdatasqloverview21jan2015-2408000
bigdatasqloverview21jan2015-2408000bigdatasqloverview21jan2015-2408000
bigdatasqloverview21jan2015-2408000
Kartik Padmanabhan
 
Microsoft Sql Server 2016 Is Now Live
Microsoft Sql Server 2016 Is Now LiveMicrosoft Sql Server 2016 Is Now Live
Microsoft Sql Server 2016 Is Now Live
Amber Moore
 
Big Data LDN 2018: CONNECTING SILOS IN REAL-TIME WITH DATA VIRTUALIZATION
Big Data LDN 2018: CONNECTING SILOS IN REAL-TIME WITH DATA VIRTUALIZATIONBig Data LDN 2018: CONNECTING SILOS IN REAL-TIME WITH DATA VIRTUALIZATION
Big Data LDN 2018: CONNECTING SILOS IN REAL-TIME WITH DATA VIRTUALIZATION
Matt Stubbs
 
Migrating legacy ERP data into Hadoop
Migrating legacy ERP data into HadoopMigrating legacy ERP data into Hadoop
Migrating legacy ERP data into Hadoop
DataWorks Summit
 
GigaOm-sector-roadmap-cloud-analytic-databases-2017
GigaOm-sector-roadmap-cloud-analytic-databases-2017GigaOm-sector-roadmap-cloud-analytic-databases-2017
GigaOm-sector-roadmap-cloud-analytic-databases-2017
Jeremy Maranitch
 

Similar to Building a Machine Learning Recommendation Engine in SQL (20)

Get a clearer picture of potential cloud performance by looking beyond SPECra...
Get a clearer picture of potential cloud performance by looking beyond SPECra...Get a clearer picture of potential cloud performance by looking beyond SPECra...
Get a clearer picture of potential cloud performance by looking beyond SPECra...
 
Making Sense of NoSQL and Big Data Amidst High Expectations
Making Sense of NoSQL and Big Data Amidst High ExpectationsMaking Sense of NoSQL and Big Data Amidst High Expectations
Making Sense of NoSQL and Big Data Amidst High Expectations
 
Logical Data Warehouse: The Foundation of Modern Data and Analytics
Logical Data Warehouse: The Foundation of Modern Data and AnalyticsLogical Data Warehouse: The Foundation of Modern Data and Analytics
Logical Data Warehouse: The Foundation of Modern Data and Analytics
 
Mule microsoft
Mule  microsoftMule  microsoft
Mule microsoft
 
Mule esb-microsoft
Mule esb-microsoftMule esb-microsoft
Mule esb-microsoft
 
Guide to NoSQL with MySQL
Guide to NoSQL with MySQLGuide to NoSQL with MySQL
Guide to NoSQL with MySQL
 
bigdatasqloverview21jan2015-2408000
bigdatasqloverview21jan2015-2408000bigdatasqloverview21jan2015-2408000
bigdatasqloverview21jan2015-2408000
 
Microsoft Sql Server 2016 Is Now Live
Microsoft Sql Server 2016 Is Now LiveMicrosoft Sql Server 2016 Is Now Live
Microsoft Sql Server 2016 Is Now Live
 
Big Data LDN 2018: CONNECTING SILOS IN REAL-TIME WITH DATA VIRTUALIZATION
Big Data LDN 2018: CONNECTING SILOS IN REAL-TIME WITH DATA VIRTUALIZATIONBig Data LDN 2018: CONNECTING SILOS IN REAL-TIME WITH DATA VIRTUALIZATION
Big Data LDN 2018: CONNECTING SILOS IN REAL-TIME WITH DATA VIRTUALIZATION
 
Connecting Silos in Real Time with Data Virtualization
Connecting Silos in Real Time with Data VirtualizationConnecting Silos in Real Time with Data Virtualization
Connecting Silos in Real Time with Data Virtualization
 
SQL Saturday 119 Chicago -- Enterprise Data Mining with SQL Server
SQL Saturday 119 Chicago -- Enterprise Data Mining with SQL ServerSQL Saturday 119 Chicago -- Enterprise Data Mining with SQL Server
SQL Saturday 119 Chicago -- Enterprise Data Mining with SQL Server
 
SQL Saturday 108 -- Enterprise Data Mining with SQL Server
SQL Saturday 108 -- Enterprise Data Mining with SQL ServerSQL Saturday 108 -- Enterprise Data Mining with SQL Server
SQL Saturday 108 -- Enterprise Data Mining with SQL Server
 
Migrating legacy ERP data into Hadoop
Migrating legacy ERP data into HadoopMigrating legacy ERP data into Hadoop
Migrating legacy ERP data into Hadoop
 
SAP and Microsoft Manufacturing Solution
SAP and Microsoft Manufacturing SolutionSAP and Microsoft Manufacturing Solution
SAP and Microsoft Manufacturing Solution
 
Webinar: Faster Big Data Analytics with MongoDB
Webinar: Faster Big Data Analytics with MongoDBWebinar: Faster Big Data Analytics with MongoDB
Webinar: Faster Big Data Analytics with MongoDB
 
Microsoft SQL Server 2012 Data Warehouse on Hitachi Converged Platform
Microsoft SQL Server 2012 Data Warehouse on Hitachi Converged PlatformMicrosoft SQL Server 2012 Data Warehouse on Hitachi Converged Platform
Microsoft SQL Server 2012 Data Warehouse on Hitachi Converged Platform
 
¿Cómo modernizar una arquitectura de TI con la virtualización de datos?
¿Cómo modernizar una arquitectura de TI con la virtualización de datos?¿Cómo modernizar una arquitectura de TI con la virtualización de datos?
¿Cómo modernizar una arquitectura de TI con la virtualización de datos?
 
How a Semantic Layer Makes Data Mesh Work at Scale
How a Semantic Layer Makes  Data Mesh Work at ScaleHow a Semantic Layer Makes  Data Mesh Work at Scale
How a Semantic Layer Makes Data Mesh Work at Scale
 
2016 Sept 1st - IBM Consultants & System Integrators Interchange - Big Data -...
2016 Sept 1st - IBM Consultants & System Integrators Interchange - Big Data -...2016 Sept 1st - IBM Consultants & System Integrators Interchange - Big Data -...
2016 Sept 1st - IBM Consultants & System Integrators Interchange - Big Data -...
 
GigaOm-sector-roadmap-cloud-analytic-databases-2017
GigaOm-sector-roadmap-cloud-analytic-databases-2017GigaOm-sector-roadmap-cloud-analytic-databases-2017
GigaOm-sector-roadmap-cloud-analytic-databases-2017
 

More from SingleStore

More from SingleStore (20)

MemSQL 201: Advanced Tips and Tricks Webcast
MemSQL 201: Advanced Tips and Tricks WebcastMemSQL 201: Advanced Tips and Tricks Webcast
MemSQL 201: Advanced Tips and Tricks Webcast
 
Introduction to MemSQL
Introduction to MemSQLIntroduction to MemSQL
Introduction to MemSQL
 
Building a Fault Tolerant Distributed Architecture
Building a Fault Tolerant Distributed ArchitectureBuilding a Fault Tolerant Distributed Architecture
Building a Fault Tolerant Distributed Architecture
 
Stream Processing with Pipelines and Stored Procedures
Stream Processing with Pipelines  and Stored ProceduresStream Processing with Pipelines  and Stored Procedures
Stream Processing with Pipelines and Stored Procedures
 
Curriculum Associates Strata NYC 2017
Curriculum Associates Strata NYC 2017Curriculum Associates Strata NYC 2017
Curriculum Associates Strata NYC 2017
 
Image Recognition on Streaming Data
Image Recognition  on Streaming DataImage Recognition  on Streaming Data
Image Recognition on Streaming Data
 
Spark Summit Dublin 2017 - MemSQL - Real-Time Image Recognition
Spark Summit Dublin 2017 - MemSQL - Real-Time Image RecognitionSpark Summit Dublin 2017 - MemSQL - Real-Time Image Recognition
Spark Summit Dublin 2017 - MemSQL - Real-Time Image Recognition
 
How Database Convergence Impacts the Coming Decades of Data Management
How Database Convergence Impacts the Coming Decades of Data ManagementHow Database Convergence Impacts the Coming Decades of Data Management
How Database Convergence Impacts the Coming Decades of Data Management
 
Teaching Databases to Learn in the World of AI
Teaching Databases to Learn in the World of AITeaching Databases to Learn in the World of AI
Teaching Databases to Learn in the World of AI
 
Gartner Catalyst 2017: The Data Warehouse Blueprint for ML, AI, and Hybrid Cloud
Gartner Catalyst 2017: The Data Warehouse Blueprint for ML, AI, and Hybrid CloudGartner Catalyst 2017: The Data Warehouse Blueprint for ML, AI, and Hybrid Cloud
Gartner Catalyst 2017: The Data Warehouse Blueprint for ML, AI, and Hybrid Cloud
 
Gartner Catalyst 2017: Image Recognition on Streaming Data
Gartner Catalyst 2017: Image Recognition on Streaming DataGartner Catalyst 2017: Image Recognition on Streaming Data
Gartner Catalyst 2017: Image Recognition on Streaming Data
 
Spark Summit West 2017: Real-Time Image Recognition with MemSQL and Spark
Spark Summit West 2017: Real-Time Image Recognition with MemSQL and SparkSpark Summit West 2017: Real-Time Image Recognition with MemSQL and Spark
Spark Summit West 2017: Real-Time Image Recognition with MemSQL and Spark
 
Real-Time Analytics at Uber Scale
Real-Time Analytics at Uber ScaleReal-Time Analytics at Uber Scale
Real-Time Analytics at Uber Scale
 
Machines and the Magic of Fast Learning
Machines and the Magic of Fast LearningMachines and the Magic of Fast Learning
Machines and the Magic of Fast Learning
 
Machines and the Magic of Fast Learning - Strata Keynote
Machines and the Magic of Fast Learning - Strata KeynoteMachines and the Magic of Fast Learning - Strata Keynote
Machines and the Magic of Fast Learning - Strata Keynote
 
Enabling Real-Time Analytics for IoT
Enabling Real-Time Analytics for IoTEnabling Real-Time Analytics for IoT
Enabling Real-Time Analytics for IoT
 
Driving the On-Demand Economy with Predictive Analytics
Driving the On-Demand Economy with Predictive AnalyticsDriving the On-Demand Economy with Predictive Analytics
Driving the On-Demand Economy with Predictive Analytics
 
Tapjoy: Building a Real-Time Data Science Service for Mobile Advertising
Tapjoy: Building a Real-Time Data Science Service for Mobile AdvertisingTapjoy: Building a Real-Time Data Science Service for Mobile Advertising
Tapjoy: Building a Real-Time Data Science Service for Mobile Advertising
 
The Real-Time CDO and the Cloud-Forward Path to Predictive Analytics
The Real-Time CDO and the Cloud-Forward Path to Predictive AnalyticsThe Real-Time CDO and the Cloud-Forward Path to Predictive Analytics
The Real-Time CDO and the Cloud-Forward Path to Predictive Analytics
 
Enabling Real-Time Analytics for IoT
Enabling Real-Time Analytics for IoTEnabling Real-Time Analytics for IoT
Enabling Real-Time Analytics for IoT
 

Recently uploaded

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Recently uploaded (20)

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 

Building a Machine Learning Recommendation Engine in SQL

  • 1. Building a Machine Learning Recommendation Engine in SQL @garyorenstein @memsql MemSQL 1
  • 2. Today’s Talk 1. State of Data 2018 according to Gartner 2. Rise of Machine Learning 3. Live Demo - A SQL Recommendation Engine MemSQL 2
  • 3. SECTION 1 The State of DataAccording to Gartner 2018 MemSQL 3
  • 4. Hype Cycle for Data Management 26 July 2017 Donald Feinberg Adam M. Ronthal G00313950 MemSQL 4
  • 6. Multimodel has the potential to support both relational and nonrelational use cases while reducing the number of disparate DBMS products in an organization. MemSQL 6
  • 7. the idea of a Hadoop distribution will become obsolete before it reaches the Plateau of Productivity MemSQL 7
  • 8. Penetration continues to increase and organizations should be evaluating these resources for — cost-efficiency — infrastructure simplification and — new use cases, such as Hybrid Transactional/ Analytical Processing (HTAP) MemSQL 8
  • 9. Build Your Digital Business Platform Around Data and Analytics 31 January 2018 Andrew White W. Roy Schulte Roxane Edjlali Joao Tapadinhas Svetlana Sicular G00350435 MemSQL 9
  • 10. Select Challenges Data and analytics investments that are tied to measurable business outcomes are more likely to produce reportable benefits. MemSQL 10
  • 11. Magic Quadrant for Data Management Solutions for Analytics 13 February 2018 Adam M. Ronthal Roxane Edjlali Rick Greenwald G00326691 MemSQL 11
  • 12. We define four primary use cases for DMSAs that reflect this diversity of data and use cases: — Traditional data warehouse — Real-time data warehouse — Context-independent data warehouse — Logical data warehouse MemSQL 12
  • 15. Real-Time Data Warehouse This use case adds a real-time component to analytics use cases, with the aim of reducing latency — the time lag between when data is generated and when it can be analyzed. MemSQL 15
  • 17. Other Vendors to Consider for Operational DBMSs 23 November 2017 Donald Feinberg Merv Adrian Nick Heudecker G00327284 MemSQL 17
  • 18. Other Vendors to Consider for Operational DBMSs Actian Aerospike Alibaba Cloud Altibase ArangoDB Cloudera Clustrix Couchbase FairCom Fujitsu General Data Technology Hortonworks MariaDB MemSQL MongoDB Neo4j NuoDB Percona Redis Labs SequoiaDB TmaxSoft VoltDB MemSQL 18
  • 19. Other Vendors to Consider for Operational DBMSs also listed as Challenger or Leader in the Magic Quadrant for Data Management Solutions for Analytics MemSQL MemSQL 19
  • 21. Over the next five years, the OPDBMS and DMSA markets converge to a single DBMS market. MemSQL 21
  • 22. Look to your operational DBMS vendor for both transactional and analytical workloads. MemSQL 22
  • 23. SECTION 2 Rise of Machine Learning MemSQL 23
  • 30. 2018 Outlook Survey MemSQL and O’Reilly 1600+ respondents memsql.com/MLsurvey MemSQL 30
  • 48. SECTION 3 DEMO with Yelp Dataset MemSQL 48
  • 53. Can you build a machine learning recommendation engine in SQL? Yes MemSQL 53
  • 54. Can you build a machine learning recommendation engine in SQL? Yes Should you? For training? Maybe, maybe not. For Operational Scoring? Absolutely! MemSQL 54
  • 57. Secret Weapons to Machine Learning in SQL — Extensibility — Stored Procedures — User Defined Functions — User Defined Aggregates — DOT_PRODUCT — Compare two vectors MemSQL 57
  • 60. Sequel Pro Mac app for MySQL databases MemSQL 60
  • 61. MemSQL in one slide — Distributed SQL database — Massively parallel, lock-free, fast — Full ACID features — In-memory and on-disk — JSON, key-value, geospatial, full-text search — Robust security — Built for transactions and analytics MemSQL 61
  • 64. Why do ML in SQL? — Train in any number of systems — Score in the database for applications from real-time drilling to fraud detection to personalization — Complete certain functions within the database to radically simplify operational infrastructure MemSQL 64
  • 65. “It is a fine line between a well executed SQL query on live data and ML/AI” MemSQL 65
  • 67. Thank you! Please visit our booth www.memsql.com @garyorenstein @memsql MemSQL 67
  • 68. Abstract: Building a Machine Learning Recommendation Engine in SQL Modern businesses constantly seek deeper customer relationships and more compelling experiences. To accomplish this, companies are looking to machine learning and artificial intelligence solutions; however, that often involves a host of new systems and approaches. With a modern database architecture, it is possible to build compelling machine learning solutions with SQL, deliver real-time engagements, and rapidly move to operational applications. See live, how a modern database can accomplish these feats within a single integrated solution. MemSQL 68