SlideShare uma empresa Scribd logo
1 de 16
Baixar para ler offline
Driving Datascience at
scale using Postgresql,
Greenplum and Dataiku
PostgresConf 2019
Nicolas GAKRELIDZ
Partner Solution Architect
Dataiku DSS is:
• Collaborative,
• For all profiles,
• Polyglot,
• Production ready
End-to-end Enterprise AI platform
Dataiku DSS
End-to-end Enterprise AI platform
Dataiku DSS
Supporting the Enterprise AI Journey of
Manufacturing Financial Services
Services Consumer Goods
Technology Consulting
E-Retail Media
Healthcare Travel
Global Presence
A WIDE USER BASE
POWERED BY A STRONG ORGANIZATION
Dataikers
220
BACKED BY MAJOR PARTNERS
Customers
220+
Users
20,000
+ of customers expand
usage after first year
80%
Raised so far
$146M
Customers Across Industries
POWERING INDUSTRY LEADERS
The “Tower of Babel” Effect of Data Projects
The Classic Data Project Silos
Business
Analyst
DATA PREPARATION ML MODELING ML DEPLOYMENT
Data Preparation
Data Science Notebooks
& API Platforms
AutoML
Solutions
Data Scientist
Data Engineer
Bring Business Analysts, Engineers, and Scientists Together
Share a common environment to have an impact
DATA PREPARATION ML MODELING ML DEPLOYMENT
Business
Analyst
Data Engineer
Data Scientist
Single Collaborative, Governable and Auditable Environment
Leverage existing skills
and secure sustained
availability
Maximise usage of most
up-to-date technologies
Extend based on current
and future operating
requirements
Get Results Today, Build for Tomorrow
Future proof your data effort
Use your current
infrastructure and be
ready for tomorrow’s
Bokeh
Fortune 500 Customer Rockets through Acceleration Phase
Customer Testimony
Quarterly Evolution of Dataiku Users
Analytics
Leader
10 Projects Leaders
Scale their team to
deliver
10x Projects / Briefs /
Models / ...
Business
Analyst
500 Business Analysts
Leverage Large and
Complex Data Sources
Independent to Deliver
New Projects Accelerate
by leveraging tools
packaged by Data
Scientists
100 Data
Scientists
Focus On Complex
Data Processing
Deliver Code and
Plugins for Reuse
Data
Scientist
20 Data Engineers
Ensure availability of data
infrastructures
Operationalize, monitor
and maintain data
projects
Data
Engineer
Delivering 1,000s of analysis, insights,
models and optimized business
processes
Enable Self-Service Analytics and Operationalize ML
The Two Key Modes of Data Innovation
SSA
Quick answers to
unformulated questions
Directly by the end-users
Pervasive
Agile and instantaneous
Limited integration
High volume
o16n
Robust solutions to
business challenges
Organization-driven
Focused
Longer term
Fine integration
High value projects
How a Major Software Player Auto-Deploys 12,000 Models
Customer Testimony
Design complex recommendation
engines combining price, content and
demand logics (the final models actually
combine 3 predictive models)
Automatically generate
such recommendation engines based on
each of its seller’s data and data models
Operate models in real time and
update them with no down time, scaling
up on a fully managed platform on top of
Kubernetes An AI-enabled Layer on top of
an an existing product
Powered by Dataiku
Dataiku Customer provides a sales management software platform to 4,000 B2B clients
(including several Fortune 100 companies), and has deployed Dataiku in order to:
Leverage your full stack and skills
Dataiku Solution Overview: Architecture
LINUX SERVER
ON PREMISE OR MANAGED
CLOUD
CENTRALIZED
OR AD-HOC
DATA SOURCES,
DATABASES,
DATA LAKE
AVAILABLE OR SPUN-UP
PROCESSING RESOURCES
Leveraging best
storage and
compute
resources
Dataiku deployment servers for
enterprise grade
operationalization
PRODUCTION
SYSTEMS
Centralized server to
facilitate
access to data, ressources,
Browser
based
interface
VISUAL DEVELOPMENT
COMPLETE
CODING
ENVIRONMENTS
VISUALIZATIO
N
COLLABORATION AND
PROJECT
MANAGEMENT
AUDIT,
MONITORING
AND
SCHEDULING
User/task specific
interaction modes
4 components
Dataiku DSS Public API
Dataiku DSS components
Data Scientist Business Analyst Data Engineer
Machine Learning Model DeploymentData Management
MADlib
In-database
machine learning
Graph
Relationship
Analytics
Greenplum
Integrated and cleansed data,
parallel SQL processing
GPText
Fast index,
search, text
analytics
PostGIS
Location analytics
Enable In-Database Analytics & Operationalized ML
Dataiku & Pivotal® Greenplum’s Value
High-Performance Analytics at Petabyte Scale
▪ Dataiku leverages Pivotal® Greenplum for in-database parallel
processing of complex queries, visual analysis and charts.
Simplify Collaboration across Data Teams
▪ End-to-end project collaboration for data scientists and
engineers
▪ Self-service access to data sources
▪ Visual Development experience for building comprehensive
analytics pipelines
Mature Your Data Analytics Operations
▪ Enable self-service analytics of large datasets stored in
Pivotal® Greenplum
▪ Enforce data governance between roles and teams
▪ Enable comprehensive of machine learning pipelines and
models.
Solution Features
Dataiku & Pivotal® Greenplum’s Value
Dataiku + Postgres and Greenplum (example)
Order
Data
Movements
(if compatible)
Dataiku Datasets:
● Index definitions
● Incremental
SQL push
back: Charts using
SQL
Pushback
…
Storage
©2019 dataiku, Inc. | dataiku.com | contact@dataiku.com | @dataiku

Mais conteúdo relacionado

Mais procurados

Designing An Enterprise Data Fabric
Designing An Enterprise Data FabricDesigning An Enterprise Data Fabric
Designing An Enterprise Data Fabric
Alan McSweeney
 

Mais procurados (20)

Data Marketplace and the Role of Data Virtualization
Data Marketplace and the Role of Data VirtualizationData Marketplace and the Role of Data Virtualization
Data Marketplace and the Role of Data Virtualization
 
How Amazon.com Uses AWS Analytics: Data Analytics Week SF
How Amazon.com Uses AWS Analytics: Data Analytics Week SFHow Amazon.com Uses AWS Analytics: Data Analytics Week SF
How Amazon.com Uses AWS Analytics: Data Analytics Week SF
 
How to Use a Semantic Layer to Deliver Actionable Insights at Scale
How to Use a Semantic Layer to Deliver Actionable Insights at ScaleHow to Use a Semantic Layer to Deliver Actionable Insights at Scale
How to Use a Semantic Layer to Deliver Actionable Insights at Scale
 
Data Catalog for Better Data Discovery and Governance
Data Catalog for Better Data Discovery and GovernanceData Catalog for Better Data Discovery and Governance
Data Catalog for Better Data Discovery and Governance
 
Five Things to Consider About Data Mesh and Data Governance
Five Things to Consider About Data Mesh and Data GovernanceFive Things to Consider About Data Mesh and Data Governance
Five Things to Consider About Data Mesh and Data Governance
 
Power BI Premium : pour quels usages ?
Power BI Premium : pour quels usages ?Power BI Premium : pour quels usages ?
Power BI Premium : pour quels usages ?
 
Dataiku data science studio
Dataiku data science studioDataiku data science studio
Dataiku data science studio
 
Sopra Steria: Intelligent Network Analysis in a Telecommunications Environment
Sopra Steria: Intelligent Network Analysis in a Telecommunications EnvironmentSopra Steria: Intelligent Network Analysis in a Telecommunications Environment
Sopra Steria: Intelligent Network Analysis in a Telecommunications Environment
 
Agile Data Engineering - Intro to Data Vault Modeling (2016)
Agile Data Engineering - Intro to Data Vault Modeling (2016)Agile Data Engineering - Intro to Data Vault Modeling (2016)
Agile Data Engineering - Intro to Data Vault Modeling (2016)
 
Netflix Data Engineering @ Uber Engineering Meetup
Netflix Data Engineering @ Uber Engineering MeetupNetflix Data Engineering @ Uber Engineering Meetup
Netflix Data Engineering @ Uber Engineering Meetup
 
Owning Your Own (Data) Lake House
Owning Your Own (Data) Lake HouseOwning Your Own (Data) Lake House
Owning Your Own (Data) Lake House
 
Data Lakehouse Symposium | Day 1 | Part 2
Data Lakehouse Symposium | Day 1 | Part 2Data Lakehouse Symposium | Day 1 | Part 2
Data Lakehouse Symposium | Day 1 | Part 2
 
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
 
Denodo: Enabling a Data Mesh Architecture and Data Sharing Culture at Landsba...
Denodo: Enabling a Data Mesh Architecture and Data Sharing Culture at Landsba...Denodo: Enabling a Data Mesh Architecture and Data Sharing Culture at Landsba...
Denodo: Enabling a Data Mesh Architecture and Data Sharing Culture at Landsba...
 
Apache Kafka® and the Data Mesh
Apache Kafka® and the Data MeshApache Kafka® and the Data Mesh
Apache Kafka® and the Data Mesh
 
Business Intelligence tools comparison
Business Intelligence tools comparisonBusiness Intelligence tools comparison
Business Intelligence tools comparison
 
seven steps to dataops @ dataops.rocks conference Oct 2019
seven steps to dataops @ dataops.rocks conference Oct 2019seven steps to dataops @ dataops.rocks conference Oct 2019
seven steps to dataops @ dataops.rocks conference Oct 2019
 
Screw DevOps, Let's Talk DataOps
Screw DevOps, Let's Talk DataOpsScrew DevOps, Let's Talk DataOps
Screw DevOps, Let's Talk DataOps
 
How to Build the Data Mesh Foundation: A Principled Approach | Zhamak Dehghan...
How to Build the Data Mesh Foundation: A Principled Approach | Zhamak Dehghan...How to Build the Data Mesh Foundation: A Principled Approach | Zhamak Dehghan...
How to Build the Data Mesh Foundation: A Principled Approach | Zhamak Dehghan...
 
Designing An Enterprise Data Fabric
Designing An Enterprise Data FabricDesigning An Enterprise Data Fabric
Designing An Enterprise Data Fabric
 

Semelhante a Driving Datascience at scale using Postgresql, Greenplum and Dataiku - Greenplum Summit 2019

Using Visualization to Succeed with Big Data
Using Visualization to Succeed with Big Data Using Visualization to Succeed with Big Data
Using Visualization to Succeed with Big Data
Pactera_US
 
SIMPosium presentation_Bardess Qlik
SIMPosium presentation_Bardess QlikSIMPosium presentation_Bardess Qlik
SIMPosium presentation_Bardess Qlik
Bardess Group
 
Business Discovery PPT
Business Discovery PPTBusiness Discovery PPT
Business Discovery PPT
pdalalau
 
PROG_UntoldStory ISV eBook_0706c FINAL
PROG_UntoldStory ISV eBook_0706c FINALPROG_UntoldStory ISV eBook_0706c FINAL
PROG_UntoldStory ISV eBook_0706c FINAL
SolarWinds MSP
 

Semelhante a Driving Datascience at scale using Postgresql, Greenplum and Dataiku - Greenplum Summit 2019 (20)

Microsoft cloud big data strategy
Microsoft cloud big data strategyMicrosoft cloud big data strategy
Microsoft cloud big data strategy
 
Paris FOD Meetup #5 Cognizant Presentation
Paris FOD Meetup #5 Cognizant PresentationParis FOD Meetup #5 Cognizant Presentation
Paris FOD Meetup #5 Cognizant Presentation
 
Big Data: It’s all about the Use Cases
Big Data: It’s all about the Use CasesBig Data: It’s all about the Use Cases
Big Data: It’s all about the Use Cases
 
Using Visualization to Succeed with Big Data
Using Visualization to Succeed with Big Data Using Visualization to Succeed with Big Data
Using Visualization to Succeed with Big Data
 
New Delhi Cloud Summit 05 26-11
New Delhi Cloud Summit 05 26-11New Delhi Cloud Summit 05 26-11
New Delhi Cloud Summit 05 26-11
 
Analyst View of Data Virtualization: Conversations with Boulder Business Inte...
Analyst View of Data Virtualization: Conversations with Boulder Business Inte...Analyst View of Data Virtualization: Conversations with Boulder Business Inte...
Analyst View of Data Virtualization: Conversations with Boulder Business Inte...
 
Opportunity: Data, Analytic & Azure
Opportunity: Data, Analytic & Azure Opportunity: Data, Analytic & Azure
Opportunity: Data, Analytic & Azure
 
Digital Business Transformation in the Streaming Era
Digital Business Transformation in the Streaming EraDigital Business Transformation in the Streaming Era
Digital Business Transformation in the Streaming Era
 
Revolution in Business Analytics-Zika Virus Example
Revolution in Business Analytics-Zika Virus ExampleRevolution in Business Analytics-Zika Virus Example
Revolution in Business Analytics-Zika Virus Example
 
SIMPosium presentation_Bardess Qlik
SIMPosium presentation_Bardess QlikSIMPosium presentation_Bardess Qlik
SIMPosium presentation_Bardess Qlik
 
Business Discovery PPT
Business Discovery PPTBusiness Discovery PPT
Business Discovery PPT
 
Business Discovery
Business DiscoveryBusiness Discovery
Business Discovery
 
Business Discovery Ppt
Business Discovery PptBusiness Discovery Ppt
Business Discovery Ppt
 
SPS Vancouver 2018 - What is CDM and CDS
SPS Vancouver 2018 - What is CDM and CDSSPS Vancouver 2018 - What is CDM and CDS
SPS Vancouver 2018 - What is CDM and CDS
 
Cloudera and Qlik: Big Data Analytics for Business
Cloudera and Qlik: Big Data Analytics for BusinessCloudera and Qlik: Big Data Analytics for Business
Cloudera and Qlik: Big Data Analytics for Business
 
Microsoft Fabric Introduction
Microsoft Fabric IntroductionMicrosoft Fabric Introduction
Microsoft Fabric Introduction
 
PROG_UntoldStory ISV eBook_0706c FINAL
PROG_UntoldStory ISV eBook_0706c FINALPROG_UntoldStory ISV eBook_0706c FINAL
PROG_UntoldStory ISV eBook_0706c FINAL
 
About CDAP
About CDAPAbout CDAP
About CDAP
 
Scaling Legacy
Scaling LegacyScaling Legacy
Scaling Legacy
 
Modern Thinking área digital MSKM 21/09/2017
Modern Thinking área digital MSKM 21/09/2017Modern Thinking área digital MSKM 21/09/2017
Modern Thinking área digital MSKM 21/09/2017
 

Mais de VMware Tanzu

Mais de VMware Tanzu (20)

What AI Means For Your Product Strategy And What To Do About It
What AI Means For Your Product Strategy And What To Do About ItWhat AI Means For Your Product Strategy And What To Do About It
What AI Means For Your Product Strategy And What To Do About It
 
Make the Right Thing the Obvious Thing at Cardinal Health 2023
Make the Right Thing the Obvious Thing at Cardinal Health 2023Make the Right Thing the Obvious Thing at Cardinal Health 2023
Make the Right Thing the Obvious Thing at Cardinal Health 2023
 
Enhancing DevEx and Simplifying Operations at Scale
Enhancing DevEx and Simplifying Operations at ScaleEnhancing DevEx and Simplifying Operations at Scale
Enhancing DevEx and Simplifying Operations at Scale
 
Spring Update | July 2023
Spring Update | July 2023Spring Update | July 2023
Spring Update | July 2023
 
Platforms, Platform Engineering, & Platform as a Product
Platforms, Platform Engineering, & Platform as a ProductPlatforms, Platform Engineering, & Platform as a Product
Platforms, Platform Engineering, & Platform as a Product
 
Building Cloud Ready Apps
Building Cloud Ready AppsBuilding Cloud Ready Apps
Building Cloud Ready Apps
 
Spring Boot 3 And Beyond
Spring Boot 3 And BeyondSpring Boot 3 And Beyond
Spring Boot 3 And Beyond
 
Spring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdf
Spring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdfSpring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdf
Spring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdf
 
Simplify and Scale Enterprise Apps in the Cloud | Boston 2023
Simplify and Scale Enterprise Apps in the Cloud | Boston 2023Simplify and Scale Enterprise Apps in the Cloud | Boston 2023
Simplify and Scale Enterprise Apps in the Cloud | Boston 2023
 
Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023
Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023
Simplify and Scale Enterprise Apps in the Cloud | Seattle 2023
 
tanzu_developer_connect.pptx
tanzu_developer_connect.pptxtanzu_developer_connect.pptx
tanzu_developer_connect.pptx
 
Tanzu Virtual Developer Connect Workshop - French
Tanzu Virtual Developer Connect Workshop - FrenchTanzu Virtual Developer Connect Workshop - French
Tanzu Virtual Developer Connect Workshop - French
 
Tanzu Developer Connect Workshop - English
Tanzu Developer Connect Workshop - EnglishTanzu Developer Connect Workshop - English
Tanzu Developer Connect Workshop - English
 
Virtual Developer Connect Workshop - English
Virtual Developer Connect Workshop - EnglishVirtual Developer Connect Workshop - English
Virtual Developer Connect Workshop - English
 
Tanzu Developer Connect - French
Tanzu Developer Connect - FrenchTanzu Developer Connect - French
Tanzu Developer Connect - French
 
Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023
Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023
Simplify and Scale Enterprise Apps in the Cloud | Dallas 2023
 
SpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring Boot
SpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring BootSpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring Boot
SpringOne Tour: Deliver 15-Factor Applications on Kubernetes with Spring Boot
 
SpringOne Tour: The Influential Software Engineer
SpringOne Tour: The Influential Software EngineerSpringOne Tour: The Influential Software Engineer
SpringOne Tour: The Influential Software Engineer
 
SpringOne Tour: Domain-Driven Design: Theory vs Practice
SpringOne Tour: Domain-Driven Design: Theory vs PracticeSpringOne Tour: Domain-Driven Design: Theory vs Practice
SpringOne Tour: Domain-Driven Design: Theory vs Practice
 
SpringOne Tour: Spring Recipes: A Collection of Common-Sense Solutions
SpringOne Tour: Spring Recipes: A Collection of Common-Sense SolutionsSpringOne Tour: Spring Recipes: A Collection of Common-Sense Solutions
SpringOne Tour: Spring Recipes: A Collection of Common-Sense Solutions
 

Último

%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
masabamasaba
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
Health
 
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
masabamasaba
 
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
masabamasaba
 
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
masabamasaba
 
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
masabamasaba
 
The title is not connected to what is inside
The title is not connected to what is insideThe title is not connected to what is inside
The title is not connected to what is inside
shinachiaurasa2
 
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
masabamasaba
 

Último (20)

%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
 
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
 
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park %in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
 
%in kempton park+277-882-255-28 abortion pills for sale in kempton park
%in kempton park+277-882-255-28 abortion pills for sale in kempton park %in kempton park+277-882-255-28 abortion pills for sale in kempton park
%in kempton park+277-882-255-28 abortion pills for sale in kempton park
 
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
 
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
 
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
 
8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students
 
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation Template
 
WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?
 
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
 
%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrand%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrand
 
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
 
Microsoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdfMicrosoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdf
 
The title is not connected to what is inside
The title is not connected to what is insideThe title is not connected to what is inside
The title is not connected to what is inside
 
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
 

Driving Datascience at scale using Postgresql, Greenplum and Dataiku - Greenplum Summit 2019

  • 1. Driving Datascience at scale using Postgresql, Greenplum and Dataiku PostgresConf 2019 Nicolas GAKRELIDZ Partner Solution Architect
  • 2. Dataiku DSS is: • Collaborative, • For all profiles, • Polyglot, • Production ready End-to-end Enterprise AI platform Dataiku DSS
  • 3. End-to-end Enterprise AI platform Dataiku DSS
  • 4. Supporting the Enterprise AI Journey of Manufacturing Financial Services Services Consumer Goods Technology Consulting E-Retail Media Healthcare Travel Global Presence A WIDE USER BASE POWERED BY A STRONG ORGANIZATION Dataikers 220 BACKED BY MAJOR PARTNERS Customers 220+ Users 20,000 + of customers expand usage after first year 80% Raised so far $146M Customers Across Industries POWERING INDUSTRY LEADERS
  • 5. The “Tower of Babel” Effect of Data Projects The Classic Data Project Silos Business Analyst DATA PREPARATION ML MODELING ML DEPLOYMENT Data Preparation Data Science Notebooks & API Platforms AutoML Solutions Data Scientist Data Engineer
  • 6. Bring Business Analysts, Engineers, and Scientists Together Share a common environment to have an impact DATA PREPARATION ML MODELING ML DEPLOYMENT Business Analyst Data Engineer Data Scientist Single Collaborative, Governable and Auditable Environment
  • 7. Leverage existing skills and secure sustained availability Maximise usage of most up-to-date technologies Extend based on current and future operating requirements Get Results Today, Build for Tomorrow Future proof your data effort Use your current infrastructure and be ready for tomorrow’s Bokeh
  • 8. Fortune 500 Customer Rockets through Acceleration Phase Customer Testimony Quarterly Evolution of Dataiku Users Analytics Leader 10 Projects Leaders Scale their team to deliver 10x Projects / Briefs / Models / ... Business Analyst 500 Business Analysts Leverage Large and Complex Data Sources Independent to Deliver New Projects Accelerate by leveraging tools packaged by Data Scientists 100 Data Scientists Focus On Complex Data Processing Deliver Code and Plugins for Reuse Data Scientist 20 Data Engineers Ensure availability of data infrastructures Operationalize, monitor and maintain data projects Data Engineer Delivering 1,000s of analysis, insights, models and optimized business processes
  • 9. Enable Self-Service Analytics and Operationalize ML The Two Key Modes of Data Innovation SSA Quick answers to unformulated questions Directly by the end-users Pervasive Agile and instantaneous Limited integration High volume o16n Robust solutions to business challenges Organization-driven Focused Longer term Fine integration High value projects
  • 10. How a Major Software Player Auto-Deploys 12,000 Models Customer Testimony Design complex recommendation engines combining price, content and demand logics (the final models actually combine 3 predictive models) Automatically generate such recommendation engines based on each of its seller’s data and data models Operate models in real time and update them with no down time, scaling up on a fully managed platform on top of Kubernetes An AI-enabled Layer on top of an an existing product Powered by Dataiku Dataiku Customer provides a sales management software platform to 4,000 B2B clients (including several Fortune 100 companies), and has deployed Dataiku in order to:
  • 11. Leverage your full stack and skills Dataiku Solution Overview: Architecture LINUX SERVER ON PREMISE OR MANAGED CLOUD CENTRALIZED OR AD-HOC DATA SOURCES, DATABASES, DATA LAKE AVAILABLE OR SPUN-UP PROCESSING RESOURCES Leveraging best storage and compute resources Dataiku deployment servers for enterprise grade operationalization PRODUCTION SYSTEMS Centralized server to facilitate access to data, ressources, Browser based interface VISUAL DEVELOPMENT COMPLETE CODING ENVIRONMENTS VISUALIZATIO N COLLABORATION AND PROJECT MANAGEMENT AUDIT, MONITORING AND SCHEDULING User/task specific interaction modes
  • 12. 4 components Dataiku DSS Public API Dataiku DSS components
  • 13. Data Scientist Business Analyst Data Engineer Machine Learning Model DeploymentData Management MADlib In-database machine learning Graph Relationship Analytics Greenplum Integrated and cleansed data, parallel SQL processing GPText Fast index, search, text analytics PostGIS Location analytics Enable In-Database Analytics & Operationalized ML Dataiku & Pivotal® Greenplum’s Value
  • 14. High-Performance Analytics at Petabyte Scale ▪ Dataiku leverages Pivotal® Greenplum for in-database parallel processing of complex queries, visual analysis and charts. Simplify Collaboration across Data Teams ▪ End-to-end project collaboration for data scientists and engineers ▪ Self-service access to data sources ▪ Visual Development experience for building comprehensive analytics pipelines Mature Your Data Analytics Operations ▪ Enable self-service analytics of large datasets stored in Pivotal® Greenplum ▪ Enforce data governance between roles and teams ▪ Enable comprehensive of machine learning pipelines and models. Solution Features Dataiku & Pivotal® Greenplum’s Value
  • 15. Dataiku + Postgres and Greenplum (example) Order Data Movements (if compatible) Dataiku Datasets: ● Index definitions ● Incremental SQL push back: Charts using SQL Pushback … Storage
  • 16. ©2019 dataiku, Inc. | dataiku.com | contact@dataiku.com | @dataiku