SlideShare a Scribd company logo
1 of 47
Download to read offline
How to build your own Delve: combining
machine learning, big data and SharePoint
#SPSBE11
Joris Poelmans
April 18th, 2015
PlatinumGoldSilver
Thanks to our sponsors!
http://jopx.blogspot.com
Agenda
 Introduction to Delve
 Office Graph
 Big Data and Machine Learning
 Building your own Delve - architectural concept
Agenda
 Introduction to Delve
 Office Graph
 Big Data and Machine Learning
 Building your own Delve - architectural concept
Stay In the Know Find What you Need Discover New Connections
Connect with the right experts and
learn more about their content.
Find just the right results from any
source and take action
Discover new information tailored
to you from your network
Delve – Search and Discovery Across O365
Powered by Office Graph
Agenda
 Introduction to Delve
 Office Graph
 Big Data and Machine Learning
 Building your own Delve - architectural concept
What is The Office Graph?
User Documents People Conversations
What is The Office Graph?
Manager
Direct report
Works with
Shared with me
Viewed by me
Trending around me
Presented to me
Liked by me
Connected Enterprise
Signals sent from Delve, Exchange, O365, …
Click person
Modify/Save
Elevate
Share
Follow
Like
Comments
Email
Ignore
Presented to
Shown document
Open document
Shown board
++
Content and signals across O365 auto-
populating the Office Graph insights
Insights derived with machine learning for proactive and intelligent experiences
Agenda
 Introduction to Delve
 Office Graph
 Big Data and Machine Learning
 Building your own Delve - architectural concept
Big data is what
happened
when the cost
of storing user data
became cheaper
than making the
decision
to throw it away
Transactions + Interactions +
Observations = Big Data
Megabytes
Gigabytes
Terabytes
Petabytes
Purchase detail
Purchase record
Payment record
ERP
CRM
WEB
Offer details
Support Contacts
Customer Touches
Segmentation
Web logs
Offer history
A/B testing
Dynamic Pricing
Affiliate Networks
Search Marketing
Behavioral Targeting
Dynamic Funnels
User Generated Content
Mobile Web
SMS/MMSSentiment
External Demographics
HD Video, Audio, Images
Speech to Text
Product/Service Logs
Social Interactions & Feeds
Business Data Feeds
User Click Stream
Sensors / RFID / Devices
Spatial & GPS Coordinates
Increasing Data Variety and Complexity
Big Data Core Technology landscape
• New paradigm for
storing data
• 100+ Non-SQL DB’s
and growing
• Support SQL querying
• Internal architecture
different from classic DBs
• Appliances
• Teradata
• Microsoft
PDW/APS
• Oracle BDA X4-2
• Hadoop/HDFS+
MapReduce
• Key Big Data
technology
Hadoop MPP
NoSQLNewSQL
Modern Data Architecture
• Apache Hadoop is an open source
framework that supports data-
intensive distributed applications
 Uses HDFS storage to enable
applications to work with 1000s of
nodes and petabytes of data using a
scale-out model
 Uses MapReduce to process data
 Inspired by Google
 MapReduce
 Google File System
 Related projects:
 HBase, Hive, Mahout, Pig,Sqoop,
Ambari, Storm, Zookeeper, ... And
many more
HDFS and MapReduce in a nutshell
Hadoop components
Distributed Storage
(HDFS)
Hive
Distributed Processing
(MapReduce)
Pig
HBase HCatalog
DataIntegration
(ODBC/SQOOP/REST/Flume)
MahoutPegasus Rhadoop
Oozie
Data integration
Data access
Hadoop core
Operations
AmbariZookeeper
StormKafka
http://jopx.blogspot.be/2015/03/overview-of-apache-hadoop-components-in.html
Microsoft Azure HDInsight
Support HBase as NoSQL columnar
database on Azure Blobs
Support Storm as stream processing
Hadoop in Azure
Data Node Data Node Data Node Data Node
Task Tracker Task Tracker Task Tracker Task Tracker
Name Node
Job Tracker
HMaster
Coordination
Region Server Region Server Region Server Region Server
Able to leverage Azure Blob Storage
Pay per use model
Based on Hortonworks Data Platform
Hive
• Hadoop feature to perform data warehouse
operations
• HiveQL
 High-level, SQL-like language, abstraction over MapReduce
 Supports equi-joins
 Schema on read NOT schema on write
 Automatically invokes MapReduce jobs
 Much simpler than using MapReduce directly
• Metadata store
 Contains descriptions of tables
• Acts as a bridge to many BI products which expect
tabular data
Sample Hive queries
Machine learning
finding the needle in the haystack
• Formal definition: “A computer program is said to learn from
experience E with respect to some class of tasks T and performance
measure P, if its performance at tasks in T, as measured by P,
improves with experience E” - Tom M. Mitchell
• Another definition: “The goal of machine learning is to program
computers to use example data or past experience to solve a given
problem.” – Introduction to Machine Learning, 2nd Edition, MIT Press
• ML often involves two primary techniques:
– Supervised Learning: Finding the mapping between inputs and outputs using
correct values to “train” a model
– Unsupervised Learning: Finding patterns in the input data (similar to Density
Estimates in Statistics)
Vision Analytics
Recommendation
engines
Advertising analysis
Weather forecasting for
business planning
Social network analysis
Legal
discovery and document
archiving
Pricing analysis
Fraud
detection
Churn
analysis
Equipment monitoring
Location-based tracking
and services
Personalized Insurance
Some retailers profit
… by predicting major changes in your life.
Steps to build a machine learning solution
Typical machine learning algorithms
• Clustering (k-means, orthogonal partitioning,…)
• Association rule learning ( A priori)
• Regression (linear/logistic)
• Recommendation engines
• Classification (C4.5, decision trees, SVM, Naïve Bayes, AdaBoost, Random Forest, …)
• Similarity matching
• Neural networks
• Bayesian networks
• Genetic algorithms
• Ensembles
See http://machinelearningmastery.com/a-tour-of-machine-learning-algorithms/
And http://www.cs.umd.edu/~samir/498/10Algorithms-08.pdf and
http://www.quora.com/What-are-the-top-10-data-mining-or-machine-learning-algorithms
Doing recommendations – some approaches
• Collaborative filtering
• Feature based recommendations
• K-nearest neighbours
Collaborative filtering
• A set of items
(books, beers,
blogposts,…)
• Ratings from users
• Recommended
items based on
your ratings and
other people’s
ratings
Feature based recommendations
• Use user’s ratings of items
 Create an algorithm to define
which features (metadata ) of
items the user likes
• Requires detailed
information about items -
content based
 An item can be a person as well –
see “People you may know”
• Most approaches combine
“feature based” and
“collaborative filtering”
K-Nearest Neighbours (Classification approach)
• Find ratings from people similar
to you and see what they liked
 Use similarity functions (Minkowski
distance, RMSE, Pearson Correlation
Coefficient,…)
• Take the average ratings of the k
people most similar to you
 Display the items with the highest
averages
• Conclusion – requires solid
background in Math and
Statistics
Machine Learning and Data Scientists
Developing predictive analytics and
machine learning must be simpler,
today it requires specialized skills:
• Data management
• Data exploration
• Math & statistics
• Domain expertise
• Machine learning
• Software development
• Data visualization
65% of enterprise feel they have a
strategic shortage of data scientists, a
role many did not know existed 12
months ago …
Microsoft Azure Machine Learning
Microsoft Azure Machine Learning (Ctd.)
Personalized Workspace
Combine R modules with Microsoft’s
best in class algorithms running Xbox
and Bing
Work with anyone, anywhere by simply
sharing the workspace
Easy Access to All Data
Drop in desktop data sets into the
built-in storage space.
Bring in cloud data with the ease of a
drop down
Deploy Models as Web Services
Operationalize in minutes and refine
models at the speed of the market
Partner Tools
ML partners enjoy SDK access for
robust solutions
Microsoft Azure
Machine Learning Studio
Microsoft Azure
Machine Learning API service
Microsoft Azure
Machine Learning SDK
Agenda
 Introduction to Delve
 Office Graph
 Big Data and Machine Learning
 Building your own Delve - architectural concept
E vent producers
Web logs
Documents &
metadata
Transform Long-term
storage
Azure SQL
Database & Azure
Storage
Predictive
Analytics
Azure
Machine
Learning
Presentation
and action
On premise
Building your own Delve - high level architecture
Building your own Delve – remarks
• Graph technology left out for simplicity
 Take a look at Neo4J or Pegasus on Hadoop if you are interested
• Not very realistic to rebuild Delve but possible to
define point solutions
• If you still go ahead
 Think about the end-to-end data pipeline
 Fast track with Recommendation API in datamarket
http://datamarket.azure.com/dataset/amla/recommendations
 Cache recommendations for performance and cost optimization
 Learn R or Python to extend AzureML capabilities
Online Resources
• www.coursera.org (MOOC)
• Microsoft Virtual Academy
 http://www.microsoftvirtualacademy.com/training-courses/getting-started-with-microsoft-
azure-machine-learning
 http://www.microsoftvirtualacademy.com/training-courses/implementing-big-data-
analysis
• Cloud Data Science process - http://azure.microsoft.com/en-
us/documentation/articles/machine-learning-data-science-how-to-create-machine-learning-service/
• Blogs
 http://blogs.msdn.com/b/benjguin/
 http://hortonworks.com/blog/
 http://blogs.msdn.com/b/bigdatasupport/
 http://blogs.msdn.com/b/big_data_france/
 http://blogs.msdn.com/b/brian_swan/
 http://blogs.msdn.com/b/mwinkle/
 http://blogs.msdn.com/b/avkashchauhan/
 http://blogs.msdn.com/b/carlnol/
 http://blogs.technet.com/b/machinelearning/
Recommended books
Thank you!
How to build your own Delve: combining machine learning, big data and SharePoint

More Related Content

What's hot

Einführung in Neo4j
Einführung in Neo4jEinführung in Neo4j
Einführung in Neo4jNeo4j
 
Power BI For Office 365 Overview
Power BI For Office 365 OverviewPower BI For Office 365 Overview
Power BI For Office 365 OverviewRichard Harbridge
 
Intro to Neo4j Webinar
Intro to Neo4j WebinarIntro to Neo4j Webinar
Intro to Neo4j WebinarNeo4j
 
What is Business Intelligence
What is Business IntelligenceWhat is Business Intelligence
What is Business IntelligenceDries Vyvey
 
Search-Driven Applications with SharePoint 2013 (#SBSBE16)
Search-Driven Applications with SharePoint 2013 (#SBSBE16)Search-Driven Applications with SharePoint 2013 (#SBSBE16)
Search-Driven Applications with SharePoint 2013 (#SBSBE16)Maximilian Melcher
 
Enhancing your career: Building your personal brand
Enhancing your career: Building your personal brandEnhancing your career: Building your personal brand
Enhancing your career: Building your personal brandJames Serra
 
How Lyft Drives Data Discovery
How Lyft Drives Data DiscoveryHow Lyft Drives Data Discovery
How Lyft Drives Data DiscoveryNeo4j
 
Better Insights from Your Master Data - Graph Database LA Meetup
Better Insights from Your Master Data - Graph Database LA MeetupBetter Insights from Your Master Data - Graph Database LA Meetup
Better Insights from Your Master Data - Graph Database LA MeetupBenjamin Nussbaum
 
Graphs for Enterprise Architects
Graphs for Enterprise ArchitectsGraphs for Enterprise Architects
Graphs for Enterprise ArchitectsNeo4j
 
Spsbe 18-04-15 - should i move my network folders to office 365
Spsbe   18-04-15 - should i move my network folders to office 365Spsbe   18-04-15 - should i move my network folders to office 365
Spsbe 18-04-15 - should i move my network folders to office 365BIWUG
 
Building Dynamic Applications on both Office 365 and On-Prem
Building Dynamic Applications on both Office 365 and On-PremBuilding Dynamic Applications on both Office 365 and On-Prem
Building Dynamic Applications on both Office 365 and On-PremChristian Buckley
 
Metadata Management In A Social Media World, Spsbos, 2 2010
Metadata Management In A Social Media World, Spsbos, 2 2010Metadata Management In A Social Media World, Spsbos, 2 2010
Metadata Management In A Social Media World, Spsbos, 2 2010Christian Buckley
 
Sp24 design a share point 2013 architecture – the basics
Sp24   design a share point 2013 architecture – the basicsSp24   design a share point 2013 architecture – the basics
Sp24 design a share point 2013 architecture – the basicsAlexander Meijers
 
Intro to Neo4j and Graph Databases
Intro to Neo4j and Graph DatabasesIntro to Neo4j and Graph Databases
Intro to Neo4j and Graph DatabasesNeo4j
 
How Social and the Cloud Impact Your Governance Strategy
How Social and the Cloud Impact Your Governance StrategyHow Social and the Cloud Impact Your Governance Strategy
How Social and the Cloud Impact Your Governance StrategyChristian Buckley
 
Graph Algorithms for Developers
Graph Algorithms for DevelopersGraph Algorithms for Developers
Graph Algorithms for DevelopersNeo4j
 
Data Mashups -Data Science Summit
Data Mashups -Data Science SummitData Mashups -Data Science Summit
Data Mashups -Data Science SummitPeter Skomoroch
 
RDBMS to Graphs
RDBMS to GraphsRDBMS to Graphs
RDBMS to GraphsNeo4j
 
Fried data summit data quality data analytics together
Fried data summit data quality data analytics togetherFried data summit data quality data analytics together
Fried data summit data quality data analytics togetherJeff Fried
 

What's hot (20)

Einführung in Neo4j
Einführung in Neo4jEinführung in Neo4j
Einführung in Neo4j
 
Power BI For Office 365 Overview
Power BI For Office 365 OverviewPower BI For Office 365 Overview
Power BI For Office 365 Overview
 
Intro to Neo4j Webinar
Intro to Neo4j WebinarIntro to Neo4j Webinar
Intro to Neo4j Webinar
 
Is BCS Dead?
Is BCS Dead?Is BCS Dead?
Is BCS Dead?
 
What is Business Intelligence
What is Business IntelligenceWhat is Business Intelligence
What is Business Intelligence
 
Search-Driven Applications with SharePoint 2013 (#SBSBE16)
Search-Driven Applications with SharePoint 2013 (#SBSBE16)Search-Driven Applications with SharePoint 2013 (#SBSBE16)
Search-Driven Applications with SharePoint 2013 (#SBSBE16)
 
Enhancing your career: Building your personal brand
Enhancing your career: Building your personal brandEnhancing your career: Building your personal brand
Enhancing your career: Building your personal brand
 
How Lyft Drives Data Discovery
How Lyft Drives Data DiscoveryHow Lyft Drives Data Discovery
How Lyft Drives Data Discovery
 
Better Insights from Your Master Data - Graph Database LA Meetup
Better Insights from Your Master Data - Graph Database LA MeetupBetter Insights from Your Master Data - Graph Database LA Meetup
Better Insights from Your Master Data - Graph Database LA Meetup
 
Graphs for Enterprise Architects
Graphs for Enterprise ArchitectsGraphs for Enterprise Architects
Graphs for Enterprise Architects
 
Spsbe 18-04-15 - should i move my network folders to office 365
Spsbe   18-04-15 - should i move my network folders to office 365Spsbe   18-04-15 - should i move my network folders to office 365
Spsbe 18-04-15 - should i move my network folders to office 365
 
Building Dynamic Applications on both Office 365 and On-Prem
Building Dynamic Applications on both Office 365 and On-PremBuilding Dynamic Applications on both Office 365 and On-Prem
Building Dynamic Applications on both Office 365 and On-Prem
 
Metadata Management In A Social Media World, Spsbos, 2 2010
Metadata Management In A Social Media World, Spsbos, 2 2010Metadata Management In A Social Media World, Spsbos, 2 2010
Metadata Management In A Social Media World, Spsbos, 2 2010
 
Sp24 design a share point 2013 architecture – the basics
Sp24   design a share point 2013 architecture – the basicsSp24   design a share point 2013 architecture – the basics
Sp24 design a share point 2013 architecture – the basics
 
Intro to Neo4j and Graph Databases
Intro to Neo4j and Graph DatabasesIntro to Neo4j and Graph Databases
Intro to Neo4j and Graph Databases
 
How Social and the Cloud Impact Your Governance Strategy
How Social and the Cloud Impact Your Governance StrategyHow Social and the Cloud Impact Your Governance Strategy
How Social and the Cloud Impact Your Governance Strategy
 
Graph Algorithms for Developers
Graph Algorithms for DevelopersGraph Algorithms for Developers
Graph Algorithms for Developers
 
Data Mashups -Data Science Summit
Data Mashups -Data Science SummitData Mashups -Data Science Summit
Data Mashups -Data Science Summit
 
RDBMS to Graphs
RDBMS to GraphsRDBMS to Graphs
RDBMS to Graphs
 
Fried data summit data quality data analytics together
Fried data summit data quality data analytics togetherFried data summit data quality data analytics together
Fried data summit data quality data analytics together
 

Viewers also liked

Delve and the Office Graph for IT- Pros & Admins
Delve and the Office Graph for IT- Pros & AdminsDelve and the Office Graph for IT- Pros & Admins
Delve and the Office Graph for IT- Pros & AdminsSPC Adriatics
 
What’s new on the Microsoft Azure Data Platform
What’s new on the Microsoft Azure Data Platform What’s new on the Microsoft Azure Data Platform
What’s new on the Microsoft Azure Data Platform Joris Poelmans
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine LearningRahul Jain
 
SPSKC Machine Learning in SharePoint
SPSKC Machine Learning in SharePointSPSKC Machine Learning in SharePoint
SPSKC Machine Learning in SharePointNaomi Moneypenny
 
DF1 - ML - Petukhov - Azure Ml Machine Learning as a Service
DF1 - ML - Petukhov - Azure Ml Machine Learning as a ServiceDF1 - ML - Petukhov - Azure Ml Machine Learning as a Service
DF1 - ML - Petukhov - Azure Ml Machine Learning as a ServiceMoscowDataFest
 
Digital Marketing Class: Big data
Digital Marketing Class: Big dataDigital Marketing Class: Big data
Digital Marketing Class: Big dataAlex Brown
 
Big data & Digital Marketing
Big data & Digital MarketingBig data & Digital Marketing
Big data & Digital MarketingKarthik Bharath
 
Azure Machine Learning - A Full Journey
Azure Machine Learning - A Full JourneyAzure Machine Learning - A Full Journey
Azure Machine Learning - A Full JourneySolidQIT
 
Optimizing Your Web Traffic: Turning Data and Insight into Actionable Market...
Optimizing Your  Web Traffic: Turning Data and Insight into Actionable Market...Optimizing Your  Web Traffic: Turning Data and Insight into Actionable Market...
Optimizing Your Web Traffic: Turning Data and Insight into Actionable Market...Alex Harris
 
Insider's introduction to microsoft azure machine learning: 201411 Seattle Bu...
Insider's introduction to microsoft azure machine learning: 201411 Seattle Bu...Insider's introduction to microsoft azure machine learning: 201411 Seattle Bu...
Insider's introduction to microsoft azure machine learning: 201411 Seattle Bu...Mark Tabladillo
 
[Research] azure ml anatomy of a machine learning service - Sharat Chikkerur
[Research] azure ml  anatomy of a machine learning service - Sharat Chikkerur[Research] azure ml  anatomy of a machine learning service - Sharat Chikkerur
[Research] azure ml anatomy of a machine learning service - Sharat ChikkerurPAPIs.io
 
La interfaz del servidor de directorios
La interfaz del servidor de directoriosLa interfaz del servidor de directorios
La interfaz del servidor de directoriospaola2545
 
Agentes inteligentes
Agentes inteligentesAgentes inteligentes
Agentes inteligentesmenamigue
 
DL on Azure ML with Python where type DL = Deep Learning | Deep LOVE
DL on Azure ML with Python where type DL = Deep Learning | Deep LOVEDL on Azure ML with Python where type DL = Deep Learning | Deep LOVE
DL on Azure ML with Python where type DL = Deep Learning | Deep LOVEYoshiyuki Nakamura
 
How Machine Learning is Shaping Digital Marketing
How Machine Learning is Shaping Digital MarketingHow Machine Learning is Shaping Digital Marketing
How Machine Learning is Shaping Digital Marketingindico data
 

Viewers also liked (20)

Delve and the Office Graph for IT- Pros & Admins
Delve and the Office Graph for IT- Pros & AdminsDelve and the Office Graph for IT- Pros & Admins
Delve and the Office Graph for IT- Pros & Admins
 
What’s new on the Microsoft Azure Data Platform
What’s new on the Microsoft Azure Data Platform What’s new on the Microsoft Azure Data Platform
What’s new on the Microsoft Azure Data Platform
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Graph
GraphGraph
Graph
 
SPSKC Machine Learning in SharePoint
SPSKC Machine Learning in SharePointSPSKC Machine Learning in SharePoint
SPSKC Machine Learning in SharePoint
 
DF1 - ML - Petukhov - Azure Ml Machine Learning as a Service
DF1 - ML - Petukhov - Azure Ml Machine Learning as a ServiceDF1 - ML - Petukhov - Azure Ml Machine Learning as a Service
DF1 - ML - Petukhov - Azure Ml Machine Learning as a Service
 
Digital Marketing Class: Big data
Digital Marketing Class: Big dataDigital Marketing Class: Big data
Digital Marketing Class: Big data
 
Big data & Digital Marketing
Big data & Digital MarketingBig data & Digital Marketing
Big data & Digital Marketing
 
Big Data of Digital Awesomeness
Big Data of Digital AwesomenessBig Data of Digital Awesomeness
Big Data of Digital Awesomeness
 
Azure Machine Learning - A Full Journey
Azure Machine Learning - A Full JourneyAzure Machine Learning - A Full Journey
Azure Machine Learning - A Full Journey
 
Optimizing Your Web Traffic: Turning Data and Insight into Actionable Market...
Optimizing Your  Web Traffic: Turning Data and Insight into Actionable Market...Optimizing Your  Web Traffic: Turning Data and Insight into Actionable Market...
Optimizing Your Web Traffic: Turning Data and Insight into Actionable Market...
 
Insider's introduction to microsoft azure machine learning: 201411 Seattle Bu...
Insider's introduction to microsoft azure machine learning: 201411 Seattle Bu...Insider's introduction to microsoft azure machine learning: 201411 Seattle Bu...
Insider's introduction to microsoft azure machine learning: 201411 Seattle Bu...
 
[Research] azure ml anatomy of a machine learning service - Sharat Chikkerur
[Research] azure ml  anatomy of a machine learning service - Sharat Chikkerur[Research] azure ml  anatomy of a machine learning service - Sharat Chikkerur
[Research] azure ml anatomy of a machine learning service - Sharat Chikkerur
 
La interfaz del servidor de directorios
La interfaz del servidor de directoriosLa interfaz del servidor de directorios
La interfaz del servidor de directorios
 
Lenguajes
LenguajesLenguajes
Lenguajes
 
Agentes inteligentes
Agentes inteligentesAgentes inteligentes
Agentes inteligentes
 
DL on Azure ML with Python where type DL = Deep Learning | Deep LOVE
DL on Azure ML with Python where type DL = Deep Learning | Deep LOVEDL on Azure ML with Python where type DL = Deep Learning | Deep LOVE
DL on Azure ML with Python where type DL = Deep Learning | Deep LOVE
 
Pres eraa
Pres eraaPres eraa
Pres eraa
 
How Machine Learning is Shaping Digital Marketing
How Machine Learning is Shaping Digital MarketingHow Machine Learning is Shaping Digital Marketing
How Machine Learning is Shaping Digital Marketing
 
CRM Process
CRM ProcessCRM Process
CRM Process
 

Similar to How to build your own Delve: combining machine learning, big data and SharePoint

How does Microsoft solve Big Data?
How does Microsoft solve Big Data?How does Microsoft solve Big Data?
How does Microsoft solve Big Data?James Serra
 
Microsoft cloud big data strategy
Microsoft cloud big data strategyMicrosoft cloud big data strategy
Microsoft cloud big data strategyJames Serra
 
Microsoft Azure BI Solutions in the Cloud
Microsoft Azure BI Solutions in the CloudMicrosoft Azure BI Solutions in the Cloud
Microsoft Azure BI Solutions in the CloudMark Kromer
 
Cloud as a Data Platform
Cloud as a Data PlatformCloud as a Data Platform
Cloud as a Data PlatformAndrei Savu
 
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...Mihai Criveti
 
Developing and deploying AI solutions on the cloud using Team Data Science Pr...
Developing and deploying AI solutions on the cloud using Team Data Science Pr...Developing and deploying AI solutions on the cloud using Team Data Science Pr...
Developing and deploying AI solutions on the cloud using Team Data Science Pr...Debraj GuhaThakurta
 
Big Data for Data Scientists - Info Session
Big Data for Data Scientists - Info SessionBig Data for Data Scientists - Info Session
Big Data for Data Scientists - Info SessionWeCloudData
 
Big Data Practice_Planning_steps_RK
Big Data Practice_Planning_steps_RKBig Data Practice_Planning_steps_RK
Big Data Practice_Planning_steps_RKRajesh Jayarman
 
SMAC - Social, Mobile, Analytics and Cloud - An overview
SMAC - Social, Mobile, Analytics and Cloud - An overview SMAC - Social, Mobile, Analytics and Cloud - An overview
SMAC - Social, Mobile, Analytics and Cloud - An overview Rajesh Menon
 
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...Perficient, Inc.
 
Digital_IOT_(Microsoft_Solution).pdf
Digital_IOT_(Microsoft_Solution).pdfDigital_IOT_(Microsoft_Solution).pdf
Digital_IOT_(Microsoft_Solution).pdfssuserd23711
 
K-MUG Azure Machine Learning
K-MUG Azure Machine LearningK-MUG Azure Machine Learning
K-MUG Azure Machine LearningPraveen Nair
 
OSCON 2014: Data Workflows for Machine Learning
OSCON 2014: Data Workflows for Machine LearningOSCON 2014: Data Workflows for Machine Learning
OSCON 2014: Data Workflows for Machine LearningPaco Nathan
 
IARE_BDBA_ PPT_0.pptx
IARE_BDBA_ PPT_0.pptxIARE_BDBA_ PPT_0.pptx
IARE_BDBA_ PPT_0.pptxAIMLSEMINARS
 
Microsoft Fabric Introduction
Microsoft Fabric IntroductionMicrosoft Fabric Introduction
Microsoft Fabric IntroductionJames Serra
 
Hadoop meets Agile! - An Agile Big Data Model
Hadoop meets Agile! - An Agile Big Data ModelHadoop meets Agile! - An Agile Big Data Model
Hadoop meets Agile! - An Agile Big Data ModelUwe Printz
 
Ai & Data Analytics 2018 - Azure Databricks for data scientist
Ai & Data Analytics 2018 - Azure Databricks for data scientistAi & Data Analytics 2018 - Azure Databricks for data scientist
Ai & Data Analytics 2018 - Azure Databricks for data scientistAlberto Diaz Martin
 

Similar to How to build your own Delve: combining machine learning, big data and SharePoint (20)

Data engineering design patterns
Data engineering design patternsData engineering design patterns
Data engineering design patterns
 
IT webinar 2016
IT webinar 2016IT webinar 2016
IT webinar 2016
 
How does Microsoft solve Big Data?
How does Microsoft solve Big Data?How does Microsoft solve Big Data?
How does Microsoft solve Big Data?
 
Microsoft cloud big data strategy
Microsoft cloud big data strategyMicrosoft cloud big data strategy
Microsoft cloud big data strategy
 
Microsoft Azure BI Solutions in the Cloud
Microsoft Azure BI Solutions in the CloudMicrosoft Azure BI Solutions in the Cloud
Microsoft Azure BI Solutions in the Cloud
 
Cloud as a Data Platform
Cloud as a Data PlatformCloud as a Data Platform
Cloud as a Data Platform
 
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
 
Developing and deploying AI solutions on the cloud using Team Data Science Pr...
Developing and deploying AI solutions on the cloud using Team Data Science Pr...Developing and deploying AI solutions on the cloud using Team Data Science Pr...
Developing and deploying AI solutions on the cloud using Team Data Science Pr...
 
Big Data for Data Scientists - Info Session
Big Data for Data Scientists - Info SessionBig Data for Data Scientists - Info Session
Big Data for Data Scientists - Info Session
 
Big Data Practice_Planning_steps_RK
Big Data Practice_Planning_steps_RKBig Data Practice_Planning_steps_RK
Big Data Practice_Planning_steps_RK
 
SMAC - Social, Mobile, Analytics and Cloud - An overview
SMAC - Social, Mobile, Analytics and Cloud - An overview SMAC - Social, Mobile, Analytics and Cloud - An overview
SMAC - Social, Mobile, Analytics and Cloud - An overview
 
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...
 
Digital_IOT_(Microsoft_Solution).pdf
Digital_IOT_(Microsoft_Solution).pdfDigital_IOT_(Microsoft_Solution).pdf
Digital_IOT_(Microsoft_Solution).pdf
 
Big Data in Azure
Big Data in AzureBig Data in Azure
Big Data in Azure
 
K-MUG Azure Machine Learning
K-MUG Azure Machine LearningK-MUG Azure Machine Learning
K-MUG Azure Machine Learning
 
OSCON 2014: Data Workflows for Machine Learning
OSCON 2014: Data Workflows for Machine LearningOSCON 2014: Data Workflows for Machine Learning
OSCON 2014: Data Workflows for Machine Learning
 
IARE_BDBA_ PPT_0.pptx
IARE_BDBA_ PPT_0.pptxIARE_BDBA_ PPT_0.pptx
IARE_BDBA_ PPT_0.pptx
 
Microsoft Fabric Introduction
Microsoft Fabric IntroductionMicrosoft Fabric Introduction
Microsoft Fabric Introduction
 
Hadoop meets Agile! - An Agile Big Data Model
Hadoop meets Agile! - An Agile Big Data ModelHadoop meets Agile! - An Agile Big Data Model
Hadoop meets Agile! - An Agile Big Data Model
 
Ai & Data Analytics 2018 - Azure Databricks for data scientist
Ai & Data Analytics 2018 - Azure Databricks for data scientistAi & Data Analytics 2018 - Azure Databricks for data scientist
Ai & Data Analytics 2018 - Azure Databricks for data scientist
 

More from Joris Poelmans

Dynamics Power! Saturday Brussels 2019 - transitioning to the unified interface
Dynamics Power! Saturday Brussels 2019 - transitioning to the unified interfaceDynamics Power! Saturday Brussels 2019 - transitioning to the unified interface
Dynamics Power! Saturday Brussels 2019 - transitioning to the unified interfaceJoris Poelmans
 
GDPR and Dynamics 365 - the Waldorf and Statler perspective
GDPR and Dynamics 365 - the Waldorf and Statler perspectiveGDPR and Dynamics 365 - the Waldorf and Statler perspective
GDPR and Dynamics 365 - the Waldorf and Statler perspectiveJoris Poelmans
 
CRM UG Belux March 2017 - Power BI and Dynamics 365
CRM UG Belux March 2017 - Power BI and Dynamics 365CRM UG Belux March 2017 - Power BI and Dynamics 365
CRM UG Belux March 2017 - Power BI and Dynamics 365Joris Poelmans
 
Dynamics 365 Saturday Amsterdam 02/2018 - Dynamics 365 and chatbots
Dynamics 365 Saturday Amsterdam 02/2018 - Dynamics 365 and chatbotsDynamics 365 Saturday Amsterdam 02/2018 - Dynamics 365 and chatbots
Dynamics 365 Saturday Amsterdam 02/2018 - Dynamics 365 and chatbotsJoris Poelmans
 
imec Share - An Office 365 customer case
imec Share - An Office 365 customer caseimec Share - An Office 365 customer case
imec Share - An Office 365 customer caseJoris Poelmans
 
IMEC Share - Innovate, collaborate and excel
IMEC Share - Innovate, collaborate and excelIMEC Share - Innovate, collaborate and excel
IMEC Share - Innovate, collaborate and excelJoris Poelmans
 
The future of business process apps - a Microsoft perspective
The future of business process apps - a Microsoft perspectiveThe future of business process apps - a Microsoft perspective
The future of business process apps - a Microsoft perspectiveJoris Poelmans
 
Yammer Social Data Mining
Yammer Social Data MiningYammer Social Data Mining
Yammer Social Data MiningJoris Poelmans
 
MSDN - SharePoint 2013 to app or not to app
MSDN - SharePoint 2013 to app or not to appMSDN - SharePoint 2013 to app or not to app
MSDN - SharePoint 2013 to app or not to appJoris Poelmans
 
Everything you always wanted to know about SharePoint 2013 Search relevance
Everything you always wanted to know about SharePoint 2013 Search relevanceEverything you always wanted to know about SharePoint 2013 Search relevance
Everything you always wanted to know about SharePoint 2013 Search relevanceJoris Poelmans
 
The Connected Company - Event Anders Vergaderen
The Connected Company - Event Anders VergaderenThe Connected Company - Event Anders Vergaderen
The Connected Company - Event Anders VergaderenJoris Poelmans
 
Building search-driven Windows 8 and Windows Phone 8 apps for SharePoint Serv...
Building search-driven Windows 8 and Windows Phone 8 apps for SharePoint Serv...Building search-driven Windows 8 and Windows Phone 8 apps for SharePoint Serv...
Building search-driven Windows 8 and Windows Phone 8 apps for SharePoint Serv...Joris Poelmans
 
Intro to MUI and variations in SharePoint 2010
Intro to MUI and variations in SharePoint 2010Intro to MUI and variations in SharePoint 2010
Intro to MUI and variations in SharePoint 2010Joris Poelmans
 
Building the SharePoint hot or not app ... or how not sell social to your boss
Building the SharePoint hot or not app ... or how not sell social to your bossBuilding the SharePoint hot or not app ... or how not sell social to your boss
Building the SharePoint hot or not app ... or how not sell social to your bossJoris Poelmans
 
SharePoint Server 2013 : The big five
SharePoint Server 2013 : The big fiveSharePoint Server 2013 : The big five
SharePoint Server 2013 : The big fiveJoris Poelmans
 
Apps for Office Introduction
Apps for Office IntroductionApps for Office Introduction
Apps for Office IntroductionJoris Poelmans
 
Fun with Social, Windows 8 and Javascript
Fun with Social, Windows 8 and JavascriptFun with Social, Windows 8 and Javascript
Fun with Social, Windows 8 and JavascriptJoris Poelmans
 
Exploring search driven applications with SharePoint 2013
Exploring search driven applications with SharePoint 2013Exploring search driven applications with SharePoint 2013
Exploring search driven applications with SharePoint 2013Joris Poelmans
 
SharePoint 2013 - What's new for Devs - Belgian IT Bootcamp 2012
SharePoint 2013 - What's new for Devs - Belgian IT Bootcamp 2012SharePoint 2013 - What's new for Devs - Belgian IT Bootcamp 2012
SharePoint 2013 - What's new for Devs - Belgian IT Bootcamp 2012Joris Poelmans
 
Building your first Windows Phone 7 application for SharePoint
Building your first Windows Phone 7 application for SharePointBuilding your first Windows Phone 7 application for SharePoint
Building your first Windows Phone 7 application for SharePointJoris Poelmans
 

More from Joris Poelmans (20)

Dynamics Power! Saturday Brussels 2019 - transitioning to the unified interface
Dynamics Power! Saturday Brussels 2019 - transitioning to the unified interfaceDynamics Power! Saturday Brussels 2019 - transitioning to the unified interface
Dynamics Power! Saturday Brussels 2019 - transitioning to the unified interface
 
GDPR and Dynamics 365 - the Waldorf and Statler perspective
GDPR and Dynamics 365 - the Waldorf and Statler perspectiveGDPR and Dynamics 365 - the Waldorf and Statler perspective
GDPR and Dynamics 365 - the Waldorf and Statler perspective
 
CRM UG Belux March 2017 - Power BI and Dynamics 365
CRM UG Belux March 2017 - Power BI and Dynamics 365CRM UG Belux March 2017 - Power BI and Dynamics 365
CRM UG Belux March 2017 - Power BI and Dynamics 365
 
Dynamics 365 Saturday Amsterdam 02/2018 - Dynamics 365 and chatbots
Dynamics 365 Saturday Amsterdam 02/2018 - Dynamics 365 and chatbotsDynamics 365 Saturday Amsterdam 02/2018 - Dynamics 365 and chatbots
Dynamics 365 Saturday Amsterdam 02/2018 - Dynamics 365 and chatbots
 
imec Share - An Office 365 customer case
imec Share - An Office 365 customer caseimec Share - An Office 365 customer case
imec Share - An Office 365 customer case
 
IMEC Share - Innovate, collaborate and excel
IMEC Share - Innovate, collaborate and excelIMEC Share - Innovate, collaborate and excel
IMEC Share - Innovate, collaborate and excel
 
The future of business process apps - a Microsoft perspective
The future of business process apps - a Microsoft perspectiveThe future of business process apps - a Microsoft perspective
The future of business process apps - a Microsoft perspective
 
Yammer Social Data Mining
Yammer Social Data MiningYammer Social Data Mining
Yammer Social Data Mining
 
MSDN - SharePoint 2013 to app or not to app
MSDN - SharePoint 2013 to app or not to appMSDN - SharePoint 2013 to app or not to app
MSDN - SharePoint 2013 to app or not to app
 
Everything you always wanted to know about SharePoint 2013 Search relevance
Everything you always wanted to know about SharePoint 2013 Search relevanceEverything you always wanted to know about SharePoint 2013 Search relevance
Everything you always wanted to know about SharePoint 2013 Search relevance
 
The Connected Company - Event Anders Vergaderen
The Connected Company - Event Anders VergaderenThe Connected Company - Event Anders Vergaderen
The Connected Company - Event Anders Vergaderen
 
Building search-driven Windows 8 and Windows Phone 8 apps for SharePoint Serv...
Building search-driven Windows 8 and Windows Phone 8 apps for SharePoint Serv...Building search-driven Windows 8 and Windows Phone 8 apps for SharePoint Serv...
Building search-driven Windows 8 and Windows Phone 8 apps for SharePoint Serv...
 
Intro to MUI and variations in SharePoint 2010
Intro to MUI and variations in SharePoint 2010Intro to MUI and variations in SharePoint 2010
Intro to MUI and variations in SharePoint 2010
 
Building the SharePoint hot or not app ... or how not sell social to your boss
Building the SharePoint hot or not app ... or how not sell social to your bossBuilding the SharePoint hot or not app ... or how not sell social to your boss
Building the SharePoint hot or not app ... or how not sell social to your boss
 
SharePoint Server 2013 : The big five
SharePoint Server 2013 : The big fiveSharePoint Server 2013 : The big five
SharePoint Server 2013 : The big five
 
Apps for Office Introduction
Apps for Office IntroductionApps for Office Introduction
Apps for Office Introduction
 
Fun with Social, Windows 8 and Javascript
Fun with Social, Windows 8 and JavascriptFun with Social, Windows 8 and Javascript
Fun with Social, Windows 8 and Javascript
 
Exploring search driven applications with SharePoint 2013
Exploring search driven applications with SharePoint 2013Exploring search driven applications with SharePoint 2013
Exploring search driven applications with SharePoint 2013
 
SharePoint 2013 - What's new for Devs - Belgian IT Bootcamp 2012
SharePoint 2013 - What's new for Devs - Belgian IT Bootcamp 2012SharePoint 2013 - What's new for Devs - Belgian IT Bootcamp 2012
SharePoint 2013 - What's new for Devs - Belgian IT Bootcamp 2012
 
Building your first Windows Phone 7 application for SharePoint
Building your first Windows Phone 7 application for SharePointBuilding your first Windows Phone 7 application for SharePoint
Building your first Windows Phone 7 application for SharePoint
 

Recently uploaded

Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfchwongval
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFAAndrei Kaleshka
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdfHuman37
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanMYRABACSAFRA2
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...limedy534
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxaleedritatuxx
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesTimothy Spann
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Thomas Poetter
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max PrincetonTimothy Spann
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhijennyeacort
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degreeyuu sss
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Boston Institute of Analytics
 
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024Timothy Spann
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfBoston Institute of Analytics
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryJeremy Anderson
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 217djon017
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxMike Bennett
 

Recently uploaded (20)

Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdf
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFA
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population Mean
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdf
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max Princeton
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
 
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data Story
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptx
 

How to build your own Delve: combining machine learning, big data and SharePoint

  • 1. How to build your own Delve: combining machine learning, big data and SharePoint #SPSBE11 Joris Poelmans April 18th, 2015
  • 4. Agenda  Introduction to Delve  Office Graph  Big Data and Machine Learning  Building your own Delve - architectural concept
  • 5. Agenda  Introduction to Delve  Office Graph  Big Data and Machine Learning  Building your own Delve - architectural concept
  • 6. Stay In the Know Find What you Need Discover New Connections Connect with the right experts and learn more about their content. Find just the right results from any source and take action Discover new information tailored to you from your network Delve – Search and Discovery Across O365 Powered by Office Graph
  • 7.
  • 8. Agenda  Introduction to Delve  Office Graph  Big Data and Machine Learning  Building your own Delve - architectural concept
  • 9. What is The Office Graph? User Documents People Conversations
  • 10. What is The Office Graph? Manager Direct report Works with Shared with me Viewed by me Trending around me Presented to me Liked by me
  • 12.
  • 13. Signals sent from Delve, Exchange, O365, … Click person Modify/Save Elevate Share Follow Like Comments Email Ignore Presented to Shown document Open document Shown board ++
  • 14. Content and signals across O365 auto- populating the Office Graph insights Insights derived with machine learning for proactive and intelligent experiences
  • 15. Agenda  Introduction to Delve  Office Graph  Big Data and Machine Learning  Building your own Delve - architectural concept
  • 16. Big data is what happened when the cost of storing user data became cheaper than making the decision to throw it away
  • 17. Transactions + Interactions + Observations = Big Data Megabytes Gigabytes Terabytes Petabytes Purchase detail Purchase record Payment record ERP CRM WEB Offer details Support Contacts Customer Touches Segmentation Web logs Offer history A/B testing Dynamic Pricing Affiliate Networks Search Marketing Behavioral Targeting Dynamic Funnels User Generated Content Mobile Web SMS/MMSSentiment External Demographics HD Video, Audio, Images Speech to Text Product/Service Logs Social Interactions & Feeds Business Data Feeds User Click Stream Sensors / RFID / Devices Spatial & GPS Coordinates Increasing Data Variety and Complexity
  • 18. Big Data Core Technology landscape • New paradigm for storing data • 100+ Non-SQL DB’s and growing • Support SQL querying • Internal architecture different from classic DBs • Appliances • Teradata • Microsoft PDW/APS • Oracle BDA X4-2 • Hadoop/HDFS+ MapReduce • Key Big Data technology Hadoop MPP NoSQLNewSQL
  • 19. Modern Data Architecture • Apache Hadoop is an open source framework that supports data- intensive distributed applications  Uses HDFS storage to enable applications to work with 1000s of nodes and petabytes of data using a scale-out model  Uses MapReduce to process data  Inspired by Google  MapReduce  Google File System  Related projects:  HBase, Hive, Mahout, Pig,Sqoop, Ambari, Storm, Zookeeper, ... And many more
  • 20. HDFS and MapReduce in a nutshell
  • 21. Hadoop components Distributed Storage (HDFS) Hive Distributed Processing (MapReduce) Pig HBase HCatalog DataIntegration (ODBC/SQOOP/REST/Flume) MahoutPegasus Rhadoop Oozie Data integration Data access Hadoop core Operations AmbariZookeeper StormKafka http://jopx.blogspot.be/2015/03/overview-of-apache-hadoop-components-in.html
  • 22. Microsoft Azure HDInsight Support HBase as NoSQL columnar database on Azure Blobs Support Storm as stream processing Hadoop in Azure Data Node Data Node Data Node Data Node Task Tracker Task Tracker Task Tracker Task Tracker Name Node Job Tracker HMaster Coordination Region Server Region Server Region Server Region Server Able to leverage Azure Blob Storage Pay per use model Based on Hortonworks Data Platform
  • 23.
  • 24. Hive • Hadoop feature to perform data warehouse operations • HiveQL  High-level, SQL-like language, abstraction over MapReduce  Supports equi-joins  Schema on read NOT schema on write  Automatically invokes MapReduce jobs  Much simpler than using MapReduce directly • Metadata store  Contains descriptions of tables • Acts as a bridge to many BI products which expect tabular data
  • 26.
  • 27.
  • 28. Machine learning finding the needle in the haystack • Formal definition: “A computer program is said to learn from experience E with respect to some class of tasks T and performance measure P, if its performance at tasks in T, as measured by P, improves with experience E” - Tom M. Mitchell • Another definition: “The goal of machine learning is to program computers to use example data or past experience to solve a given problem.” – Introduction to Machine Learning, 2nd Edition, MIT Press • ML often involves two primary techniques: – Supervised Learning: Finding the mapping between inputs and outputs using correct values to “train” a model – Unsupervised Learning: Finding patterns in the input data (similar to Density Estimates in Statistics)
  • 29. Vision Analytics Recommendation engines Advertising analysis Weather forecasting for business planning Social network analysis Legal discovery and document archiving Pricing analysis Fraud detection Churn analysis Equipment monitoring Location-based tracking and services Personalized Insurance
  • 30. Some retailers profit … by predicting major changes in your life.
  • 31. Steps to build a machine learning solution
  • 32. Typical machine learning algorithms • Clustering (k-means, orthogonal partitioning,…) • Association rule learning ( A priori) • Regression (linear/logistic) • Recommendation engines • Classification (C4.5, decision trees, SVM, Naïve Bayes, AdaBoost, Random Forest, …) • Similarity matching • Neural networks • Bayesian networks • Genetic algorithms • Ensembles See http://machinelearningmastery.com/a-tour-of-machine-learning-algorithms/ And http://www.cs.umd.edu/~samir/498/10Algorithms-08.pdf and http://www.quora.com/What-are-the-top-10-data-mining-or-machine-learning-algorithms
  • 33. Doing recommendations – some approaches • Collaborative filtering • Feature based recommendations • K-nearest neighbours
  • 34. Collaborative filtering • A set of items (books, beers, blogposts,…) • Ratings from users • Recommended items based on your ratings and other people’s ratings
  • 35. Feature based recommendations • Use user’s ratings of items  Create an algorithm to define which features (metadata ) of items the user likes • Requires detailed information about items - content based  An item can be a person as well – see “People you may know” • Most approaches combine “feature based” and “collaborative filtering”
  • 36. K-Nearest Neighbours (Classification approach) • Find ratings from people similar to you and see what they liked  Use similarity functions (Minkowski distance, RMSE, Pearson Correlation Coefficient,…) • Take the average ratings of the k people most similar to you  Display the items with the highest averages • Conclusion – requires solid background in Math and Statistics
  • 37. Machine Learning and Data Scientists Developing predictive analytics and machine learning must be simpler, today it requires specialized skills: • Data management • Data exploration • Math & statistics • Domain expertise • Machine learning • Software development • Data visualization 65% of enterprise feel they have a strategic shortage of data scientists, a role many did not know existed 12 months ago …
  • 39. Microsoft Azure Machine Learning (Ctd.) Personalized Workspace Combine R modules with Microsoft’s best in class algorithms running Xbox and Bing Work with anyone, anywhere by simply sharing the workspace Easy Access to All Data Drop in desktop data sets into the built-in storage space. Bring in cloud data with the ease of a drop down Deploy Models as Web Services Operationalize in minutes and refine models at the speed of the market Partner Tools ML partners enjoy SDK access for robust solutions Microsoft Azure Machine Learning Studio Microsoft Azure Machine Learning API service Microsoft Azure Machine Learning SDK
  • 40.
  • 41. Agenda  Introduction to Delve  Office Graph  Big Data and Machine Learning  Building your own Delve - architectural concept
  • 42. E vent producers Web logs Documents & metadata Transform Long-term storage Azure SQL Database & Azure Storage Predictive Analytics Azure Machine Learning Presentation and action On premise Building your own Delve - high level architecture
  • 43. Building your own Delve – remarks • Graph technology left out for simplicity  Take a look at Neo4J or Pegasus on Hadoop if you are interested • Not very realistic to rebuild Delve but possible to define point solutions • If you still go ahead  Think about the end-to-end data pipeline  Fast track with Recommendation API in datamarket http://datamarket.azure.com/dataset/amla/recommendations  Cache recommendations for performance and cost optimization  Learn R or Python to extend AzureML capabilities
  • 44. Online Resources • www.coursera.org (MOOC) • Microsoft Virtual Academy  http://www.microsoftvirtualacademy.com/training-courses/getting-started-with-microsoft- azure-machine-learning  http://www.microsoftvirtualacademy.com/training-courses/implementing-big-data- analysis • Cloud Data Science process - http://azure.microsoft.com/en- us/documentation/articles/machine-learning-data-science-how-to-create-machine-learning-service/ • Blogs  http://blogs.msdn.com/b/benjguin/  http://hortonworks.com/blog/  http://blogs.msdn.com/b/bigdatasupport/  http://blogs.msdn.com/b/big_data_france/  http://blogs.msdn.com/b/brian_swan/  http://blogs.msdn.com/b/mwinkle/  http://blogs.msdn.com/b/avkashchauhan/  http://blogs.msdn.com/b/carlnol/  http://blogs.technet.com/b/machinelearning/