SlideShare uma empresa Scribd logo
1 de 37
Baixar para ler offline
Enterprise Data Mining with SQL Server

        Mark Tabladillo Ph.D.
        Microsoft MVP
        MarkTab Consulting


        March 21, 2012
About Mark Tabladillo

    • 20 Years in Atlanta, Georgia
    • Consulting since 1998; Incorporated 2003
      – Part-Time Faculty at University of Phoenix
    • SAS and Microsoft Expert
      – Presenter since 1998 at conferences like Microsoft
        TechEd and SAS Global Forum
    • Taught statistics at undergraduate and graduate level
    • Blog: http://marktab.net    @MarkTabNet



3
Enterprise:
Leaders of Leaders of
      Leaders
Enterprise Challenge
Enterprise Challenge
Enterprise Challenge
Enterprise Challenge
“Data Mining”
Definitions

Phrase          Goal
“Data Mining”   Inform actionable decisions


“Machine        Determine best performing
Learning”       algorithm
Data Mining > Just Drilldown

          Query     Typical Result

          T‐SQL     Exact values and 
                    calculations
          MDX       Exact values and 
                    calculations
          DAX       Exact values and 
                    calculations
          DMX       Values plus 
                    probabilities
SQL Server
     2008 R2:

Physical and Logical
OLAP Engine
Physical
Architecture
• http://msdn.microsoft.com/en-
  us/library/ms174776.aspx
Analysis Services
Logical Architecture
• http://msdn.microsoft.com/en-us/library/ms174587.aspx
Outline

• Contoso Retail and Fundamentals
• Enterprise-Level Data Mining Demo for
  SQL Server
• What is my next step?
What is Contoso Retail?

• Demonstration dataset for SQL Server
  Database Engine and Analysis Services
•   http://www.microsoft.com/downloads/en/details.aspx?displaylang=en&FamilyID=868662dc-187a-
    4a85-b611-b7df7dc909fc
What are the fundamentals?


                           ‘Readin’

   Arithmetic    Reading
                           ‘Ritin’


           Writing         ‘Rithmetic
What Enterprise Tools support Data
Mining?

• SQL Server Management Studio (SSMS)
• Business Intelligence Development Studio
  (BIDS)
  – SQL Server Integration Services (SSIS)
• PowerShell version 2
What Enterprise Tools support Data
Mining?



                Data 
               Mining

  SSMS           SSIS      PowerShell
Variable      0   1   2   3   4   5   6   7



Discretized
Discretized
Continuous
Discrete
Variable      0   1   2   3   4   5   6   7



Discretized
Discretized
Continuous
Discrete
Variable      0   1   2   3   4   5   6   7



Discretized
Discretized
Continuous
Discrete
Variable      0   1   2   3   4   5   6   7



Discretized
Discretized
Continuous
Discrete
Variable      0   1   2   3   4   5   6   7


Discretized
Discretized
Continuous
Discrete
Documentation

• Data Mining Structures
 – http://msdn.microsoft.com/en-us/library/cc645741.aspx
 – http://msdn.microsoft.com/en-us/library/ms174757.aspx
• Data Mining Models
 – http://msdn.microsoft.com/en-us/library/cc645779.aspx
Contoso Retail:
Enterprise Data Mining

   Demonstration
What is my next step?

• SQL Server 2008 R2 Enterprise
  (includes database engine, Analysis Services,
  SSMS and BIDS)
 – http://www.microsoft.com/sqlserver/2008/en/us/trial-software.aspx
• Microsoft Office 2010 Professional
 – http://office.microsoft.com/en-us/try
• PowerShell 2.0
 – http://support.microsoft.com/kb/968929
• Data Mining Portal and Blog
 – http://www.marktab.net
Conclusion

  • Data mining leaders can tackle enterprise
    data mining challenges with
    – SQL Server Management Studio
    – Business Intelligence Development Studio
    – PowerShell version 2
  • Become leaders of leaders of leaders
Where Can I Find More Information?

•   http://marktab.net Data Mining Resource
•   http://marktab.net/datamining Data Mining Blog
•   http://sqlserverdatamining.com SQL Server Data Mining
•   http://technet.microsoft.com Microsoft’s TechNet
Graphics

• Ship graphics Copyright © 1995-2006 Nova Development
  and its licensors. All rights reserved. Used with
  permission.
Abstract

     This presentation introduces SQL Server Data Mining (SSDM) for SQL
     Server Professionals based on the speaker's past presentation for
     Microsoft TechEd. Starting with SQL Server Management Studio
     (SSMS), the demo includes the interfaces important for professional
     development, including Business Intelligence Development Studio
     (BIDS), highlighting Integration Services, and PowerShell. The
     interactive demos are based on Microsoft's Contoso Retail sample
     data. Finally we will evaluate where Microsoft data mining can help you
     in a practical business environment, which may include Oracle and
     SAS.

     Online Video:
     http://channel9.msdn.com/Events/TechEd/NorthAmerica/2011/DBI326


36
Thank You to our Sponsors

Mais conteúdo relacionado

Semelhante a 24 Hours of PASS -- Enterprise Data Mining with SQL Server

BrianMiller CV short 2015
BrianMiller CV short 2015BrianMiller CV short 2015
BrianMiller CV short 2015
Brian Miller
 
Satya\'s Resume
Satya\'s ResumeSatya\'s Resume
Satya\'s Resume
sqlmaster
 

Semelhante a 24 Hours of PASS -- Enterprise Data Mining with SQL Server (20)

SQL Saturday 108 -- Enterprise Data Mining with SQL Server
SQL Saturday 108 -- Enterprise Data Mining with SQL ServerSQL Saturday 108 -- Enterprise Data Mining with SQL Server
SQL Saturday 108 -- Enterprise Data Mining with SQL Server
 
SQL Saturday 109 -- Enterprise Data Mining with SQL Server
SQL Saturday 109 -- Enterprise Data Mining with SQL ServerSQL Saturday 109 -- Enterprise Data Mining with SQL Server
SQL Saturday 109 -- Enterprise Data Mining with SQL Server
 
Enterprise Data Mining for SQL Server Pros
Enterprise Data Mining for SQL Server ProsEnterprise Data Mining for SQL Server Pros
Enterprise Data Mining for SQL Server Pros
 
Introduction To SQL Server 2014
Introduction To SQL Server 2014Introduction To SQL Server 2014
Introduction To SQL Server 2014
 
SQL Operations Studio - new multi-platform tool for SQL Server database devel...
SQL Operations Studio - new multi-platform tool for SQL Server database devel...SQL Operations Studio - new multi-platform tool for SQL Server database devel...
SQL Operations Studio - new multi-platform tool for SQL Server database devel...
 
BI 2008 Simple
BI 2008 SimpleBI 2008 Simple
BI 2008 Simple
 
SSAS, MDX , Cube understanding, Browsing and Tools information
SSAS, MDX , Cube understanding, Browsing and Tools information SSAS, MDX , Cube understanding, Browsing and Tools information
SSAS, MDX , Cube understanding, Browsing and Tools information
 
Business analyst with project training
Business analyst with project trainingBusiness analyst with project training
Business analyst with project training
 
Mstr meetup
Mstr meetupMstr meetup
Mstr meetup
 
Data mining (Part I)
Data mining (Part I)Data mining (Part I)
Data mining (Part I)
 
MSBI Tutorials for Beginners | Business Intelligence Tutorial | Learn MSBI | ...
MSBI Tutorials for Beginners | Business Intelligence Tutorial | Learn MSBI | ...MSBI Tutorials for Beginners | Business Intelligence Tutorial | Learn MSBI | ...
MSBI Tutorials for Beginners | Business Intelligence Tutorial | Learn MSBI | ...
 
SQL Server 2019 Master Data Service
SQL Server 2019 Master Data ServiceSQL Server 2019 Master Data Service
SQL Server 2019 Master Data Service
 
BrianMiller CV short 2015
BrianMiller CV short 2015BrianMiller CV short 2015
BrianMiller CV short 2015
 
SQL Server 2008 Data Mining
SQL Server 2008 Data MiningSQL Server 2008 Data Mining
SQL Server 2008 Data Mining
 
SQL Server 2008 Data Mining
SQL Server 2008 Data MiningSQL Server 2008 Data Mining
SQL Server 2008 Data Mining
 
Satya\'s Resume
Satya\'s ResumeSatya\'s Resume
Satya\'s Resume
 
SQL Server 2008 Data Mining
SQL Server 2008 Data MiningSQL Server 2008 Data Mining
SQL Server 2008 Data Mining
 
Secrets of Enterprise Data Mining 201310
Secrets of Enterprise Data Mining 201310Secrets of Enterprise Data Mining 201310
Secrets of Enterprise Data Mining 201310
 
Steps towards business intelligence
Steps towards business intelligenceSteps towards business intelligence
Steps towards business intelligence
 
Data Mining 2008
Data Mining 2008Data Mining 2008
Data Mining 2008
 

Mais de Mark Tabladillo

Mais de Mark Tabladillo (20)

How to find low-cost or free data science resources 202006
How to find low-cost or free data science resources 202006How to find low-cost or free data science resources 202006
How to find low-cost or free data science resources 202006
 
Microsoft Build 2020: Data Science Recap
Microsoft Build 2020: Data Science RecapMicrosoft Build 2020: Data Science Recap
Microsoft Build 2020: Data Science Recap
 
201909 Automated ML for Developers
201909 Automated ML for Developers201909 Automated ML for Developers
201909 Automated ML for Developers
 
201908 Overview of Automated ML
201908 Overview of Automated ML201908 Overview of Automated ML
201908 Overview of Automated ML
 
201906 01 Introduction to ML.NET 1.0
201906 01 Introduction to ML.NET 1.0201906 01 Introduction to ML.NET 1.0
201906 01 Introduction to ML.NET 1.0
 
201906 04 Overview of Automated ML June 2019
201906 04 Overview of Automated ML June 2019201906 04 Overview of Automated ML June 2019
201906 04 Overview of Automated ML June 2019
 
201906 03 Introduction to NimbusML
201906 03 Introduction to NimbusML201906 03 Introduction to NimbusML
201906 03 Introduction to NimbusML
 
201906 02 Introduction to AutoML with ML.NET 1.0
201906 02 Introduction to AutoML with ML.NET 1.0201906 02 Introduction to AutoML with ML.NET 1.0
201906 02 Introduction to AutoML with ML.NET 1.0
 
201905 Azure Databricks for Machine Learning
201905 Azure Databricks for Machine Learning201905 Azure Databricks for Machine Learning
201905 Azure Databricks for Machine Learning
 
201905 Azure Certification DP-100: Designing and Implementing a Data Science ...
201905 Azure Certification DP-100: Designing and Implementing a Data Science ...201905 Azure Certification DP-100: Designing and Implementing a Data Science ...
201905 Azure Certification DP-100: Designing and Implementing a Data Science ...
 
Big Data Advanced Analytics on Microsoft Azure 201904
Big Data Advanced Analytics on Microsoft Azure 201904Big Data Advanced Analytics on Microsoft Azure 201904
Big Data Advanced Analytics on Microsoft Azure 201904
 
Managing Enterprise Data Science 201904
Managing Enterprise Data Science 201904Managing Enterprise Data Science 201904
Managing Enterprise Data Science 201904
 
Training of Python scikit-learn models on Azure
Training of Python scikit-learn models on AzureTraining of Python scikit-learn models on Azure
Training of Python scikit-learn models on Azure
 
Big Data Adavnced Analytics on Microsoft Azure
Big Data Adavnced Analytics on Microsoft AzureBig Data Adavnced Analytics on Microsoft Azure
Big Data Adavnced Analytics on Microsoft Azure
 
Advanced Analytics with Power BI 201808
Advanced Analytics with Power BI 201808Advanced Analytics with Power BI 201808
Advanced Analytics with Power BI 201808
 
Microsoft Cognitive Toolkit (Atlanta Code Camp 2017)
Microsoft Cognitive Toolkit (Atlanta Code Camp 2017)Microsoft Cognitive Toolkit (Atlanta Code Camp 2017)
Microsoft Cognitive Toolkit (Atlanta Code Camp 2017)
 
Machine learning services with SQL Server 2017
Machine learning services with SQL Server 2017Machine learning services with SQL Server 2017
Machine learning services with SQL Server 2017
 
Microsoft Technologies for Data Science 201612
Microsoft Technologies for Data Science 201612Microsoft Technologies for Data Science 201612
Microsoft Technologies for Data Science 201612
 
How Big Companies plan to use Our Big Data 201610
How Big Companies plan to use Our Big Data 201610How Big Companies plan to use Our Big Data 201610
How Big Companies plan to use Our Big Data 201610
 
Georgia Tech Data Science Hackathon September 2016
Georgia Tech Data Science Hackathon September 2016Georgia Tech Data Science Hackathon September 2016
Georgia Tech Data Science Hackathon September 2016
 

Último

Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 

Último (20)

Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 

24 Hours of PASS -- Enterprise Data Mining with SQL Server

  • 1. Enterprise Data Mining with SQL Server Mark Tabladillo Ph.D. Microsoft MVP MarkTab Consulting March 21, 2012
  • 2.
  • 3. About Mark Tabladillo • 20 Years in Atlanta, Georgia • Consulting since 1998; Incorporated 2003 – Part-Time Faculty at University of Phoenix • SAS and Microsoft Expert – Presenter since 1998 at conferences like Microsoft TechEd and SAS Global Forum • Taught statistics at undergraduate and graduate level • Blog: http://marktab.net @MarkTabNet 3
  • 10. Definitions Phrase Goal “Data Mining” Inform actionable decisions “Machine  Determine best performing Learning” algorithm
  • 11. Data Mining > Just Drilldown Query Typical Result T‐SQL Exact values and  calculations MDX Exact values and  calculations DAX Exact values and  calculations DMX Values plus  probabilities
  • 12. SQL Server 2008 R2: Physical and Logical
  • 14. Analysis Services Logical Architecture • http://msdn.microsoft.com/en-us/library/ms174587.aspx
  • 15. Outline • Contoso Retail and Fundamentals • Enterprise-Level Data Mining Demo for SQL Server • What is my next step?
  • 16. What is Contoso Retail? • Demonstration dataset for SQL Server Database Engine and Analysis Services • http://www.microsoft.com/downloads/en/details.aspx?displaylang=en&FamilyID=868662dc-187a- 4a85-b611-b7df7dc909fc
  • 17. What are the fundamentals? ‘Readin’ Arithmetic Reading ‘Ritin’ Writing ‘Rithmetic
  • 18. What Enterprise Tools support Data Mining? • SQL Server Management Studio (SSMS) • Business Intelligence Development Studio (BIDS) – SQL Server Integration Services (SSIS) • PowerShell version 2
  • 19. What Enterprise Tools support Data Mining? Data  Mining SSMS SSIS PowerShell
  • 20.
  • 21.
  • 22. Variable 0 1 2 3 4 5 6 7 Discretized Discretized Continuous Discrete
  • 23. Variable 0 1 2 3 4 5 6 7 Discretized Discretized Continuous Discrete
  • 24. Variable 0 1 2 3 4 5 6 7 Discretized Discretized Continuous Discrete
  • 25. Variable 0 1 2 3 4 5 6 7 Discretized Discretized Continuous Discrete
  • 26. Variable 0 1 2 3 4 5 6 7 Discretized Discretized Continuous Discrete
  • 27. Documentation • Data Mining Structures – http://msdn.microsoft.com/en-us/library/cc645741.aspx – http://msdn.microsoft.com/en-us/library/ms174757.aspx • Data Mining Models – http://msdn.microsoft.com/en-us/library/cc645779.aspx
  • 28. Contoso Retail: Enterprise Data Mining Demonstration
  • 29. What is my next step? • SQL Server 2008 R2 Enterprise (includes database engine, Analysis Services, SSMS and BIDS) – http://www.microsoft.com/sqlserver/2008/en/us/trial-software.aspx • Microsoft Office 2010 Professional – http://office.microsoft.com/en-us/try • PowerShell 2.0 – http://support.microsoft.com/kb/968929 • Data Mining Portal and Blog – http://www.marktab.net
  • 30.
  • 31.
  • 32.
  • 33. Conclusion • Data mining leaders can tackle enterprise data mining challenges with – SQL Server Management Studio – Business Intelligence Development Studio – PowerShell version 2 • Become leaders of leaders of leaders
  • 34. Where Can I Find More Information? • http://marktab.net Data Mining Resource • http://marktab.net/datamining Data Mining Blog • http://sqlserverdatamining.com SQL Server Data Mining • http://technet.microsoft.com Microsoft’s TechNet
  • 35. Graphics • Ship graphics Copyright © 1995-2006 Nova Development and its licensors. All rights reserved. Used with permission.
  • 36. Abstract This presentation introduces SQL Server Data Mining (SSDM) for SQL Server Professionals based on the speaker's past presentation for Microsoft TechEd. Starting with SQL Server Management Studio (SSMS), the demo includes the interfaces important for professional development, including Business Intelligence Development Studio (BIDS), highlighting Integration Services, and PowerShell. The interactive demos are based on Microsoft's Contoso Retail sample data. Finally we will evaluate where Microsoft data mining can help you in a practical business environment, which may include Oracle and SAS. Online Video: http://channel9.msdn.com/Events/TechEd/NorthAmerica/2011/DBI326 36
  • 37. Thank You to our Sponsors