SlideShare a Scribd company logo
1 of 22
©2014 DesignMind. All Rights Reserved. 
Big Data and Data Science at DesignMind
2 
©2014 DesignMind. All Rights Reserved. 
DesignMind’s Expertise and Offering 
Power BI 
Applications 
Databases 
Data Warehousing 
Big Data 
BI & Data Visualization 
Information Sharing & Collaboration 
Cloud Computing 
Data Science
3 
©2014 DesignMind. All Rights Reserved. 
Our Clients
4 
©2014 DesignMind. All Rights Reserved. 
Agenda 
Big Data 
Data Science 
Big Data and Data Science at DesignMind 
Partners & Products 
The Team
5 
©2014 DesignMind. All Rights Reserved.
6 
©2014 DesignMind. All Rights Reserved. 
What is Big Data? 
 
Large data sets 
 
excessive retrieval and processing time 
 
structured and unstructured collections 
BIG DATA
7 
©2014 DesignMind. All Rights Reserved. 
SQL vs. Big Data 
 
volume velocity variety 
 
New tools and methodologies are needed! 
Volume Velocity VarietySQLBIG DATA
8 
©2014 DesignMind. All Rights Reserved. 
Examples of Big Data Applications 
Vertical 
Use Case 
Financial Services 
New Accounts Risk Assessment 
Loans and Credit Cards Application Processing 
Fraud Prevention 
Insurance Underwriting 
Cross-sell and Up-sell Consideration 
Trading Risk 
Telecom 
Call Detail Records (CDRs) 
Real-Time Bandwidth Allocation 
Infrastructure Investment 
New Product Development 
Next Product to Buy 
Retail 
360' View of the Customer 
Brand Sentiment Analysis 
Localized, Personalized Promotions 
Website Optimization 
Optimal Store Layout 
Manufacturing 
Supply Chain and Logistics 
Assembly Line Quality Assurance 
Proactive Maintenance 
Services 
Resources Allocation 
Quality Assurance 
Workflows Routing
9 
©2014 DesignMind. All Rights Reserved.
10 
©2014 DesignMind. All Rights Reserved. 
Big Data Analysis vs. Traditional Data Analysis 
 
different tools and methodologies 
 
same underlying process 
- 
- 
- 
 
common goal 
1. Collect 
2. Prepare 
3. Examine 
4. Model 
5. Decide 
6. Act 
Decision Cycle
11 
©2014 DesignMind. All Rights Reserved. 
Why Data Science? 
 
broader skillset 
Technology 
Business 
Statistics & Math 
IT Professionals 
DATA 
SCIENTISTS
12 
©2014 DesignMind. All Rights Reserved. 
BI vs. Data Science 
 
Broader skillsetbroaderanalyticsspectrum 
SQL Analytics 
Descriptive Analytics 
Data Mining 
Predictive Analytics 
Simulation 
Optimization 
- 
Count 
- 
Mean 
- 
OLAP 
- 
Univariate Distribution 
- 
Central tendency 
- 
Dispersion 
- 
Association rules 
- 
Clustering 
- 
Features extraction 
- 
Classification 
- 
Regression 
- 
Forecasting 
- 
Spatial 
- 
Machine learnings 
- 
Text analytics 
- 
Monte Carlo 
- 
Agent-based modeling 
- 
Discrete event modeling 
- 
Linear optimization 
- 
Non-linear optimization
13 
©2014 DesignMind. All Rights Reserved.
14 
©2014 DesignMind. All Rights Reserved. 
Our Goal 
 
to help our customers to fully exploit their big data 
- 
high-quality data 
- 
tools 
- 
services
15 
©2014 DesignMind. All Rights Reserved. 
High-Quality Data is … 
 
T 
 
R 
 
A 
 
C 
 
E
16 
©2014 DesignMind. All Rights Reserved. 
Tools for… 
 
D 
 
R 
 
E 
 
A 
 
M
17 
©2014 DesignMind. All Rights Reserved. 
Services to better exploit data and use tools … 
 
O 
 
P 
 
T 
 
I 
 
C
18 
©2014 DesignMind. All Rights Reserved.
19 
©2014 DesignMind. All Rights Reserved. 
Cloudera is the leader in Apache Hadoop-based software and services and offers a powerful new data platform that enables enterprises and organizations to look at all their data (structured as well as unstructured) and ask bigger questions for unprecedented insight at the speed of thought. 
Platfora's Big Data Analytics Platform masks the complexity of Hadoop, making it easy for customers to understand all the facts in their business across events, actions, behaviors and time. 
Tableau makes it easy for people to rapidly transform data into smart business analytics. 
Windows Azure HDInsight is Microsoft’s easy to manage, agile, and open Enterprise-ready Hadoop in the cloud. 
Enterprises can combine both relation and non-relational data with the new PolyBase, available in SQL Server 2012 Parallel Data Warehouse. 
Partners
20 
©2014 DesignMind. All Rights Reserved. 
Products 
Pig
21 
©2014 DesignMind. All Rights Reserved. 
 
big data better questions better answers  better decisions 
big data to convert the invisible into the visible
22 
©2014 DesignMind. All Rights Reserved. 
www.designmind.com

More Related Content

More from Mark Ginnebaugh

More from Mark Ginnebaugh (20)

Microsoft SQL Server Continuous Integration
Microsoft SQL Server Continuous IntegrationMicrosoft SQL Server Continuous Integration
Microsoft SQL Server Continuous Integration
 
Hortonworks Big Data & Hadoop
Hortonworks Big Data & HadoopHortonworks Big Data & Hadoop
Hortonworks Big Data & Hadoop
 
Microsoft SQL Server Physical Join Operators
Microsoft SQL Server Physical Join OperatorsMicrosoft SQL Server Physical Join Operators
Microsoft SQL Server Physical Join Operators
 
Microsoft PowerPivot & Power View in Excel 2013
Microsoft PowerPivot & Power View in Excel 2013Microsoft PowerPivot & Power View in Excel 2013
Microsoft PowerPivot & Power View in Excel 2013
 
Microsoft Data Warehouse Business Intelligence Lifecycle - The Kimball Approach
Microsoft Data Warehouse Business Intelligence Lifecycle - The Kimball ApproachMicrosoft Data Warehouse Business Intelligence Lifecycle - The Kimball Approach
Microsoft Data Warehouse Business Intelligence Lifecycle - The Kimball Approach
 
Fusion-io Memory Flash for Microsoft SQL Server 2012
Fusion-io Memory Flash for Microsoft SQL Server 2012Fusion-io Memory Flash for Microsoft SQL Server 2012
Fusion-io Memory Flash for Microsoft SQL Server 2012
 
Microsoft Data Mining 2012
Microsoft Data Mining 2012Microsoft Data Mining 2012
Microsoft Data Mining 2012
 
Microsoft SQL Server PASS News August 2012
Microsoft SQL Server PASS News August 2012Microsoft SQL Server PASS News August 2012
Microsoft SQL Server PASS News August 2012
 
Business Intelligence Dashboard Design Best Practices
Business Intelligence Dashboard Design Best PracticesBusiness Intelligence Dashboard Design Best Practices
Business Intelligence Dashboard Design Best Practices
 
Microsoft Mobile Business Intelligence
Microsoft Mobile Business Intelligence Microsoft Mobile Business Intelligence
Microsoft Mobile Business Intelligence
 
Microsoft SQL Server 2012 Cloud Ready
Microsoft SQL Server 2012 Cloud ReadyMicrosoft SQL Server 2012 Cloud Ready
Microsoft SQL Server 2012 Cloud Ready
 
Microsoft SQL Server 2012 Master Data Services
Microsoft SQL Server 2012 Master Data ServicesMicrosoft SQL Server 2012 Master Data Services
Microsoft SQL Server 2012 Master Data Services
 
Microsoft SQL Server PowerPivot
Microsoft SQL Server PowerPivotMicrosoft SQL Server PowerPivot
Microsoft SQL Server PowerPivot
 
Microsoft SQL Server Testing Frameworks
Microsoft SQL Server Testing FrameworksMicrosoft SQL Server Testing Frameworks
Microsoft SQL Server Testing Frameworks
 
Microsoft SQL Server - How to Collaboratively Manage Excel Data
Microsoft SQL Server - How to Collaboratively Manage Excel DataMicrosoft SQL Server - How to Collaboratively Manage Excel Data
Microsoft SQL Server - How to Collaboratively Manage Excel Data
 
Microsoft SQL Server Flash Storage
Microsoft SQL Server Flash StorageMicrosoft SQL Server Flash Storage
Microsoft SQL Server Flash Storage
 
Microsoft Business Intelligence Performance Management Dan Bulos_2011
Microsoft Business Intelligence Performance Management Dan Bulos_2011Microsoft Business Intelligence Performance Management Dan Bulos_2011
Microsoft Business Intelligence Performance Management Dan Bulos_2011
 
Microsoft SQL Server Filtered Indexes & Sparse Columns Feb 2011
Microsoft SQL Server Filtered Indexes & Sparse Columns Feb 2011Microsoft SQL Server Filtered Indexes & Sparse Columns Feb 2011
Microsoft SQL Server Filtered Indexes & Sparse Columns Feb 2011
 
SQL Server Managing Test Data & Stress Testing January 2011
SQL Server Managing Test Data & Stress Testing January 2011SQL Server Managing Test Data & Stress Testing January 2011
SQL Server Managing Test Data & Stress Testing January 2011
 
Microsoft SQL Server Query Tuning
Microsoft SQL Server Query TuningMicrosoft SQL Server Query Tuning
Microsoft SQL Server Query Tuning
 

Recently uploaded

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Recently uploaded (20)

Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 

Big Data & Data Science at DesignMind

  • 1. ©2014 DesignMind. All Rights Reserved. Big Data and Data Science at DesignMind
  • 2. 2 ©2014 DesignMind. All Rights Reserved. DesignMind’s Expertise and Offering Power BI Applications Databases Data Warehousing Big Data BI & Data Visualization Information Sharing & Collaboration Cloud Computing Data Science
  • 3. 3 ©2014 DesignMind. All Rights Reserved. Our Clients
  • 4. 4 ©2014 DesignMind. All Rights Reserved. Agenda Big Data Data Science Big Data and Data Science at DesignMind Partners & Products The Team
  • 5. 5 ©2014 DesignMind. All Rights Reserved.
  • 6. 6 ©2014 DesignMind. All Rights Reserved. What is Big Data?  Large data sets  excessive retrieval and processing time  structured and unstructured collections BIG DATA
  • 7. 7 ©2014 DesignMind. All Rights Reserved. SQL vs. Big Data  volume velocity variety  New tools and methodologies are needed! Volume Velocity VarietySQLBIG DATA
  • 8. 8 ©2014 DesignMind. All Rights Reserved. Examples of Big Data Applications Vertical Use Case Financial Services New Accounts Risk Assessment Loans and Credit Cards Application Processing Fraud Prevention Insurance Underwriting Cross-sell and Up-sell Consideration Trading Risk Telecom Call Detail Records (CDRs) Real-Time Bandwidth Allocation Infrastructure Investment New Product Development Next Product to Buy Retail 360' View of the Customer Brand Sentiment Analysis Localized, Personalized Promotions Website Optimization Optimal Store Layout Manufacturing Supply Chain and Logistics Assembly Line Quality Assurance Proactive Maintenance Services Resources Allocation Quality Assurance Workflows Routing
  • 9. 9 ©2014 DesignMind. All Rights Reserved.
  • 10. 10 ©2014 DesignMind. All Rights Reserved. Big Data Analysis vs. Traditional Data Analysis  different tools and methodologies  same underlying process - - -  common goal 1. Collect 2. Prepare 3. Examine 4. Model 5. Decide 6. Act Decision Cycle
  • 11. 11 ©2014 DesignMind. All Rights Reserved. Why Data Science?  broader skillset Technology Business Statistics & Math IT Professionals DATA SCIENTISTS
  • 12. 12 ©2014 DesignMind. All Rights Reserved. BI vs. Data Science  Broader skillsetbroaderanalyticsspectrum SQL Analytics Descriptive Analytics Data Mining Predictive Analytics Simulation Optimization - Count - Mean - OLAP - Univariate Distribution - Central tendency - Dispersion - Association rules - Clustering - Features extraction - Classification - Regression - Forecasting - Spatial - Machine learnings - Text analytics - Monte Carlo - Agent-based modeling - Discrete event modeling - Linear optimization - Non-linear optimization
  • 13. 13 ©2014 DesignMind. All Rights Reserved.
  • 14. 14 ©2014 DesignMind. All Rights Reserved. Our Goal  to help our customers to fully exploit their big data - high-quality data - tools - services
  • 15. 15 ©2014 DesignMind. All Rights Reserved. High-Quality Data is …  T  R  A  C  E
  • 16. 16 ©2014 DesignMind. All Rights Reserved. Tools for…  D  R  E  A  M
  • 17. 17 ©2014 DesignMind. All Rights Reserved. Services to better exploit data and use tools …  O  P  T  I  C
  • 18. 18 ©2014 DesignMind. All Rights Reserved.
  • 19. 19 ©2014 DesignMind. All Rights Reserved. Cloudera is the leader in Apache Hadoop-based software and services and offers a powerful new data platform that enables enterprises and organizations to look at all their data (structured as well as unstructured) and ask bigger questions for unprecedented insight at the speed of thought. Platfora's Big Data Analytics Platform masks the complexity of Hadoop, making it easy for customers to understand all the facts in their business across events, actions, behaviors and time. Tableau makes it easy for people to rapidly transform data into smart business analytics. Windows Azure HDInsight is Microsoft’s easy to manage, agile, and open Enterprise-ready Hadoop in the cloud. Enterprises can combine both relation and non-relational data with the new PolyBase, available in SQL Server 2012 Parallel Data Warehouse. Partners
  • 20. 20 ©2014 DesignMind. All Rights Reserved. Products Pig
  • 21. 21 ©2014 DesignMind. All Rights Reserved.  big data better questions better answers  better decisions big data to convert the invisible into the visible
  • 22. 22 ©2014 DesignMind. All Rights Reserved. www.designmind.com