SlideShare uma empresa Scribd logo
1 de 30
Baixar para ler offline
Trendwise Analytics 
Introduction to 
Big Data 
S.Mohan Kumar 
Copyright Trendwise Analytics
Trendwise Analytics 
Big Data – Hype or Reality? 
Gartner's Hype Cycles for Emerging Technologies, 2011-2013 
Copyright Trendwise Analytics
Trendwise Analytics 
Market opportunity 
Source: wikibon 2014
Trendwise Analytics 
Market opportunity 
Source: wikibon 2014
Trendwise Analytics 
InformationWeek 2013 Analytic 
Copyright Trendwise Analytics 
InformationWeek 2013 Analytic
Trendwise Analytics 
Data Growth? 
Source: The Digital Universe in 2020: Big Data, Bigger Digital Shadows, and Biggest Growth in the Far East (IDC & EMC, December 2012) 
Copyright Trendwise Analytics
Trendwise Analytics 
What is Big Data?
Trendwise Analytics 
Internet Minute .... 
Copyright Trendwise Analytics
Trendwise Analytics 
Other sources 
Copyright Trendwise Analytics 
· A commercial aircraft 
generates 3GB of 
flight sensor data in 1 
hour 
An ERP system for an mid 
size company grows by 
1-2TB annually 
A Video Suveillance 
Camera generates 1- 
3TB data in 3 months 
Airtel or Vodafone 
generates 3TB of 
Call Details 
Records (CDR) 
every day 
Every day 2.5 quintillion 
(2.5×10^18) bytes of data 
is created 
i.e., 2,500,000TB
Trendwise Analytics 
How are Organizations 
using Big Data 
Technology?
Trendwise Analytics 
Watson wins Jeopardy! 
Feb 14th 2011 – Watson wins Jeopardy! 
beating its human opponents. 
Watson is IBM’s super computer built 
using Big Data Technology. 
Copyright Trendwise Analytics 9 Dec 2013 11
Trendwise Analytics 
Across All Industries 
Web app 
optimization 
Smart meter 
Equipment monitoring 
monitoring 
Advertising 
analysis 
Life sciences 
research 
Fraud 
detection 
Healthcare 
outcomes 
Weather 
forecasting 
Natural 
resource 
exploration 
Social 
network 
analysis 
Churn 
analysis 
Traffic flow 
optimization 
IT 
infrastructure 
optimization 
Legal 
discovery
Trendwise Analytics 
Real Life Cases 
Other Examples: 
i. UPS implemented Big Data to track data on 16.3 million 
packages per day for 8.8 million customers, with an 
average of 39.5 million tracking requests from customers 
per day. The company stores over 16 petabytes of data. 
ii. Caesars Entertainment uses Big Data to analyze customer 
data from its Total Rewards loyalty program,web clickstreams, 
and from real-time play in slot machines for its Marketing 
and Service. They plan to implement video analytics to increase 
customer satisfaction. 
Source: http://www.csc.com/big_data/success_stories 
Copyright Trendwise Analytics
Trendwise Analytics 
Some more... 
Copyright Trendwise Analytics
Trendwise Analytics 
TECHNOLOGY 
Copyright Trendwise Analytics
Trendwise Analytics 
Hadoop 
NoSQL 
SAP Hana 
Copyright Trendwise Analytics
Trendwise Analytics 
What is Hadoop? 
Open source project started by Doug Cutting 
A platform to manage Big Data 
Helps in Distributed computing 
Runs on Commodity Hardware
Trendwise Analytics 
Core components of Hadoop 
 Data storage (HDFS) 
Runs on commodity hardware 
Horizontally scalable 
 Processing (MapReduce) 
Parallelized (scalable) processing 
Fault Tolerant 
 Other Tools / Frameworks 
HBase, Hive, Pig, Mahout
Trendwise Analytics 
Hadoop Components 
Copyright Trendwise Analytics
Trendwise Analytics 
Commercial Hadoop Distributions 
• Cloudera 
• Hortonworks 
• Greenplum, A Division of EMC 
• IBM InfoSphere BigInsights 
Copyright Trendwise Analytics
Trendwise Analytics 
Where to start? 
1. http://hadoop.apache.org/ 
2. Pre-requisite: 
- Linux ( Preferred OS) 
- Java JDK 
3. Install and run a single node cluster – pseudo mode
Trendwise Analytics 
How it looks?
Trendwise Analytics 
Web Interface – NameNode Tracker
Trendwise Analytics 
Hadoop 
NoSQL 
SAP Hana 
Copyright Trendwise Analytics
Trendwise Analytics 
What is NoSQL 
It does not mean No SQL... 
But it means Not only SQL 
- It is a schema less database 
- Or NonRelational Database 
Copyright Trendwise Analytics
Trendwise Analytics 
Why NoSQL? 
Source: www.Benstopford.com 
Copyright Trendwise Analytics
Trendwise Analytics 
Advantages of NoSQL 
• Flexible (schema-less) 
• Highly scalable 
– Horizontal scalability 
Copyright Trendwise Analytics 
• Cheap 
– Most of them are open source
Trendwise Analytics 
Types of NoSQL 
• Key-Value Store (persistent & Volatile) 
• Column store 
• Document database 
• Graph database 
Copyright Trendwise Analytics
Trendwise Analytics 
Examples 
Copyright Trendwise Analytics 
Key Value 
Store 
Column 
Family 
Document Graph 
Riak Cassandra MongoDB Neo4j 
Redis HBase CouchDB
Trendwise Analytics 
Contact us 
Website: www.TrendwiseAnalytics.com 
Email: info@TrendwiseAnalytics.com 
US Tollfree: +1 877 268 2872 
India Number: +91 80 4094 9600 
Copyright Trendwise Analytics

Mais conteúdo relacionado

Mais procurados

Big data landscape map collection by aibdp
Big data landscape map collection by aibdpBig data landscape map collection by aibdp
Big data landscape map collection by aibdpAIBDP
 
Ensuring compliance of patient data with big data
Ensuring compliance of patient data with big dataEnsuring compliance of patient data with big data
Ensuring compliance of patient data with big dataAyad Shammout
 
From Data to Insights to Action: When Transactions and Analytics Converge
From Data to Insights to Action: When Transactions and Analytics ConvergeFrom Data to Insights to Action: When Transactions and Analytics Converge
From Data to Insights to Action: When Transactions and Analytics ConvergeAli Hodroj
 
Introduction to Big Data & Hadoop
Introduction to Big Data & Hadoop Introduction to Big Data & Hadoop
Introduction to Big Data & Hadoop iACT Global
 
Real time analytics of big data
Real time analytics of big dataReal time analytics of big data
Real time analytics of big dataDeependra Jyoti
 
Momentum in Big Data, IoT and Machine Intelligence
Momentum in Big Data, IoT and Machine IntelligenceMomentum in Big Data, IoT and Machine Intelligence
Momentum in Big Data, IoT and Machine IntelligenceShamshad Ansari
 
Geo-Analytics with Apache Spark and In-Memory Data Grids
Geo-Analytics with Apache Spark and In-Memory Data GridsGeo-Analytics with Apache Spark and In-Memory Data Grids
Geo-Analytics with Apache Spark and In-Memory Data GridsAli Hodroj
 
A chart of the big data ecosystem
A chart of the big data ecosystemA chart of the big data ecosystem
A chart of the big data ecosystemMatt Turck
 
Real-time Analytics in Financial
Real-time Analytics in FinancialReal-time Analytics in Financial
Real-time Analytics in FinancialYifeng Jiang
 
Using a Semantic and Graph-based Data Catalog in a Modern Data Fabric
Using a Semantic and Graph-based Data Catalog in a Modern Data FabricUsing a Semantic and Graph-based Data Catalog in a Modern Data Fabric
Using a Semantic and Graph-based Data Catalog in a Modern Data FabricCambridge Semantics
 
Big data landscape version 2.0
Big data landscape version 2.0Big data landscape version 2.0
Big data landscape version 2.0Matt Turck
 
"Hadoop: What we've learned in 5 years", Martin Oberhuber, Senior Data Scient...
"Hadoop: What we've learned in 5 years", Martin Oberhuber, Senior Data Scient..."Hadoop: What we've learned in 5 years", Martin Oberhuber, Senior Data Scient...
"Hadoop: What we've learned in 5 years", Martin Oberhuber, Senior Data Scient...Dataconomy Media
 
Automating Splunk at Large Scale with Cloudify
Automating Splunk at Large Scale with CloudifyAutomating Splunk at Large Scale with Cloudify
Automating Splunk at Large Scale with CloudifyCloudify Community
 
Data Science in the Enterprise
Data Science in the EnterpriseData Science in the Enterprise
Data Science in the EnterpriseThe Hive
 
13 2792 big-data_keynote_presentation_finalpass_05_d_v02
13 2792 big-data_keynote_presentation_finalpass_05_d_v0213 2792 big-data_keynote_presentation_finalpass_05_d_v02
13 2792 big-data_keynote_presentation_finalpass_05_d_v02Erin Kerrigan
 
High Performance Computing and Big Data: The coming wave
High Performance Computing and Big Data: The coming waveHigh Performance Computing and Big Data: The coming wave
High Performance Computing and Big Data: The coming waveIntel IT Center
 
CASE 1 : Big Data Big Reward
CASE 1 : Big Data Big RewardCASE 1 : Big Data Big Reward
CASE 1 : Big Data Big RewardAya Wan Idris
 
The Hive Think Tank: AI in The Enterprise by Venkat Srinivasan
The Hive Think Tank: AI in The Enterprise by Venkat SrinivasanThe Hive Think Tank: AI in The Enterprise by Venkat Srinivasan
The Hive Think Tank: AI in The Enterprise by Venkat SrinivasanThe Hive
 

Mais procurados (20)

Big data landscape map collection by aibdp
Big data landscape map collection by aibdpBig data landscape map collection by aibdp
Big data landscape map collection by aibdp
 
Michael Hummel - Stop Storing Data! - Parstream
Michael Hummel - Stop Storing Data! - ParstreamMichael Hummel - Stop Storing Data! - Parstream
Michael Hummel - Stop Storing Data! - Parstream
 
Ensuring compliance of patient data with big data
Ensuring compliance of patient data with big dataEnsuring compliance of patient data with big data
Ensuring compliance of patient data with big data
 
From Data to Insights to Action: When Transactions and Analytics Converge
From Data to Insights to Action: When Transactions and Analytics ConvergeFrom Data to Insights to Action: When Transactions and Analytics Converge
From Data to Insights to Action: When Transactions and Analytics Converge
 
Introduction to Big Data & Hadoop
Introduction to Big Data & Hadoop Introduction to Big Data & Hadoop
Introduction to Big Data & Hadoop
 
Real time analytics of big data
Real time analytics of big dataReal time analytics of big data
Real time analytics of big data
 
Momentum in Big Data, IoT and Machine Intelligence
Momentum in Big Data, IoT and Machine IntelligenceMomentum in Big Data, IoT and Machine Intelligence
Momentum in Big Data, IoT and Machine Intelligence
 
Geo-Analytics with Apache Spark and In-Memory Data Grids
Geo-Analytics with Apache Spark and In-Memory Data GridsGeo-Analytics with Apache Spark and In-Memory Data Grids
Geo-Analytics with Apache Spark and In-Memory Data Grids
 
A chart of the big data ecosystem
A chart of the big data ecosystemA chart of the big data ecosystem
A chart of the big data ecosystem
 
IoT Big Data Analytics Insights from Patents
IoT Big Data Analytics Insights from PatentsIoT Big Data Analytics Insights from Patents
IoT Big Data Analytics Insights from Patents
 
Real-time Analytics in Financial
Real-time Analytics in FinancialReal-time Analytics in Financial
Real-time Analytics in Financial
 
Using a Semantic and Graph-based Data Catalog in a Modern Data Fabric
Using a Semantic and Graph-based Data Catalog in a Modern Data FabricUsing a Semantic and Graph-based Data Catalog in a Modern Data Fabric
Using a Semantic and Graph-based Data Catalog in a Modern Data Fabric
 
Big data landscape version 2.0
Big data landscape version 2.0Big data landscape version 2.0
Big data landscape version 2.0
 
"Hadoop: What we've learned in 5 years", Martin Oberhuber, Senior Data Scient...
"Hadoop: What we've learned in 5 years", Martin Oberhuber, Senior Data Scient..."Hadoop: What we've learned in 5 years", Martin Oberhuber, Senior Data Scient...
"Hadoop: What we've learned in 5 years", Martin Oberhuber, Senior Data Scient...
 
Automating Splunk at Large Scale with Cloudify
Automating Splunk at Large Scale with CloudifyAutomating Splunk at Large Scale with Cloudify
Automating Splunk at Large Scale with Cloudify
 
Data Science in the Enterprise
Data Science in the EnterpriseData Science in the Enterprise
Data Science in the Enterprise
 
13 2792 big-data_keynote_presentation_finalpass_05_d_v02
13 2792 big-data_keynote_presentation_finalpass_05_d_v0213 2792 big-data_keynote_presentation_finalpass_05_d_v02
13 2792 big-data_keynote_presentation_finalpass_05_d_v02
 
High Performance Computing and Big Data: The coming wave
High Performance Computing and Big Data: The coming waveHigh Performance Computing and Big Data: The coming wave
High Performance Computing and Big Data: The coming wave
 
CASE 1 : Big Data Big Reward
CASE 1 : Big Data Big RewardCASE 1 : Big Data Big Reward
CASE 1 : Big Data Big Reward
 
The Hive Think Tank: AI in The Enterprise by Venkat Srinivasan
The Hive Think Tank: AI in The Enterprise by Venkat SrinivasanThe Hive Think Tank: AI in The Enterprise by Venkat Srinivasan
The Hive Think Tank: AI in The Enterprise by Venkat Srinivasan
 

Destaque

NoSQL & Big Data Analytics: History, Hype, Opportunities
NoSQL & Big Data Analytics: History, Hype, OpportunitiesNoSQL & Big Data Analytics: History, Hype, Opportunities
NoSQL & Big Data Analytics: History, Hype, OpportunitiesVishy Poosala
 
Big Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data ManagementBig Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data ManagementTony Bain
 
Marketingcamp2015 - B&O User Engagement i Analytics - Rasmus Himmelstrup
Marketingcamp2015 - B&O User Engagement i Analytics - Rasmus HimmelstrupMarketingcamp2015 - B&O User Engagement i Analytics - Rasmus Himmelstrup
Marketingcamp2015 - B&O User Engagement i Analytics - Rasmus HimmelstrupRasmus Gi
 
Big Data with Not Only SQL
Big Data with Not Only SQLBig Data with Not Only SQL
Big Data with Not Only SQLPhilippe Julio
 
Advanced Analytics and Big Data (August 2014)
Advanced Analytics and Big Data (August 2014)Advanced Analytics and Big Data (August 2014)
Advanced Analytics and Big Data (August 2014)Thomas W. Dinsmore
 
SQL, NoSQL, BigData in Data Architecture
SQL, NoSQL, BigData in Data ArchitectureSQL, NoSQL, BigData in Data Architecture
SQL, NoSQL, BigData in Data ArchitectureVenu Anuganti
 
Sql vs NoSQL
Sql vs NoSQLSql vs NoSQL
Sql vs NoSQLRTigger
 
SAP HANA Cloud Platform - Overview
SAP HANA Cloud Platform - OverviewSAP HANA Cloud Platform - Overview
SAP HANA Cloud Platform - OverviewMatthias Steiner
 
Big Data Analytics with Hadoop
Big Data Analytics with HadoopBig Data Analytics with Hadoop
Big Data Analytics with HadoopPhilippe Julio
 

Destaque (12)

NoSQL & Big Data Analytics: History, Hype, Opportunities
NoSQL & Big Data Analytics: History, Hype, OpportunitiesNoSQL & Big Data Analytics: History, Hype, Opportunities
NoSQL & Big Data Analytics: History, Hype, Opportunities
 
Big Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data ManagementBig Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data Management
 
Marketingcamp2015 - B&O User Engagement i Analytics - Rasmus Himmelstrup
Marketingcamp2015 - B&O User Engagement i Analytics - Rasmus HimmelstrupMarketingcamp2015 - B&O User Engagement i Analytics - Rasmus Himmelstrup
Marketingcamp2015 - B&O User Engagement i Analytics - Rasmus Himmelstrup
 
Big Data with Not Only SQL
Big Data with Not Only SQLBig Data with Not Only SQL
Big Data with Not Only SQL
 
Advanced Analytics and Big Data (August 2014)
Advanced Analytics and Big Data (August 2014)Advanced Analytics and Big Data (August 2014)
Advanced Analytics and Big Data (August 2014)
 
SQL, NoSQL, BigData in Data Architecture
SQL, NoSQL, BigData in Data ArchitectureSQL, NoSQL, BigData in Data Architecture
SQL, NoSQL, BigData in Data Architecture
 
Sql vs NoSQL
Sql vs NoSQLSql vs NoSQL
Sql vs NoSQL
 
SAP HANA Cloud Platform - Overview
SAP HANA Cloud Platform - OverviewSAP HANA Cloud Platform - Overview
SAP HANA Cloud Platform - Overview
 
Liquid & Linked marketing
Liquid & Linked marketingLiquid & Linked marketing
Liquid & Linked marketing
 
Big Data and Advanced Analytics
Big Data and Advanced AnalyticsBig Data and Advanced Analytics
Big Data and Advanced Analytics
 
Big Data Analytics with Hadoop
Big Data Analytics with HadoopBig Data Analytics with Hadoop
Big Data Analytics with Hadoop
 
Big data ppt
Big  data pptBig  data ppt
Big data ppt
 

Semelhante a Introduction to Big Data

Hadoop,Big Data Analytics and More
Hadoop,Big Data Analytics and MoreHadoop,Big Data Analytics and More
Hadoop,Big Data Analytics and MoreTrendwise Analytics
 
Big Data PPT by Rohit Dubey
Big Data PPT by Rohit DubeyBig Data PPT by Rohit Dubey
Big Data PPT by Rohit DubeyRohit Dubey
 
Big Data Session 1.pptx
Big Data Session 1.pptxBig Data Session 1.pptx
Big Data Session 1.pptxElsonPaul2
 
Overview - IBM Big Data Platform
Overview - IBM Big Data PlatformOverview - IBM Big Data Platform
Overview - IBM Big Data PlatformVikas Manoria
 
Big Data Analytics for Real Time Systems
Big Data Analytics for Real Time SystemsBig Data Analytics for Real Time Systems
Big Data Analytics for Real Time SystemsKamalika Dutta
 
Big Data - A Real Life Revolution
Big Data - A Real Life RevolutionBig Data - A Real Life Revolution
Big Data - A Real Life RevolutionCapgemini
 
02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big dataRaul Chong
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationDenodo
 
Introduction Big Data
Introduction Big DataIntroduction Big Data
Introduction Big DataFrank Kienle
 
Information Security Analytics
Information Security AnalyticsInformation Security Analytics
Information Security AnalyticsAmrit Chhetri
 
Hitachi Data Systems Big Data Roadmap
Hitachi Data Systems Big Data RoadmapHitachi Data Systems Big Data Roadmap
Hitachi Data Systems Big Data RoadmapHitachi Vantara
 
Using real time big data analytics for competitive advantage
 Using real time big data analytics for competitive advantage Using real time big data analytics for competitive advantage
Using real time big data analytics for competitive advantageAmazon Web Services
 
Capturing big value in big data
Capturing big value in big data Capturing big value in big data
Capturing big value in big data BSP Media Group
 
How Startups can leverage big data?
How Startups can leverage big data?How Startups can leverage big data?
How Startups can leverage big data?Rackspace
 
Harness the Power of Big Data with Oracle
Harness the Power of Big Data with OracleHarness the Power of Big Data with Oracle
Harness the Power of Big Data with OracleSai Janakiram Penumuru
 

Semelhante a Introduction to Big Data (20)

Big data Introduction by Mohan
Big data Introduction by MohanBig data Introduction by Mohan
Big data Introduction by Mohan
 
Hadoop,Big Data Analytics and More
Hadoop,Big Data Analytics and MoreHadoop,Big Data Analytics and More
Hadoop,Big Data Analytics and More
 
Big Data PPT by Rohit Dubey
Big Data PPT by Rohit DubeyBig Data PPT by Rohit Dubey
Big Data PPT by Rohit Dubey
 
Big Data Session 1.pptx
Big Data Session 1.pptxBig Data Session 1.pptx
Big Data Session 1.pptx
 
Big data
Big dataBig data
Big data
 
Overview - IBM Big Data Platform
Overview - IBM Big Data PlatformOverview - IBM Big Data Platform
Overview - IBM Big Data Platform
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 
Big Data Analytics for Real Time Systems
Big Data Analytics for Real Time SystemsBig Data Analytics for Real Time Systems
Big Data Analytics for Real Time Systems
 
Big Data - A Real Life Revolution
Big Data - A Real Life RevolutionBig Data - A Real Life Revolution
Big Data - A Real Life Revolution
 
Bigdata
BigdataBigdata
Bigdata
 
02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big data
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data Virtualization
 
Introduction Big Data
Introduction Big DataIntroduction Big Data
Introduction Big Data
 
Information Security Analytics
Information Security AnalyticsInformation Security Analytics
Information Security Analytics
 
Hitachi Data Systems Big Data Roadmap
Hitachi Data Systems Big Data RoadmapHitachi Data Systems Big Data Roadmap
Hitachi Data Systems Big Data Roadmap
 
Using real time big data analytics for competitive advantage
 Using real time big data analytics for competitive advantage Using real time big data analytics for competitive advantage
Using real time big data analytics for competitive advantage
 
Capturing big value in big data
Capturing big value in big data Capturing big value in big data
Capturing big value in big data
 
How Startups can leverage big data?
How Startups can leverage big data?How Startups can leverage big data?
How Startups can leverage big data?
 
Taming Big Data With Modern Software Architecture
Taming Big Data  With Modern Software ArchitectureTaming Big Data  With Modern Software Architecture
Taming Big Data With Modern Software Architecture
 
Harness the Power of Big Data with Oracle
Harness the Power of Big Data with OracleHarness the Power of Big Data with Oracle
Harness the Power of Big Data with Oracle
 

Último

🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 

Último (20)

🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 

Introduction to Big Data

  • 1. Trendwise Analytics Introduction to Big Data S.Mohan Kumar Copyright Trendwise Analytics
  • 2. Trendwise Analytics Big Data – Hype or Reality? Gartner's Hype Cycles for Emerging Technologies, 2011-2013 Copyright Trendwise Analytics
  • 3. Trendwise Analytics Market opportunity Source: wikibon 2014
  • 4. Trendwise Analytics Market opportunity Source: wikibon 2014
  • 5. Trendwise Analytics InformationWeek 2013 Analytic Copyright Trendwise Analytics InformationWeek 2013 Analytic
  • 6. Trendwise Analytics Data Growth? Source: The Digital Universe in 2020: Big Data, Bigger Digital Shadows, and Biggest Growth in the Far East (IDC & EMC, December 2012) Copyright Trendwise Analytics
  • 8. Trendwise Analytics Internet Minute .... Copyright Trendwise Analytics
  • 9. Trendwise Analytics Other sources Copyright Trendwise Analytics · A commercial aircraft generates 3GB of flight sensor data in 1 hour An ERP system for an mid size company grows by 1-2TB annually A Video Suveillance Camera generates 1- 3TB data in 3 months Airtel or Vodafone generates 3TB of Call Details Records (CDR) every day Every day 2.5 quintillion (2.5×10^18) bytes of data is created i.e., 2,500,000TB
  • 10. Trendwise Analytics How are Organizations using Big Data Technology?
  • 11. Trendwise Analytics Watson wins Jeopardy! Feb 14th 2011 – Watson wins Jeopardy! beating its human opponents. Watson is IBM’s super computer built using Big Data Technology. Copyright Trendwise Analytics 9 Dec 2013 11
  • 12. Trendwise Analytics Across All Industries Web app optimization Smart meter Equipment monitoring monitoring Advertising analysis Life sciences research Fraud detection Healthcare outcomes Weather forecasting Natural resource exploration Social network analysis Churn analysis Traffic flow optimization IT infrastructure optimization Legal discovery
  • 13. Trendwise Analytics Real Life Cases Other Examples: i. UPS implemented Big Data to track data on 16.3 million packages per day for 8.8 million customers, with an average of 39.5 million tracking requests from customers per day. The company stores over 16 petabytes of data. ii. Caesars Entertainment uses Big Data to analyze customer data from its Total Rewards loyalty program,web clickstreams, and from real-time play in slot machines for its Marketing and Service. They plan to implement video analytics to increase customer satisfaction. Source: http://www.csc.com/big_data/success_stories Copyright Trendwise Analytics
  • 14. Trendwise Analytics Some more... Copyright Trendwise Analytics
  • 15. Trendwise Analytics TECHNOLOGY Copyright Trendwise Analytics
  • 16. Trendwise Analytics Hadoop NoSQL SAP Hana Copyright Trendwise Analytics
  • 17. Trendwise Analytics What is Hadoop? Open source project started by Doug Cutting A platform to manage Big Data Helps in Distributed computing Runs on Commodity Hardware
  • 18. Trendwise Analytics Core components of Hadoop  Data storage (HDFS) Runs on commodity hardware Horizontally scalable  Processing (MapReduce) Parallelized (scalable) processing Fault Tolerant  Other Tools / Frameworks HBase, Hive, Pig, Mahout
  • 19. Trendwise Analytics Hadoop Components Copyright Trendwise Analytics
  • 20. Trendwise Analytics Commercial Hadoop Distributions • Cloudera • Hortonworks • Greenplum, A Division of EMC • IBM InfoSphere BigInsights Copyright Trendwise Analytics
  • 21. Trendwise Analytics Where to start? 1. http://hadoop.apache.org/ 2. Pre-requisite: - Linux ( Preferred OS) - Java JDK 3. Install and run a single node cluster – pseudo mode
  • 23. Trendwise Analytics Web Interface – NameNode Tracker
  • 24. Trendwise Analytics Hadoop NoSQL SAP Hana Copyright Trendwise Analytics
  • 25. Trendwise Analytics What is NoSQL It does not mean No SQL... But it means Not only SQL - It is a schema less database - Or NonRelational Database Copyright Trendwise Analytics
  • 26. Trendwise Analytics Why NoSQL? Source: www.Benstopford.com Copyright Trendwise Analytics
  • 27. Trendwise Analytics Advantages of NoSQL • Flexible (schema-less) • Highly scalable – Horizontal scalability Copyright Trendwise Analytics • Cheap – Most of them are open source
  • 28. Trendwise Analytics Types of NoSQL • Key-Value Store (persistent & Volatile) • Column store • Document database • Graph database Copyright Trendwise Analytics
  • 29. Trendwise Analytics Examples Copyright Trendwise Analytics Key Value Store Column Family Document Graph Riak Cassandra MongoDB Neo4j Redis HBase CouchDB
  • 30. Trendwise Analytics Contact us Website: www.TrendwiseAnalytics.com Email: info@TrendwiseAnalytics.com US Tollfree: +1 877 268 2872 India Number: +91 80 4094 9600 Copyright Trendwise Analytics