SlideShare uma empresa Scribd logo
1 de 26
Baixar para ler offline
Trendwise Analytics
Copyright Trendwise Analytics
Big Data/Hadoop
Introduction
Trendwise Analytics
Agenda
1.Introduction to Big Data and Hadoop
2.Big Data Business cases
3.Technology
4.Q&A - Wind up
Trendwise Analytics
What is Big Data
Three V's
• Volume
• Variety
• Velocity
Trendwise Analytics
Copyright Trendwise Analytics
Volume of Data
•
A commercial aircraft
generates 3GB of
flight sensor data in 1
hour
An ERP system for an mid
size company grows by
1-2TB annually
A Video Suveillance
Camera generates 1-
3TB data in 3 months
Airtel or Vodafone
generates 3TB of
Call Details
Records (CDR)
every day
Every day 2.5 quintillion
(2.5×10^18) bytes of data
is created
i.e., 2,500,000TB
Trendwise Analytics
Copyright Trendwise Analytics
Internet Minute ....
Trendwise Analytics
Market opportunity
IDC, a research firm, predicts that
the market for Big Data technology and services will reach $16.9 billion by 2015,
up from $3.2 billion in 2010. That is a 40 percent-a-year growth rate —
about seven times the estimated growth rate for the overall information
technology and communications business, according to IDC.
Billions and billions: big data
becomes a big deal :
Deloitte predicts that in 2012, “big
data” will likely experience
accelerating growth and market
penetration.
Trendwise Analytics
How Companies are
using Big Data?
Trendwise Analytics
Common Big Data Customer Scenarios
in your industry
Web app
optimization
Smart meter
monitoring
Equipment
monitoring
Advertising
analysis
Life sciences
research
Fraud
detection
Healthcare
outcomes
Weather
forecasting
Natural
resource
exploration
Social network
analysis
Churn
analysis
Traffic flow
optimization
IT
infrastructure
optimization
Legal
discovery
Trendwise Analytics
Copyright Trendwise Analytics
916 Apr 2013
Watson wins Jeopardy!
Feb 14th
2011 – Watson wins Jeopardy!
beating its human opponents.
Watson is IBM’s super computer built
using Big Data Technology.
Trendwise Analytics
Big data Applications
 Social media analytics ‒ ”People You May Know” at
LinkedIn
 Voice analytics ‒ Call center
 Text analytics ‒ Voice of customer,
sentiment analysis, warranty analysis
 Video analytics ‒ Intelligence, policing, retail
applications
 Telecom – customer churn
Trendwise Analytics
Big Data at GE
11
 New $1B corporate center for software and analytics
 Hiring 400 data scientists
 Includes financial and marketing applications,
but with special focus on industrial uses of big data
 When will this gas turbine need maintenance?
 How can we optimize the performance of a locomotive?
 What is the best way to make decisions about energy finance?
Trendwise Analytics
Copyright Trendwise Analytics
Ford Gets Smarter About Marketing and
Design
• Ford collects and aggregates data from the 4 million vehicles that use
in-car sensing and remote app management software
• The data allows to glean information on a range of issues, from how
drivers are using their vehicles, to the driving environment that could
help them improve the quality of the vehicle
• Partnered with Microsoft to develop SYNC
Trendwise Analytics
Copyright Trendwise Analytics
How Amazon Uses Big Data To Make You
Love Them
• Amazon has been collecting customer information for years--not just
addresses and payment information but the identity of everything that a
customer had ever bought or even looked at.
• They’re using that data to build customer relationship
Trendwise Analytics
Copyright Trendwise Analytics
How LinkedIn is Riding a Wave of Big Data
All the Way to the Bank
• LinkedIn is a trove of data not just about people, but how people are
making their money and what industries they are working in and how
they connect to each other.
Trendwise Analytics
Copyright Trendwise Analytics
How AT&T is using cell phone to watch
user movements?
• AT&T has 300 million customers
• A team of researchers is working to turn data collected through the
company’s cellular network into a trove of information for
policymakers, urban planners and traffic engineers.
• The researchers want to see how the city changes hourly by looking at
calls and text messages relayed through cell towers around the region,
noting that certain towers see more activity at different times
Trendwise Analytics
Copyright Trendwise Analytics
Govt of India
• Aadhar project by Govt. of India uses Hadoop
Trendwise Analytics
Copyright Trendwise Analytics
TECHNOLOGY
Hadoop Non Hadoop
Trendwise Analytics
Copyright Trendwise Analytics
Hadoop Components
Trendwise Analytics
Copyright Trendwise Analytics
Hadoop HDFS and MapReduce

Hadoop runs on HDFS,
Hadoop Distributed Filesystem

Any data stored is converted to blocks
and distributed across the cluster nodes
Trendwise Analytics
Copyright Trendwise Analytics
Other components
Hive
• Data Warehouse infrastructure
that provides data
summarization and ad hoc
querying on top of Hadoop
PIG
• A high-level data-flow language and
execution framework for parallel
computation
Sqoop
• Sqoop is a tool designed to
help users of large data
import existing relational
databases into their Hadoop
clusters
Zookeeper
• Zookeeper is a centralized
service for maintaining
configuration information,
naming, providing
distributed synchronization,
and providing group services
Trendwise Analytics
Benefits of Hadoop
• Hadoop is designed to run on cheap commodity
hardware
• It automatically handles data replication and node
failure
• Handles large volumes of unstructured data easily
• Last but not least – its free! ( Open source)
Trendwise Analytics
Copyright Trendwise Analytics
Commercial Hadoop Distributions
• Cloudera
• Hortonworks
• Greenplum, A Division of EMC
• IBM InfoSphere BigInsights
Trendwise Analytics
Technology – Non Hadoop
•HPCC - HPCC Systems from LexisNexis Risk Solutions offers a proven,
• open-source, data-intensive supercomputing platform
• designed for the enterprise to solve big data problems.
• SAP HANA is SAP AG’s implementation of in-memory database technology.
•NoSQL Databases
Key-Values Stores – Redis, Riak
Column Family Stores – Cassandra, HBase
Document Databases – CouchDB, MongoDB
Graph Database – InfoGrid, Infinite Graph
Trendwise Analytics
SAP HANA In-memory Database System
 Hana is an in-memory database system
developed by SAP AG.
 It takes the advantage of –
 low-cost of main memory
 Fast data access of solid state drives.
 Data processing abilities
of multi-core processors.
 It supports both row-oriented
and column-oriented data storage.
 It incorporates powerful graph and
text processing capabilities to work
with semi and full unstructured data.
 SAP has positioned HANA as its solution
to big data challenges at the low end of this
scale.
Trendwise Analytics
Copyright Trendwise Analytics
Thank You!
Trendwise Analytics
Additional resources
Hadoop:
http://hadoop.apache.org/
http://bigdatauniversity.com/
Others:
http://www.cloudera.com/content/cloudera/en/home.html
http://hortonworks.com/

Mais conteúdo relacionado

Mais procurados

Data mining with big data
Data mining with big dataData mining with big data
Data mining with big datakk1718
 
Introduction to big data
Introduction to big dataIntroduction to big data
Introduction to big dataRichard Vidgen
 
Big Data’s Big Impact on Businesses
Big Data’s Big Impact on BusinessesBig Data’s Big Impact on Businesses
Big Data’s Big Impact on BusinessesCRISIL Limited
 
Big Data Applications | Big Data Analytics Use-Cases | Big Data Tutorial for ...
Big Data Applications | Big Data Analytics Use-Cases | Big Data Tutorial for ...Big Data Applications | Big Data Analytics Use-Cases | Big Data Tutorial for ...
Big Data Applications | Big Data Analytics Use-Cases | Big Data Tutorial for ...Edureka!
 
Presentation Big Data
Presentation Big DataPresentation Big Data
Presentation Big DataRené Kuipers
 
Big data deep learning: applications and challenges
Big data deep learning: applications and challengesBig data deep learning: applications and challenges
Big data deep learning: applications and challengesfazail amin
 
Big Data Landscape 2018
Big Data Landscape 2018Big Data Landscape 2018
Big Data Landscape 2018Leanne Hwee
 
On Big Data Analytics - opportunities and challenges
On Big Data Analytics - opportunities and challengesOn Big Data Analytics - opportunities and challenges
On Big Data Analytics - opportunities and challengesPetteri Alahuhta
 
Digital-Warriors-Marketing Roadmap with Big Data Analytics
Digital-Warriors-Marketing Roadmap with Big Data AnalyticsDigital-Warriors-Marketing Roadmap with Big Data Analytics
Digital-Warriors-Marketing Roadmap with Big Data AnalyticsJaysonBowden
 
Presentation on Big Data Analytics
Presentation on Big Data AnalyticsPresentation on Big Data Analytics
Presentation on Big Data AnalyticsS P Sajjan
 
Lesson 1 introduction to_big_data_and_hadoop.pptx
Lesson 1 introduction to_big_data_and_hadoop.pptxLesson 1 introduction to_big_data_and_hadoop.pptx
Lesson 1 introduction to_big_data_and_hadoop.pptxPankajkumar496281
 
Big Data and Classification
Big Data and ClassificationBig Data and Classification
Big Data and Classification303Computing
 
Big Data
Big DataBig Data
Big DataNGDATA
 
Big data analytics with Apache Hadoop
Big data analytics with Apache  HadoopBig data analytics with Apache  Hadoop
Big data analytics with Apache HadoopSuman Saurabh
 
Big data analytics, research report
Big data analytics, research reportBig data analytics, research report
Big data analytics, research reportJULIO GONZALEZ SANZ
 

Mais procurados (20)

Data mining with big data
Data mining with big dataData mining with big data
Data mining with big data
 
Introduction to big data
Introduction to big dataIntroduction to big data
Introduction to big data
 
Big Data’s Big Impact on Businesses
Big Data’s Big Impact on BusinessesBig Data’s Big Impact on Businesses
Big Data’s Big Impact on Businesses
 
A Big Data Concept
A Big Data ConceptA Big Data Concept
A Big Data Concept
 
Big Data Applications | Big Data Analytics Use-Cases | Big Data Tutorial for ...
Big Data Applications | Big Data Analytics Use-Cases | Big Data Tutorial for ...Big Data Applications | Big Data Analytics Use-Cases | Big Data Tutorial for ...
Big Data Applications | Big Data Analytics Use-Cases | Big Data Tutorial for ...
 
Presentation Big Data
Presentation Big DataPresentation Big Data
Presentation Big Data
 
Big data deep learning: applications and challenges
Big data deep learning: applications and challengesBig data deep learning: applications and challenges
Big data deep learning: applications and challenges
 
Big data(1st presentation)
Big data(1st presentation)Big data(1st presentation)
Big data(1st presentation)
 
Sina Sohangir Presentation on IWMC 2015
Sina Sohangir Presentation on IWMC 2015Sina Sohangir Presentation on IWMC 2015
Sina Sohangir Presentation on IWMC 2015
 
Big Data Landscape 2018
Big Data Landscape 2018Big Data Landscape 2018
Big Data Landscape 2018
 
Big data 101
Big data 101Big data 101
Big data 101
 
On Big Data Analytics - opportunities and challenges
On Big Data Analytics - opportunities and challengesOn Big Data Analytics - opportunities and challenges
On Big Data Analytics - opportunities and challenges
 
Digital-Warriors-Marketing Roadmap with Big Data Analytics
Digital-Warriors-Marketing Roadmap with Big Data AnalyticsDigital-Warriors-Marketing Roadmap with Big Data Analytics
Digital-Warriors-Marketing Roadmap with Big Data Analytics
 
Presentation on Big Data Analytics
Presentation on Big Data AnalyticsPresentation on Big Data Analytics
Presentation on Big Data Analytics
 
Lesson 1 introduction to_big_data_and_hadoop.pptx
Lesson 1 introduction to_big_data_and_hadoop.pptxLesson 1 introduction to_big_data_and_hadoop.pptx
Lesson 1 introduction to_big_data_and_hadoop.pptx
 
Big Data and Classification
Big Data and ClassificationBig Data and Classification
Big Data and Classification
 
Exploring Big Data Analytics Tools
Exploring Big Data Analytics ToolsExploring Big Data Analytics Tools
Exploring Big Data Analytics Tools
 
Big Data
Big DataBig Data
Big Data
 
Big data analytics with Apache Hadoop
Big data analytics with Apache  HadoopBig data analytics with Apache  Hadoop
Big data analytics with Apache Hadoop
 
Big data analytics, research report
Big data analytics, research reportBig data analytics, research report
Big data analytics, research report
 

Destaque

Big data introduction - Big Data from a Consulting perspective - Sogeti
Big data introduction - Big Data from a Consulting perspective - SogetiBig data introduction - Big Data from a Consulting perspective - Sogeti
Big data introduction - Big Data from a Consulting perspective - SogetiEdzo Botjes
 
Introduction to predictive modeling v1
Introduction to predictive modeling v1Introduction to predictive modeling v1
Introduction to predictive modeling v1Venkata Reddy Konasani
 
Learning Tableau - Data, Graphs, Filters, Dashboards and Advanced features
Learning Tableau -  Data, Graphs, Filters, Dashboards and Advanced featuresLearning Tableau -  Data, Graphs, Filters, Dashboards and Advanced features
Learning Tableau - Data, Graphs, Filters, Dashboards and Advanced featuresVenkata Reddy Konasani
 
Data exploration validation and sanitization
Data exploration validation and sanitizationData exploration validation and sanitization
Data exploration validation and sanitizationVenkata Reddy Konasani
 
Data Exploration, Validation and Sanitization
Data Exploration, Validation and SanitizationData Exploration, Validation and Sanitization
Data Exploration, Validation and SanitizationVenkata Reddy Konasani
 

Destaque (20)

Big data introduction - Big Data from a Consulting perspective - Sogeti
Big data introduction - Big Data from a Consulting perspective - SogetiBig data introduction - Big Data from a Consulting perspective - Sogeti
Big data introduction - Big Data from a Consulting perspective - Sogeti
 
Machine Learning for Dummies
Machine Learning for DummiesMachine Learning for Dummies
Machine Learning for Dummies
 
Statistical Distributions
Statistical DistributionsStatistical Distributions
Statistical Distributions
 
Testing of hypothesis case study
Testing of hypothesis case study Testing of hypothesis case study
Testing of hypothesis case study
 
Testing of hypothesis
Testing of hypothesisTesting of hypothesis
Testing of hypothesis
 
Introduction to predictive modeling v1
Introduction to predictive modeling v1Introduction to predictive modeling v1
Introduction to predictive modeling v1
 
ARIMA
ARIMA ARIMA
ARIMA
 
Multiple regression
Multiple regressionMultiple regression
Multiple regression
 
Step By Step Guide to Learn R
Step By Step Guide to Learn RStep By Step Guide to Learn R
Step By Step Guide to Learn R
 
Logistic regression
Logistic regressionLogistic regression
Logistic regression
 
A data analyst view of Bigdata
A data analyst view of Bigdata A data analyst view of Bigdata
A data analyst view of Bigdata
 
Decision tree
Decision treeDecision tree
Decision tree
 
SAS basics Step by step learning
SAS basics Step by step learningSAS basics Step by step learning
SAS basics Step by step learning
 
Credit Risk Model Building Steps
Credit Risk Model Building StepsCredit Risk Model Building Steps
Credit Risk Model Building Steps
 
Learning Tableau - Data, Graphs, Filters, Dashboards and Advanced features
Learning Tableau -  Data, Graphs, Filters, Dashboards and Advanced featuresLearning Tableau -  Data, Graphs, Filters, Dashboards and Advanced features
Learning Tableau - Data, Graphs, Filters, Dashboards and Advanced features
 
Data exploration validation and sanitization
Data exploration validation and sanitizationData exploration validation and sanitization
Data exploration validation and sanitization
 
Data analysis Design Document
Data analysis Design DocumentData analysis Design Document
Data analysis Design Document
 
Data Exploration, Validation and Sanitization
Data Exploration, Validation and SanitizationData Exploration, Validation and Sanitization
Data Exploration, Validation and Sanitization
 
Cluster analysis
Cluster analysisCluster analysis
Cluster analysis
 
Timeseries forecasting
Timeseries forecastingTimeseries forecasting
Timeseries forecasting
 

Semelhante a Big data Introduction by Mohan

Hadoop,Big Data Analytics and More
Hadoop,Big Data Analytics and MoreHadoop,Big Data Analytics and More
Hadoop,Big Data Analytics and MoreTrendwise Analytics
 
In-Memory Computing Webcast. Market Predictions 2017
In-Memory Computing Webcast. Market Predictions 2017In-Memory Computing Webcast. Market Predictions 2017
In-Memory Computing Webcast. Market Predictions 2017SingleStore
 
Big Data Solutions on Cloud – The Way Forward by Kiththi Perera SLT
Big Data Solutions on Cloud – The Way Forward by Kiththi Perera SLTBig Data Solutions on Cloud – The Way Forward by Kiththi Perera SLT
Big Data Solutions on Cloud – The Way Forward by Kiththi Perera SLTKiththi Perera
 
Big data solutions on cloud – the way forward
Big data solutions on cloud – the way forwardBig data solutions on cloud – the way forward
Big data solutions on cloud – the way forwardKiththi Perera
 
Getting started with Hadoop on the Cloud with Bluemix
Getting started with Hadoop on the Cloud with BluemixGetting started with Hadoop on the Cloud with Bluemix
Getting started with Hadoop on the Cloud with BluemixNicolas Morales
 
Overview - IBM Big Data Platform
Overview - IBM Big Data PlatformOverview - IBM Big Data Platform
Overview - IBM Big Data PlatformVikas Manoria
 
Let's make money from big data!
Let's make money from big data! Let's make money from big data!
Let's make money from big data! B Spot
 
Big Data Use Cases
Big Data Use CasesBig Data Use Cases
Big Data Use Casesaziksa
 
Aziksa hadoop for buisness users2 santosh jha
Aziksa hadoop for buisness users2 santosh jhaAziksa hadoop for buisness users2 santosh jha
Aziksa hadoop for buisness users2 santosh jhaData Con LA
 
Harnessing Big Data_UCLA
Harnessing Big Data_UCLAHarnessing Big Data_UCLA
Harnessing Big Data_UCLAPaul Barsch
 
Accelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and VisualizationAccelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and VisualizationDenodo
 
How Startups can leverage big data?
How Startups can leverage big data?How Startups can leverage big data?
How Startups can leverage big data?Rackspace
 
Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...
Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...
Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...Matt Stubbs
 
Vertica Analytics Database general overview
Vertica Analytics Database general overviewVertica Analytics Database general overview
Vertica Analytics Database general overviewStratebi
 
Hadoop 2.0: YARN to Further Optimize Data Processing
Hadoop 2.0: YARN to Further Optimize Data ProcessingHadoop 2.0: YARN to Further Optimize Data Processing
Hadoop 2.0: YARN to Further Optimize Data ProcessingHortonworks
 
Introduction to Harnessing Big Data
Introduction to Harnessing Big DataIntroduction to Harnessing Big Data
Introduction to Harnessing Big DataPaul Barsch
 

Semelhante a Big data Introduction by Mohan (20)

Hadoop,Big Data Analytics and More
Hadoop,Big Data Analytics and MoreHadoop,Big Data Analytics and More
Hadoop,Big Data Analytics and More
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
In-Memory Computing Webcast. Market Predictions 2017
In-Memory Computing Webcast. Market Predictions 2017In-Memory Computing Webcast. Market Predictions 2017
In-Memory Computing Webcast. Market Predictions 2017
 
Big Data Solutions on Cloud – The Way Forward by Kiththi Perera SLT
Big Data Solutions on Cloud – The Way Forward by Kiththi Perera SLTBig Data Solutions on Cloud – The Way Forward by Kiththi Perera SLT
Big Data Solutions on Cloud – The Way Forward by Kiththi Perera SLT
 
Big data solutions on cloud – the way forward
Big data solutions on cloud – the way forwardBig data solutions on cloud – the way forward
Big data solutions on cloud – the way forward
 
Getting started with Hadoop on the Cloud with Bluemix
Getting started with Hadoop on the Cloud with BluemixGetting started with Hadoop on the Cloud with Bluemix
Getting started with Hadoop on the Cloud with Bluemix
 
Overview - IBM Big Data Platform
Overview - IBM Big Data PlatformOverview - IBM Big Data Platform
Overview - IBM Big Data Platform
 
Let's make money from big data!
Let's make money from big data! Let's make money from big data!
Let's make money from big data!
 
Big data
Big dataBig data
Big data
 
Big Data Use Cases
Big Data Use CasesBig Data Use Cases
Big Data Use Cases
 
Aziksa hadoop for buisness users2 santosh jha
Aziksa hadoop for buisness users2 santosh jhaAziksa hadoop for buisness users2 santosh jha
Aziksa hadoop for buisness users2 santosh jha
 
Harnessing Big Data_UCLA
Harnessing Big Data_UCLAHarnessing Big Data_UCLA
Harnessing Big Data_UCLA
 
Accelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and VisualizationAccelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and Visualization
 
How Startups can leverage big data?
How Startups can leverage big data?How Startups can leverage big data?
How Startups can leverage big data?
 
Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...
Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...
Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...
 
Big data and analytics
Big data and analyticsBig data and analytics
Big data and analytics
 
Vertica Analytics Database general overview
Vertica Analytics Database general overviewVertica Analytics Database general overview
Vertica Analytics Database general overview
 
Hadoop 2.0: YARN to Further Optimize Data Processing
Hadoop 2.0: YARN to Further Optimize Data ProcessingHadoop 2.0: YARN to Further Optimize Data Processing
Hadoop 2.0: YARN to Further Optimize Data Processing
 
Presentation on Big Data
Presentation on Big DataPresentation on Big Data
Presentation on Big Data
 
Introduction to Harnessing Big Data
Introduction to Harnessing Big DataIntroduction to Harnessing Big Data
Introduction to Harnessing Big Data
 

Mais de Venkata Reddy Konasani

Mais de Venkata Reddy Konasani (14)

Transformers 101
Transformers 101 Transformers 101
Transformers 101
 
Machine Learning Deep Learning AI and Data Science
Machine Learning Deep Learning AI and Data Science Machine Learning Deep Learning AI and Data Science
Machine Learning Deep Learning AI and Data Science
 
Model selection and cross validation techniques
Model selection and cross validation techniquesModel selection and cross validation techniques
Model selection and cross validation techniques
 
Neural Network Part-2
Neural Network Part-2Neural Network Part-2
Neural Network Part-2
 
GBM theory code and parameters
GBM theory code and parametersGBM theory code and parameters
GBM theory code and parameters
 
Neural Networks made easy
Neural Networks made easyNeural Networks made easy
Neural Networks made easy
 
Table of Contents - Practical Business Analytics using SAS
Table of Contents - Practical Business Analytics using SAS Table of Contents - Practical Business Analytics using SAS
Table of Contents - Practical Business Analytics using SAS
 
L101 predictive modeling case_study
L101 predictive modeling case_studyL101 predictive modeling case_study
L101 predictive modeling case_study
 
Online data sources for analaysis
Online data sources for analaysis Online data sources for analaysis
Online data sources for analaysis
 
R- Introduction
R- IntroductionR- Introduction
R- Introduction
 
Cluster Analysis for Dummies
Cluster Analysis for DummiesCluster Analysis for Dummies
Cluster Analysis for Dummies
 
Data Analyst - Interview Guide
Data Analyst - Interview GuideData Analyst - Interview Guide
Data Analyst - Interview Guide
 
Model building in credit card and loan approval
Model building in credit card and loan approval Model building in credit card and loan approval
Model building in credit card and loan approval
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
 

Último

Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Mark Reed
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)lakshayb543
 
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfGrade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfJemuel Francisco
 
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Celine George
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxiammrhaywood
 
Activity 2-unit 2-update 2024. English translation
Activity 2-unit 2-update 2024. English translationActivity 2-unit 2-update 2024. English translation
Activity 2-unit 2-update 2024. English translationRosabel UA
 
4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptxmary850239
 
Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management SystemChristalin Nelson
 
Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Seán Kennedy
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management systemChristalin Nelson
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Celine George
 
Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxAshokKarra1
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Celine George
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPCeline George
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxHumphrey A Beña
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxCarlos105
 
ROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxVanesaIglesias10
 
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...JojoEDelaCruz
 
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptxAUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptxiammrhaywood
 

Último (20)

Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
 
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfGrade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
 
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
 
Activity 2-unit 2-update 2024. English translation
Activity 2-unit 2-update 2024. English translationActivity 2-unit 2-update 2024. English translation
Activity 2-unit 2-update 2024. English translation
 
4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx
 
Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management System
 
Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management system
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17
 
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptxYOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
 
Karra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptxKarra SKD Conference Presentation Revised.pptx
Karra SKD Conference Presentation Revised.pptx
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERP
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
 
ROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptx
 
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
 
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptxAUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptx
 

Big data Introduction by Mohan

  • 1. Trendwise Analytics Copyright Trendwise Analytics Big Data/Hadoop Introduction
  • 2. Trendwise Analytics Agenda 1.Introduction to Big Data and Hadoop 2.Big Data Business cases 3.Technology 4.Q&A - Wind up
  • 3. Trendwise Analytics What is Big Data Three V's • Volume • Variety • Velocity
  • 4. Trendwise Analytics Copyright Trendwise Analytics Volume of Data • A commercial aircraft generates 3GB of flight sensor data in 1 hour An ERP system for an mid size company grows by 1-2TB annually A Video Suveillance Camera generates 1- 3TB data in 3 months Airtel or Vodafone generates 3TB of Call Details Records (CDR) every day Every day 2.5 quintillion (2.5×10^18) bytes of data is created i.e., 2,500,000TB
  • 5. Trendwise Analytics Copyright Trendwise Analytics Internet Minute ....
  • 6. Trendwise Analytics Market opportunity IDC, a research firm, predicts that the market for Big Data technology and services will reach $16.9 billion by 2015, up from $3.2 billion in 2010. That is a 40 percent-a-year growth rate — about seven times the estimated growth rate for the overall information technology and communications business, according to IDC. Billions and billions: big data becomes a big deal : Deloitte predicts that in 2012, “big data” will likely experience accelerating growth and market penetration.
  • 7. Trendwise Analytics How Companies are using Big Data?
  • 8. Trendwise Analytics Common Big Data Customer Scenarios in your industry Web app optimization Smart meter monitoring Equipment monitoring Advertising analysis Life sciences research Fraud detection Healthcare outcomes Weather forecasting Natural resource exploration Social network analysis Churn analysis Traffic flow optimization IT infrastructure optimization Legal discovery
  • 9. Trendwise Analytics Copyright Trendwise Analytics 916 Apr 2013 Watson wins Jeopardy! Feb 14th 2011 – Watson wins Jeopardy! beating its human opponents. Watson is IBM’s super computer built using Big Data Technology.
  • 10. Trendwise Analytics Big data Applications  Social media analytics ‒ ”People You May Know” at LinkedIn  Voice analytics ‒ Call center  Text analytics ‒ Voice of customer, sentiment analysis, warranty analysis  Video analytics ‒ Intelligence, policing, retail applications  Telecom – customer churn
  • 11. Trendwise Analytics Big Data at GE 11  New $1B corporate center for software and analytics  Hiring 400 data scientists  Includes financial and marketing applications, but with special focus on industrial uses of big data  When will this gas turbine need maintenance?  How can we optimize the performance of a locomotive?  What is the best way to make decisions about energy finance?
  • 12. Trendwise Analytics Copyright Trendwise Analytics Ford Gets Smarter About Marketing and Design • Ford collects and aggregates data from the 4 million vehicles that use in-car sensing and remote app management software • The data allows to glean information on a range of issues, from how drivers are using their vehicles, to the driving environment that could help them improve the quality of the vehicle • Partnered with Microsoft to develop SYNC
  • 13. Trendwise Analytics Copyright Trendwise Analytics How Amazon Uses Big Data To Make You Love Them • Amazon has been collecting customer information for years--not just addresses and payment information but the identity of everything that a customer had ever bought or even looked at. • They’re using that data to build customer relationship
  • 14. Trendwise Analytics Copyright Trendwise Analytics How LinkedIn is Riding a Wave of Big Data All the Way to the Bank • LinkedIn is a trove of data not just about people, but how people are making their money and what industries they are working in and how they connect to each other.
  • 15. Trendwise Analytics Copyright Trendwise Analytics How AT&T is using cell phone to watch user movements? • AT&T has 300 million customers • A team of researchers is working to turn data collected through the company’s cellular network into a trove of information for policymakers, urban planners and traffic engineers. • The researchers want to see how the city changes hourly by looking at calls and text messages relayed through cell towers around the region, noting that certain towers see more activity at different times
  • 16. Trendwise Analytics Copyright Trendwise Analytics Govt of India • Aadhar project by Govt. of India uses Hadoop
  • 17. Trendwise Analytics Copyright Trendwise Analytics TECHNOLOGY Hadoop Non Hadoop
  • 18. Trendwise Analytics Copyright Trendwise Analytics Hadoop Components
  • 19. Trendwise Analytics Copyright Trendwise Analytics Hadoop HDFS and MapReduce  Hadoop runs on HDFS, Hadoop Distributed Filesystem  Any data stored is converted to blocks and distributed across the cluster nodes
  • 20. Trendwise Analytics Copyright Trendwise Analytics Other components Hive • Data Warehouse infrastructure that provides data summarization and ad hoc querying on top of Hadoop PIG • A high-level data-flow language and execution framework for parallel computation Sqoop • Sqoop is a tool designed to help users of large data import existing relational databases into their Hadoop clusters Zookeeper • Zookeeper is a centralized service for maintaining configuration information, naming, providing distributed synchronization, and providing group services
  • 21. Trendwise Analytics Benefits of Hadoop • Hadoop is designed to run on cheap commodity hardware • It automatically handles data replication and node failure • Handles large volumes of unstructured data easily • Last but not least – its free! ( Open source)
  • 22. Trendwise Analytics Copyright Trendwise Analytics Commercial Hadoop Distributions • Cloudera • Hortonworks • Greenplum, A Division of EMC • IBM InfoSphere BigInsights
  • 23. Trendwise Analytics Technology – Non Hadoop •HPCC - HPCC Systems from LexisNexis Risk Solutions offers a proven, • open-source, data-intensive supercomputing platform • designed for the enterprise to solve big data problems. • SAP HANA is SAP AG’s implementation of in-memory database technology. •NoSQL Databases Key-Values Stores – Redis, Riak Column Family Stores – Cassandra, HBase Document Databases – CouchDB, MongoDB Graph Database – InfoGrid, Infinite Graph
  • 24. Trendwise Analytics SAP HANA In-memory Database System  Hana is an in-memory database system developed by SAP AG.  It takes the advantage of –  low-cost of main memory  Fast data access of solid state drives.  Data processing abilities of multi-core processors.  It supports both row-oriented and column-oriented data storage.  It incorporates powerful graph and text processing capabilities to work with semi and full unstructured data.  SAP has positioned HANA as its solution to big data challenges at the low end of this scale.