SlideShare uma empresa Scribd logo
1 de 22
Submitted By
J. Subha, M.Tech II Year
M.S. University, Tirunelveli.
 Introduction Big Data
 Data Facts
 Characteristics of Big Data
 Type of Data
 Big Data Tools
 Hadoop
No single definition: here is from Wikipedia:
 Big data is the term for a collection of data
sets so large and complex that it becomes
difficult to process using on-hand database
management tools or traditional data
processing applications.
 Involves various tools, techniques and
frameworks.

Customer
Social
Media
Gamin
g
Entertai
n
Bankin
g
Financ
e
Our
Know
n
Histor
y
Purcha
se
 Over 90% of all the data in the world was
created in the past 2 years.
 Every 2 days we create as much information.
 The total amount of data being captured and
stored by industry doubles every years.
 Every minute we send 204 million emails,
Generate 1.8 million Facebook likes, send
278 thousand Tweets, and upload 200,000
photos to Facebook
 Around 100 hours of video are uploaded to
every minute.
 Big data (TB) cannot fit in a memory of single
computer
 RDBMS fail to handle Big Data
 Processing of Big data in a single computer
will take a lot of time.
 Big data cannot be analyzed with a traditional
tools.
 Characteristics of Big Data:5V’s
 Volume – Data Quantity
 Velocity – Data Speed
 Variety - Data Types
 Veracity – Data Quality and accuracy
 Value - Data Value
 Turning Big Data into Value: The latest
technology such as Distributed systems and
cloud computing together with the latest
software and analysis approaches allow us to
leverage all types of data to gain insights and
add value.
The Model of Generating/Consuming Data has Changed
Old Model: Few companies are generating data, all others are
consuming data
New Model: all of us are generating data, and all of us are
consuming data
Processing Big Data
 Unstructured - Video data, audio data,
( PDF)
 Semi-structured - Many sources of big data
( XML)
 Structured - Most traditional data sources
(Tables)
 Sensors
 Cc-cams
 Social Network- FB..
 Online Shopping
 Airlines
 Hospitality data etc.,
 Big Data is needed – Increase of storage
capacities – Increase of processing power –
Availability of data (different data types).
 Collecting
 Organizing
 Analyzing of Large
set of data to discover
pattern or other
useful information.
Organizing
Analyzing
Collecting
Representation
 Hadoop – Getting huge data, processed in
less time
 Storing and processing huge amount of data
 Hadoop is the Open source frame work
software, that is developed by ‘Apache’ to
support distributed processing of data.
 Initially, Java Language was used to develop
Hadoop script, but today many other
languages are used for scripting Hadoop.
 Hadoop is used to helps in data analytics
 Hadoop implements Google’s MapReduce,
using HDFS
 MapReduce divides applications into many
small blocks of work.
 HDFS creates multiple replicas of data
blocks for reliability, placing them on
compute nodes around the cluster.
 MapReduce can then process the data
where it is located.
 Hadoop ‘s target is to run on clusters of the
order of 10,000-nodes.
 Hardware Requirements
 Quad core processor- 64 bit
 RAM – 8GB
 Disk Free – 20 GB
 Software Requirements
 Windows 7+, MAC Osx10.10+,..
 Several Opensource Software tools including
Apache Hadoop.
Thank You,

Mais conteúdo relacionado

Mais procurados

Big Data PPT by Rohit Dubey
Big Data PPT by Rohit DubeyBig Data PPT by Rohit Dubey
Big Data PPT by Rohit Dubey
Rohit Dubey
 
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
Simplilearn
 
Big data ecosystem
Big data ecosystemBig data ecosystem
Big data ecosystem
magda3695
 

Mais procurados (20)

Big Data
Big DataBig Data
Big Data
 
Big Data PPT by Rohit Dubey
Big Data PPT by Rohit DubeyBig Data PPT by Rohit Dubey
Big Data PPT by Rohit Dubey
 
Big Data ppt
Big Data pptBig Data ppt
Big Data ppt
 
Big Data
Big DataBig Data
Big Data
 
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
 
Presentation on Big Data
Presentation on Big DataPresentation on Big Data
Presentation on Big Data
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
Big data
Big dataBig data
Big data
 
Big Data & Analytics (Conceptual and Practical Introduction)
Big Data & Analytics (Conceptual and Practical Introduction)Big Data & Analytics (Conceptual and Practical Introduction)
Big Data & Analytics (Conceptual and Practical Introduction)
 
Big Data Ppt PowerPoint Presentation Slides
Big Data Ppt PowerPoint Presentation Slides Big Data Ppt PowerPoint Presentation Slides
Big Data Ppt PowerPoint Presentation Slides
 
Social media with big data analytics
Social media with big data analyticsSocial media with big data analytics
Social media with big data analytics
 
Big Data & Data Science
Big Data & Data ScienceBig Data & Data Science
Big Data & Data Science
 
Lecture1 introduction to big data
Lecture1 introduction to big dataLecture1 introduction to big data
Lecture1 introduction to big data
 
Big Data
Big DataBig Data
Big Data
 
Big Data - Applications and Technologies Overview
Big Data - Applications and Technologies OverviewBig Data - Applications and Technologies Overview
Big Data - Applications and Technologies Overview
 
Big data by Mithlesh sadh
Big data by Mithlesh sadhBig data by Mithlesh sadh
Big data by Mithlesh sadh
 
Chapter 1 big data
Chapter 1 big dataChapter 1 big data
Chapter 1 big data
 
What is big data?
What is big data?What is big data?
What is big data?
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
Big data ecosystem
Big data ecosystemBig data ecosystem
Big data ecosystem
 

Destaque

Data Mining – analyse Bank Marketing Data Set
Data Mining – analyse Bank Marketing Data SetData Mining – analyse Bank Marketing Data Set
Data Mining – analyse Bank Marketing Data Set
Mateusz Brzoska
 
Marketing segmentation
Marketing segmentationMarketing segmentation
Marketing segmentation
Maya Humbatova
 

Destaque (15)

Top 6 Tips to Market to Affluent Chinese Consumers - Dragon Trail China
Top 6 Tips to Market to Affluent Chinese Consumers - Dragon Trail ChinaTop 6 Tips to Market to Affluent Chinese Consumers - Dragon Trail China
Top 6 Tips to Market to Affluent Chinese Consumers - Dragon Trail China
 
Cluster Analysis - Keyword Clustering
Cluster Analysis -  Keyword ClusteringCluster Analysis -  Keyword Clustering
Cluster Analysis - Keyword Clustering
 
AXIS BANK (SEGMENTATION AXIS BANK PRODUCTS & SERVICES.)
AXIS BANK (SEGMENTATION AXIS BANK PRODUCTS & SERVICES.)AXIS BANK (SEGMENTATION AXIS BANK PRODUCTS & SERVICES.)
AXIS BANK (SEGMENTATION AXIS BANK PRODUCTS & SERVICES.)
 
Affluent Market
Affluent MarketAffluent Market
Affluent Market
 
Mass Affluent South Asian Business Proposal
Mass Affluent South Asian Business ProposalMass Affluent South Asian Business Proposal
Mass Affluent South Asian Business Proposal
 
Data Mining – analyse Bank Marketing Data Set
Data Mining – analyse Bank Marketing Data SetData Mining – analyse Bank Marketing Data Set
Data Mining – analyse Bank Marketing Data Set
 
Segmenting the SME & Commercial Customer Banking Market
Segmenting the SME & Commercial Customer Banking MarketSegmenting the SME & Commercial Customer Banking Market
Segmenting the SME & Commercial Customer Banking Market
 
Market segmentation & competitive analysis of banking products
Market segmentation & competitive analysis of banking productsMarket segmentation & competitive analysis of banking products
Market segmentation & competitive analysis of banking products
 
Introduction to Market Segmentation
Introduction to Market SegmentationIntroduction to Market Segmentation
Introduction to Market Segmentation
 
Learning & Development Strategy in Banking Industry
Learning & Development Strategy in Banking IndustryLearning & Development Strategy in Banking Industry
Learning & Development Strategy in Banking Industry
 
Towards Future Proof Customer Relations
Towards Future Proof Customer RelationsTowards Future Proof Customer Relations
Towards Future Proof Customer Relations
 
Marketing segmentation
Marketing segmentationMarketing segmentation
Marketing segmentation
 
Market Segmentation
Market SegmentationMarket Segmentation
Market Segmentation
 
Customer centric in a digital world
Customer centric in a digital worldCustomer centric in a digital world
Customer centric in a digital world
 
Market Segmentation, Targeting and Positioning
Market Segmentation, Targeting and PositioningMarket Segmentation, Targeting and Positioning
Market Segmentation, Targeting and Positioning
 

Semelhante a Big Data

Semelhante a Big Data (20)

GADLJRIET850691
GADLJRIET850691GADLJRIET850691
GADLJRIET850691
 
How Do I Learn Big Data
How Do I Learn Big DataHow Do I Learn Big Data
How Do I Learn Big Data
 
How Do I Learn Big Data
How Do I Learn Big DataHow Do I Learn Big Data
How Do I Learn Big Data
 
Big data
Big dataBig data
Big data
 
Big data
Big dataBig data
Big data
 
Easylearning Guru online Hadoop class
Easylearning Guru online Hadoop class Easylearning Guru online Hadoop class
Easylearning Guru online Hadoop class
 
Big Data and Big Data Management (BDM) with current Technologies –Review
Big Data and Big Data Management (BDM) with current Technologies –ReviewBig Data and Big Data Management (BDM) with current Technologies –Review
Big Data and Big Data Management (BDM) with current Technologies –Review
 
Big Data
Big DataBig Data
Big Data
 
IRJET- Youtube Data Sensitivity and Analysis using Hadoop Framework
IRJET-  	  Youtube Data Sensitivity and Analysis using Hadoop FrameworkIRJET-  	  Youtube Data Sensitivity and Analysis using Hadoop Framework
IRJET- Youtube Data Sensitivity and Analysis using Hadoop Framework
 
Big Data Hadoop Training by Easylearning Guru
Big Data Hadoop Training by Easylearning GuruBig Data Hadoop Training by Easylearning Guru
Big Data Hadoop Training by Easylearning Guru
 
Data mining with big data
Data mining with big dataData mining with big data
Data mining with big data
 
Big data-analytics-cpe8035
Big data-analytics-cpe8035Big data-analytics-cpe8035
Big data-analytics-cpe8035
 
Big data and hadoop introduction
Big data and hadoop introductionBig data and hadoop introduction
Big data and hadoop introduction
 
Big Data
Big DataBig Data
Big Data
 
Big Data Hadoop Tutorial by Easylearning Guru
Big Data Hadoop Tutorial by Easylearning GuruBig Data Hadoop Tutorial by Easylearning Guru
Big Data Hadoop Tutorial by Easylearning Guru
 
Big data abstract
Big data abstractBig data abstract
Big data abstract
 
BIG Data and Methodology-A review
BIG Data and Methodology-A reviewBIG Data and Methodology-A review
BIG Data and Methodology-A review
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Big Data
Big DataBig Data
Big Data
 
Big data Presentation
Big data PresentationBig data Presentation
Big data Presentation
 

Último

Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
negromaestrong
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 

Último (20)

Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Spatium Project Simulation student brief
Spatium Project Simulation student briefSpatium Project Simulation student brief
Spatium Project Simulation student brief
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docx
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
Magic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptxMagic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptx
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Dyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptxDyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptx
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 

Big Data

  • 1.
  • 2.
  • 3. Submitted By J. Subha, M.Tech II Year M.S. University, Tirunelveli.
  • 4.  Introduction Big Data  Data Facts  Characteristics of Big Data  Type of Data  Big Data Tools  Hadoop
  • 5. No single definition: here is from Wikipedia:  Big data is the term for a collection of data sets so large and complex that it becomes difficult to process using on-hand database management tools or traditional data processing applications.  Involves various tools, techniques and frameworks.
  • 7.
  • 8.  Over 90% of all the data in the world was created in the past 2 years.  Every 2 days we create as much information.  The total amount of data being captured and stored by industry doubles every years.  Every minute we send 204 million emails, Generate 1.8 million Facebook likes, send 278 thousand Tweets, and upload 200,000 photos to Facebook  Around 100 hours of video are uploaded to every minute.
  • 9.  Big data (TB) cannot fit in a memory of single computer  RDBMS fail to handle Big Data  Processing of Big data in a single computer will take a lot of time.  Big data cannot be analyzed with a traditional tools.
  • 10.  Characteristics of Big Data:5V’s  Volume – Data Quantity  Velocity – Data Speed  Variety - Data Types  Veracity – Data Quality and accuracy  Value - Data Value  Turning Big Data into Value: The latest technology such as Distributed systems and cloud computing together with the latest software and analysis approaches allow us to leverage all types of data to gain insights and add value.
  • 11.
  • 12.
  • 13. The Model of Generating/Consuming Data has Changed Old Model: Few companies are generating data, all others are consuming data New Model: all of us are generating data, and all of us are consuming data
  • 14. Processing Big Data  Unstructured - Video data, audio data, ( PDF)  Semi-structured - Many sources of big data ( XML)  Structured - Most traditional data sources (Tables)
  • 15.  Sensors  Cc-cams  Social Network- FB..  Online Shopping  Airlines  Hospitality data etc.,  Big Data is needed – Increase of storage capacities – Increase of processing power – Availability of data (different data types).
  • 16.  Collecting  Organizing  Analyzing of Large set of data to discover pattern or other useful information. Organizing Analyzing Collecting Representation
  • 17.
  • 18.  Hadoop – Getting huge data, processed in less time  Storing and processing huge amount of data  Hadoop is the Open source frame work software, that is developed by ‘Apache’ to support distributed processing of data.  Initially, Java Language was used to develop Hadoop script, but today many other languages are used for scripting Hadoop.  Hadoop is used to helps in data analytics
  • 19.
  • 20.  Hadoop implements Google’s MapReduce, using HDFS  MapReduce divides applications into many small blocks of work.  HDFS creates multiple replicas of data blocks for reliability, placing them on compute nodes around the cluster.  MapReduce can then process the data where it is located.  Hadoop ‘s target is to run on clusters of the order of 10,000-nodes.
  • 21.  Hardware Requirements  Quad core processor- 64 bit  RAM – 8GB  Disk Free – 20 GB  Software Requirements  Windows 7+, MAC Osx10.10+,..  Several Opensource Software tools including Apache Hadoop.

Notas do Editor

  1. B