SlideShare uma empresa Scribd logo
1 de 25
Big Data
Mintu Butani ( 04)
Introduction
• Big Data may well be the Next Big Thing in the IT world.
• Big data burst upon the scene in the first decade of the 21st century.
• The first organizations to embrace it were online and startup firms.
Firms like Google, eBay, LinkedIn, and Facebook were built around
big data from the beginning.
• Like many new information technologies, big data can bring about
dramatic cost reductions, substantial improvements in the time
required to perform a computing task, or new product and service
offerings.
• ‘Big Data’ is similar to ‘small data’, but bigger in size
• but having data bigger it requires different
approaches:
– Techniques, tools and architecture
• an aim to solve new problems or old problems in a
better way
• Big Data generates value from the storage and
processing of very large quantities of digital information
that cannot be analyzed with traditional computing
techniques.
What is BIG DATA?
©2012 DataStax
What is it ?
“Big data is data that exceeds the
processing capacity of conventional
database systems. The data is too big,
moves too fast, or doesn't fit the strictures of
your database architectures. To gain value
from this data, you must choose an
alternative way to process it.”
Why Big Data
• Growth of Big Data is needed
– Increase of storage capacities
– Increase of processing power
– Availability of data(different data types)
– Every day we create 2.5 quintillion bytes of data; 90% of the data in
the world today has been created in the last two years alone
“Big data is the data characterized by 4 key
attributes: volume, variety, velocity and
Complexity.”
-- Oracle
The characteristics that define big dataare :
1. Velocity – includes the speed at which data comes in, and the
number of events/elements being stored
2. Variety – involves structured, semi-structured,unstructured data
3. Volume – can equate toTB-PB’s of data
4. Complexity – typically entails the difficulty distributing the data
(e.g. multi-data centers, cloud, etc.) and managing the data
traffic/movement (e.g. ETL, migrations, etc.)
• Data has high rate ofinput
• Data has large quantity ofelements/events
•Sensor data
•Media streaming
•Mobile devices
•Financial streams
•Web clickstream
•Traffic monitoring
•Patient care
©2012 DataStax 9
1 – Can it handle high data velocity ?
• Includes structured, semi, and unstructured
• Involves, real-time, analytic, and searchdata
• Necessitates new data model and fileformats
©2012 DataStax
2 – Can it handle data variety ?
The Structure of Big Data
Structured
• Most traditional data
sources
Semi-structured
• Many sources of big
data
Unstructured
• Video data, audio data
13
• TB’s to PB’s
• Also involves data maintenance functions(e.g. purging, etc.)
3 – Can it handle large data volume ?
• Typically involves data distribution, movement,
etc., across multiple data centers and
geographies
• Can be on-premise, cloud, orhybrid
4 – Can it handle complexity ?
The ‘Datafication’
of our World;
• Activities
• Conversations
• Words
• Voice
• Social Media
• Browser logs
• Photos
• Videos
• Sensors
• Etc.
Volume
Veracity
Variety
Velocity
Analysing
Big Data:
• Text
analytics
• Sentiment
analysis
• Face
recognition
• Voice
analytics
• Movement
analytics
• Etc.
Value
Turning Big Data into Value:
1. Real-time – transactional, online, streaming, low
latency data
2. Analytic – aggregated data from real-time feedsor
other sources; many times batch innature
3. Search – supporting data, both external and
internal, used
for locating desired information and/or
objects(e.g. products, documents, etc.)
©2012 DataStax
How Is Big Data Different?
1) Automatically generated by a machine
(e.g. Sensor embedded in an engine)
2) Typically an entirely new source of data
(e.g. Use of the internet)
3) Not designed to be friendly
(e.g. Text streams)
4) May not have much values
• Need to focus on the important part 16
Big Data sources
Users
Application
Systems
Sensors
Large and growing files
(Big data files)
Data generation points Examples
Mobile Devices
Microphones
Readers/Scanners
Science facilities
Programs/ Software
Social Media
Cameras
Big Data Analytics
• Examining large amount of data
• Appropriate information
• Identification of hidden patterns, unknown correlations
• Competitive advantage
• Better business decisions: strategic and operational
• Effective marketing, customer satisfaction, increased
revenue
Application Of Big Data analytics
Homeland
Security
Smarter
Healthcare
Multi-channel
sales
Telecom
Manufacturing
TrafficControl
Trading
Analytics
Search
Quality
Benefits of Big Data
• Our newest research finds that organizations are using big data
to target customer-centric outcomes, tap into internal data and
build a better information ecosystem.
• Big Data is an important part of the $64 billion database and
data analytics market
• It offers commercial opportunities of a comparable scale
to enterprise software in the late 1980s
• And the Internet boom of the 1990s, and the social media
explosion of today.
Risks of Big Data
• Will be so overwhelmed
• Need the right people and solve the right problems
• Costs escalate too fast
• Isn’t necessary to capture 100%
• Many sources of big data
is privacy
• self-regulation
• Legal regulation
Leading Technology Vendors
Example Vendors
• IBM – Netezza
• EMC – Greenplum
• Oracle – Exadata
Commonality
• MPP architectures
• Commodity Hardware
• RDBMS based
• Full SQL compliance
Thank You…

Mais conteúdo relacionado

Mais procurados

Banji Adenusi - big data prezzie - InfoSci
Banji Adenusi - big data prezzie - InfoSciBanji Adenusi - big data prezzie - InfoSci
Banji Adenusi - big data prezzie - InfoSci
Banji Adenusi
 
Data mining by_ashok
Data mining by_ashokData mining by_ashok
Data mining by_ashok
Ashok Kumar
 

Mais procurados (19)

elgendy2014.pdf
elgendy2014.pdfelgendy2014.pdf
elgendy2014.pdf
 
Understanding big data
Understanding big dataUnderstanding big data
Understanding big data
 
Big data seminor
Big data seminorBig data seminor
Big data seminor
 
BIg Data Overview
BIg Data OverviewBIg Data Overview
BIg Data Overview
 
BIG DATA BY SAIKIRAN PANJALA
BIG DATA BY SAIKIRAN PANJALABIG DATA BY SAIKIRAN PANJALA
BIG DATA BY SAIKIRAN PANJALA
 
1
11
1
 
Intro big data analytics
Intro big data analyticsIntro big data analytics
Intro big data analytics
 
Big data intro.pptx
Big data intro.pptxBig data intro.pptx
Big data intro.pptx
 
Big data survey
Big data surveyBig data survey
Big data survey
 
What is Data Science
What is Data ScienceWhat is Data Science
What is Data Science
 
An Investigation on Scalable and Efficient Privacy Preserving Challenges for ...
An Investigation on Scalable and Efficient Privacy Preserving Challenges for ...An Investigation on Scalable and Efficient Privacy Preserving Challenges for ...
An Investigation on Scalable and Efficient Privacy Preserving Challenges for ...
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 
RESEARCH IN BIG DATA – AN OVERVIEW
RESEARCH IN BIG DATA – AN OVERVIEWRESEARCH IN BIG DATA – AN OVERVIEW
RESEARCH IN BIG DATA – AN OVERVIEW
 
Data Mining
Data MiningData Mining
Data Mining
 
Banji Adenusi - big data prezzie - InfoSci
Banji Adenusi - big data prezzie - InfoSciBanji Adenusi - big data prezzie - InfoSci
Banji Adenusi - big data prezzie - InfoSci
 
In memory big data management and processing
In memory big data management and processingIn memory big data management and processing
In memory big data management and processing
 
A Survey on Data Mining
A Survey on Data MiningA Survey on Data Mining
A Survey on Data Mining
 
Data mining by_ashok
Data mining by_ashokData mining by_ashok
Data mining by_ashok
 
Governance of Big Data
Governance of Big DataGovernance of Big Data
Governance of Big Data
 

Semelhante a What is big data

Content1. Introduction2. What is Big Data3. Characte.docx
Content1. Introduction2. What is Big Data3. Characte.docxContent1. Introduction2. What is Big Data3. Characte.docx
Content1. Introduction2. What is Big Data3. Characte.docx
dickonsondorris
 
UNIT 1 -BIG DATA ANALYTICS Full.pdf
UNIT 1 -BIG DATA ANALYTICS Full.pdfUNIT 1 -BIG DATA ANALYTICS Full.pdf
UNIT 1 -BIG DATA ANALYTICS Full.pdf
vvpadhu
 
ppt final.pptx
ppt final.pptxppt final.pptx
ppt final.pptx
kalai75
 

Semelhante a What is big data (20)

Big data
Big dataBig data
Big data
 
TOPIC.pptx
TOPIC.pptxTOPIC.pptx
TOPIC.pptx
 
Big data
Big dataBig data
Big data
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Kartikey tripathi
Kartikey tripathiKartikey tripathi
Kartikey tripathi
 
bigdatappt.pptx
bigdatappt.pptxbigdatappt.pptx
bigdatappt.pptx
 
Handling and Processing Big Data
Handling and Processing Big DataHandling and Processing Big Data
Handling and Processing Big Data
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Content1. Introduction2. What is Big Data3. Characte.docx
Content1. Introduction2. What is Big Data3. Characte.docxContent1. Introduction2. What is Big Data3. Characte.docx
Content1. Introduction2. What is Big Data3. Characte.docx
 
UNIT 1 -BIG DATA ANALYTICS Full.pdf
UNIT 1 -BIG DATA ANALYTICS Full.pdfUNIT 1 -BIG DATA ANALYTICS Full.pdf
UNIT 1 -BIG DATA ANALYTICS Full.pdf
 
Unit-1 -2-3- BDA PIET 6 AIDS.pptx
Unit-1 -2-3- BDA PIET 6 AIDS.pptxUnit-1 -2-3- BDA PIET 6 AIDS.pptx
Unit-1 -2-3- BDA PIET 6 AIDS.pptx
 
BigData.pptx
BigData.pptxBigData.pptx
BigData.pptx
 
ppt final.pptx
ppt final.pptxppt final.pptx
ppt final.pptx
 
Big data by Mithlesh sadh
Big data by Mithlesh sadhBig data by Mithlesh sadh
Big data by Mithlesh sadh
 
Bigdatappt 140225061440-phpapp01
Bigdatappt 140225061440-phpapp01Bigdatappt 140225061440-phpapp01
Bigdatappt 140225061440-phpapp01
 
Big data ppt
Big  data pptBig  data ppt
Big data ppt
 
A beginner's guide to Big data
A beginner's guide to Big dataA beginner's guide to Big data
A beginner's guide to Big data
 
BigDataFinal.pptx
BigDataFinal.pptxBigDataFinal.pptx
BigDataFinal.pptx
 
Big_Data_ppt[1] (1).pptx
Big_Data_ppt[1] (1).pptxBig_Data_ppt[1] (1).pptx
Big_Data_ppt[1] (1).pptx
 
Special issues on big data
Special issues on big dataSpecial issues on big data
Special issues on big data
 

Mais de mintubutani2212 (10)

H o theory
H   o theoryH   o theory
H o theory
 
Facility location
Facility location Facility location
Facility location
 
Globalization
GlobalizationGlobalization
Globalization
 
Omnichannel004
Omnichannel004 Omnichannel004
Omnichannel004
 
Limited liabiliy partnership
Limited liabiliy partnershipLimited liabiliy partnership
Limited liabiliy partnership
 
sataym scandal
sataym scandalsataym scandal
sataym scandal
 
Satyam scandal
Satyam scandalSatyam scandal
Satyam scandal
 
world Trade Organization
world Trade Organizationworld Trade Organization
world Trade Organization
 
3 idiots
3 idiots3 idiots
3 idiots
 
Multinational company
Multinational companyMultinational company
Multinational company
 

Último

The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 

Último (20)

TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
Plant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptxPlant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptx
 
REMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxREMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptx
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptx
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
 
Interdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxInterdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptx
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfUnit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
 

What is big data

  • 2. Introduction • Big Data may well be the Next Big Thing in the IT world. • Big data burst upon the scene in the first decade of the 21st century. • The first organizations to embrace it were online and startup firms. Firms like Google, eBay, LinkedIn, and Facebook were built around big data from the beginning. • Like many new information technologies, big data can bring about dramatic cost reductions, substantial improvements in the time required to perform a computing task, or new product and service offerings.
  • 3. • ‘Big Data’ is similar to ‘small data’, but bigger in size • but having data bigger it requires different approaches: – Techniques, tools and architecture • an aim to solve new problems or old problems in a better way • Big Data generates value from the storage and processing of very large quantities of digital information that cannot be analyzed with traditional computing techniques. What is BIG DATA?
  • 5. “Big data is data that exceeds the processing capacity of conventional database systems. The data is too big, moves too fast, or doesn't fit the strictures of your database architectures. To gain value from this data, you must choose an alternative way to process it.”
  • 6. Why Big Data • Growth of Big Data is needed – Increase of storage capacities – Increase of processing power – Availability of data(different data types) – Every day we create 2.5 quintillion bytes of data; 90% of the data in the world today has been created in the last two years alone
  • 7. “Big data is the data characterized by 4 key attributes: volume, variety, velocity and Complexity.” -- Oracle
  • 8. The characteristics that define big dataare : 1. Velocity – includes the speed at which data comes in, and the number of events/elements being stored 2. Variety – involves structured, semi-structured,unstructured data 3. Volume – can equate toTB-PB’s of data 4. Complexity – typically entails the difficulty distributing the data (e.g. multi-data centers, cloud, etc.) and managing the data traffic/movement (e.g. ETL, migrations, etc.)
  • 9. • Data has high rate ofinput • Data has large quantity ofelements/events •Sensor data •Media streaming •Mobile devices •Financial streams •Web clickstream •Traffic monitoring •Patient care ©2012 DataStax 9 1 – Can it handle high data velocity ?
  • 10. • Includes structured, semi, and unstructured • Involves, real-time, analytic, and searchdata • Necessitates new data model and fileformats ©2012 DataStax 2 – Can it handle data variety ?
  • 11. The Structure of Big Data Structured • Most traditional data sources Semi-structured • Many sources of big data Unstructured • Video data, audio data 13
  • 12. • TB’s to PB’s • Also involves data maintenance functions(e.g. purging, etc.) 3 – Can it handle large data volume ?
  • 13. • Typically involves data distribution, movement, etc., across multiple data centers and geographies • Can be on-premise, cloud, orhybrid 4 – Can it handle complexity ?
  • 14. The ‘Datafication’ of our World; • Activities • Conversations • Words • Voice • Social Media • Browser logs • Photos • Videos • Sensors • Etc. Volume Veracity Variety Velocity Analysing Big Data: • Text analytics • Sentiment analysis • Face recognition • Voice analytics • Movement analytics • Etc. Value Turning Big Data into Value:
  • 15. 1. Real-time – transactional, online, streaming, low latency data 2. Analytic – aggregated data from real-time feedsor other sources; many times batch innature 3. Search – supporting data, both external and internal, used for locating desired information and/or objects(e.g. products, documents, etc.) ©2012 DataStax
  • 16. How Is Big Data Different? 1) Automatically generated by a machine (e.g. Sensor embedded in an engine) 2) Typically an entirely new source of data (e.g. Use of the internet) 3) Not designed to be friendly (e.g. Text streams) 4) May not have much values • Need to focus on the important part 16
  • 17. Big Data sources Users Application Systems Sensors Large and growing files (Big data files)
  • 18. Data generation points Examples Mobile Devices Microphones Readers/Scanners Science facilities Programs/ Software Social Media Cameras
  • 19. Big Data Analytics • Examining large amount of data • Appropriate information • Identification of hidden patterns, unknown correlations • Competitive advantage • Better business decisions: strategic and operational • Effective marketing, customer satisfaction, increased revenue
  • 20. Application Of Big Data analytics Homeland Security Smarter Healthcare Multi-channel sales Telecom Manufacturing TrafficControl Trading Analytics Search Quality
  • 21. Benefits of Big Data • Our newest research finds that organizations are using big data to target customer-centric outcomes, tap into internal data and build a better information ecosystem. • Big Data is an important part of the $64 billion database and data analytics market • It offers commercial opportunities of a comparable scale to enterprise software in the late 1980s • And the Internet boom of the 1990s, and the social media explosion of today.
  • 22. Risks of Big Data • Will be so overwhelmed • Need the right people and solve the right problems • Costs escalate too fast • Isn’t necessary to capture 100% • Many sources of big data is privacy • self-regulation • Legal regulation
  • 23. Leading Technology Vendors Example Vendors • IBM – Netezza • EMC – Greenplum • Oracle – Exadata Commonality • MPP architectures • Commodity Hardware • RDBMS based • Full SQL compliance
  • 24.