SlideShare uma empresa Scribd logo
1 de 21
BIG DATA
By Sakshi Chawla
What is BIG DATA?
• The Oxford English Dictionary (OED )defines big data:
“data of a very large size, typically to the extent that its
manipulation and management present significant
logistical challenges.”
• Big data is an evolving term that describes any
voluminous amount of structured, semi-structured and
unstructured data that has the potential to be mined for
information. Although big data doesn't refer to any
specific quantity, the term is often used when speaking
about petabytes and exabytes of data.
.
• Big Data generates value from the storage and processing of very large quantities of
digital information that cannot be analyzed with traditional computing techniques.
Characterstics of Big Data: The 5 Vs
Volume:
• It is the size of the data which determines the
value and potential of the data under
consideration and whether it can actually be
considered as Big Data or not. The name ‘Big
Data’ itself contains a term which is related to size
and hence the characteristic.
• Big data implies enormous volumes of data.
• Now that data is generated by machines,
networks and human interaction on systems like
social media the volume of data to be analyzed is
massive.
Velocity:
• The term ‘velocity’ in the context refers to the
speed of generation of data or how fast the
data is generated and processed to meet the
demands and the challenges which lie ahead
in the path of growth and development.
• Big Data Velocity deals with the pace at which
data flows in from sources like business
processes, machines, networks and human
interaction with things like social media sites,
mobile devices, etc.
Variety:
• The next aspect of Big Data is its variety.
This means that the category to which Big
Data belongs to is also a very essential fact
that needs to be known by the data analysts.
• This helps the people, who are closely
analyzing the data and are associated with it,
to effectively use the data to their advantage
and thus upholding the importance of the Big
Data.
Veracity:
• Big Data Veracity refers to the biases, noise and
abnormality in data. Is the data that is being
stored, and mined meaningful to the problem
being analyzed.
• The quality of the data being captured can vary
greatly. Accuracy of analysis depends on the
veracity of the source data.
Variability:
• This is a factor which can be a problem for those who analyse the data. This refers
to the inconsistency which can be shown by the data at times, thus hampering the
process of being able to handle and manage the data effectively.
Storage And Architecture:
• Recent studies show that the use of a multiple layer architecture is an option for
dealing with big data. The Distributed Parallel architecture distributes data across
multiple processing units and parallel processing units provide data much faster,
by improving processing speeds.
• This type of architecture inserts data into a parallel DBMS, which implements the
use of MapReduce and Hadoop frameworks. This type of framework looks to make
the processing power transparent to the end user by using a front end application
server.
Hadoop
• Hadoop is a set of algorithms (an open-source software framework written in Java)
for distributed storage and distributed processing of very large data or Big Data
on computer clusters built from commodity hardware.
• It is designed to scale up from a single server to thousands of machines, with very
high degree of fault tolerance.
• Hadoop changes the economics and the dynamics of large-scale computing
Applications:
Smart Healthcare
Homeland Security
Traffic control
Manufacturing
Multi channel
Sales
Telecom
Trading analytics
Search quality
Government:
The use and adoption of Big Data, within governmental processes, is beneficial and
allows efficiencies in terms of cost, productivity and innovation. That said, this
process does not come without its flaws. Data analysis often requires multiple parts
of government (central and local) to work in collaboration and create new and
innovative processes to deliver the desired outcome. Below are the thought leading
examples within the Governmental Big Data space.
India:
• Big data analysis was, in parts, responsible for the BJP and its allies to win a highly
successful Indian General Election 2014.
• The Indian Government utilises numerous techniques to ascertain how the Indian
electorate is responding to government action, as well as ideas for policy
augmentation
Risks of Big Data:
#1: Loss of agility
In a typical large-scale organization, data is housed on multiple platforms.
There is transactional data, email data, analytics data, etc. Management wants
people to be able to locate, analyze, and make decisions based on this data
quickly. But if the data isn’t evaluated, organized, and stored properly, critical
information can be either difficult or impossible to find – slowing a business
down at the exact moment when speed is essential.
#2: Loss of compliance
Laws are getting more and more complex with regard to how long companies need
to retain data, how they need to retain it, and where they need to retain it. There are
both general regulations in place as well as state- or industry-specific regulations
that may apply. It is not uncommon for regulators to perform random audits to
examine a company’s policies regarding data and their actual management of that
data. A compliance failure can result in significant fine or damage to reputational
risk.
#3: Loss of security
With more data located in and moving between more places than ever before, there
are also a vastly increased number of ways to hack into that data. A security breach
can result in theft, fraud, fines … and, of course, reputational loss.
#4: Loss of money
A server may seem inexpensive at first glance – but never assume that storage is
cheap.
Benefits:
• Cost reduction
Big data technologies like Hadoop and cloud-based analytics can provide
substantial cost advantages. While comparisons between big data technology and
traditional architectures (data warehouses and marts in particular) are difficult
because of differences in functionality, a price comparison alone can suggest
order-of-magnitude improvements
• Faster, better decision making
Analytics has always involved attempts to improve decision making, and big data
doesn’t change that.
• New products and services
Perhaps the most interesting use of big data analytics is to create new products and
services for customers. Online companies have done this for a decade or so, but
now predominantly offline firms are doing it too. GE, for example, has made a
major investment in new service models for its industrial products using big data
analytics.
ThankYou

Mais conteúdo relacionado

Mais procurados

elgendy2014.pdf
elgendy2014.pdfelgendy2014.pdf
elgendy2014.pdfAkuhuruf
 
BIG DATA BY SAIKIRAN PANJALA
BIG DATA BY SAIKIRAN PANJALABIG DATA BY SAIKIRAN PANJALA
BIG DATA BY SAIKIRAN PANJALASaikiran Panjala
 
Data mining & big data presentation 01
Data mining & big data presentation 01Data mining & big data presentation 01
Data mining & big data presentation 01Aseem Chakrabarthy
 
Introduction to Big Data Analytics
Introduction to Big Data AnalyticsIntroduction to Big Data Analytics
Introduction to Big Data AnalyticsUtkarsh Sharma
 
000 introduction to big data analytics 2021
000   introduction to big data analytics  2021000   introduction to big data analytics  2021
000 introduction to big data analytics 2021Dendej Sawarnkatat
 
BIG DATA & DATA ANALYTICS
BIG  DATA & DATA  ANALYTICSBIG  DATA & DATA  ANALYTICS
BIG DATA & DATA ANALYTICSNAGARAJAGIDDE
 
Big data Presentation
Big data PresentationBig data Presentation
Big data PresentationAswadmehar
 
Importance of Data Analytics
 Importance of Data Analytics Importance of Data Analytics
Importance of Data AnalyticsProduct School
 
big data Presentation
big data Presentationbig data Presentation
big data PresentationMahmoud Farag
 
Applications of Big Data Analytics in Businesses
Applications of Big Data Analytics in BusinessesApplications of Big Data Analytics in Businesses
Applications of Big Data Analytics in BusinessesT.S. Lim
 
BIG Data and Methodology-A review
BIG Data and Methodology-A reviewBIG Data and Methodology-A review
BIG Data and Methodology-A reviewShilpa Soi
 
Introduction to Big Data & Analytics
Introduction to Big Data & AnalyticsIntroduction to Big Data & Analytics
Introduction to Big Data & AnalyticsPrasad Chitta
 

Mais procurados (20)

Introduction to BigData
Introduction to BigData Introduction to BigData
Introduction to BigData
 
Understanding big data
Understanding big dataUnderstanding big data
Understanding big data
 
elgendy2014.pdf
elgendy2014.pdfelgendy2014.pdf
elgendy2014.pdf
 
BIG DATA BY SAIKIRAN PANJALA
BIG DATA BY SAIKIRAN PANJALABIG DATA BY SAIKIRAN PANJALA
BIG DATA BY SAIKIRAN PANJALA
 
Big data
Big dataBig data
Big data
 
Data mining & big data presentation 01
Data mining & big data presentation 01Data mining & big data presentation 01
Data mining & big data presentation 01
 
Introduction to Big Data Analytics
Introduction to Big Data AnalyticsIntroduction to Big Data Analytics
Introduction to Big Data Analytics
 
Big data-ppt-
Big data-ppt-Big data-ppt-
Big data-ppt-
 
Big data
Big dataBig data
Big data
 
000 introduction to big data analytics 2021
000   introduction to big data analytics  2021000   introduction to big data analytics  2021
000 introduction to big data analytics 2021
 
Thilga
ThilgaThilga
Thilga
 
BIG DATA & DATA ANALYTICS
BIG  DATA & DATA  ANALYTICSBIG  DATA & DATA  ANALYTICS
BIG DATA & DATA ANALYTICS
 
Big data Presentation
Big data PresentationBig data Presentation
Big data Presentation
 
Importance of Data Analytics
 Importance of Data Analytics Importance of Data Analytics
Importance of Data Analytics
 
big data Presentation
big data Presentationbig data Presentation
big data Presentation
 
Applications of Big Data Analytics in Businesses
Applications of Big Data Analytics in BusinessesApplications of Big Data Analytics in Businesses
Applications of Big Data Analytics in Businesses
 
Big data
Big dataBig data
Big data
 
BIG Data and Methodology-A review
BIG Data and Methodology-A reviewBIG Data and Methodology-A review
BIG Data and Methodology-A review
 
Introduction to Big Data & Analytics
Introduction to Big Data & AnalyticsIntroduction to Big Data & Analytics
Introduction to Big Data & Analytics
 
Big data, Big decision
Big data, Big decisionBig data, Big decision
Big data, Big decision
 

Semelhante a Big data (20)

Big data
Big dataBig data
Big data
 
Handling and Processing Big Data
Handling and Processing Big DataHandling and Processing Big Data
Handling and Processing Big Data
 
What Is Big Data How Big Data Works.pdf
What Is Big Data How Big Data Works.pdfWhat Is Big Data How Big Data Works.pdf
What Is Big Data How Big Data Works.pdf
 
M.Florence Dayana
M.Florence DayanaM.Florence Dayana
M.Florence Dayana
 
Big_Data.pptx
Big_Data.pptxBig_Data.pptx
Big_Data.pptx
 
A Survey on Big Data Analytics
A Survey on Big Data AnalyticsA Survey on Big Data Analytics
A Survey on Big Data Analytics
 
Big Data Analytics Materials, Chapter: 1
Big Data Analytics Materials, Chapter: 1Big Data Analytics Materials, Chapter: 1
Big Data Analytics Materials, Chapter: 1
 
big_data.ppt
big_data.pptbig_data.ppt
big_data.ppt
 
big_data.ppt
big_data.pptbig_data.ppt
big_data.ppt
 
big_data.ppt
big_data.pptbig_data.ppt
big_data.ppt
 
Group 2 Handling and Processing of big data.pptx
Group 2 Handling and Processing of big data.pptxGroup 2 Handling and Processing of big data.pptx
Group 2 Handling and Processing of big data.pptx
 
Big data
Big dataBig data
Big data
 
What Is Big Data How Big Data Works.pdf
What Is Big Data How Big Data Works.pdfWhat Is Big Data How Big Data Works.pdf
What Is Big Data How Big Data Works.pdf
 
Unit-1 introduction to Big data.pdf
Unit-1 introduction to Big data.pdfUnit-1 introduction to Big data.pdf
Unit-1 introduction to Big data.pdf
 
Unit-1 introduction to Big data.pdf
Unit-1 introduction to Big data.pdfUnit-1 introduction to Big data.pdf
Unit-1 introduction to Big data.pdf
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Big data
Big dataBig data
Big data
 
Trends in data analytics
Trends in data analyticsTrends in data analytics
Trends in data analytics
 
Research paper on big data and hadoop
Research paper on big data and hadoopResearch paper on big data and hadoop
Research paper on big data and hadoop
 

Último

My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 

Último (20)

My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 

Big data

  • 2. What is BIG DATA? • The Oxford English Dictionary (OED )defines big data: “data of a very large size, typically to the extent that its manipulation and management present significant logistical challenges.” • Big data is an evolving term that describes any voluminous amount of structured, semi-structured and unstructured data that has the potential to be mined for information. Although big data doesn't refer to any specific quantity, the term is often used when speaking about petabytes and exabytes of data.
  • 3. . • Big Data generates value from the storage and processing of very large quantities of digital information that cannot be analyzed with traditional computing techniques.
  • 4. Characterstics of Big Data: The 5 Vs
  • 5. Volume: • It is the size of the data which determines the value and potential of the data under consideration and whether it can actually be considered as Big Data or not. The name ‘Big Data’ itself contains a term which is related to size and hence the characteristic. • Big data implies enormous volumes of data. • Now that data is generated by machines, networks and human interaction on systems like social media the volume of data to be analyzed is massive.
  • 6. Velocity: • The term ‘velocity’ in the context refers to the speed of generation of data or how fast the data is generated and processed to meet the demands and the challenges which lie ahead in the path of growth and development. • Big Data Velocity deals with the pace at which data flows in from sources like business processes, machines, networks and human interaction with things like social media sites, mobile devices, etc.
  • 7. Variety: • The next aspect of Big Data is its variety. This means that the category to which Big Data belongs to is also a very essential fact that needs to be known by the data analysts. • This helps the people, who are closely analyzing the data and are associated with it, to effectively use the data to their advantage and thus upholding the importance of the Big Data.
  • 8. Veracity: • Big Data Veracity refers to the biases, noise and abnormality in data. Is the data that is being stored, and mined meaningful to the problem being analyzed. • The quality of the data being captured can vary greatly. Accuracy of analysis depends on the veracity of the source data.
  • 9. Variability: • This is a factor which can be a problem for those who analyse the data. This refers to the inconsistency which can be shown by the data at times, thus hampering the process of being able to handle and manage the data effectively.
  • 10.
  • 11. Storage And Architecture: • Recent studies show that the use of a multiple layer architecture is an option for dealing with big data. The Distributed Parallel architecture distributes data across multiple processing units and parallel processing units provide data much faster, by improving processing speeds. • This type of architecture inserts data into a parallel DBMS, which implements the use of MapReduce and Hadoop frameworks. This type of framework looks to make the processing power transparent to the end user by using a front end application server.
  • 12. Hadoop • Hadoop is a set of algorithms (an open-source software framework written in Java) for distributed storage and distributed processing of very large data or Big Data on computer clusters built from commodity hardware. • It is designed to scale up from a single server to thousands of machines, with very high degree of fault tolerance. • Hadoop changes the economics and the dynamics of large-scale computing
  • 13. Applications: Smart Healthcare Homeland Security Traffic control Manufacturing Multi channel Sales Telecom Trading analytics Search quality
  • 14.
  • 15. Government: The use and adoption of Big Data, within governmental processes, is beneficial and allows efficiencies in terms of cost, productivity and innovation. That said, this process does not come without its flaws. Data analysis often requires multiple parts of government (central and local) to work in collaboration and create new and innovative processes to deliver the desired outcome. Below are the thought leading examples within the Governmental Big Data space. India: • Big data analysis was, in parts, responsible for the BJP and its allies to win a highly successful Indian General Election 2014. • The Indian Government utilises numerous techniques to ascertain how the Indian electorate is responding to government action, as well as ideas for policy augmentation
  • 16. Risks of Big Data: #1: Loss of agility In a typical large-scale organization, data is housed on multiple platforms. There is transactional data, email data, analytics data, etc. Management wants people to be able to locate, analyze, and make decisions based on this data quickly. But if the data isn’t evaluated, organized, and stored properly, critical information can be either difficult or impossible to find – slowing a business down at the exact moment when speed is essential. #2: Loss of compliance Laws are getting more and more complex with regard to how long companies need to retain data, how they need to retain it, and where they need to retain it. There are both general regulations in place as well as state- or industry-specific regulations that may apply. It is not uncommon for regulators to perform random audits to examine a company’s policies regarding data and their actual management of that data. A compliance failure can result in significant fine or damage to reputational risk.
  • 17. #3: Loss of security With more data located in and moving between more places than ever before, there are also a vastly increased number of ways to hack into that data. A security breach can result in theft, fraud, fines … and, of course, reputational loss. #4: Loss of money A server may seem inexpensive at first glance – but never assume that storage is cheap.
  • 18.
  • 19. Benefits: • Cost reduction Big data technologies like Hadoop and cloud-based analytics can provide substantial cost advantages. While comparisons between big data technology and traditional architectures (data warehouses and marts in particular) are difficult because of differences in functionality, a price comparison alone can suggest order-of-magnitude improvements • Faster, better decision making Analytics has always involved attempts to improve decision making, and big data doesn’t change that.
  • 20. • New products and services Perhaps the most interesting use of big data analytics is to create new products and services for customers. Online companies have done this for a decade or so, but now predominantly offline firms are doing it too. GE, for example, has made a major investment in new service models for its industrial products using big data analytics.