SlideShare a Scribd company logo
1 of 17
Amity Institute of Information Technology
Introduction to Data Science
BSc.IT/BCA/ DUAL VI Semester
Faculty: Dr. Shambhu Kumar Jha
1
Amity Institute of Information Technology
Module I
Introduction to Big Data
Difference between Big Data and Data Science,
2
Amity Institute of Information Technology
Introduction to Big Data
let’s start by understanding what is Big Data?
 Big Data: It is large or voluminous data, information,
or the relevant statistics acquired by large
organizations and ventures from various sources.
 Many software and data storages is created and
prepared as it is difficult to compute the big data
manually.
 It is used to discover patterns and trends and make
decisions related to human behavior and interaction
technology
3
Amity Institute of Information Technology
Introduction to Big Data
Big data encompasses following wide variety of data types:
 Structured data, such as transactions and financial records;
 Unstructured data, such as email, text, documents and multimedia
files;
 Semi structured data, such as web server logs and streaming data
from sensors.
Big data is often characterized by the three V's:
• Large volume of data in many environments;
• Variety of data types frequently stored in big data systems; and
• Velocity at which much of the data is generated, collected and
processed. 4
Amity Institute of Information Technology
Why Big Data is Important
Companies use big data to :
 Improve operations,
 Provide better customer service,
 Create personalized marketing campaigns and take other actions to
increase revenue and profits.
 Competitive advantage over those that don't because they're able to
make faster and more informed business decisions.
 Example: Big data provides valuable insights into customers that
companies can use to refine their marketing, advertising and
promotions in order to increase customer engagement and conversion
rates.
 Both historical and real-time data can be analyzed to assess the
evolving preferences of consumers or corporate buyers, enabling
businesses to become more responsive to customer wants and needs.5
Amity Institute of Information Technology
Is big data part of data science?
• Big Data is essentially a special application of data science, in
which the data sets are enormous and require overcoming logistical
challenges to deal with them.
• The primary concern is efficiently capturing, storing, extracting,
processing, and analyzing information from these enormous data
sets.
• Big data is a combination of structured, semi structured and
unstructured data collected by organizations that can be mined for
information and used in machine learning projects, predictive
modeling and other advanced analytics applications.
6
Amity Institute of Information Technology
Differences between Big Data and Data
Science:
DATA SCIENCE BIG DATA
It is about the collection,
processing, analysing, and utilizing
of data in various operations.
It is about extracting vital and valuable
information from a huge amount of data.
It is a field of study just like
Computer Science, Applied
Statistics, or Applied Mathematics.
It is a technique for tracking and
discovering trends in complex data sets.
7
Amity Institute of Information Technology
Differences between Big Data and Data
Science:
The goal is to build data-dominant
products for a venture.
The goal is to make data more vital and
usable i.e. by extracting only important
information from the huge data within
existing traditional aspects.
Tools mainly used in Data Science
include SAS, R, Python, etc
Tools mostly used in Big Data include
Hadoop, Spark, Flink, etc.
It is a superset of Big Data as data
science consists of Data scrapping,
cleaning, visualization, statistics, and
many more techniques.
It is a sub-set of Data Science as mining
activities which is in a pipeline of Data
science.
8
Amity Institute of Information Technology
Differences between Big Data and Data
Science:
It is mainly used for scientific purposes.
It is mainly used for business purposes and
customer satisfaction.
It broadly focuses on the science of the data.
It is more involved with the processes of handling
voluminous data.
It is mainly used for scientific purposes.
It is mainly used for business purposes and
customer satisfaction.
9
AMITY INSTITUTE OF INFORMATION TECHNOLOGY
Applications Of Big Data Finance
o
o
o
o
AMITY INSTITUTE OF INFORMATION TECHNOLOGY
Applications of Big Data: Social Network
Social media in the current scenario is
considered as the largest data generator.
The stats have shown that around 500+
terabytes of new data get generated into the
databases of social media every day, particularly
in the case of Facebook.
AMITY INSTITUTE OF INFORMATION TECHNOLOGY
Applications of Big Data: Healthcare
Nowadays, doctors rely mostly on patients’
clinical records, which means that a lot of data
needs to be gathered, that too for different
patients.
Since there is a large amount of data coming
from different sources, in various formats, the
need to handle this large amount of data is
increased
AMITY INSTITUTE OF INFORMATION TECHNOLOGY
Applications of Big Data E-Commerce
Maintaining customer relationships is the most important in the e-
commerce industry.
E-commerce websites have different marketing ideas to retail their
merchandise to their customers, to manage transactions, and to
implement better tactics of using innovative ideas with Big Data to
improve businesses.
AMITY INSTITUTE OF INFORMATION TECHNOLOGY
Applications of Big Data: Education
The education sector holds a lot of information with regard to curriculum,
students, and faculty.
The information is analyzed to get insights that can enhance the operational
adequacy of the educational organization.
Collecting and analyzing information of a student such as attendance, test scores,
grades, and other issues take up a lot of data.
So, big data makes an approach for a progressive framework wherein this data
can be stored and analyzed making it easier for the institutes to work with.
Amity Institute of Information Technology
Application of Big data
Big Data in Communications
•Gaining new subscribers, retaining customers, and expanding
within current subscriber bases are top priorities for
telecommunication service providers.
•The solutions to these challenges lie in the ability to combine
and analyze the masses of customer-generated data and
machine-generated data that is being created every day.
15
Amity Institute of Information Technology
Skills Required Becoming a Data Scientist
• In-depth knowledge of SAS or R. For data science, R is generally
preferred.
• Python coding: Python is the most common coding language that is
used in data science, along with Java, Perl, and C/C++.
• Hadoop platform: Although not always a requirement, knowing the
Hadoop platform is still preferred for the field. Having some
experience in Hive or Pig is also beneficial.
• SQL database/coding: Although NoSQL and Hadoop have become a
significant part of data science, it is still preferred if you can write
and execute complex queries in SQL.
• Working with unstructured data: It is essential that a data
scientist can work with unstructured data, whether on social media,
video feeds, or audio.
16
Amity Institute of Information Technology
Skills Required Becoming a Big Data Specialist
• Analytical skills: These skills are essential for making sense of data,
and determining which data is relevant when creating reports and
looking for solutions.
• Creativity: You need to have the ability to create new methods to
gather, interpret, and analyze a data strategy. Mathematics and
statistical skills: Good, old-fashioned “number crunching” is also
necessary, be it in data science, data analytics, or big data.
• Computer science: Computers are the backbone of every data strategy.
Programmers will have a constant need to come up with algorithms to
process data into insights.
• Business skills: Big data professionals will need to have an
understanding of the business objectives that are in place, as well as
the underlying processes that drive the growth of the business and its
profits.
17

More Related Content

Similar to L3 Big Data and Application.pptx

data analytics lecture2.pptx
data analytics lecture2.pptxdata analytics lecture2.pptx
data analytics lecture2.pptxNamrataBhatt8
 
Big Data - Everything you need to know
Big Data - Everything you need to knowBig Data - Everything you need to know
Big Data - Everything you need to knowV2Soft
 
A study on web analytics with reference to select sports websites
A study on web analytics with reference to select sports websitesA study on web analytics with reference to select sports websites
A study on web analytics with reference to select sports websitesBhanu Prakash
 
The Comparison of Big Data Strategies in Corporate Environment
The Comparison of Big Data Strategies in Corporate EnvironmentThe Comparison of Big Data Strategies in Corporate Environment
The Comparison of Big Data Strategies in Corporate EnvironmentIRJET Journal
 
QuickView #3 - Big Data
QuickView #3 - Big DataQuickView #3 - Big Data
QuickView #3 - Big DataSonovate
 
Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate
Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate
Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate Oomph! Recruitment
 
Big data (word file)
Big data  (word file)Big data  (word file)
Big data (word file)Shahbaz Anjam
 
Big Data Trends and Challenges Report - Whitepaper
Big Data Trends and Challenges Report - WhitepaperBig Data Trends and Challenges Report - Whitepaper
Big Data Trends and Challenges Report - WhitepaperVasu S
 
OVERVIEW OF DATA SCIENCE (3).pdf
OVERVIEW OF DATA SCIENCE (3).pdfOVERVIEW OF DATA SCIENCE (3).pdf
OVERVIEW OF DATA SCIENCE (3).pdfcareer tech
 
IRJET- Big Data: A Study
IRJET-  	  Big Data: A StudyIRJET-  	  Big Data: A Study
IRJET- Big Data: A StudyIRJET Journal
 
Big Data why Now and where to?
Big Data why Now and where to?Big Data why Now and where to?
Big Data why Now and where to?Fady Sayah
 
06. 9534 14985-1-ed b edit dhyan
06. 9534 14985-1-ed b edit dhyan06. 9534 14985-1-ed b edit dhyan
06. 9534 14985-1-ed b edit dhyanIAESIJEECS
 
02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big dataRaul Chong
 
Bda assignment can also be used for BDA notes and concept understanding.
Bda assignment can also be used for BDA notes and concept understanding.Bda assignment can also be used for BDA notes and concept understanding.
Bda assignment can also be used for BDA notes and concept understanding.Aditya205306
 
Analysis of Big Data
Analysis of Big DataAnalysis of Big Data
Analysis of Big DataIRJET Journal
 
Learn All about Data Science from the Best Private University in Karnataka
Learn All about Data Science from the Best Private University in KarnatakaLearn All about Data Science from the Best Private University in Karnataka
Learn All about Data Science from the Best Private University in KarnatakaREVA University
 

Similar to L3 Big Data and Application.pptx (20)

data analytics lecture2.pptx
data analytics lecture2.pptxdata analytics lecture2.pptx
data analytics lecture2.pptx
 
What is data science ?
What is data science ?What is data science ?
What is data science ?
 
Big Data - Everything you need to know
Big Data - Everything you need to knowBig Data - Everything you need to know
Big Data - Everything you need to know
 
A study on web analytics with reference to select sports websites
A study on web analytics with reference to select sports websitesA study on web analytics with reference to select sports websites
A study on web analytics with reference to select sports websites
 
The Comparison of Big Data Strategies in Corporate Environment
The Comparison of Big Data Strategies in Corporate EnvironmentThe Comparison of Big Data Strategies in Corporate Environment
The Comparison of Big Data Strategies in Corporate Environment
 
QuickView #3 - Big Data
QuickView #3 - Big DataQuickView #3 - Big Data
QuickView #3 - Big Data
 
Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate
Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate
Quick view Big Data, brought by Oomph!, courtesy of our partner Sonovate
 
Big data (word file)
Big data  (word file)Big data  (word file)
Big data (word file)
 
Unlocking big data
Unlocking big dataUnlocking big data
Unlocking big data
 
Big Data Trends and Challenges Report - Whitepaper
Big Data Trends and Challenges Report - WhitepaperBig Data Trends and Challenges Report - Whitepaper
Big Data Trends and Challenges Report - Whitepaper
 
OVERVIEW OF DATA SCIENCE (3).pdf
OVERVIEW OF DATA SCIENCE (3).pdfOVERVIEW OF DATA SCIENCE (3).pdf
OVERVIEW OF DATA SCIENCE (3).pdf
 
IRJET- Big Data: A Study
IRJET-  	  Big Data: A StudyIRJET-  	  Big Data: A Study
IRJET- Big Data: A Study
 
Big Data at a Glance
Big Data at a GlanceBig Data at a Glance
Big Data at a Glance
 
Big Data why Now and where to?
Big Data why Now and where to?Big Data why Now and where to?
Big Data why Now and where to?
 
Unit III.pdf
Unit III.pdfUnit III.pdf
Unit III.pdf
 
06. 9534 14985-1-ed b edit dhyan
06. 9534 14985-1-ed b edit dhyan06. 9534 14985-1-ed b edit dhyan
06. 9534 14985-1-ed b edit dhyan
 
02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big data
 
Bda assignment can also be used for BDA notes and concept understanding.
Bda assignment can also be used for BDA notes and concept understanding.Bda assignment can also be used for BDA notes and concept understanding.
Bda assignment can also be used for BDA notes and concept understanding.
 
Analysis of Big Data
Analysis of Big DataAnalysis of Big Data
Analysis of Big Data
 
Learn All about Data Science from the Best Private University in Karnataka
Learn All about Data Science from the Best Private University in KarnatakaLearn All about Data Science from the Best Private University in Karnataka
Learn All about Data Science from the Best Private University in Karnataka
 

More from Shambhavi Vats

L4 Intro Statistical Inferencing.pptx
L4 Intro Statistical Inferencing.pptxL4 Intro Statistical Inferencing.pptx
L4 Intro Statistical Inferencing.pptxShambhavi Vats
 
L2 DS Tools and Application.pptx
L2 DS Tools and Application.pptxL2 DS Tools and Application.pptx
L2 DS Tools and Application.pptxShambhavi Vats
 
L1 Introduction DS.pptx
L1 Introduction DS.pptxL1 Introduction DS.pptx
L1 Introduction DS.pptxShambhavi Vats
 
Vaastav talwar a010145020052 adbms psda vaastav talwar
Vaastav talwar a010145020052 adbms psda   vaastav talwarVaastav talwar a010145020052 adbms psda   vaastav talwar
Vaastav talwar a010145020052 adbms psda vaastav talwarShambhavi Vats
 
38 object-concepts (1)
38 object-concepts (1)38 object-concepts (1)
38 object-concepts (1)Shambhavi Vats
 
Aakarsh 038 csit142_lab_work (1)
Aakarsh 038 csit142_lab_work (1)Aakarsh 038 csit142_lab_work (1)
Aakarsh 038 csit142_lab_work (1)Shambhavi Vats
 
wool from fibre to fabric
wool from fibre to fabricwool from fibre to fabric
wool from fibre to fabricShambhavi Vats
 
road side rain water harvesting
road side rain water harvestingroad side rain water harvesting
road side rain water harvestingShambhavi Vats
 

More from Shambhavi Vats (11)

L4 Intro Statistical Inferencing.pptx
L4 Intro Statistical Inferencing.pptxL4 Intro Statistical Inferencing.pptx
L4 Intro Statistical Inferencing.pptx
 
L2 DS Tools and Application.pptx
L2 DS Tools and Application.pptxL2 DS Tools and Application.pptx
L2 DS Tools and Application.pptx
 
L1 Introduction DS.pptx
L1 Introduction DS.pptxL1 Introduction DS.pptx
L1 Introduction DS.pptx
 
Vaastav talwar a010145020052 adbms psda vaastav talwar
Vaastav talwar a010145020052 adbms psda   vaastav talwarVaastav talwar a010145020052 adbms psda   vaastav talwar
Vaastav talwar a010145020052 adbms psda vaastav talwar
 
38 object-concepts (1)
38 object-concepts (1)38 object-concepts (1)
38 object-concepts (1)
 
Bca 2b
Bca 2bBca 2b
Bca 2b
 
Aakarsh 038 csit142_lab_work (1)
Aakarsh 038 csit142_lab_work (1)Aakarsh 038 csit142_lab_work (1)
Aakarsh 038 csit142_lab_work (1)
 
wool from fibre to fabric
wool from fibre to fabricwool from fibre to fabric
wool from fibre to fabric
 
urban livehood
urban livehoodurban livehood
urban livehood
 
road side rain water harvesting
road side rain water harvestingroad side rain water harvesting
road side rain water harvesting
 
Water
WaterWater
Water
 

Recently uploaded

VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
ELKO dropshipping via API with DroFx.pptx
ELKO dropshipping via API with DroFx.pptxELKO dropshipping via API with DroFx.pptx
ELKO dropshipping via API with DroFx.pptxolyaivanovalion
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsJoseMangaJr1
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...amitlee9823
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxolyaivanovalion
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila
 

Recently uploaded (20)

VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
Predicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science ProjectPredicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science Project
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
ELKO dropshipping via API with DroFx.pptx
ELKO dropshipping via API with DroFx.pptxELKO dropshipping via API with DroFx.pptx
ELKO dropshipping via API with DroFx.pptx
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
Anomaly detection and data imputation within time series
Anomaly detection and data imputation within time seriesAnomaly detection and data imputation within time series
Anomaly detection and data imputation within time series
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 

L3 Big Data and Application.pptx

  • 1. Amity Institute of Information Technology Introduction to Data Science BSc.IT/BCA/ DUAL VI Semester Faculty: Dr. Shambhu Kumar Jha 1
  • 2. Amity Institute of Information Technology Module I Introduction to Big Data Difference between Big Data and Data Science, 2
  • 3. Amity Institute of Information Technology Introduction to Big Data let’s start by understanding what is Big Data?  Big Data: It is large or voluminous data, information, or the relevant statistics acquired by large organizations and ventures from various sources.  Many software and data storages is created and prepared as it is difficult to compute the big data manually.  It is used to discover patterns and trends and make decisions related to human behavior and interaction technology 3
  • 4. Amity Institute of Information Technology Introduction to Big Data Big data encompasses following wide variety of data types:  Structured data, such as transactions and financial records;  Unstructured data, such as email, text, documents and multimedia files;  Semi structured data, such as web server logs and streaming data from sensors. Big data is often characterized by the three V's: • Large volume of data in many environments; • Variety of data types frequently stored in big data systems; and • Velocity at which much of the data is generated, collected and processed. 4
  • 5. Amity Institute of Information Technology Why Big Data is Important Companies use big data to :  Improve operations,  Provide better customer service,  Create personalized marketing campaigns and take other actions to increase revenue and profits.  Competitive advantage over those that don't because they're able to make faster and more informed business decisions.  Example: Big data provides valuable insights into customers that companies can use to refine their marketing, advertising and promotions in order to increase customer engagement and conversion rates.  Both historical and real-time data can be analyzed to assess the evolving preferences of consumers or corporate buyers, enabling businesses to become more responsive to customer wants and needs.5
  • 6. Amity Institute of Information Technology Is big data part of data science? • Big Data is essentially a special application of data science, in which the data sets are enormous and require overcoming logistical challenges to deal with them. • The primary concern is efficiently capturing, storing, extracting, processing, and analyzing information from these enormous data sets. • Big data is a combination of structured, semi structured and unstructured data collected by organizations that can be mined for information and used in machine learning projects, predictive modeling and other advanced analytics applications. 6
  • 7. Amity Institute of Information Technology Differences between Big Data and Data Science: DATA SCIENCE BIG DATA It is about the collection, processing, analysing, and utilizing of data in various operations. It is about extracting vital and valuable information from a huge amount of data. It is a field of study just like Computer Science, Applied Statistics, or Applied Mathematics. It is a technique for tracking and discovering trends in complex data sets. 7
  • 8. Amity Institute of Information Technology Differences between Big Data and Data Science: The goal is to build data-dominant products for a venture. The goal is to make data more vital and usable i.e. by extracting only important information from the huge data within existing traditional aspects. Tools mainly used in Data Science include SAS, R, Python, etc Tools mostly used in Big Data include Hadoop, Spark, Flink, etc. It is a superset of Big Data as data science consists of Data scrapping, cleaning, visualization, statistics, and many more techniques. It is a sub-set of Data Science as mining activities which is in a pipeline of Data science. 8
  • 9. Amity Institute of Information Technology Differences between Big Data and Data Science: It is mainly used for scientific purposes. It is mainly used for business purposes and customer satisfaction. It broadly focuses on the science of the data. It is more involved with the processes of handling voluminous data. It is mainly used for scientific purposes. It is mainly used for business purposes and customer satisfaction. 9
  • 10. AMITY INSTITUTE OF INFORMATION TECHNOLOGY Applications Of Big Data Finance o o o o
  • 11. AMITY INSTITUTE OF INFORMATION TECHNOLOGY Applications of Big Data: Social Network Social media in the current scenario is considered as the largest data generator. The stats have shown that around 500+ terabytes of new data get generated into the databases of social media every day, particularly in the case of Facebook.
  • 12. AMITY INSTITUTE OF INFORMATION TECHNOLOGY Applications of Big Data: Healthcare Nowadays, doctors rely mostly on patients’ clinical records, which means that a lot of data needs to be gathered, that too for different patients. Since there is a large amount of data coming from different sources, in various formats, the need to handle this large amount of data is increased
  • 13. AMITY INSTITUTE OF INFORMATION TECHNOLOGY Applications of Big Data E-Commerce Maintaining customer relationships is the most important in the e- commerce industry. E-commerce websites have different marketing ideas to retail their merchandise to their customers, to manage transactions, and to implement better tactics of using innovative ideas with Big Data to improve businesses.
  • 14. AMITY INSTITUTE OF INFORMATION TECHNOLOGY Applications of Big Data: Education The education sector holds a lot of information with regard to curriculum, students, and faculty. The information is analyzed to get insights that can enhance the operational adequacy of the educational organization. Collecting and analyzing information of a student such as attendance, test scores, grades, and other issues take up a lot of data. So, big data makes an approach for a progressive framework wherein this data can be stored and analyzed making it easier for the institutes to work with.
  • 15. Amity Institute of Information Technology Application of Big data Big Data in Communications •Gaining new subscribers, retaining customers, and expanding within current subscriber bases are top priorities for telecommunication service providers. •The solutions to these challenges lie in the ability to combine and analyze the masses of customer-generated data and machine-generated data that is being created every day. 15
  • 16. Amity Institute of Information Technology Skills Required Becoming a Data Scientist • In-depth knowledge of SAS or R. For data science, R is generally preferred. • Python coding: Python is the most common coding language that is used in data science, along with Java, Perl, and C/C++. • Hadoop platform: Although not always a requirement, knowing the Hadoop platform is still preferred for the field. Having some experience in Hive or Pig is also beneficial. • SQL database/coding: Although NoSQL and Hadoop have become a significant part of data science, it is still preferred if you can write and execute complex queries in SQL. • Working with unstructured data: It is essential that a data scientist can work with unstructured data, whether on social media, video feeds, or audio. 16
  • 17. Amity Institute of Information Technology Skills Required Becoming a Big Data Specialist • Analytical skills: These skills are essential for making sense of data, and determining which data is relevant when creating reports and looking for solutions. • Creativity: You need to have the ability to create new methods to gather, interpret, and analyze a data strategy. Mathematics and statistical skills: Good, old-fashioned “number crunching” is also necessary, be it in data science, data analytics, or big data. • Computer science: Computers are the backbone of every data strategy. Programmers will have a constant need to come up with algorithms to process data into insights. • Business skills: Big data professionals will need to have an understanding of the business objectives that are in place, as well as the underlying processes that drive the growth of the business and its profits. 17

Editor's Notes

  1. 1
  2. The data generated mainly consist of videos, photos, message exchanges, etc. A single activity on any social media site generates a lot of data which is again stored and gets processed whenever required. Since the data stored is in terabytes, it would take a lot of time for processing if it is done by our legacy systems. Big Data is a solution to this problem.