SlideShare uma empresa Scribd logo
1 de 16
Mrs. D. Suja Mary
Assistant Professor
Nanjil Catholic College of Arts and Science,
Kaliyakkavilai
What is Data?
 The quantities, characters, or symbols on which
operations are performed by a computer, which
may be stored and transmitted in the form of
electrical signals and recorded on magnetic,
optical, or mechanical recording media.
What is Big Data?
 Big Data is a collection of data that is huge in
volume, yet growing exponentially with time. It
is a data with so large size and complexity that
none of traditional data management tools can
store it or process it efficiently. Big data is also a
data but with huge size.
What Is Data Analytics?
 The term data analytics refers to the process of
examining datasets to draw conclusions about the
information they contain. Data analytic techniques
enable you to take raw data and uncover patterns to
extract valuable insights from it.
 Data Scientists and Analysts use data analytics
techniques in their research, and businesses also use
it to inform their decisions.
 Data analysis can help companies better understand
their customers, evaluate their ad campaigns,
personalize content, create content strategies and
develop products.
 businesses can use data analytics to boost business
performance and improve their bottom line.
 Analysis techniques give businesses access to insights that can help
them to improve their performance.
 As the importance of data analytics in the business world
increases, it becomes more critical that our company understand
how to implement it. Some benefits of data analytics include:
 1. Improved Decision Making
Companies can use the insights they gain from data analytics to
inform their decisions, leading to better outcomes.
 2. More Effective Marketing
Data analytics also gives us useful insights into how our campaigns
are performing so that can fine-tune them for optimal outcomes.
 3. Better Customer Service
Data analytics provide us with more insights into our customers,
allowing us to tailor customer service to their needs, provide more
personalization and build stronger relationships with them.
 4. More Efficient Operations
Data analytics can help us streamline our processes, save money
and boost our bottom line. When we have an improved
understanding of what our audience wants, we waste less time on
creating ads and content that don’t match our audience’s
interests.
 It is an organized collection of structured data. It is a
collection of related information.
 DB stores and access data electronically.
 A database is stored as a file or a set of files on magnetic
disk or tape, optical disk, or some other secondary storage
device.
 It is an data structure that stores organized information.
 They are administrated to facilitate the storage of data,
retrieval of data, modification of data, and deletion of
data.
 It allows processing various data-processing operations.
 Databases bolster stockpiling and control of information.
 Databases make information administration simple.
 Any database developer with certain sets of syntax can
process can work on the database
 A DB is a collection of related data. There are
two types of databases – Relation Database
Management System while other is Non –
Relational Database Management System.
 If we are storing and capable of processing a
very huge volume of data in databases,
Definitely we can store and process Big Data
through relational or Non-relational Databases.
 Big data is not going to replace databases. In one
form or other we will be using SQL databases to
store and process Big Data. In this regard, Big
Data is completely separate from DB.
Given below is the difference between Big Data and Database:
 Big Data is a term applied to data sets whose size or type is beyond
the ability of traditional relational databases. A traditional
database is not able to capture, manage, and process the high
volume of data with low-latency While Database is a collection of
information that is organized so that it can be easily captured,
accessed, managed and updated.
 Big Data refers to technologies and initiatives that involve data
that is too diverse i.e. varieties, rapid-changing or massive for
skills, conventional technologies, and infrastructure to address
efficiently While Database management system (DBMS) extracts
information from the database in response to queries but it in
restricted conditions.
 There can be any varieties of data while DB can be defined through
some schema.
 It is difficult to store and process while Databases like SQL, data
can be easily stored and process.
 Raw data is the data that is collected from a
source, but in its initial state. It has not yet
been processed — or cleaned, organized, and
visually presented. Raw data can be
manually written down or typed, recorded,
or automatically input by a machine. You can
find raw data in a variety of places, including
databases, files, spreadsheets, and even on
source devices, such as a camera.
 Data analysts, software, and artificial
intelligence (AI) all work to transform raw data
into processed data.
 They start by organizing and cleaning the raw
data. One of the most important parts of this
process is removing outliers and duplicates within
the data set.
 The next step is an initial analysis that may
involve data manipulation. Especially if analysts
are analyzing raw data based on human responses
to a question, they will look closely at those
responses and determine if respondents
inaccurately replied to the question in a way that
will change the results.
 Raw data serves several purposes, particularly in businesses
where full data visibility is key to statistical and predictive
analytics.
Here are a few reasons why businesses heavily rely on raw data
sources:
 Raw data is the starting phase of all data and the initial source of
data-based decisions. You can’t make visually compelling charts
or overarching analytical statements about processed data until
we’ve worked through all of the raw data.
 We can trust the integrity of raw data. We don’t have to worry
that something has been removed or adjusted, because the
format has not yet been manipulated by humans or machines.
 AI and machine learning methods can only analyze data in a raw
format. Once the data has been processed, it is illegible to these
technologies.
 Raw data gives you a backup resource. We can check our work
and go back to the source after processing and manipulating our
data sets. It’s all there for your reference if we run into a
problem and need a new analysis.
All data inside a computer is transmitted as a
series of electrical signals that are either on
or off. Therefore, in order for a computer to
be able to process any kind of data, including
text, images and sound, they must be
converted into binary form.
 Data Representation Types of data: – Numbers – Text – Images
– Audio & Video
 Text
 When any key on a keyboard is pressed, it needs to be
converted into a binary number so that it can be processed by
the computer and the typed character can appear on the
screen.
 A code where each number represents a character can be used
to convert text into binary. One code we can use for this is
called ASCII. The ASCII code takes each character on the
keyboard and assigns it a binary number.
 Images also need to be converted into binary in
order for a computer to process them so that
they can be seen on our screen. Digital images
are made up of pixels. Each pixel in an image is
made up of binary numbers.
 If we say that 1 is black (or on) and 0 is white (or
off), then a simple black and white picture can
be created using binary.
 The terms audio and video commonly refers
to the time-based media storage format for
sound/music and moving pictures
information. Audio and video digital
recording, also referred as audio and video
codecs, can be uncompressed, lossless
compressed, or lossy compressed depending
on the desired quality and use cases.
 Connected objects are another source of raw
data, which retrieves a large amount of data
through their sensors.
 The Internet of Things (IoT) contributes to
double the size of the digital universe every 2
years, which could be 44,000 billion gigabytes in
2020, 10 times more than in 2013
 The connected object thus allows extend the
scope of internet allowing any object, machine
or living element to transmit information about
its environment and eventually be activated
remotely.
Analyzing Raw Data Sources

Mais conteúdo relacionado

Mais procurados

The Growing Importance of Data Cleaning
The Growing Importance of Data CleaningThe Growing Importance of Data Cleaning
The Growing Importance of Data CleaningCarolineSmith912130
 
Data Analyst vs Data Engineer vs Data Scientist | Data Analytics Masters Prog...
Data Analyst vs Data Engineer vs Data Scientist | Data Analytics Masters Prog...Data Analyst vs Data Engineer vs Data Scientist | Data Analytics Masters Prog...
Data Analyst vs Data Engineer vs Data Scientist | Data Analytics Masters Prog...Edureka!
 
Exploratory data analysis data visualization
Exploratory data analysis data visualizationExploratory data analysis data visualization
Exploratory data analysis data visualizationDr. Hamdan Al-Sabri
 
What Is Unstructured Data And Why Is It So Important To Businesses?
What Is Unstructured Data And Why Is It So Important To Businesses?What Is Unstructured Data And Why Is It So Important To Businesses?
What Is Unstructured Data And Why Is It So Important To Businesses?Bernard Marr
 
Data science life cycle
Data science life cycleData science life cycle
Data science life cycleManoj Mishra
 
Socable Influence Maximization
Socable Influence MaximizationSocable Influence Maximization
Socable Influence Maximizationrobertlz
 
Data preprocessing using Machine Learning
Data  preprocessing using Machine Learning Data  preprocessing using Machine Learning
Data preprocessing using Machine Learning Gopal Sakarkar
 
4.3 multimedia datamining
4.3 multimedia datamining4.3 multimedia datamining
4.3 multimedia dataminingKrish_ver2
 
R and Visualization: A match made in Heaven
R and Visualization: A match made in HeavenR and Visualization: A match made in Heaven
R and Visualization: A match made in HeavenEdureka!
 
Information cascades
Information cascadesInformation cascades
Information cascadesLeonid Zhukov
 
Feature Engineering & Selection
Feature Engineering & SelectionFeature Engineering & Selection
Feature Engineering & SelectionEng Teong Cheah
 
Chapter 4 Classification
Chapter 4 ClassificationChapter 4 Classification
Chapter 4 ClassificationKhalid Elshafie
 
PCA (Principal component analysis)
PCA (Principal component analysis)PCA (Principal component analysis)
PCA (Principal component analysis)Learnbay Datascience
 
Knowledge discovery thru data mining
Knowledge discovery thru data miningKnowledge discovery thru data mining
Knowledge discovery thru data miningDevakumar Jain
 
The Data Science Process
The Data Science ProcessThe Data Science Process
The Data Science ProcessVishal Patel
 
Classification in data mining
Classification in data mining Classification in data mining
Classification in data mining Sulman Ahmed
 
Data Science Tutorial | Introduction To Data Science | Data Science Training ...
Data Science Tutorial | Introduction To Data Science | Data Science Training ...Data Science Tutorial | Introduction To Data Science | Data Science Training ...
Data Science Tutorial | Introduction To Data Science | Data Science Training ...Edureka!
 

Mais procurados (20)

The Growing Importance of Data Cleaning
The Growing Importance of Data CleaningThe Growing Importance of Data Cleaning
The Growing Importance of Data Cleaning
 
Data Analyst vs Data Engineer vs Data Scientist | Data Analytics Masters Prog...
Data Analyst vs Data Engineer vs Data Scientist | Data Analytics Masters Prog...Data Analyst vs Data Engineer vs Data Scientist | Data Analytics Masters Prog...
Data Analyst vs Data Engineer vs Data Scientist | Data Analytics Masters Prog...
 
Exploratory data analysis data visualization
Exploratory data analysis data visualizationExploratory data analysis data visualization
Exploratory data analysis data visualization
 
What Is Unstructured Data And Why Is It So Important To Businesses?
What Is Unstructured Data And Why Is It So Important To Businesses?What Is Unstructured Data And Why Is It So Important To Businesses?
What Is Unstructured Data And Why Is It So Important To Businesses?
 
Data science life cycle
Data science life cycleData science life cycle
Data science life cycle
 
Socable Influence Maximization
Socable Influence MaximizationSocable Influence Maximization
Socable Influence Maximization
 
Data preprocessing using Machine Learning
Data  preprocessing using Machine Learning Data  preprocessing using Machine Learning
Data preprocessing using Machine Learning
 
4.3 multimedia datamining
4.3 multimedia datamining4.3 multimedia datamining
4.3 multimedia datamining
 
R and Visualization: A match made in Heaven
R and Visualization: A match made in HeavenR and Visualization: A match made in Heaven
R and Visualization: A match made in Heaven
 
Data Science
Data ScienceData Science
Data Science
 
Big Data analytics best practices
Big Data analytics best practicesBig Data analytics best practices
Big Data analytics best practices
 
Information cascades
Information cascadesInformation cascades
Information cascades
 
Feature Engineering & Selection
Feature Engineering & SelectionFeature Engineering & Selection
Feature Engineering & Selection
 
Classification of data
Classification of dataClassification of data
Classification of data
 
Chapter 4 Classification
Chapter 4 ClassificationChapter 4 Classification
Chapter 4 Classification
 
PCA (Principal component analysis)
PCA (Principal component analysis)PCA (Principal component analysis)
PCA (Principal component analysis)
 
Knowledge discovery thru data mining
Knowledge discovery thru data miningKnowledge discovery thru data mining
Knowledge discovery thru data mining
 
The Data Science Process
The Data Science ProcessThe Data Science Process
The Data Science Process
 
Classification in data mining
Classification in data mining Classification in data mining
Classification in data mining
 
Data Science Tutorial | Introduction To Data Science | Data Science Training ...
Data Science Tutorial | Introduction To Data Science | Data Science Training ...Data Science Tutorial | Introduction To Data Science | Data Science Training ...
Data Science Tutorial | Introduction To Data Science | Data Science Training ...
 

Semelhante a Analyzing Raw Data Sources

Business Intelligence
Business IntelligenceBusiness Intelligence
Business IntelligenceSukirti Garg
 
introduction to data science
introduction to data scienceintroduction to data science
introduction to data scienceJohnson Ubah
 
Chapter 2 - Intro to Data Sciences[2].pptx
Chapter 2 - Intro to Data Sciences[2].pptxChapter 2 - Intro to Data Sciences[2].pptx
Chapter 2 - Intro to Data Sciences[2].pptxJethroDignadice2
 
Security issues in big data
Security issues in big data Security issues in big data
Security issues in big data Shallote Dsouza
 
C21027_Aditya_Big Data Analytics In Baking Sector.pptx
C21027_Aditya_Big Data Analytics In Baking Sector.pptxC21027_Aditya_Big Data Analytics In Baking Sector.pptx
C21027_Aditya_Big Data Analytics In Baking Sector.pptxAdityaDeshpande674450
 
IRJET- Comparative Study of Efficacy of Big Data Analysis and Deep Learni...
IRJET-  	  Comparative Study of Efficacy of Big Data Analysis and Deep Learni...IRJET-  	  Comparative Study of Efficacy of Big Data Analysis and Deep Learni...
IRJET- Comparative Study of Efficacy of Big Data Analysis and Deep Learni...IRJET Journal
 
Datawarehousing
DatawarehousingDatawarehousing
Datawarehousingwork
 
Dataware housing
Dataware housingDataware housing
Dataware housingwork
 
Harness the power of data
Harness the power of dataHarness the power of data
Harness the power of dataHarsha MV
 
Using Data Lakes to Sail Through Your Sales Goals
Using Data Lakes to Sail Through Your Sales GoalsUsing Data Lakes to Sail Through Your Sales Goals
Using Data Lakes to Sail Through Your Sales GoalsIrshadKhan682442
 
Using Data Lakes to Sail Through Your Sales Goals
Using Data Lakes to Sail Through Your Sales GoalsUsing Data Lakes to Sail Through Your Sales Goals
Using Data Lakes to Sail Through Your Sales GoalsWilliamJohnson288536
 
Using Data Lakes To Sail Through Your Sales Goals
Using Data Lakes To Sail Through Your Sales GoalsUsing Data Lakes To Sail Through Your Sales Goals
Using Data Lakes To Sail Through Your Sales GoalsKevinJohnson667312
 
WHAT IS A DATA LAKE? Know DATA LAKES & SALES ECOSYSTEM
WHAT IS A DATA LAKE? Know DATA LAKES & SALES ECOSYSTEMWHAT IS A DATA LAKE? Know DATA LAKES & SALES ECOSYSTEM
WHAT IS A DATA LAKE? Know DATA LAKES & SALES ECOSYSTEMRajaraj64
 
Big data (word file)
Big data  (word file)Big data  (word file)
Big data (word file)Shahbaz Anjam
 

Semelhante a Analyzing Raw Data Sources (20)

Business Intelligence
Business IntelligenceBusiness Intelligence
Business Intelligence
 
introduction to data science
introduction to data scienceintroduction to data science
introduction to data science
 
Big Data.pptx
Big Data.pptxBig Data.pptx
Big Data.pptx
 
1 UNIT-DSP.pptx
1 UNIT-DSP.pptx1 UNIT-DSP.pptx
1 UNIT-DSP.pptx
 
Chapter 2 - Intro to Data Sciences[2].pptx
Chapter 2 - Intro to Data Sciences[2].pptxChapter 2 - Intro to Data Sciences[2].pptx
Chapter 2 - Intro to Data Sciences[2].pptx
 
Bigdata
Bigdata Bigdata
Bigdata
 
U - 2 Emerging.pptx
U - 2 Emerging.pptxU - 2 Emerging.pptx
U - 2 Emerging.pptx
 
365 Data Science
365 Data Science365 Data Science
365 Data Science
 
Security issues in big data
Security issues in big data Security issues in big data
Security issues in big data
 
IT Ready - DW: 1st Day
IT Ready - DW: 1st Day IT Ready - DW: 1st Day
IT Ready - DW: 1st Day
 
C21027_Aditya_Big Data Analytics In Baking Sector.pptx
C21027_Aditya_Big Data Analytics In Baking Sector.pptxC21027_Aditya_Big Data Analytics In Baking Sector.pptx
C21027_Aditya_Big Data Analytics In Baking Sector.pptx
 
IRJET- Comparative Study of Efficacy of Big Data Analysis and Deep Learni...
IRJET-  	  Comparative Study of Efficacy of Big Data Analysis and Deep Learni...IRJET-  	  Comparative Study of Efficacy of Big Data Analysis and Deep Learni...
IRJET- Comparative Study of Efficacy of Big Data Analysis and Deep Learni...
 
Datawarehousing
DatawarehousingDatawarehousing
Datawarehousing
 
Dataware housing
Dataware housingDataware housing
Dataware housing
 
Harness the power of data
Harness the power of dataHarness the power of data
Harness the power of data
 
Using Data Lakes to Sail Through Your Sales Goals
Using Data Lakes to Sail Through Your Sales GoalsUsing Data Lakes to Sail Through Your Sales Goals
Using Data Lakes to Sail Through Your Sales Goals
 
Using Data Lakes to Sail Through Your Sales Goals
Using Data Lakes to Sail Through Your Sales GoalsUsing Data Lakes to Sail Through Your Sales Goals
Using Data Lakes to Sail Through Your Sales Goals
 
Using Data Lakes To Sail Through Your Sales Goals
Using Data Lakes To Sail Through Your Sales GoalsUsing Data Lakes To Sail Through Your Sales Goals
Using Data Lakes To Sail Through Your Sales Goals
 
WHAT IS A DATA LAKE? Know DATA LAKES & SALES ECOSYSTEM
WHAT IS A DATA LAKE? Know DATA LAKES & SALES ECOSYSTEMWHAT IS A DATA LAKE? Know DATA LAKES & SALES ECOSYSTEM
WHAT IS A DATA LAKE? Know DATA LAKES & SALES ECOSYSTEM
 
Big data (word file)
Big data  (word file)Big data  (word file)
Big data (word file)
 

Último

Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxfirstjob4
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...shivangimorya083
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal
 
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service OnlineCALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Onlineanilsa9823
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramMoniSankarHazra
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023ymrp368
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girlCall Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girlkumarajju5765
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 

Último (20)

Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptx
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
 
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service OnlineCALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girlCall Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
Call Girls 🫤 Dwarka ➡️ 9711199171 ➡️ Delhi 🫦 Two shot with one girl
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 

Analyzing Raw Data Sources

  • 1. Mrs. D. Suja Mary Assistant Professor Nanjil Catholic College of Arts and Science, Kaliyakkavilai
  • 2. What is Data?  The quantities, characters, or symbols on which operations are performed by a computer, which may be stored and transmitted in the form of electrical signals and recorded on magnetic, optical, or mechanical recording media. What is Big Data?  Big Data is a collection of data that is huge in volume, yet growing exponentially with time. It is a data with so large size and complexity that none of traditional data management tools can store it or process it efficiently. Big data is also a data but with huge size.
  • 3. What Is Data Analytics?  The term data analytics refers to the process of examining datasets to draw conclusions about the information they contain. Data analytic techniques enable you to take raw data and uncover patterns to extract valuable insights from it.  Data Scientists and Analysts use data analytics techniques in their research, and businesses also use it to inform their decisions.  Data analysis can help companies better understand their customers, evaluate their ad campaigns, personalize content, create content strategies and develop products.  businesses can use data analytics to boost business performance and improve their bottom line.
  • 4.  Analysis techniques give businesses access to insights that can help them to improve their performance.  As the importance of data analytics in the business world increases, it becomes more critical that our company understand how to implement it. Some benefits of data analytics include:  1. Improved Decision Making Companies can use the insights they gain from data analytics to inform their decisions, leading to better outcomes.  2. More Effective Marketing Data analytics also gives us useful insights into how our campaigns are performing so that can fine-tune them for optimal outcomes.  3. Better Customer Service Data analytics provide us with more insights into our customers, allowing us to tailor customer service to their needs, provide more personalization and build stronger relationships with them.  4. More Efficient Operations Data analytics can help us streamline our processes, save money and boost our bottom line. When we have an improved understanding of what our audience wants, we waste less time on creating ads and content that don’t match our audience’s interests.
  • 5.  It is an organized collection of structured data. It is a collection of related information.  DB stores and access data electronically.  A database is stored as a file or a set of files on magnetic disk or tape, optical disk, or some other secondary storage device.  It is an data structure that stores organized information.  They are administrated to facilitate the storage of data, retrieval of data, modification of data, and deletion of data.  It allows processing various data-processing operations.  Databases bolster stockpiling and control of information.  Databases make information administration simple.  Any database developer with certain sets of syntax can process can work on the database
  • 6.  A DB is a collection of related data. There are two types of databases – Relation Database Management System while other is Non – Relational Database Management System.  If we are storing and capable of processing a very huge volume of data in databases, Definitely we can store and process Big Data through relational or Non-relational Databases.  Big data is not going to replace databases. In one form or other we will be using SQL databases to store and process Big Data. In this regard, Big Data is completely separate from DB.
  • 7. Given below is the difference between Big Data and Database:  Big Data is a term applied to data sets whose size or type is beyond the ability of traditional relational databases. A traditional database is not able to capture, manage, and process the high volume of data with low-latency While Database is a collection of information that is organized so that it can be easily captured, accessed, managed and updated.  Big Data refers to technologies and initiatives that involve data that is too diverse i.e. varieties, rapid-changing or massive for skills, conventional technologies, and infrastructure to address efficiently While Database management system (DBMS) extracts information from the database in response to queries but it in restricted conditions.  There can be any varieties of data while DB can be defined through some schema.  It is difficult to store and process while Databases like SQL, data can be easily stored and process.
  • 8.  Raw data is the data that is collected from a source, but in its initial state. It has not yet been processed — or cleaned, organized, and visually presented. Raw data can be manually written down or typed, recorded, or automatically input by a machine. You can find raw data in a variety of places, including databases, files, spreadsheets, and even on source devices, such as a camera.
  • 9.  Data analysts, software, and artificial intelligence (AI) all work to transform raw data into processed data.  They start by organizing and cleaning the raw data. One of the most important parts of this process is removing outliers and duplicates within the data set.  The next step is an initial analysis that may involve data manipulation. Especially if analysts are analyzing raw data based on human responses to a question, they will look closely at those responses and determine if respondents inaccurately replied to the question in a way that will change the results.
  • 10.  Raw data serves several purposes, particularly in businesses where full data visibility is key to statistical and predictive analytics. Here are a few reasons why businesses heavily rely on raw data sources:  Raw data is the starting phase of all data and the initial source of data-based decisions. You can’t make visually compelling charts or overarching analytical statements about processed data until we’ve worked through all of the raw data.  We can trust the integrity of raw data. We don’t have to worry that something has been removed or adjusted, because the format has not yet been manipulated by humans or machines.  AI and machine learning methods can only analyze data in a raw format. Once the data has been processed, it is illegible to these technologies.  Raw data gives you a backup resource. We can check our work and go back to the source after processing and manipulating our data sets. It’s all there for your reference if we run into a problem and need a new analysis.
  • 11. All data inside a computer is transmitted as a series of electrical signals that are either on or off. Therefore, in order for a computer to be able to process any kind of data, including text, images and sound, they must be converted into binary form.
  • 12.  Data Representation Types of data: – Numbers – Text – Images – Audio & Video  Text  When any key on a keyboard is pressed, it needs to be converted into a binary number so that it can be processed by the computer and the typed character can appear on the screen.  A code where each number represents a character can be used to convert text into binary. One code we can use for this is called ASCII. The ASCII code takes each character on the keyboard and assigns it a binary number.
  • 13.  Images also need to be converted into binary in order for a computer to process them so that they can be seen on our screen. Digital images are made up of pixels. Each pixel in an image is made up of binary numbers.  If we say that 1 is black (or on) and 0 is white (or off), then a simple black and white picture can be created using binary.
  • 14.  The terms audio and video commonly refers to the time-based media storage format for sound/music and moving pictures information. Audio and video digital recording, also referred as audio and video codecs, can be uncompressed, lossless compressed, or lossy compressed depending on the desired quality and use cases.
  • 15.  Connected objects are another source of raw data, which retrieves a large amount of data through their sensors.  The Internet of Things (IoT) contributes to double the size of the digital universe every 2 years, which could be 44,000 billion gigabytes in 2020, 10 times more than in 2013  The connected object thus allows extend the scope of internet allowing any object, machine or living element to transmit information about its environment and eventually be activated remotely.