9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
DS.pptx
1. STAGES IN A DATA SCIENCE
PROJECT
By
LAKSHAY AGARWAL
01224402021
BCA E1
2. There are 5 Stages Of a Data
Science Project :-
3.
4. So, What is Data Collection??????
•Data collection is the process of collecting, measuring and
analyzing different types of information using a set of
standard validated techniques.
•Once the data is collected, it goes through a rigorous
process of data cleaning and data processing to make this
data truly useful for businesses.
5. • There Are two methods of data collection:-
• Primary Data :- Primary data refers to data collected from first-hand
experience directly from the main source.
• This is collected through Interview, Observations, Surveys and
Questionnaires, Focus Group, etc.
• Secondary Data :- Secondary data is the data that has already been
collected through primary sources and made readily available for
researchers to use for their own research. It is a type of data that has
already been collected in the past.
• This is collected through Internet, Government Archives, Libraries,
etc.
6. •So, There Are three types of Data:-
•Structured Data :- Structured data is generally tabular
data that is represented by columns and rows in a
database.
•Databases that hold tables in this form are called
relational databases.
7. •Unstructured Data :- Unstructured data is information
that either does not organize in a pre-defined manner or
not have a pre-defined data model.
•Examples:- Video Files , Audio Files,etc.
•Semi- Structured Data :- It is data which consists both
parts of Structured And Unstructured Data.
8. So, What is Data Cleaning?????
•It is the process in which we eliminates duplicate and null
values, corrupt data, inconsistent data types, invalid
entries, missing data, and improper formatting.
9. Exploratory Data Analysis (EDA)
•It helps to uncover valuable insights that will be useful in
the next phase of the data science lifecycle.