SlideShare uma empresa Scribd logo
1 de 16
Descriptive Statistics
for one variable
Statistics has two major chapters:
• Descriptive Statistics
• Inferential statistics
Statistics
Descriptive Statistics
• Gives numerical and
graphic procedures to
summarize a collection
of data in a clear and
understandable way
Inferential Statistics
• Provides procedures
to draw inferences
about a population
from a sample
Descriptive Measures
• Central Tendency measures. They are
computed to give a “center” around which the
measurements in the data are distributed.
• Variation or Variability measures. They
describe “data spread” or how far away the
measurements are from the center.
• Relative Standing measures. They describe
the relative position of specific measurements in the
data.
Measures of Central Tendency
• Mean:
Sum of all measurements divided by the number
of measurements.
• Median:
A number such that at most half of the
measurements are below it and at most half of the
measurements are above it.
• Mode:
The most frequent measurement in the data.
Example of Mean
Measurements Deviation
x x - mean
3 -1
5 1
5 1
1 -3
7 3
2 -2
6 2
7 3
0 -4
4 0
40 0
• MEAN = 40/10 = 4
• Notice that the sum of the
“deviations” is 0.
• Notice that every single
observation intervenes in
the computation of the
mean.
Example of Median
• Median: (4+5)/2 =
4.5
• Notice that only the
two central values are
used in the
computation.
• The median is not
sensible to extreme
values
Measurements Measurements
Ranked
x x
3 0
5 1
5 2
1 3
7 4
2 5
6 5
7 6
0 7
4 7
40 40
Example of Mode
Measurements
x
3
5
5
1
7
2
6
7
0
4
• In this case the data have
two modes:
• 5 and 7
• Both measurements are
repeated twice
Example of Mode
Measurements
x
3
5
1
1
4
7
3
8
3
• Mode: 3
• Notice that it is possible for a
data not to have any mode.
Variance (for a sample)
• Steps:
– Compute each deviation
– Square each deviation
– Sum all the squares
– Divide by the data size (sample size) minus
one: n-1
Example of Variance
Measurements Deviations Square of
deviations
x x - mean
3 -1 1
5 1 1
5 1 1
1 -3 9
7 3 9
2 -2 4
6 2 4
7 3 9
0 -4 16
4 0 0
40 0 54
• Variance = 54/9 = 6
• It is a measure of
“spread”.
• Notice that the larger
the deviations (positive
or negative) the larger
the variance
The standard deviation
• It is defines as the square root of the
variance
• In the previous example
• Variance = 6
• Standard deviation = Square root of the
variance = Square root of 6 = 2.45
Percentiles
• The p-the percentile is a number such that at most p%
of the measurements are below it and at most 100 – p
percent of the data are above it.
• Example, if in a certain data the 85th
percentile is 340
means that 15% of the measurements in the data are
above 340. It also means that 85% of the
measurements are below 340
• Notice that the median is the 50th
percentile
For any data
• At least 75% of the measurements differ from the mean
less than twice the standard deviation.
• At least 89% of the measurements differ from the mean
less than three times the standard deviation.
Note: This is a general property and it is called Tchebichev’s Rule: At
least 1-1/k2
of the observation falls within k standard deviations from the
mean. It is true for every dataset.
Example of Tchebichev’s Rule
Suppose that for a certain
data is :
• Mean = 20
• Standard deviation =3
Then:
• A least 75% of the
measurements are
between 14 and 26
• At least 89% of the
measurements are
between 11 and 29
Further Notes
• When the Mean is greater than the Median the
data distribution is skewed to the Right.
• When the Median is greater than the Mean the
data distribution is skewed to the Left.
• When Mean and Median are very close to each
other the data distribution is approximately
symmetric.

Mais conteúdo relacionado

Mais procurados

Data Analysis, Presentation and Interpretation of Data
Data Analysis, Presentation and Interpretation of DataData Analysis, Presentation and Interpretation of Data
Data Analysis, Presentation and Interpretation of Data
Roqui Malijan
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
Aiden Yeh
 

Mais procurados (20)

Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
 
Spss tutorial 1
Spss tutorial 1Spss tutorial 1
Spss tutorial 1
 
Data Analysis, Presentation and Interpretation of Data
Data Analysis, Presentation and Interpretation of DataData Analysis, Presentation and Interpretation of Data
Data Analysis, Presentation and Interpretation of Data
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
 
Measures of central tendancy
Measures of central tendancy Measures of central tendancy
Measures of central tendancy
 
Descriptive Statistics
Descriptive StatisticsDescriptive Statistics
Descriptive Statistics
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
 
Data Analysis and Statistics
Data Analysis and StatisticsData Analysis and Statistics
Data Analysis and Statistics
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
 
Basic statistics
Basic statisticsBasic statistics
Basic statistics
 
Basics of Educational Statistics (Descriptive statistics)
Basics of Educational Statistics (Descriptive statistics)Basics of Educational Statistics (Descriptive statistics)
Basics of Educational Statistics (Descriptive statistics)
 
Ses 1 basic fundamentals of mathematics and statistics
Ses 1 basic fundamentals of mathematics and statisticsSes 1 basic fundamentals of mathematics and statistics
Ses 1 basic fundamentals of mathematics and statistics
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
 
Basic Statistics & Data Analysis
Basic Statistics & Data AnalysisBasic Statistics & Data Analysis
Basic Statistics & Data Analysis
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
 
Analysis and Interpretation of Data
Analysis and Interpretation of DataAnalysis and Interpretation of Data
Analysis and Interpretation of Data
 
Measures of Central Tendency
Measures of Central TendencyMeasures of Central Tendency
Measures of Central Tendency
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
 
descriptive data analysis
 descriptive data analysis descriptive data analysis
descriptive data analysis
 

Semelhante a Descriptive statistics -review(2)

Ch5-quantitative-data analysis.pptx
Ch5-quantitative-data analysis.pptxCh5-quantitative-data analysis.pptx
Ch5-quantitative-data analysis.pptx
zerihunnana
 

Semelhante a Descriptive statistics -review(2) (20)

Statr sessions 4 to 6
Statr sessions 4 to 6Statr sessions 4 to 6
Statr sessions 4 to 6
 
2. chapter ii(analyz)
2. chapter ii(analyz)2. chapter ii(analyz)
2. chapter ii(analyz)
 
Chapter 3 Ken Black 2.ppt
Chapter 3 Ken Black 2.pptChapter 3 Ken Black 2.ppt
Chapter 3 Ken Black 2.ppt
 
determinatiion of
determinatiion of determinatiion of
determinatiion of
 
Measure of Variability Report.pptx
Measure of Variability Report.pptxMeasure of Variability Report.pptx
Measure of Variability Report.pptx
 
IV STATISTICS I.pdf
IV STATISTICS I.pdfIV STATISTICS I.pdf
IV STATISTICS I.pdf
 
Ch5-quantitative-data analysis.pptx
Ch5-quantitative-data analysis.pptxCh5-quantitative-data analysis.pptx
Ch5-quantitative-data analysis.pptx
 
State presentation2
State presentation2State presentation2
State presentation2
 
Dscriptive statistics
Dscriptive statisticsDscriptive statistics
Dscriptive statistics
 
Business statistics
Business statisticsBusiness statistics
Business statistics
 
Statistics for Medical students
Statistics for Medical studentsStatistics for Medical students
Statistics for Medical students
 
3. Statistical Analysis.pptx
3. Statistical Analysis.pptx3. Statistical Analysis.pptx
3. Statistical Analysis.pptx
 
Measures of dispersion
Measures of dispersionMeasures of dispersion
Measures of dispersion
 
Statistics for machine learning shifa noorulain
Statistics for machine learning   shifa noorulainStatistics for machine learning   shifa noorulain
Statistics for machine learning shifa noorulain
 
trs-9.ppt
trs-9.ppttrs-9.ppt
trs-9.ppt
 
Basic statisctis -Anandh Shankar
Basic statisctis -Anandh ShankarBasic statisctis -Anandh Shankar
Basic statisctis -Anandh Shankar
 
Statistics
StatisticsStatistics
Statistics
 
Biostatistics mean median mode unit 1.pptx
Biostatistics mean median mode unit 1.pptxBiostatistics mean median mode unit 1.pptx
Biostatistics mean median mode unit 1.pptx
 
Descriptive Analysis.pptx
Descriptive Analysis.pptxDescriptive Analysis.pptx
Descriptive Analysis.pptx
 
Basic Statistical Descriptions of Data.pptx
Basic Statistical Descriptions of Data.pptxBasic Statistical Descriptions of Data.pptx
Basic Statistical Descriptions of Data.pptx
 

Mais de Hanimarcelo slideshare (14)

Biostatistics
BiostatisticsBiostatistics
Biostatistics
 
Biostatistics
BiostatisticsBiostatistics
Biostatistics
 
History and philosophy of science
History and  philosophy of scienceHistory and  philosophy of science
History and philosophy of science
 
Genetics and evolution
Genetics and evolutionGenetics and evolution
Genetics and evolution
 
Biostatistics
BiostatisticsBiostatistics
Biostatistics
 
Understanding inferential statistics
Understanding inferential statisticsUnderstanding inferential statistics
Understanding inferential statistics
 
Plant morphology
Plant morphologyPlant morphology
Plant morphology
 
Animals let
Animals letAnimals let
Animals let
 
Philsophical foundation of Curriculum
Philsophical foundation of CurriculumPhilsophical foundation of Curriculum
Philsophical foundation of Curriculum
 
Problem Centered Approach
Problem Centered ApproachProblem Centered Approach
Problem Centered Approach
 
Professional Organization
 Professional Organization Professional Organization
Professional Organization
 
Curriculum Design Models
Curriculum Design ModelsCurriculum Design Models
Curriculum Design Models
 
Classroom management
Classroom managementClassroom management
Classroom management
 
Curriculum development internal stakeholders
Curriculum development internal stakeholdersCurriculum development internal stakeholders
Curriculum development internal stakeholders
 

Último

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Último (20)

Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdf
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 

Descriptive statistics -review(2)

  • 2. Statistics has two major chapters: • Descriptive Statistics • Inferential statistics
  • 3. Statistics Descriptive Statistics • Gives numerical and graphic procedures to summarize a collection of data in a clear and understandable way Inferential Statistics • Provides procedures to draw inferences about a population from a sample
  • 4. Descriptive Measures • Central Tendency measures. They are computed to give a “center” around which the measurements in the data are distributed. • Variation or Variability measures. They describe “data spread” or how far away the measurements are from the center. • Relative Standing measures. They describe the relative position of specific measurements in the data.
  • 5. Measures of Central Tendency • Mean: Sum of all measurements divided by the number of measurements. • Median: A number such that at most half of the measurements are below it and at most half of the measurements are above it. • Mode: The most frequent measurement in the data.
  • 6. Example of Mean Measurements Deviation x x - mean 3 -1 5 1 5 1 1 -3 7 3 2 -2 6 2 7 3 0 -4 4 0 40 0 • MEAN = 40/10 = 4 • Notice that the sum of the “deviations” is 0. • Notice that every single observation intervenes in the computation of the mean.
  • 7. Example of Median • Median: (4+5)/2 = 4.5 • Notice that only the two central values are used in the computation. • The median is not sensible to extreme values Measurements Measurements Ranked x x 3 0 5 1 5 2 1 3 7 4 2 5 6 5 7 6 0 7 4 7 40 40
  • 8. Example of Mode Measurements x 3 5 5 1 7 2 6 7 0 4 • In this case the data have two modes: • 5 and 7 • Both measurements are repeated twice
  • 9. Example of Mode Measurements x 3 5 1 1 4 7 3 8 3 • Mode: 3 • Notice that it is possible for a data not to have any mode.
  • 10. Variance (for a sample) • Steps: – Compute each deviation – Square each deviation – Sum all the squares – Divide by the data size (sample size) minus one: n-1
  • 11. Example of Variance Measurements Deviations Square of deviations x x - mean 3 -1 1 5 1 1 5 1 1 1 -3 9 7 3 9 2 -2 4 6 2 4 7 3 9 0 -4 16 4 0 0 40 0 54 • Variance = 54/9 = 6 • It is a measure of “spread”. • Notice that the larger the deviations (positive or negative) the larger the variance
  • 12. The standard deviation • It is defines as the square root of the variance • In the previous example • Variance = 6 • Standard deviation = Square root of the variance = Square root of 6 = 2.45
  • 13. Percentiles • The p-the percentile is a number such that at most p% of the measurements are below it and at most 100 – p percent of the data are above it. • Example, if in a certain data the 85th percentile is 340 means that 15% of the measurements in the data are above 340. It also means that 85% of the measurements are below 340 • Notice that the median is the 50th percentile
  • 14. For any data • At least 75% of the measurements differ from the mean less than twice the standard deviation. • At least 89% of the measurements differ from the mean less than three times the standard deviation. Note: This is a general property and it is called Tchebichev’s Rule: At least 1-1/k2 of the observation falls within k standard deviations from the mean. It is true for every dataset.
  • 15. Example of Tchebichev’s Rule Suppose that for a certain data is : • Mean = 20 • Standard deviation =3 Then: • A least 75% of the measurements are between 14 and 26 • At least 89% of the measurements are between 11 and 29
  • 16. Further Notes • When the Mean is greater than the Median the data distribution is skewed to the Right. • When the Median is greater than the Mean the data distribution is skewed to the Left. • When Mean and Median are very close to each other the data distribution is approximately symmetric.