SlideShare uma empresa Scribd logo
1 de 22
Baixar para ler offline
Stat310            Sequences of rvs


                            Hadley Wickham
Wednesday, 17 March 2010
Major’s day
                   2:30-4:30pm Today
                   Oshman Engineering Design Kitchen


                   Come along and talk to me (or Rudy
                   Guerra) if you’re interested in becoming a
                   stat major


Wednesday, 17 March 2010
Assessment

                   Test model answers online tonight
                   (hopefully)
                   Usual help session tonight 4-5pm.




Wednesday, 17 March 2010
1. Sequences
               2. Limits
               3. Chebyshev’s theorem
               4. The law of large numbers
               5. The central limit theorem



Wednesday, 17 March 2010
Sequences

                   1 variable: X
                   2 variables: X, Y
                   ...
                   n variables: X1, X2, X3, ..., Xn




Wednesday, 17 March 2010
Sequences
                   Xi ~ Normal(μi, σi)
                   Xi ~ Normal(μ, σi)
                   Xi ~ Normal(μi, σ)
                   Xi ~ Normal(μ, σ)
                   Almost always assume that the Xi’s are
                   independent. In the last case they are
                   also identically distributed.


Wednesday, 17 March 2010
iid = independent &
                identically distributed


Wednesday, 17 March 2010
Your turn

                   Xi are iid N(0, 2).
                   What is E(X30)? What is Var(X2001)?
                   What is Cor(X10, X11)? Cor(X1, X1000)?




Wednesday, 17 March 2010
n
                                               n
                                                
               E(                   Xi ) =           E(Xi )
                               i                 i
                           n
                                                n
                                                 
         V ar(                      ai Xi ) =         2
                                                     ai V   ar(Xi )
                               i                 i
                                                     If what is true?
                       n
                                           n
                                            
          E(                       Xi ) =        E(Xi )
                           i                 i        If what is true?
Wednesday, 17 March 2010
Limits
                   Typically will define some function of n
                                           ¯
                   random variables, e.g. Xn
                                   ¯
                   What happens to Xn when n → ∞?
                   Why? Because often it will converge, and
                   we can use this to approximate results for
                   any large n.



Wednesday, 17 March 2010
New notation

                   If xn → 0, and n is big, we can say xn ≈ 0.
                   If Xn → Z, Z ~ N(0, 1), and n is big,
                   we can say Xn ~ . N(0,1).

                   Read as approximately distributed.
                   Other ways to write it



Wednesday, 17 March 2010
N
                                       go

                                          o
                                        od
                                             lim art
                           Chebyshev




                                                it ing
                                            st

                                                  -b p
                                                     ut oin
                                                       a t
                                1
         P (|X − µ|  Kσ) ≥ 1 − 2
                               K
                             1
         P (|X − µ|  Kσ) ≤ 2
                            K
                                        For K  0
Wednesday, 17 March 2010
Your turn

                   How can you put this in words?
                                      1
                   P (|X − µ|  Kσ) ≤ 2
                                     K


Wednesday, 17 March 2010
The probability of being more
                               than K standard deviations
        80                     away from the mean is less
                               than one over K squared.
        60
                               (For K  0)
 1 K2




        40




        20




                 0         2     4           6     8       10
                                     K
Wednesday, 17 March 2010
(For K  1)
        1.0




        0.8




        0.6
 1 K2




        0.4




        0.2




        0.0

                           2   4       6   8           10
                                   K
Wednesday, 17 March 2010
Your turn

                   How does this compare to the normal
                   distribution? Compare the probability of
                   being less than 1, 2 and 3 standard
                   deviations away from the mean given by
                   Chebychev and what we know about the
                   normal.



Wednesday, 17 March 2010
1.0




         0.8




         0.6

                                                    variable
 value




                                                        cheby
                                                        norm
         0.4




         0.2




         0.0

                           2   4       6   8   10
                                   x
Wednesday, 17 March 2010
LLN
                   Law of large numbers
                   X1, X2, ..., Xn iid.

                           n
                           
                  ¯
                  Xn =          Xi
                            i


                   There are five ways to write the result.


Wednesday, 17 March 2010
What does it mean?
                   As we collect more and more data, the
                   sample mean gets closer and closer to
                   the true mean.
                   Not that surprising!
                   But note that we didn’t make any
                   assumptions about the distributions



Wednesday, 17 March 2010
CLT

                   Central limit theorem.
                   The distribution of a mean is normal when
                   gets big.




Wednesday, 17 March 2010
Approximation


                   This implies that if n is big then ...




Wednesday, 17 March 2010
Reading


                Section 4.1
                Focus on the general ideas and the
                defintions




Wednesday, 17 March 2010

Mais conteúdo relacionado

Destaque

Model Visualisation (with ggplot2)
Model Visualisation (with ggplot2)Model Visualisation (with ggplot2)
Model Visualisation (with ggplot2)
Hadley Wickham
 
Introducing natural language processing(NLP) with r
Introducing natural language processing(NLP) with rIntroducing natural language processing(NLP) with r
Introducing natural language processing(NLP) with r
Vivian S. Zhang
 

Destaque (20)

20 date-times
20 date-times20 date-times
20 date-times
 
04 Wrapup
04 Wrapup04 Wrapup
04 Wrapup
 
Correlations, Trends, and Outliers in ggplot2
Correlations, Trends, and Outliers in ggplot2Correlations, Trends, and Outliers in ggplot2
Correlations, Trends, and Outliers in ggplot2
 
24 modelling
24 modelling24 modelling
24 modelling
 
21 spam
21 spam21 spam
21 spam
 
03 Conditional
03 Conditional03 Conditional
03 Conditional
 
Model Visualisation (with ggplot2)
Model Visualisation (with ggplot2)Model Visualisation (with ggplot2)
Model Visualisation (with ggplot2)
 
Graphical inference
Graphical inferenceGraphical inference
Graphical inference
 
03 Modelling
03 Modelling03 Modelling
03 Modelling
 
R workshop iii -- 3 hours to learn ggplot2 series
R workshop iii -- 3 hours to learn ggplot2 seriesR workshop iii -- 3 hours to learn ggplot2 series
R workshop iii -- 3 hours to learn ggplot2 series
 
23 data-structures
23 data-structures23 data-structures
23 data-structures
 
R packages
R packagesR packages
R packages
 
02 Ddply
02 Ddply02 Ddply
02 Ddply
 
01 Intro
01 Intro01 Intro
01 Intro
 
Reshaping Data in R
Reshaping Data in RReshaping Data in R
Reshaping Data in R
 
Machine learning in R
Machine learning in RMachine learning in R
Machine learning in R
 
4 R Tutorial DPLYR Apply Function
4 R Tutorial DPLYR Apply Function4 R Tutorial DPLYR Apply Function
4 R Tutorial DPLYR Apply Function
 
Data manipulation with dplyr
Data manipulation with dplyrData manipulation with dplyr
Data manipulation with dplyr
 
Data Manipulation Using R (& dplyr)
Data Manipulation Using R (& dplyr)Data Manipulation Using R (& dplyr)
Data Manipulation Using R (& dplyr)
 
Introducing natural language processing(NLP) with r
Introducing natural language processing(NLP) with rIntroducing natural language processing(NLP) with r
Introducing natural language processing(NLP) with r
 

Mais de Hadley Wickham (20)

27 development
27 development27 development
27 development
 
27 development
27 development27 development
27 development
 
22 spam
22 spam22 spam
22 spam
 
19 tables
19 tables19 tables
19 tables
 
18 cleaning
18 cleaning18 cleaning
18 cleaning
 
17 polishing
17 polishing17 polishing
17 polishing
 
16 critique
16 critique16 critique
16 critique
 
15 time-space
15 time-space15 time-space
15 time-space
 
14 case-study
14 case-study14 case-study
14 case-study
 
13 case-study
13 case-study13 case-study
13 case-study
 
12 adv-manip
12 adv-manip12 adv-manip
12 adv-manip
 
11 adv-manip
11 adv-manip11 adv-manip
11 adv-manip
 
11 adv-manip
11 adv-manip11 adv-manip
11 adv-manip
 
10 simulation
10 simulation10 simulation
10 simulation
 
10 simulation
10 simulation10 simulation
10 simulation
 
09 bootstrapping
09 bootstrapping09 bootstrapping
09 bootstrapping
 
08 functions
08 functions08 functions
08 functions
 
07 problem-solving
07 problem-solving07 problem-solving
07 problem-solving
 
06 data
06 data06 data
06 data
 
05 subsetting
05 subsetting05 subsetting
05 subsetting
 

Último

Último (20)

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 

16 Sequences

  • 1. Stat310 Sequences of rvs Hadley Wickham Wednesday, 17 March 2010
  • 2. Major’s day 2:30-4:30pm Today Oshman Engineering Design Kitchen Come along and talk to me (or Rudy Guerra) if you’re interested in becoming a stat major Wednesday, 17 March 2010
  • 3. Assessment Test model answers online tonight (hopefully) Usual help session tonight 4-5pm. Wednesday, 17 March 2010
  • 4. 1. Sequences 2. Limits 3. Chebyshev’s theorem 4. The law of large numbers 5. The central limit theorem Wednesday, 17 March 2010
  • 5. Sequences 1 variable: X 2 variables: X, Y ... n variables: X1, X2, X3, ..., Xn Wednesday, 17 March 2010
  • 6. Sequences Xi ~ Normal(μi, σi) Xi ~ Normal(μ, σi) Xi ~ Normal(μi, σ) Xi ~ Normal(μ, σ) Almost always assume that the Xi’s are independent. In the last case they are also identically distributed. Wednesday, 17 March 2010
  • 7. iid = independent & identically distributed Wednesday, 17 March 2010
  • 8. Your turn Xi are iid N(0, 2). What is E(X30)? What is Var(X2001)? What is Cor(X10, X11)? Cor(X1, X1000)? Wednesday, 17 March 2010
  • 9. n n E( Xi ) = E(Xi ) i i n n V ar( ai Xi ) = 2 ai V ar(Xi ) i i If what is true? n n E( Xi ) = E(Xi ) i i If what is true? Wednesday, 17 March 2010
  • 10. Limits Typically will define some function of n ¯ random variables, e.g. Xn ¯ What happens to Xn when n → ∞? Why? Because often it will converge, and we can use this to approximate results for any large n. Wednesday, 17 March 2010
  • 11. New notation If xn → 0, and n is big, we can say xn ≈ 0. If Xn → Z, Z ~ N(0, 1), and n is big, we can say Xn ~ . N(0,1). Read as approximately distributed. Other ways to write it Wednesday, 17 March 2010
  • 12. N go o od lim art Chebyshev it ing st -b p ut oin a t 1 P (|X − µ| Kσ) ≥ 1 − 2 K 1 P (|X − µ| Kσ) ≤ 2 K For K 0 Wednesday, 17 March 2010
  • 13. Your turn How can you put this in words? 1 P (|X − µ| Kσ) ≤ 2 K Wednesday, 17 March 2010
  • 14. The probability of being more than K standard deviations 80 away from the mean is less than one over K squared. 60 (For K 0) 1 K2 40 20 0 2 4 6 8 10 K Wednesday, 17 March 2010
  • 15. (For K 1) 1.0 0.8 0.6 1 K2 0.4 0.2 0.0 2 4 6 8 10 K Wednesday, 17 March 2010
  • 16. Your turn How does this compare to the normal distribution? Compare the probability of being less than 1, 2 and 3 standard deviations away from the mean given by Chebychev and what we know about the normal. Wednesday, 17 March 2010
  • 17. 1.0 0.8 0.6 variable value cheby norm 0.4 0.2 0.0 2 4 6 8 10 x Wednesday, 17 March 2010
  • 18. LLN Law of large numbers X1, X2, ..., Xn iid. n ¯ Xn = Xi i There are five ways to write the result. Wednesday, 17 March 2010
  • 19. What does it mean? As we collect more and more data, the sample mean gets closer and closer to the true mean. Not that surprising! But note that we didn’t make any assumptions about the distributions Wednesday, 17 March 2010
  • 20. CLT Central limit theorem. The distribution of a mean is normal when gets big. Wednesday, 17 March 2010
  • 21. Approximation This implies that if n is big then ... Wednesday, 17 March 2010
  • 22. Reading Section 4.1 Focus on the general ideas and the defintions Wednesday, 17 March 2010