SlideShare uma empresa Scribd logo
1 de 22
Effect of Number of Categories and Category
  Boundaries on Recovery of Latent Linear
   Correlations from Optimally Weighted
              Categorical Data

                 Johnny Lin
            Advisor: Peter Bentler


             November 19, 2008
Outline


   Introduction
       LINEALS
       Forming a Hypothesis

   Method
      Description
      Simulation
      Analysis

   Results
      Main Effects
      Interactions
Outline


   Introduction
       LINEALS
       Forming a Hypothesis

   Method
      Description
      Simulation
      Analysis

   Results
      Main Effects
      Interactions
Introducing LINEALS
A Method of Optimal Scaling




    Algorithm
    An iterative process that minimizes m   m     2      2          2
                                            l=1 (ηjl − rjl ) where ηjl
                                        j=1
    is a measure of nonlinearity.
    Developed by Jan de Leeuw and implemented by Patrick Mair.

    Assumption
    That bi-linearization is possible. No assumption of normality.
Plot of LINEALS Transformation
   Criterion: Linearize both X on Y and Y on X simultaneously.




                  Figure: Red: X on Y , Blue: Y on X
Outline


   Introduction
       LINEALS
       Forming a Hypothesis

   Method
      Description
      Simulation
      Analysis

   Results
      Main Effects
      Interactions
Questions to ask




   First, define good recovery as small deviation from true score.
    1. Does LINEALS recover true population correlations better
       than Pearson for categorical data?
    2. Is the performance of LINEALS robust?
    3. What factors influence good recovery?
Outline


   Introduction
       LINEALS
       Forming a Hypothesis

   Method
      Description
      Simulation
      Analysis

   Results
      Main Effects
      Interactions
Conditions tested



   Correlation Type, True Population Correlation, Number of
   Categories, and Homogeneity


    Condition                            Parameters
                                         {0=LINEALS, 1=Pearson}
    1. Correlation Type (r)
                                         {0.3,0.5,0.7,0.9}
    2. True Population Correlation (P)
                                         {2,3,5,7,10}
    3. Number of Categories (V)
                                         {0=Non-Homogeneous, 1=Homogeneous}
    4. Homogeneity (h)


       Total of 80 combinations (2x4x5x2).
Outline


   Introduction
       LINEALS
       Forming a Hypothesis

   Method
      Description
      Simulation
      Analysis

   Results
      Main Effects
      Interactions
Creating functions in R




   For each combination (total of 80):
    1. Generate 1000 sets of bivariate normal data.
    2. Make “cuts” (homogeneous vs. non-homogeneous).
    3. Run through LINEALS / Pearson.
    4. Calculate deviation of result and true population correlation.
    5. Repeat Steps 1 - 4 twenty-five times.
   Result: Total of 2000 deviations (80x25).
Outline


   Introduction
       LINEALS
       Forming a Hypothesis

   Method
      Description
      Simulation
      Analysis

   Results
      Main Effects
      Interactions
Hierarchical Regression
Description




          DV: deviation of sample correlation from true population
          correlation |ρ12 | − |ˆ12 |
                                ρ
          IVs: main effect and interactions of four conditions (total of
          15)
              Four main effects (h,r,P,V)
              Six 2-way interactions (hr, hP, hV, . . . )
              Four 3-way interactions (hrP, hrV, . . . )
              One 4-way interaction (hrPV)
Hierarchical Regression
Model Selection



          Tested full model against nested models.
          Confirmed with Best Subset Regression.
                  Optimal Adj. R 2 and Mallow’s CP found with 7-8 parameters.




                         (a) Adj. R 2          (b) Mallow’s CP
Final Model
SPSS Output

                                        Coefficients(a)

                                 Unstandardized           Standardized
      Model                       Coefficients             Coefficients       t        Sig.

                                 B          Std. Error        Beta
      1        (Constant)           .189          .006                       31.240       .000
               h                   -.113          .012               -.620   -9.299       .000
               r                     .007         .002               .041     3.054       .002
               V                   -.024          .001               -.773   -40.558      .000
               P                     .098         .008               .241    12.655       .000
               hV                    .013         .002               .487     7.164       .000
               hP                    .117         .018               .435     6.392       .000
               hPV                 -.017          .003               -.422    -6.326      .000
     a Dependent Variable: difference



    Difference between LINEALS and Pearson deviations is .007
    controlling for other factors.
Outline


   Introduction
       LINEALS
       Forming a Hypothesis

   Method
      Description
      Simulation
      Analysis

   Results
      Main Effects
      Interactions
Plot of Main Effects I




                                   Figure: Main Effect of Number of
Figure: Main Effect of Population
                                   Categories V
Correlation P
Plot of Main Effects II




Figure: Main Effect of Homogeneity h   Figure: Main Effect of Correlation Type r
Outline


   Introduction
       LINEALS
       Forming a Hypothesis

   Method
      Description
      Simulation
      Analysis

   Results
      Main Effects
      Interactions
Plot of Significant Interactions

    Note: The significant 3-way interaction hPV is not plotted.




Figure: Population Correlation by Levels Figure: Number of Categories by Levels
of Homogeneity hP                        of Homogeneity hV
Interaction of Correlation Type and Number of Categories
   When rV added into regression model, the main effect of
   Correlation Type r goes away.
        Suggests that number of categories may contribute to the LINEALS vs.
        Pearson difference.




    Figure: Number of Categories by Correlation Type (rV, marginally sig.)
Summary



   1. LINEALS performs slightly better than Pearson under
      bivariate normal categorizations.
   2. The non-significant interactions with Correlation Type suggest
      that LINEALS is robust.
   3. Recovery of true population correlations is highly influenced by
      homogeneity (i.e., the underlying equality of interval widths).


      Future Studies
          How does it compare against polychoric correlations?
          Is the resulting matrix positive definite?

Mais conteúdo relacionado

Mais procurados

Applications of regression analysis - Measurement of validity of relationship
Applications of regression analysis - Measurement of validity of relationshipApplications of regression analysis - Measurement of validity of relationship
Applications of regression analysis - Measurement of validity of relationshipRithish Kumar
 
Regression
Regression Regression
Regression Ali Raza
 
Linear regression without tears
Linear regression without tearsLinear regression without tears
Linear regression without tearsAnkit Sharma
 
Regression Analysis
Regression AnalysisRegression Analysis
Regression AnalysisSalim Azad
 
Regression: A skin-deep dive
Regression: A skin-deep diveRegression: A skin-deep dive
Regression: A skin-deep diveabulyomon
 
Regression analysis algorithm
Regression analysis algorithm Regression analysis algorithm
Regression analysis algorithm Sammer Qader
 
Statistics-Regression analysis
Statistics-Regression analysisStatistics-Regression analysis
Statistics-Regression analysisRabin BK
 
Regression analysis.
Regression analysis.Regression analysis.
Regression analysis.sonia gupta
 
Chap12 multiple regression
Chap12 multiple regressionChap12 multiple regression
Chap12 multiple regressionJudianto Nugroho
 

Mais procurados (18)

Regression analysis
Regression analysisRegression analysis
Regression analysis
 
Applications of regression analysis - Measurement of validity of relationship
Applications of regression analysis - Measurement of validity of relationshipApplications of regression analysis - Measurement of validity of relationship
Applications of regression analysis - Measurement of validity of relationship
 
Regression analysis
Regression analysisRegression analysis
Regression analysis
 
Regression
Regression Regression
Regression
 
04 regression
04 regression04 regression
04 regression
 
Linear regression without tears
Linear regression without tearsLinear regression without tears
Linear regression without tears
 
Simple linear regression
Simple linear regression Simple linear regression
Simple linear regression
 
Regression Analysis
Regression AnalysisRegression Analysis
Regression Analysis
 
Regression: A skin-deep dive
Regression: A skin-deep diveRegression: A skin-deep dive
Regression: A skin-deep dive
 
Regression Analysis
Regression AnalysisRegression Analysis
Regression Analysis
 
Regression analysis algorithm
Regression analysis algorithm Regression analysis algorithm
Regression analysis algorithm
 
Statistics-Regression analysis
Statistics-Regression analysisStatistics-Regression analysis
Statistics-Regression analysis
 
Chap5 correlation
Chap5 correlationChap5 correlation
Chap5 correlation
 
Regression analysis.
Regression analysis.Regression analysis.
Regression analysis.
 
Regression
RegressionRegression
Regression
 
Chap12 multiple regression
Chap12 multiple regressionChap12 multiple regression
Chap12 multiple regression
 
Chap11 simple regression
Chap11 simple regressionChap11 simple regression
Chap11 simple regression
 
Regression analysis
Regression analysisRegression analysis
Regression analysis
 

Destaque

Social Media "Playbook" Outline - From Week 2 Guest Speaker Clint Schaff
Social Media "Playbook" Outline - From Week 2 Guest Speaker Clint SchaffSocial Media "Playbook" Outline - From Week 2 Guest Speaker Clint Schaff
Social Media "Playbook" Outline - From Week 2 Guest Speaker Clint SchaffSocialMediaUCLA
 
AQM Presentation by Johnny Lin on Jan 9, 2009
AQM Presentation by Johnny Lin on Jan 9, 2009AQM Presentation by Johnny Lin on Jan 9, 2009
AQM Presentation by Johnny Lin on Jan 9, 2009guestbeb22e
 
Rosa galindez presentacion
Rosa galindez presentacionRosa galindez presentacion
Rosa galindez presentacionRosaGalindez
 
Zespół pałacowo-parkowy w Dobrocinie
Zespół pałacowo-parkowy w DobrocinieZespół pałacowo-parkowy w Dobrocinie
Zespół pałacowo-parkowy w Dobrociniespmaldyty
 
Manufacturing-Robotics
Manufacturing-RoboticsManufacturing-Robotics
Manufacturing-RoboticsKeith Bradford
 
http://es.slideshare.net/E.Prego/caligrama
http://es.slideshare.net/E.Prego/caligramahttp://es.slideshare.net/E.Prego/caligrama
http://es.slideshare.net/E.Prego/caligramaEnriquePrego
 
3. respuesta intimidacion de cronix
3. respuesta intimidacion de cronix3. respuesta intimidacion de cronix
3. respuesta intimidacion de cronixAnibal Carrera
 
Comunicadores indigenes ley consulta previa
Comunicadores indigenes ley consulta previaComunicadores indigenes ley consulta previa
Comunicadores indigenes ley consulta previaCrónicas del despojo
 
Product Overview Brochure[Wais]
Product Overview Brochure[Wais]Product Overview Brochure[Wais]
Product Overview Brochure[Wais]wais31
 
Wings & more menu2
Wings & more menu2Wings & more menu2
Wings & more menu2phanelson
 
Aleksej Kovaliov - When the Whole World is Against You
Aleksej Kovaliov - When the Whole World is Against YouAleksej Kovaliov - When the Whole World is Against You
Aleksej Kovaliov - When the Whole World is Against YouAgile Lietuva
 
RGD Ontario Webinar: Strategy In Design: How To Create Meaningful & Successfu...
RGD Ontario Webinar: Strategy In Design: How To Create Meaningful & Successfu...RGD Ontario Webinar: Strategy In Design: How To Create Meaningful & Successfu...
RGD Ontario Webinar: Strategy In Design: How To Create Meaningful & Successfu...MLD/Mel Lim Design
 
UCLA X469.21 - FALL '16 WEEK 5
UCLA X469.21 - FALL '16 WEEK 5UCLA X469.21 - FALL '16 WEEK 5
UCLA X469.21 - FALL '16 WEEK 5SocialMediaUCLA
 
Defects in timber
Defects in timberDefects in timber
Defects in timberVikul Puri
 
Réseau de capteurs sans fils wsn
Réseau de capteurs sans fils wsnRéseau de capteurs sans fils wsn
Réseau de capteurs sans fils wsnAchref Ben helel
 

Destaque (17)

Social Media "Playbook" Outline - From Week 2 Guest Speaker Clint Schaff
Social Media "Playbook" Outline - From Week 2 Guest Speaker Clint SchaffSocial Media "Playbook" Outline - From Week 2 Guest Speaker Clint Schaff
Social Media "Playbook" Outline - From Week 2 Guest Speaker Clint Schaff
 
AQM Presentation by Johnny Lin on Jan 9, 2009
AQM Presentation by Johnny Lin on Jan 9, 2009AQM Presentation by Johnny Lin on Jan 9, 2009
AQM Presentation by Johnny Lin on Jan 9, 2009
 
Bookads
BookadsBookads
Bookads
 
Rosa galindez presentacion
Rosa galindez presentacionRosa galindez presentacion
Rosa galindez presentacion
 
Zespół pałacowo-parkowy w Dobrocinie
Zespół pałacowo-parkowy w DobrocinieZespół pałacowo-parkowy w Dobrocinie
Zespół pałacowo-parkowy w Dobrocinie
 
Manufacturing-Robotics
Manufacturing-RoboticsManufacturing-Robotics
Manufacturing-Robotics
 
http://es.slideshare.net/E.Prego/caligrama
http://es.slideshare.net/E.Prego/caligramahttp://es.slideshare.net/E.Prego/caligrama
http://es.slideshare.net/E.Prego/caligrama
 
3. respuesta intimidacion de cronix
3. respuesta intimidacion de cronix3. respuesta intimidacion de cronix
3. respuesta intimidacion de cronix
 
Comunicadores indigenes ley consulta previa
Comunicadores indigenes ley consulta previaComunicadores indigenes ley consulta previa
Comunicadores indigenes ley consulta previa
 
Product Overview Brochure[Wais]
Product Overview Brochure[Wais]Product Overview Brochure[Wais]
Product Overview Brochure[Wais]
 
Wings & more menu2
Wings & more menu2Wings & more menu2
Wings & more menu2
 
Aleksej Kovaliov - When the Whole World is Against You
Aleksej Kovaliov - When the Whole World is Against YouAleksej Kovaliov - When the Whole World is Against You
Aleksej Kovaliov - When the Whole World is Against You
 
Presentacion
PresentacionPresentacion
Presentacion
 
RGD Ontario Webinar: Strategy In Design: How To Create Meaningful & Successfu...
RGD Ontario Webinar: Strategy In Design: How To Create Meaningful & Successfu...RGD Ontario Webinar: Strategy In Design: How To Create Meaningful & Successfu...
RGD Ontario Webinar: Strategy In Design: How To Create Meaningful & Successfu...
 
UCLA X469.21 - FALL '16 WEEK 5
UCLA X469.21 - FALL '16 WEEK 5UCLA X469.21 - FALL '16 WEEK 5
UCLA X469.21 - FALL '16 WEEK 5
 
Defects in timber
Defects in timberDefects in timber
Defects in timber
 
Réseau de capteurs sans fils wsn
Réseau de capteurs sans fils wsnRéseau de capteurs sans fils wsn
Réseau de capteurs sans fils wsn
 

Semelhante a Johnny Aqm Presentation

Chapter 9 Regression
Chapter 9 RegressionChapter 9 Regression
Chapter 9 Regressionghalan
 
Multiple Regression.ppt
Multiple Regression.pptMultiple Regression.ppt
Multiple Regression.pptTanyaWadhwani4
 
What is Isotonic Regression and How Can a Business Utilize it to Analyze Data?
What is Isotonic Regression and How Can a Business Utilize it to Analyze Data?What is Isotonic Regression and How Can a Business Utilize it to Analyze Data?
What is Isotonic Regression and How Can a Business Utilize it to Analyze Data?Smarten Augmented Analytics
 
manecohuhuhuhubasicEstimation-1.pptx
manecohuhuhuhubasicEstimation-1.pptxmanecohuhuhuhubasicEstimation-1.pptx
manecohuhuhuhubasicEstimation-1.pptxasdfg hjkl
 
Ch8 Regression Revby Rao
Ch8 Regression Revby RaoCh8 Regression Revby Rao
Ch8 Regression Revby RaoSumit Prajapati
 
Mba2216 week 11 data analysis part 02
Mba2216 week 11 data analysis part 02Mba2216 week 11 data analysis part 02
Mba2216 week 11 data analysis part 02Stephen Ong
 
Lesson07_new
Lesson07_newLesson07_new
Lesson07_newshengvn
 
Intro to econometrics
Intro to econometricsIntro to econometrics
Intro to econometricsGaetan Lion
 
unit 3 regression.pptx
unit 3 regression.pptxunit 3 regression.pptx
unit 3 regression.pptxssuser5c580e1
 
Simple lin regress_inference
Simple lin regress_inferenceSimple lin regress_inference
Simple lin regress_inferenceKemal İnciroğlu
 
Linear regression.pptx
Linear regression.pptxLinear regression.pptx
Linear regression.pptxssuserb8a904
 
A NEW CORRELATION COEFFICIENT AND A DECOMPOSITION OF THE PEARSON COEFFICIENT
A NEW CORRELATION COEFFICIENT AND A DECOMPOSITION OF THE PEARSON COEFFICIENTA NEW CORRELATION COEFFICIENT AND A DECOMPOSITION OF THE PEARSON COEFFICIENT
A NEW CORRELATION COEFFICIENT AND A DECOMPOSITION OF THE PEARSON COEFFICIENTSavas Papadopoulos, Ph.D
 
Correlation and Regression ppt
Correlation and Regression pptCorrelation and Regression ppt
Correlation and Regression pptSantosh Bhaskar
 

Semelhante a Johnny Aqm Presentation (20)

Ders 2 ols .ppt
Ders 2 ols .pptDers 2 ols .ppt
Ders 2 ols .ppt
 
Chapter 9 Regression
Chapter 9 RegressionChapter 9 Regression
Chapter 9 Regression
 
Multiple Regression.ppt
Multiple Regression.pptMultiple Regression.ppt
Multiple Regression.ppt
 
What is Isotonic Regression and How Can a Business Utilize it to Analyze Data?
What is Isotonic Regression and How Can a Business Utilize it to Analyze Data?What is Isotonic Regression and How Can a Business Utilize it to Analyze Data?
What is Isotonic Regression and How Can a Business Utilize it to Analyze Data?
 
manecohuhuhuhubasicEstimation-1.pptx
manecohuhuhuhubasicEstimation-1.pptxmanecohuhuhuhubasicEstimation-1.pptx
manecohuhuhuhubasicEstimation-1.pptx
 
Ch8 Regression Revby Rao
Ch8 Regression Revby RaoCh8 Regression Revby Rao
Ch8 Regression Revby Rao
 
Mba2216 week 11 data analysis part 02
Mba2216 week 11 data analysis part 02Mba2216 week 11 data analysis part 02
Mba2216 week 11 data analysis part 02
 
Regression
Regression  Regression
Regression
 
Lesson07_new
Lesson07_newLesson07_new
Lesson07_new
 
Unit 03 - Consolidated.pptx
Unit 03 - Consolidated.pptxUnit 03 - Consolidated.pptx
Unit 03 - Consolidated.pptx
 
Intro to econometrics
Intro to econometricsIntro to econometrics
Intro to econometrics
 
CFA Fit Statistics
CFA Fit StatisticsCFA Fit Statistics
CFA Fit Statistics
 
Regression
RegressionRegression
Regression
 
unit 3 regression.pptx
unit 3 regression.pptxunit 3 regression.pptx
unit 3 regression.pptx
 
Regression
RegressionRegression
Regression
 
Multiple regression
Multiple regressionMultiple regression
Multiple regression
 
Simple lin regress_inference
Simple lin regress_inferenceSimple lin regress_inference
Simple lin regress_inference
 
Linear regression.pptx
Linear regression.pptxLinear regression.pptx
Linear regression.pptx
 
A NEW CORRELATION COEFFICIENT AND A DECOMPOSITION OF THE PEARSON COEFFICIENT
A NEW CORRELATION COEFFICIENT AND A DECOMPOSITION OF THE PEARSON COEFFICIENTA NEW CORRELATION COEFFICIENT AND A DECOMPOSITION OF THE PEARSON COEFFICIENT
A NEW CORRELATION COEFFICIENT AND A DECOMPOSITION OF THE PEARSON COEFFICIENT
 
Correlation and Regression ppt
Correlation and Regression pptCorrelation and Regression ppt
Correlation and Regression ppt
 

Último

AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...apidays
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfOrbitshub
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Zilliz
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 

Último (20)

AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 

Johnny Aqm Presentation

  • 1. Effect of Number of Categories and Category Boundaries on Recovery of Latent Linear Correlations from Optimally Weighted Categorical Data Johnny Lin Advisor: Peter Bentler November 19, 2008
  • 2. Outline Introduction LINEALS Forming a Hypothesis Method Description Simulation Analysis Results Main Effects Interactions
  • 3. Outline Introduction LINEALS Forming a Hypothesis Method Description Simulation Analysis Results Main Effects Interactions
  • 4. Introducing LINEALS A Method of Optimal Scaling Algorithm An iterative process that minimizes m m 2 2 2 l=1 (ηjl − rjl ) where ηjl j=1 is a measure of nonlinearity. Developed by Jan de Leeuw and implemented by Patrick Mair. Assumption That bi-linearization is possible. No assumption of normality.
  • 5. Plot of LINEALS Transformation Criterion: Linearize both X on Y and Y on X simultaneously. Figure: Red: X on Y , Blue: Y on X
  • 6. Outline Introduction LINEALS Forming a Hypothesis Method Description Simulation Analysis Results Main Effects Interactions
  • 7. Questions to ask First, define good recovery as small deviation from true score. 1. Does LINEALS recover true population correlations better than Pearson for categorical data? 2. Is the performance of LINEALS robust? 3. What factors influence good recovery?
  • 8. Outline Introduction LINEALS Forming a Hypothesis Method Description Simulation Analysis Results Main Effects Interactions
  • 9. Conditions tested Correlation Type, True Population Correlation, Number of Categories, and Homogeneity Condition Parameters {0=LINEALS, 1=Pearson} 1. Correlation Type (r) {0.3,0.5,0.7,0.9} 2. True Population Correlation (P) {2,3,5,7,10} 3. Number of Categories (V) {0=Non-Homogeneous, 1=Homogeneous} 4. Homogeneity (h) Total of 80 combinations (2x4x5x2).
  • 10. Outline Introduction LINEALS Forming a Hypothesis Method Description Simulation Analysis Results Main Effects Interactions
  • 11. Creating functions in R For each combination (total of 80): 1. Generate 1000 sets of bivariate normal data. 2. Make “cuts” (homogeneous vs. non-homogeneous). 3. Run through LINEALS / Pearson. 4. Calculate deviation of result and true population correlation. 5. Repeat Steps 1 - 4 twenty-five times. Result: Total of 2000 deviations (80x25).
  • 12. Outline Introduction LINEALS Forming a Hypothesis Method Description Simulation Analysis Results Main Effects Interactions
  • 13. Hierarchical Regression Description DV: deviation of sample correlation from true population correlation |ρ12 | − |ˆ12 | ρ IVs: main effect and interactions of four conditions (total of 15) Four main effects (h,r,P,V) Six 2-way interactions (hr, hP, hV, . . . ) Four 3-way interactions (hrP, hrV, . . . ) One 4-way interaction (hrPV)
  • 14. Hierarchical Regression Model Selection Tested full model against nested models. Confirmed with Best Subset Regression. Optimal Adj. R 2 and Mallow’s CP found with 7-8 parameters. (a) Adj. R 2 (b) Mallow’s CP
  • 15. Final Model SPSS Output Coefficients(a) Unstandardized Standardized Model Coefficients Coefficients t Sig. B Std. Error Beta 1 (Constant) .189 .006 31.240 .000 h -.113 .012 -.620 -9.299 .000 r .007 .002 .041 3.054 .002 V -.024 .001 -.773 -40.558 .000 P .098 .008 .241 12.655 .000 hV .013 .002 .487 7.164 .000 hP .117 .018 .435 6.392 .000 hPV -.017 .003 -.422 -6.326 .000 a Dependent Variable: difference Difference between LINEALS and Pearson deviations is .007 controlling for other factors.
  • 16. Outline Introduction LINEALS Forming a Hypothesis Method Description Simulation Analysis Results Main Effects Interactions
  • 17. Plot of Main Effects I Figure: Main Effect of Number of Figure: Main Effect of Population Categories V Correlation P
  • 18. Plot of Main Effects II Figure: Main Effect of Homogeneity h Figure: Main Effect of Correlation Type r
  • 19. Outline Introduction LINEALS Forming a Hypothesis Method Description Simulation Analysis Results Main Effects Interactions
  • 20. Plot of Significant Interactions Note: The significant 3-way interaction hPV is not plotted. Figure: Population Correlation by Levels Figure: Number of Categories by Levels of Homogeneity hP of Homogeneity hV
  • 21. Interaction of Correlation Type and Number of Categories When rV added into regression model, the main effect of Correlation Type r goes away. Suggests that number of categories may contribute to the LINEALS vs. Pearson difference. Figure: Number of Categories by Correlation Type (rV, marginally sig.)
  • 22. Summary 1. LINEALS performs slightly better than Pearson under bivariate normal categorizations. 2. The non-significant interactions with Correlation Type suggest that LINEALS is robust. 3. Recovery of true population correlations is highly influenced by homogeneity (i.e., the underlying equality of interval widths). Future Studies How does it compare against polychoric correlations? Is the resulting matrix positive definite?