SlideShare uma empresa Scribd logo
1 de 20
CORRELATION
  -Rank Correlation
Correlation
Definition:
        In a distribution if the change in one variable
  effects a change in the other variable,
  the variable are said to be correlated(or there
  is a correlation between the variables)

        Let X and Y measure some characteristics
   of a particular system .To study the overall
   measure of the system it is necessary to
   measure the interdependence of X and Y.
Correlation
        If the quantities(X,Y) vary in such a way that
   change in one variable corresponds to
   change in the other variable then the variables
   X and Y are correlated.
Types of Correlation:
        The important ways of classifying the
correlation are:
1.      Positive and Negative
2.      Simple , Partial and Multiple
3.      Linear and non-Linear.
Correlation
Methods of Studying Correlation:
      The following are the important methods of
  ascertaining between two variables.
     Scatter diagram method
     Karl Pearson’s Co-efficient
     Spearman’s Rank Correlation Co-Efficient

• Scatter Diagram Method: The simplest device for
studying correlation between two variables is a special
   type of dot chart.
Correlation
Karl Pearson’s Co-Efficient of Correlation:
       r = (∑ xy) / (N σxσ y )
       where
              x
       x = (x-¯)
               y
       y= (y- ¯)
        σx =Standard Deviation of Series X.
        σ y =Standard Deviation of Series Y.
       r= The correlation Coefficient
       N=Number of Pairs of Observation.
Demerits
•        By applying Scatter diagram method we can get
an idea about the direction of correlation and also
whether it is high or low. But we cannot establish the
exact degree of correlation between the variables as it is
possible by applying mathematical methods.
• The Karl Pearson’s method is based on the
assumption that the population being studied is normally
distributed. When it is known that the population is not
normal or when he shape the distribution is not known,
there is need for a measure of correlation that involves
no assumption about the parameter of the population.
So Why Rank Correlation????
         It is possible to avoid making any assumptions
about the populations being studied by ranking the
observations according to size and basing the
calculations on the ranks rather than upon the original
observations. It does not matter which way the items are
ranked, item number one may be the largest or it may be
the smallest. Using ranks rather than actual observations
gives the coefficient of rank correlations.
         This method of finding out co variability or the
lack of it between two variables was developed by the
British Psychologist Charles Edward Spearman in 1904.
Ranking
      A ranking is a relationship between a set of items
 such that, for any two items, the first is either 'ranked
 higher than', 'ranked lower than' or 'ranked equal to'
 the second. In mathematics, this is known as a weak
 order or total preorder of objects. It is not necessarily
 a total order of objects because two different objects
 can have the same ranking. The rankings themselves
 are totally ordered. For example, materials are totally
 preordered by hardness, while degrees of hardness
 are totally ordered.
Rank Correlation
         “Rank correlation” is the study of relationships
between different rankings on the same set of items. It
deals with measuring correspondence between two
rankings, and assessing the significance of this
correspondence. Spearman’s correlation coefficient is
defined as:
                  r = 1-((6∑D2)/(N(N-1)2))
Where r , denotes rank coefficient of correlation and D
refers to the difference of rank relation between paired I
tems in two series.
Features
The rank method has principal uses:

• The sum of the differences between two variables is
  zero.

• Spearman’s rank correlation coefficient ρ is the
  Pearsonian correlation coefficient between the ranks.

• The rank correlation can be interpreted in the same
  way as Karl Pearson’s correlation coefficient.
Features
• Karl Pearson correlation coefficient assumes that the
sample observations are drawn from a normal
population. Rank correlation coefficient is a distribution-
free measure since no strict assumption is made about
the population from which it is drawn.

• The values obtained for two formulae are different due
to the fact that when ranking is used some information
is hidden.
• Spearman’s formula is the only formulae available to
find the correlation between qualitative characters.
Types of Rank Methods
 In the rank correlation we may have two types of
   problems:
• Where ranks are given
• Where ranks are not given
• Where repeated ranks occur

Note:
  If r = 1 then there is a perfect Positive correlation
  If r = 0 then the variables are uncorrelated
  If r=-1 then there is a perfect Negative Correlation
Where ranks are given :
Where actual ranks are given to us the steps required for

computing rank correlation are:

• Take the difference of the two ranks, i.e., (R1-R2)
and denotes these differences by D.
• Square these difference and obtain the total
• Apply the formula
Where ranks are not given

• When we are given the actual data and not the
ranks, it will be necessary to assign the ranks. Ranks
can be assigned by taking either highest values as 1 or
the lowest value as 1. But whether we start with the
lowest value or the highest value we must follow the
same method in case of both the variables.
Steps To Find RC
• Step 1:
  – Draw the table like
• Step 2:
  – Fill the data field with the given data
• Step 3:
     • Give the Rank for the data
• Step 4
  – Find the difference d & d2
• Step 5:
  – Apply the formula:




                     r=



            Where d= difference, n=no.of data
• THANK YOU………

Mais conteúdo relacionado

Mais procurados

Correlation coefficient
Correlation coefficientCorrelation coefficient
Correlation coefficient
Carlo Magno
 

Mais procurados (20)

Tabular and Graphical Representation of Data
Tabular and Graphical Representation of Data Tabular and Graphical Representation of Data
Tabular and Graphical Representation of Data
 
Regression analysis
Regression analysisRegression analysis
Regression analysis
 
F test and ANOVA
F test and ANOVAF test and ANOVA
F test and ANOVA
 
Spearman’s rank correlation
Spearman’s rank correlation Spearman’s rank correlation
Spearman’s rank correlation
 
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regression
 
Correlation IN STATISTICS
Correlation IN STATISTICSCorrelation IN STATISTICS
Correlation IN STATISTICS
 
Correlation and regression analysis
Correlation and regression analysisCorrelation and regression analysis
Correlation and regression analysis
 
Spearman's Rank order Correlation
Spearman's Rank order CorrelationSpearman's Rank order Correlation
Spearman's Rank order Correlation
 
Spearman Rank Correlation - Thiyagu
Spearman Rank Correlation - ThiyaguSpearman Rank Correlation - Thiyagu
Spearman Rank Correlation - Thiyagu
 
Regression analysis
Regression analysisRegression analysis
Regression analysis
 
Pearson's correlation
Pearson's  correlationPearson's  correlation
Pearson's correlation
 
Regression analysis
Regression analysisRegression analysis
Regression analysis
 
Correlation
CorrelationCorrelation
Correlation
 
Factor analysis
Factor analysis Factor analysis
Factor analysis
 
Sampling distribution
Sampling distributionSampling distribution
Sampling distribution
 
Correlation coefficient
Correlation coefficientCorrelation coefficient
Correlation coefficient
 
Rank correlation
Rank correlationRank correlation
Rank correlation
 
Karl pearson's correlation
Karl pearson's correlationKarl pearson's correlation
Karl pearson's correlation
 
Correlation Analysis
Correlation AnalysisCorrelation Analysis
Correlation Analysis
 
Correlation and Regression
Correlation and RegressionCorrelation and Regression
Correlation and Regression
 

Destaque (16)

Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regression
 
Regression analysis
Regression analysisRegression analysis
Regression analysis
 
Binomial and Poission Probablity distribution
Binomial and Poission Probablity distributionBinomial and Poission Probablity distribution
Binomial and Poission Probablity distribution
 
Correlation continued
Correlation continuedCorrelation continued
Correlation continued
 
Time Series
Time SeriesTime Series
Time Series
 
Linear correlation
Linear correlationLinear correlation
Linear correlation
 
correlation_and_covariance
correlation_and_covariancecorrelation_and_covariance
correlation_and_covariance
 
Consumer Price Index
Consumer Price IndexConsumer Price Index
Consumer Price Index
 
Stat
StatStat
Stat
 
time series analysis
time series analysistime series analysis
time series analysis
 
Correlation and Simple Regression
Correlation  and Simple RegressionCorrelation  and Simple Regression
Correlation and Simple Regression
 
Student t-test
Student t-testStudent t-test
Student t-test
 
Tests of significance
Tests of significanceTests of significance
Tests of significance
 
Correlation ppt...
Correlation ppt...Correlation ppt...
Correlation ppt...
 
Basic Statistical Concepts and Methods
Basic Statistical Concepts and MethodsBasic Statistical Concepts and Methods
Basic Statistical Concepts and Methods
 
Regression analysis ppt
Regression analysis pptRegression analysis ppt
Regression analysis ppt
 

Semelhante a Correlation

Semelhante a Correlation (20)

Statistics ppt
Statistics pptStatistics ppt
Statistics ppt
 
Correlation
CorrelationCorrelation
Correlation
 
Correlation.pdf
Correlation.pdfCorrelation.pdf
Correlation.pdf
 
Study of Correlation
Study of Correlation Study of Correlation
Study of Correlation
 
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regression
 
Power point presentationCORRELATION.pptx
Power point presentationCORRELATION.pptxPower point presentationCORRELATION.pptx
Power point presentationCORRELATION.pptx
 
Correlation - Biostatistics
Correlation - BiostatisticsCorrelation - Biostatistics
Correlation - Biostatistics
 
Correlation and Regression
Correlation and RegressionCorrelation and Regression
Correlation and Regression
 
Correlationanalysis
CorrelationanalysisCorrelationanalysis
Correlationanalysis
 
Correlation and Regression.pptx
Correlation and Regression.pptxCorrelation and Regression.pptx
Correlation and Regression.pptx
 
Correlation
CorrelationCorrelation
Correlation
 
Association between-variables
Association between-variablesAssociation between-variables
Association between-variables
 
Correlation
CorrelationCorrelation
Correlation
 
Correlation
CorrelationCorrelation
Correlation
 
Biostatistics - Correlation explanation.pptx
Biostatistics - Correlation explanation.pptxBiostatistics - Correlation explanation.pptx
Biostatistics - Correlation explanation.pptx
 
Correlation (1)
Correlation (1)Correlation (1)
Correlation (1)
 
Association between-variables
Association between-variablesAssociation between-variables
Association between-variables
 
13943056.ppt
13943056.ppt13943056.ppt
13943056.ppt
 
Correlation analysis in Biostatistics .pptx
Correlation analysis in Biostatistics .pptxCorrelation analysis in Biostatistics .pptx
Correlation analysis in Biostatistics .pptx
 
12943625.ppt
12943625.ppt12943625.ppt
12943625.ppt
 

Mais de Tech_MX

Virtual base class
Virtual base classVirtual base class
Virtual base class
Tech_MX
 
Theory of estimation
Theory of estimationTheory of estimation
Theory of estimation
Tech_MX
 
Templates in C++
Templates in C++Templates in C++
Templates in C++
Tech_MX
 
String & its application
String & its applicationString & its application
String & its application
Tech_MX
 
Statistical quality__control_2
Statistical  quality__control_2Statistical  quality__control_2
Statistical quality__control_2
Tech_MX
 
Stack data structure
Stack data structureStack data structure
Stack data structure
Tech_MX
 
Stack Data Structure & It's Application
Stack Data Structure & It's Application Stack Data Structure & It's Application
Stack Data Structure & It's Application
Tech_MX
 
Spanning trees & applications
Spanning trees & applicationsSpanning trees & applications
Spanning trees & applications
Tech_MX
 
Set data structure 2
Set data structure 2Set data structure 2
Set data structure 2
Tech_MX
 
Set data structure
Set data structure Set data structure
Set data structure
Tech_MX
 
Real time Operating System
Real time Operating SystemReal time Operating System
Real time Operating System
Tech_MX
 
Mouse interrupts (Assembly Language & C)
Mouse interrupts (Assembly Language & C)Mouse interrupts (Assembly Language & C)
Mouse interrupts (Assembly Language & C)
Tech_MX
 
Motherboard of a pc
Motherboard of a pcMotherboard of a pc
Motherboard of a pc
Tech_MX
 
More on Lex
More on LexMore on Lex
More on Lex
Tech_MX
 
MultiMedia dbms
MultiMedia dbmsMultiMedia dbms
MultiMedia dbms
Tech_MX
 
Merging files (Data Structure)
Merging files (Data Structure)Merging files (Data Structure)
Merging files (Data Structure)
Tech_MX
 
Memory dbms
Memory dbmsMemory dbms
Memory dbms
Tech_MX
 

Mais de Tech_MX (20)

Virtual base class
Virtual base classVirtual base class
Virtual base class
 
Uid
UidUid
Uid
 
Theory of estimation
Theory of estimationTheory of estimation
Theory of estimation
 
Templates in C++
Templates in C++Templates in C++
Templates in C++
 
String & its application
String & its applicationString & its application
String & its application
 
Statistical quality__control_2
Statistical  quality__control_2Statistical  quality__control_2
Statistical quality__control_2
 
Stack data structure
Stack data structureStack data structure
Stack data structure
 
Stack Data Structure & It's Application
Stack Data Structure & It's Application Stack Data Structure & It's Application
Stack Data Structure & It's Application
 
Spss
SpssSpss
Spss
 
Spanning trees & applications
Spanning trees & applicationsSpanning trees & applications
Spanning trees & applications
 
Set data structure 2
Set data structure 2Set data structure 2
Set data structure 2
 
Set data structure
Set data structure Set data structure
Set data structure
 
Real time Operating System
Real time Operating SystemReal time Operating System
Real time Operating System
 
Parsing
ParsingParsing
Parsing
 
Mouse interrupts (Assembly Language & C)
Mouse interrupts (Assembly Language & C)Mouse interrupts (Assembly Language & C)
Mouse interrupts (Assembly Language & C)
 
Motherboard of a pc
Motherboard of a pcMotherboard of a pc
Motherboard of a pc
 
More on Lex
More on LexMore on Lex
More on Lex
 
MultiMedia dbms
MultiMedia dbmsMultiMedia dbms
MultiMedia dbms
 
Merging files (Data Structure)
Merging files (Data Structure)Merging files (Data Structure)
Merging files (Data Structure)
 
Memory dbms
Memory dbmsMemory dbms
Memory dbms
 

Último

Último (20)

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 

Correlation

  • 1. CORRELATION -Rank Correlation
  • 2. Correlation Definition: In a distribution if the change in one variable effects a change in the other variable, the variable are said to be correlated(or there is a correlation between the variables) Let X and Y measure some characteristics of a particular system .To study the overall measure of the system it is necessary to measure the interdependence of X and Y.
  • 3. Correlation If the quantities(X,Y) vary in such a way that change in one variable corresponds to change in the other variable then the variables X and Y are correlated. Types of Correlation: The important ways of classifying the correlation are: 1. Positive and Negative 2. Simple , Partial and Multiple 3. Linear and non-Linear.
  • 4. Correlation Methods of Studying Correlation: The following are the important methods of ascertaining between two variables.  Scatter diagram method  Karl Pearson’s Co-efficient  Spearman’s Rank Correlation Co-Efficient • Scatter Diagram Method: The simplest device for studying correlation between two variables is a special type of dot chart.
  • 5. Correlation Karl Pearson’s Co-Efficient of Correlation: r = (∑ xy) / (N σxσ y ) where x x = (x-¯) y y= (y- ¯) σx =Standard Deviation of Series X. σ y =Standard Deviation of Series Y. r= The correlation Coefficient N=Number of Pairs of Observation.
  • 6. Demerits • By applying Scatter diagram method we can get an idea about the direction of correlation and also whether it is high or low. But we cannot establish the exact degree of correlation between the variables as it is possible by applying mathematical methods. • The Karl Pearson’s method is based on the assumption that the population being studied is normally distributed. When it is known that the population is not normal or when he shape the distribution is not known, there is need for a measure of correlation that involves no assumption about the parameter of the population.
  • 7. So Why Rank Correlation???? It is possible to avoid making any assumptions about the populations being studied by ranking the observations according to size and basing the calculations on the ranks rather than upon the original observations. It does not matter which way the items are ranked, item number one may be the largest or it may be the smallest. Using ranks rather than actual observations gives the coefficient of rank correlations. This method of finding out co variability or the lack of it between two variables was developed by the British Psychologist Charles Edward Spearman in 1904.
  • 8. Ranking A ranking is a relationship between a set of items such that, for any two items, the first is either 'ranked higher than', 'ranked lower than' or 'ranked equal to' the second. In mathematics, this is known as a weak order or total preorder of objects. It is not necessarily a total order of objects because two different objects can have the same ranking. The rankings themselves are totally ordered. For example, materials are totally preordered by hardness, while degrees of hardness are totally ordered.
  • 9. Rank Correlation “Rank correlation” is the study of relationships between different rankings on the same set of items. It deals with measuring correspondence between two rankings, and assessing the significance of this correspondence. Spearman’s correlation coefficient is defined as: r = 1-((6∑D2)/(N(N-1)2)) Where r , denotes rank coefficient of correlation and D refers to the difference of rank relation between paired I tems in two series.
  • 10. Features The rank method has principal uses: • The sum of the differences between two variables is zero. • Spearman’s rank correlation coefficient ρ is the Pearsonian correlation coefficient between the ranks. • The rank correlation can be interpreted in the same way as Karl Pearson’s correlation coefficient.
  • 11. Features • Karl Pearson correlation coefficient assumes that the sample observations are drawn from a normal population. Rank correlation coefficient is a distribution- free measure since no strict assumption is made about the population from which it is drawn. • The values obtained for two formulae are different due to the fact that when ranking is used some information is hidden. • Spearman’s formula is the only formulae available to find the correlation between qualitative characters.
  • 12. Types of Rank Methods In the rank correlation we may have two types of problems: • Where ranks are given • Where ranks are not given • Where repeated ranks occur Note: If r = 1 then there is a perfect Positive correlation If r = 0 then the variables are uncorrelated If r=-1 then there is a perfect Negative Correlation
  • 13. Where ranks are given : Where actual ranks are given to us the steps required for computing rank correlation are: • Take the difference of the two ranks, i.e., (R1-R2) and denotes these differences by D. • Square these difference and obtain the total • Apply the formula
  • 14. Where ranks are not given • When we are given the actual data and not the ranks, it will be necessary to assign the ranks. Ranks can be assigned by taking either highest values as 1 or the lowest value as 1. But whether we start with the lowest value or the highest value we must follow the same method in case of both the variables.
  • 15. Steps To Find RC • Step 1: – Draw the table like
  • 16. • Step 2: – Fill the data field with the given data
  • 17. • Step 3: • Give the Rank for the data
  • 18. • Step 4 – Find the difference d & d2
  • 19. • Step 5: – Apply the formula: r= Where d= difference, n=no.of data