Menu Top
Latest Economics NCERT Notes, Solutions and Extra Q & A (Class 9th to 12th)
9th 10th 11th 12th

Class 11th Chapters
Indian Economic Development
1. The Indian Economy On The Eve Of Independence 2. Indian Economy 1950-1990 3. Liberalisation, Privatisation And Globalisation: An Appraisal
4. Human Capital Formation In India 5. Rural Development 6. Employment: Growth, Informalisation And Other Issues
7. Environment And Sustainable Development 8. Comparative Development Experiences Of India And Its Neighbours
Statistics For Economics
1. Introduction 2. Collection Of Data 3. Organisation Of Data
4. Presentation Of Data 5. Measures Of Central Tendency 6. Correlation
7. Index Numbers 8. Use Of Statistical Tools



Chapter 6 Correlation



This chapter explores Correlation, a statistical tool used to measure and analyse the relationship between two variables. It helps determine if a change in one variable is associated with a change in another, and it quantifies the direction and strength of this linear relationship. The chapter explains the difference between Positive Correlation, where both variables move in the same direction (e.g., income and consumption), and Negative Correlation, where they move in opposite directions (e.g., price and quantity demanded). A crucial point emphasized is that correlation measures association, not causation; it shows that two variables move together, but does not prove that one causes the other.

Three main techniques for measuring correlation are discussed. The Scatter Diagram provides a visual representation of the relationship, allowing for a quick assessment of its nature and strength. For a precise numerical measure, Karl Pearson’s Coefficient of Correlation (r) is used, which ranges from -1 (perfect negative correlation) to +1 (perfect positive correlation), with 0 indicating no linear relationship. For qualitative data or data containing extreme values, Spearman’s Rank Correlation ($r_k$) is a more suitable alternative, as it measures the association between the ranks of the data rather than their actual values.

Introduction to Correlation

In previous chapters, you've learned how to summarise a single dataset. Now, we will explore how to examine the relationship between two variables. This is the essence of correlation analysis.

In our daily lives, we often observe relationships between different phenomena. For example:

Correlation analysis provides a systematic means to examine such relationships. It helps answer questions like:


What Does Correlation Measure?

Correlation studies and measures the direction and intensity of the relationship between variables. It's crucial to understand that correlation measures covariation, not causation. The presence of a correlation between two variables, X and Y, simply means that when X changes, Y also changes in a definite way. It does not mean that a change in X causes a change in Y.

For instance, a brisk sale of ice-creams might be correlated with a higher number of deaths by drowning. This doesn't mean eating ice cream causes drowning. Instead, a third variable—rising temperature—leads to both increased ice-cream sales and more people going swimming, which in turn might increase the number of drowning incidents.

For simplicity, we will assume that the correlation, if it exists, is linear, meaning the relationship can be represented by a straight line on a graph.



Types of Correlation

Correlation is commonly classified based on the direction of the relationship between the two variables.

1. Positive Correlation

The correlation is said to be positive when the two variables move together in the same direction. If one variable increases, the other variable also increases. If one decreases, the other also decreases.

Examples:


2. Negative Correlation

The correlation is negative when the two variables move in opposite directions. If one variable increases, the other variable decreases, and vice versa.

Examples:


3. No Correlation

When there is no discernible relationship between the movements of two variables, it is a case of no correlation. Changes in one variable do not lead to any definite change in the other.

Example: The relationship between the size of your shoes and the amount of money in your pocket is likely a coincidence with no real correlation.



Techniques for Measuring Correlation

There are three important tools used to study and measure correlation:

  1. Scatter Diagrams: A graphical method to visually assess the nature of the relationship.
  2. Karl Pearson’s Coefficient of Correlation: A numerical measure of the linear relationship between two variables.
  3. Spearman’s Rank Correlation: A measure of the linear association between the ranks of variables, particularly useful for qualitative data.

Scatter Diagram

A scatter diagram is a graphical technique where the values of two variables are plotted as points on a graph paper. This visual representation provides a good idea of the nature and strength of the relationship without calculating any numerical value.

The pattern of the scattered points reveals the relationship:

A collection of six scatter diagrams showing different types of correlation: positive, negative, no correlation, perfect positive, perfect negative, and non-linear.


Karl Pearson’s Coefficient of Correlation

Also known as the product-moment correlation coefficient, this is a precise numerical measure of the degree of linear relationship between two variables, X and Y. It is denoted by 'r'. It is advisable to first examine a scatter diagram to ensure the relationship is linear before calculating this coefficient.

Formula for Karl Pearson’s Coefficient (r)

The correlation coefficient is the covariance of X and Y divided by the product of their standard deviations.

$ r = \frac{Cov(X,Y)}{\sigma_x \sigma_y} $

Where:

The formula can be expressed in various forms for calculation:

$ r = \frac{\sum xy}{\sqrt{\sum x^2 \sum y^2}} \quad \text{where } x = X - \bar{X} \text{ and } y = Y - \bar{Y} $

Or, for direct calculation from raw data:

$ r = \frac{N \sum XY - (\sum X)(\sum Y)}{\sqrt{[N \sum X^2 - (\sum X)^2][N \sum Y^2 - (\sum Y)^2]}} $


Properties of the Correlation Coefficient (r)



Spearman’s Rank Correlation

Developed by the British psychologist C.E. Spearman, this method measures the linear association between the ranks assigned to individual items, not their actual values. It is denoted by $r_k$.

When to Use Rank Correlation

Spearman’s rank correlation is particularly useful in the following situations:

  1. When the data is qualitative and cannot be measured numerically but can be ranked. For example, ranking individuals based on attributes like honesty, beauty, or intelligence.
  2. When the relationship between variables is clearly non-linear, but the direction of the relationship is consistent.
  3. When the data contains extreme values (outliers), as rank correlation is not affected by them, unlike Pearson’s correlation.

Formula for Spearman’s Rank Correlation ($r_k$)

The formula for calculating the rank correlation coefficient is:

$ r_k = 1 - \frac{6 \sum D^2}{n(n^2 - 1)} $

Where:

Calculation of Rank Correlation

The process involves three main scenarios:

  1. When Ranks are Given: Simply calculate the differences (D) between the ranks for each pair, square them ($D^2$), sum them up ($\sum D^2$), and apply the formula.
  2. When Ranks are Not Given: First, assign ranks to the values of each variable separately. The highest value is typically given rank 1, the next highest rank 2, and so on. Then, proceed as above.
  3. When Ranks are Repeated (Tied Ranks): If two or more items have the same value, a common rank is assigned to them. This common rank is the average of the ranks they would have occupied if they were slightly different. For example, if two items are tied for the 3rd and 4th positions, both are assigned the rank $ (3+4)/2 = 3.5 $.

When ranks are repeated, a correction factor is added to the $\sum D^2$ term in the formula:

$ r_k = 1 - \frac{6 \left( \sum D^2 + \frac{m_1(m_1^2 - 1)}{12} + \frac{m_2(m_2^2 - 1)}{12} + \dots \right)}{n(n^2 - 1)} $

Where $m_1, m_2, \dots$ are the number of times each rank is repeated.



NCERT Questions Solution



Question 1. The unit of correlation coefficient between height in feet and weight in kgs is

(i) kg/feet

(ii) percentage

(iii) non-existent

Answer:

Question 2. The range of simple correlation coefficient is

(i) 0 to infinity

(ii) minus one to plus one

(iii) minus infinity to infinity

Answer:

Question 3. If rxy is positive the relation between X and Y is of the type

(i) When Y increases X increases

(ii) When Y decreases X increases

(iii) When Y increases X does not change

Answer:

Question 4. If rxy = 0 the variable X and Y are

(i) linearly related

(ii) not linearly related

(iii) independent

Answer:

Question 5. Of the following three measures which can measure any type of relationship

(i) Karl Pearson’s coefficient of correlation

(ii) Spearman’s rank correlation

(iii) Scatter diagram

Answer:

Question 6. If precisely measured data are available the simple correlation coefficient is

(i) more accurate than rank correlation coefficient

(ii) less accurate than rank correlation coefficient

(iii) as accurate as the rank correlation coefficient

Answer:

Question 7. Why is r preferred to covariance as a measure of association?

Answer:

Question 8. Can r lie outside the –1 and 1 range depending on the type of data?

Answer:

Question 9. Does correlation imply causation?

Answer:

Question 10. When is rank correlation more precise than simple correlation coefficient?

Answer:

Question 11. Does zero correlation mean independence?

Answer:

Question 12. Can simple correlation coefficient measure any type of relationship?

Answer:

Question 13. Collect the price of five vegetables from your local market every day for a week. Calculate their correlation coefficients. Interpret the result.

Answer:

Question 14. Measure the height of your classmates. Ask them the height of their benchmate. Calculate the correlation coefficient of these two variables. Interpret the result.

Answer:

Question 15. List some variables where accurate measurement is difficult.

Answer:

Question 16. Interpret the values of r as 1, –1 and 0.

Answer:

Question 17. Why does rank correlation coefficient differ from Pearsonian correlation coefficient?

Answer:

Question 18. Calculate the correlation coefficient between the heights of fathers in inches (X) and their sons (Y)

X 65 66 57 67 68 69 70 72
Y 67 56 65 68 72 72 69 71

Answer:

Question 19. Calculate the correlation coefficient between X and Y and comment on their relationship:

X –3 –2 –1 1 2 3
Y 9 4 1 1 4 9

Answer:

Question 20. Calculate the correlation coefficient between X and Y and comment on their relationship

X 1 3 4 5 7 8
Y 2 6 8 10 14 16

Answer:



Extra Q & A



Multiple Choice Questions



Short Answer Type Questions



Long Answer Type Questions