Which of the following relationships represent a positive correlation between two variables Quizlet

Home>AP statistics>This page

Table of Contents Show

How to Interpret a Correlation Coefficient
Scatterplots and Correlation Coefficients
How to Calculate a Correlation Coefficient
Test Your Understanding
Frequently Asked Questions

Correlation coefficients measure the strength of association between two variables. The most common correlation coefficient, called the Pearson product-moment correlation coefficient, measures the strength of the linear association between variables measured on an interval or ratio scale.

Note: Your browser does not support HTML5 video. If you view this web page on a different browser (e.g., a recent version of Edge, Chrome, Firefox, or Opera), you can watch a video treatment of this lesson.

In this tutorial, when we speak simply of a correlation coefficient, we are referring to the Pearson product-moment correlation. Generally, the correlation coefficient of a sample is denoted by r, and the correlation coefficient of a population is denoted by ρ or R.

How to Interpret a Correlation Coefficient

The sign and the absolute value of a correlation coefficient describe the direction and the magnitude of the relationship between two variables.

A negative correlation means that if one variable gets bigger, the other variable tends to get smaller.

Keep in mind that the Pearson product-moment correlation coefficient only measures linear relationships. Therefore, a correlation of 0 does not mean zero relationship between two variables; rather, it means zero linear relationship. (It is possible for two variables to have zero linear relationship and a strong curvilinear relationship at the same time.)

Scatterplots and Correlation Coefficients

The scatterplots below show how different patterns of data produce different degrees of correlation.

Maximum positive correlation
(r = 1.0)

Strong positive correlation
(r = 0.80)

Zero correlation
(r = 0)

Maximum negative correlation
(r = -1.0)

Moderate negative correlation
(r = -0.43)

Strong correlation & outlier
(r = 0.71)

Several points are evident from the scatterplots.

How to Calculate a Correlation Coefficient

If you look in different statistics textbooks, you are likely to find different-looking (but equivalent) formulas for computing a correlation coefficient. In this section, we present several formulas that you may encounter.

The most common formula for computing a product-moment correlation coefficient (r) is given below.

Product-moment correlation coefficient. The correlation r between two variables is:

r = Σ (xy) / sqrt [ ( Σ x2 ) * ( Σ y2 ) ]

where Σ is the summation symbol, x = xi - x, xi is the x value for observation i, x is the mean x value, y = yi - y, yi is the y value for observation i, and y is the mean y value.

The formula below uses population means and population standard deviations to compute a population correlation coefficient (ρ) from population data.

Population correlation coefficient. The correlation ρ between two variables is:

ρ = [ 1 / N ] * Σ { [ (Xi - μX) / σx ]
* [ (Yi - μY) / σy ] }

where N is the number of observations in the population, Σ is the summation symbol, Xi is the X value for observation i, μX is the population mean for variable X, Yi is the Y value for observation i, μY is the population mean for variable Y, σx is the population standard deviation of X, and σy is the population standard deviation of Y.

The formula below uses sample means and sample standard deviations to compute a sample correlation coefficient (r) from sample data.

Sample correlation coefficient. The correlation r between two variables is:

r = [ 1 / (n - 1) ] * Σ { [ (xi - x) / sx ]
* [ (yi - y) / sy ] }

where n is the number of observations in the sample, Σ is the summation symbol, xi is the x value for observation i, x is the sample mean of x, yi is the y value for observation i, y is the sample mean of y, sx is the sample standard deviation of x, and sy is the sample standard deviation of y.

The interpretation of the sample correlation coefficient depends on how the sample data are collected. With a large simple random sample, the sample correlation coefficient is an unbiased estimate of the population correlation coefficient.

Each of the latter two formulas can be derived from the first formula. Use the first or second formula when you have data from the entire population. Use the third formula when you only have sample data, but want to estimate the correlation in the population. When in doubt, use the first formula.

Fortunately, you will rarely have to compute a correlation coefficient by hand. Many software packages (e.g., Excel) and most graphing calculators have a correlation function that will do the job for you.

Test Your Understanding

Problem 1

A national consumer magazine reported the following correlations.

The correlation between car weight and car reliability is -0.30.
The correlation between car weight and annual maintenance cost is 0.20.

Which of the following statements are true?

I. Heavier cars tend to be less reliable.II. Heavier cars tend to cost more to maintain.

III. Car weight is related more strongly to reliability than to maintenance cost.

(A) I only(B) II only(C) III only(D) I and II only

(E) I, II, and III

Solution

The correct answer is (E). The correlation between car weight and reliability is negative. This means that reliability tends to decrease as car weight increases. The correlation between car weight and maintenance cost is positive. This means that maintenance costs tend to increase as car weight increases.

The strength of a relationship between two variables is indicated by the absolute value of the correlation coefficient. The correlation between car weight and reliability has an absolute value of 0.30. The correlation between car weight and maintenance cost has an absolute value of 0.20. Therefore, the relationship between car weight and reliability is stronger than the relationship between car weight and maintenance cost.

If you would like to cite this web page, you can use the following text:

Berman H.B., "Correlation Coefficient", [online] Available at: https://stattrek.com/statistics/correlation URL [Accessed Date: 7/16/2022].

A correlation coefficient, often expressed as r, indicates a measure of the direction and strength of a relationship between two variables. When the r value is closer to +1 or -1, it indicates that there is a stronger linear relationship between the two variables.

Correlational studies are quite common in psychology, particularly because some things are impossible to recreate or research in a lab setting. Instead of performing an experiment, researchers may collect data to look at possible relationships between variables. From the data they collect and its analysis, researchers then make inferences and predictions about the nature of the relationships between variables.

A correlation is a statistical measurement of the relationship between two variables. Remember this handy rule: The closer the correlation is to 0, the weaker it is. The closer it is to +/-1, the stronger it is.

Correlation strength ranges from -1 to +1.

A correlation of +1 indicates a perfect positive correlation, meaning that both variables move in the same direction together.

A correlation of –1 indicates a perfect negative correlation, meaning that as one variable goes up, the other goes down.

A zero correlation suggests that the correlation statistic does not indicate a relationship between the two variables. This does not mean that there is no relationship at all; it simply means that there is not a linear relationship. A zero correlation is often indicated using the abbreviation r = 0.

Scattergrams (also called scatter charts, scatter plots, and scatter diagrams) are used to plot variables on a chart to observe the associations or relationships between them. The horizontal axis represents one variable, and the vertical axis represents the other.

Scatter Plot diagram.

Each point on the plot is a different measurement. From those measurements, a trend line can be calculated. The correlation coefficient is the slope of that line. When the correlation is weak (r is close to zero), the line is hard to distinguish. When the correlation is strong (r is close to 1), the line will be more apparent.

Correlations can be confusing, and many people equate positive with strong and negative with weak. A relationship between two variables can be negative, but that doesn't mean that the relationship isn't strong.

A weak positive correlation indicates that, although both variables tend to go up in response to one another, the relationship is not very strong. A strong negative correlation, on the other hand, indicates a strong connection between the two variables, but that one goes up whenever the other one goes down.

For example, a correlation of -0.97 is a strong negative correlation, whereas a correlation of 0.10 indicates a weak positive correlation. A correlation of +0.10 is weaker than -0.74, and a correlation of -0.98 is stronger than +0.79.

Correlation does not equal causation. Just because two variables have a relationship does not mean that changes in one variable cause changes in the other. Correlations tell us that there is a relationship between variables, but this does not necessarily mean that one variable causes the other to change.

An oft-cited example is the correlation between ice cream consumption and homicide rates. Studies have found a correlation between increased ice cream sales and spikes in homicides. However, eating ice cream does not cause you to commit murder. Instead, there is a third variable: heat. Both variables increase during summertime.

An illusory correlation is the perception of a relationship between two variables when only a minor relationship—or none at all—actually exists. An illusory correlation does not always mean inferring causation; it can also mean inferring a relationship between two variables when one does not exist.

For example, people sometimes assume that, because two events occurred together at one point in the past, one event must be the cause of the other. These illusory correlations can occur both in scientific investigations and in real-world situations.

Stereotypes are a good example of illusory correlations. Research has shown that people tend to assume that certain groups and traits occur together and frequently overestimate the strength of the association between the two variables.

For example, suppose a man holds the mistaken belief that all people from small towns are extremely kind. When the individual meets a very kind person, his immediate assumption might be that the person is from a small town, despite the fact that kindness is not related to city population.

Psychology research makes frequent use of correlations, but it's important to understand that correlation is not the same as causation. This is a frequent assumption among those not familiar with statistics and assumes a cause-effect relationship that might not exist.

Frequently Asked Questions

How do you find the correlation coefficient?

You can calculate the correlation coefficient in a few different ways, with the same result. The general formula is rXY=COVXY/(SX SY), which is the covariance between the two variables, divided by the product of their standard deviations:
How do you calculate a correlation coefficient in Excel?

In the cell in which you want the correlation coefficient to appear, enter =CORREL(A2:A7,B2:B7), where A2:A7 and B2:B7 are the variable lists to compare. Press Enter.
How do you find a linear correlation coefficient?

Finding the linear correlation coefficient requires a long, difficult calculation, so most people use a calculator or software such as Excel or a statistics program.
How do you interpret a correlation coefficient?

Correlations range from -1.00 to +1.00. The correlation coefficient (expressed as r ) shows the direction and strength of a relationship between two variables. The closer the r value is to +1 or -1, the stronger the linear relationship between the two variables is.
What is the difference between correlation and causation?

Correlations indicate a relationship between two variables, but one doesn't necessarily cause the other to change.