What is the correlation coefficient?

The correlation coefficient (r) is a specific measure that quantifies the strength of a linear statistical relationship between two variables, as well as its direction. There are several different measures for the degree of a correlation, depending on the type of data that is evaluated. They all range between -1 and 1. A coefficient of 0 indicates that there is no correlation between the two variables. Values between 0 and plus 1 indicate a positive correlation. Values between 0 and minus 1 on the other hand imply a negative correlation. The thresholds when a correlation can be considered important is always a matter of context. A strong relationship is often considered from r=0.8, though. In general, we can state that the correlation coefficient increases with the degree of association between two variables.

Most frequently used is the Spearman rank correlation coefficient. The advantage of the Spearman rank correlation is that it does not require any further assumptions about the distribution of the data. And it is applicable for data that is at least on an ordinal scale. It cannot, however, differentiate between dependent and independent variables and it cannot, like mentioned before, capture nonlinear relationships between two variables.

It is very important to remember that a correlation between two variables does not imply a causal relationship between these! A third variable could be involved linking these two variables. Correlations of this kind are called spurious correlations. An experimental setup can show cause and effect (or causation), but a correlation coefficient can only predict a relationship. Studies tend to overinterpret correlation coefficients, concluding causal assertions when only correlational evidence was shown. Furthermore, only a test can show if the correlation is statistically significant.

Another interesting aspect of the correlation coefficient is that it can be used to evaluate how much of the variance (or distribution) of one variable can be explained by that of the other variable. For this we use the square of the correlation coefficient, also called the degree or coefficient of determination. For example, a correlation coefficient of 0.3 indicates that 9% (0.3²= 0.09) of the overall occurring variance can be explained from the statistical perspective, and the other 91% remain unexplained.

If you are interested in a visual presentation of this topic, please feel free to follow us on our GCP Mindset YouTube channel! If you would like to know more about how we could implement statistics in your clinical trial, send us a mail to statistics@gcp-service.com.

More To Explore

General

Challenges in Document Management for Clinical Trials

Document management in clinical trials is a critical component in the quest for new and improved medical treatments. It involves the organization, storage, retrieval, and

22. May 2024

General

The Impact of Feasibility on Clinical Trial Outcomes

Clinical trials are the cornerstone of medical advancements, providing the crucial link between laboratory research and real-world application. These trials test the safety and efficacy

14. May 2024

Manage Cookie Consent

To provide the best experiences, we use technologies like cookies to store and/or access device information. Consenting to these technologies will allow us to process data such as browsing behaviour or unique IDs on this site. Not consenting or withdrawing consent, may adversely affect certain features and functions.

Functional Functional Always active

The technical storage or access is strictly necessary for the legitimate purpose of enabling the use of a specific service explicitly requested by the subscriber or user, or for the sole purpose of carrying out the transmission of a communication over an electronic communications network.

Preferences Preferences

The technical storage or access is necessary for the legitimate purpose of storing preferences that are not requested by the subscriber or user.

Statistics Statistics

The technical storage or access that is used exclusively for statistical purposes. The technical storage or access that is used exclusively for anonymous statistical purposes. Without a subpoena, voluntary compliance on the part of your Internet Service Provider, or additional records from a third party, information stored or retrieved for this purpose alone cannot usually be used to identify you.

Marketing Marketing

The technical storage or access is required to create user profiles to send advertising, or to track the user on a website or across several websites for similar marketing purposes.

Manage options Manage services Manage {vendor_count} vendors Read more about these purposes

View preferences

{title} {title} {title}