What is the scatter, dispersion and shape of a distribution?

When presenting the data descriptively, the distribution of single variables can include the following parameters: Scatter parameters such as the mean, median, and mode; dispersion parameters, which are the range, quartiles, variance, and standard deviation; and to define the shape of the distribution also skewness and kurtosis.

The mean is calculated by summing all occurring values and dividing the sum by the number of. The median is simply the value that is in the middle of the sequence of values, ordered by size. The mode on the other hand is the value which occurs most often in the data. So, these three aspects give us a good impression of the average participant, but we do not know yet how the data is spread. For that, we can look at the dispersion parameters.

The range of a distribution states between which minimum and maximum the values in our sample group are distributed. The quartiles, referred to as Q1, Q2 and Q3, are the values below which 25%, 50% or 75% of the data is found, respectively. And a measure for the diversity in a sample is the variance, which indicates how far from the mean value the data is dispersed on average. But usually, what is used when presenting the data, is the standard deviation, which is simply the square root of the variance. It tells you, how large the dispersion of the individual observation from the mean is. With all this information, we have some idea of the spread of the data.

The skewness of a distribution curve gives an indication of how similar the curve looks like to the left and right from the mean value. If there are more data points below the mean than above, then the distribution is shifted towards the left. The right tail of the distribution curve would conversely be longer, which is why it is then called right-skewed or right-tailed. A skew occurs when there are extreme values on one end of the curve impacting the location of the mean. In these cases, describing the data using the median may be better than using the mean.

The kurtosis describes how closely a distribution resembles a normal distribution. Each value being observed has a certain probability that can be summarized using a curve. If we have a deviation of zero, the shape of the distribution equals a normal distribution. If the deviation is below zero, it means the distribution shows fewer and less extreme values than the normal distribution. The curve may seem to appear flatter at the extremes. Conversely, a deviation of more than zero means that the curve has more pronounced tails, and therefore has more extreme values than the normal distribution.

If you are interested in a visual presentation of this topic, please feel free to follow us on GCP Mindset YouTube channel! If you would like to know more about how we could implement statistics in your clinical trial, send us a mail at at statistics@gcp-service.com.

More To Explore

General

Challenges in Document Management for Clinical Trials

Document management in clinical trials is a critical component in the quest for new and improved medical treatments. It involves the organization, storage, retrieval, and

22. May 2024

General

The Impact of Feasibility on Clinical Trial Outcomes

Clinical trials are the cornerstone of medical advancements, providing the crucial link between laboratory research and real-world application. These trials test the safety and efficacy

14. May 2024

Manage Cookie Consent

To provide the best experiences, we use technologies like cookies to store and/or access device information. Consenting to these technologies will allow us to process data such as browsing behaviour or unique IDs on this site. Not consenting or withdrawing consent, may adversely affect certain features and functions.

Functional Functional Always active

The technical storage or access is strictly necessary for the legitimate purpose of enabling the use of a specific service explicitly requested by the subscriber or user, or for the sole purpose of carrying out the transmission of a communication over an electronic communications network.

Preferences Preferences

The technical storage or access is necessary for the legitimate purpose of storing preferences that are not requested by the subscriber or user.

Statistics Statistics

The technical storage or access that is used exclusively for statistical purposes. The technical storage or access that is used exclusively for anonymous statistical purposes. Without a subpoena, voluntary compliance on the part of your Internet Service Provider, or additional records from a third party, information stored or retrieved for this purpose alone cannot usually be used to identify you.

Marketing Marketing

The technical storage or access is required to create user profiles to send advertising, or to track the user on a website or across several websites for similar marketing purposes.

Manage options Manage services Manage {vendor_count} vendors Read more about these purposes

View preferences

{title} {title} {title}