Wager Mage
Photo by Michael Burrows Pexels Logo Photo: Michael Burrows

What is the most accurate measure of spread?

The interquartile range (IQR) is the difference between the upper (Q3) and lower (Q1) quartiles, and describes the middle 50% of values when ordered from lowest to highest. The IQR is often seen as a better measure of spread than the range as it is not affected by outliers.

How do I pay back my signing bonus?
How do I pay back my signing bonus?

If the signing bonus is repaid the same year as it was received, the employee need only pay the net amount. The employer can then receive the state...

Read More »
What is America's lucky number?
What is America's lucky number?

In the US, as well as in many Western cultures, the numbers 3, 7 and 12 are considered lucky. Dec 23, 2021

Read More »

Measures of Spread

What are measures of spread?

Measures of spread describe how similar or varied the set of observed values are for a particular variable (data item). Measures of spread include the range, quartiles and the interquartile range, variance and standard deviation.

When can we measure spread?

The spread of the values can be measured for quantitative data, as the variables are numeric and can be arranged into a logical order with a low end value and a high end value.

Why do we measure spread?

Summarising the dataset can help us understand the data, especially when the dataset is large. As discussed in the Measures of Central Tendency page, the mode, median, and mean summarise the data into a single value that is typical or representative of all the values in the dataset, but this is only part of the 'picture' that summarises a dataset. Measures of spread summarise the data in a way that shows how scattered the values are and how much they differ from the mean value.

For example:

Dataset A Dataset B 4, 5, 5, 5, 6, 6, 6, 6, 7, 7, 7, 8 1, 2, 3, 4, 5, 6, 6, 7, 8, 9, 10, 11 The mode (most frequent value), median (middle value*) and mean (arithmetic average) of both datasets is 6. (*note, the median of an even numbered data set is calculated by taking the mean of the middle two observations). If we just looked at the measures of central tendency, we may assume that the datasets are the same. However, if we look at the spread of the values in the following graph, we can see that Dataset B is more dispersed than Dataset A. Used together, the measures of central tendency and measures of spread help us to better understand the data

What does each measure of spread tell us?

The range is the difference between the smallest value and the largest value in a dataset.

Calculating the Range

Dataset A 4, 5, 5, 5, 6, 6, 6, 6, 7, 7, 7, 8 The range is 4, the difference between the highest value (8 ) and the lowest value (4). Dataset B 1, 2, 3, 4, 5, 6, 6, 7, 8, 9, 10, 11 The range is 10, the difference between the highest value (11 ) and the lowest value (1). Dataset A 0 1 2 3 4 5 6 7 8 9 10 11 12 13 Dataset B 0 1 2 3 4 5 6 7 8 9 10 11 12 13 On a number line, you can see that the range of values for Dataset B is larger than Dataset A. Quartiles divide an ordered dataset into four equal parts, and refer to the values of the point between the quarters. A dataset may also be divided into quintiles (five equal parts) or deciles (ten equal parts). Quartiles 25% of values Q1 25% of values Q2 25% of values Q3 25% of values

Is paid VPN better than free?
Is paid VPN better than free?

The only strength of a free VPN is that it's free. People typically don't expect high-end service when they get something without paying. But if...

Read More »
Should you hit on a 12 or 13 in blackjack?
Should you hit on a 12 or 13 in blackjack?

Blackjack: When to hit Ultimately, the game's main aim is to beat the dealer's hand. While it's not advised, some players choose to hit when they...

Read More »

The lower quartile (Q1) is the point between the lowest 25% of values and the highest 75% of values. It is also called the 25th percentile . The second quartile (Q2) is the middle of the data set. It is also called the 50th percentile , or the median . The upper quartile (Q3) is the point between the lowest 75% and highest 25% of values. It is also called the 75th percentile .

Calculating Quartiles

Dataset A 4 5 5 Q1 5 6 6 Q2 6 6 7 Q3 7 7 8 As the quartile point falls between two values, the mean (average) of those values is the quartile value:

Q1 = (5+5) / 2 = 5

Q2 = (6+6) / 2 = 6

Q3 = (7+7) / 2 = 7

Dataset B 1 2 3 Q1 4 5 6 Q2 6 7 8 Q3 9 10 11 As the quartile point falls between two values, the mean (average) of those values is the quartile value:

Q1 = (3+4) / 2 = 3.5

Q2 = (6+6) / 2 = 6

Q3 = (8+9) / 2 = 8.5 The interquartile range (IQR) is the difference between the upper (Q3) and lower (Q1) quartiles, and describes the middle 50% of values when ordered from lowest to highest. The IQR is often seen as a better measure of spread than the range as it is not affected by outliers . Interquartile Range 25% of values Q1 25% of values Q2 25% of values Q3 25% of values

Calculating the Interquartile Range

The IQR for Dataset A is = 2

IQR = Q3 - Q1

= 7 - 5

= 2

The IQR for Dataset B is = 5

IQR = Q3 - Q1

= 8.5 - 3.5

= 5

The variance and the standard deviation are measures of the spread of the data around the mean. They summarise how close each observed data value is to the mean value. In datasets with a small spread all values are very close to the mean, resulting in a small variance and standard deviation. Where a dataset is more dispersed, values are spread further away from the mean, leading to a larger variance and standard deviation. The smaller the variance and standard deviation, the more the mean value is indicative of the whole dataset. Therefore, if all values of a dataset are the same, the standard deviation and variance are zero. The standard deviation of a normal distribution enables us to calculate confidence intervals. In a normal distribution, about 68% of the values are within one standard deviation either side of the mean and about 95% of the scores are within two standard deviations of the mean. The population Variance σ 2 (pronounced sigma squared ) of a discrete set of numbers is expressed by the following formula:

where:

Is 13 8 good odds?
Is 13 8 good odds?

The 13-8 betting odds probability is a 61.90 per cent probability of a particular outcome and 38.10 per cent probability of another outcome. The...

Read More »
How much does a maxed out Vigilante sell for?
How much does a maxed out Vigilante sell for?

Can you sell the Vigilante in GTA Online? Yes, you can sell the Vigilante at MOC / Avenger Workshop for a resale price of $2,250,000 (60% of the...

Read More »

X i represents the ith unit, starting from the first observation to the last

μ represents the population mean

N represents the number of units in the population

The Variance of a sample s 2 (pronounced s squared ) is expressed by a slightly different formula:

where:

x i represents the ith unit, starting from the first observation to the last

x̅ represents the sample mean

n represents the number of units in the sample

The standard deviation is the square root of the variance. The standard deviation for a population is represented by σ , and the standard deviation for a sample is represented by s. Calculating the Population Variance σ 2 and Standard Deviation σ Dataset A Calculate the population mean ( μ ) of Dataset A. (4 + 5 + 5 + 5 + 6 + 6 + 6 + 6 + 7 + 7 + 7 + 8) / 12

mean ( μ ) = 6

Calculate the deviation of the individual values from the mean by subtracting the mean from each value in the dataset = -2, -1, -1, -1, 0, 0, 0, 0, 1, 1, 1, 2

Square each individual deviation value

= 4, 1, 1, 1, 0, 0, 0, 0, 1,1,1, 4

Calculate the mean of the squared deviation values

=

(4 + 1 +1 +1 + 0 + 0 + 0 + 0 +1 +1 +1 + 4) / 12

Variance σ 2 = 1.17

Calculate the square root of the variance

Standard deviation σ = 1.08 Dataset B Calculate the population mean ( μ ) of Dataset B. (1 + 2 + 3 + 4 + 5 + 6 + 6 + 7 + 8 + 9 + 10 + 11) / 12

mean ( μ ) = 6

Calculate the deviation of the individual values from the mean by subtracting the mean from each value in the dataset = -5, -4, -3, -2, -1, 0, 0, 1, 2, 3, 4, 5,

Square each individual deviation value

= 25, 16, 9, 4, 1, 0, 0, 1, 4, 9, 16, 25

Calculate the mean of the squared deviation values

=

(25 + 16 + 9 + 4 + 1 + 0 + 0 + 1 + 4 + 9 + 16 + 25) / 12

Variance σ 2 = 9.17

Calculate the square root of the variance

Standard deviation σ = 3.03

The larger Variance and Standard Deviation in Dataset B further demonstrates that Dataset B is more dispersed than Dataset A.

Return to Statistical Language Homepage

Further information:

External links:

easycalculation.com - Standard Deviation calculator

easycalculation.com - Five Number Summary calculator

Is being a sports bookie legal?
Is being a sports bookie legal?

No. Sports betting remains illegal in both retail and online contexts in California. Though Californians had two different ballot options to...

Read More »
How do I know what team to bet on?
How do I know what team to bet on?

When oddsmakers release a betting line on a game, the first thing they do is decide which team should be the favorite and which should be the...

Read More »
Which app contains all TV channels?
Which app contains all TV channels?

YuppTV is one of the well-known Android apps, it is also available on Android TV. The TV app comes with a dedicated Live TV section. Live TV...

Read More »
Does rugby pay more than soccer?
Does rugby pay more than soccer?

The lowest earning players earn R240 000 per year all the way to R2 million per year. There are 30 players who earn a minimum of R35 000 per month...

Read More »