Grade 9 → Statistics ↓
Measures of Central Tendency
Measures of central tendency are statistics that describe the center or average of a dataset. In simple terms, they tell us where most of the values in a dataset are concentrated. Generally, we discuss three main measures of central tendency: mean, median, and mode.
Meaning
The mean is what most people commonly refer to as the "average." It is calculated by adding up all the numbers in a dataset and then dividing by the number of values in that dataset.
Formula of mean
Mean = (Sum of all values) / (Total number of values)
Consider the dataset: 3, 5, 7, 9, 11.
To find the mean:
Total = 3 + 5 + 7 + 9 + 11 = 35 Number of values = 5 Mean = 35 / 5 = 7
In the above figure, each blue circle represents a data point and the green circle represents the position of the mean (average).
Median
When the numbers in a data set are arranged in order, the median is the middle value, either from lowest to highest or highest to lowest. If the number of values is odd, the median is the middle number. If the number of values is even, the median is the average of the two middle numbers.
Example 1 (odd number of values): Consider the dataset: 5, 3, 8, 1, 7.
First, order the dataset: 1, 3, 5, 7, 8.
The median value is the third number:
Median = 5
Example 2 (even number of values): Consider the dataset: 22, 15, 30, 17.
First, sort the dataset: 15, 17, 22, 30.
The median is the average of the two middle numbers, 17 and 22:
Median = (17 + 22) / 2 = 19.5
In the above figure, the red circle represents the median, which shows its central position in the ordered data set.
Mode
The mode is the number that appears most often in the dataset. A dataset can have one mode, more than one mode, or no mode if no numbers are repeated.
Consider the dataset: 4, 1, 2, 4, 3, 4, 5.
In this dataset, the number 4 appears most often:
Mode = 4
For the dataset: 6, 2, 6, 3, 5, 5, 7:
The numbers 6 and 5 both appear twice:
Mode = 6, 5 (bimodal)
The red circles indicate the modes of sample datasets with more than one mode.
Comparison of mean, median and mode
Each measure of central tendency provides different insights and works better in different situations.
- Mean: Best for datasets without outliers (extreme values) as it considers all values.
- Median: Useful for skewed datasets or when outliers are present because it indicates the middle of the dataset.
- Mode: Valued for determining the most common value, especially in categorical data.
Working example
Let's compare the mean, median, and mode with a more complex example:
Consider the dataset: 2, 3, 5, 7, 10, 3, 9, 2, 3, 11.
First, order the dataset: 2, 2, 3, 3, 3, 5, 7, 9, 10, 11.
Mean:
Total = 2 + 2 + 3 + 3 + 3 + 5 + 7 + 9 + 10 + 11 = 55 Total number of values = 10 Mean = 55 / 10 = 5.5
Median:
Median = (5 + 7) / 2 = 6
Mode:
Mode = 3 (appears most frequently)
So, for this example:
The mean is 5.5, the median is 6, and the mode is 3.
Conclusion
Measures of central tendency are key components of descriptive statistics. By understanding the difference between the mean, median, and mode, we can better analyze datasets to find patterns and make predictions. Practice with different datasets to see how these calculations can differ and what they reveal about the data.