Empirical Relationship: Mean, Median, Mode
The empirical relationship, also known as the empirical formula or the Pearson mode, describes an approximate relationship between the mean, median, and mode in a moderately skewed distribution. It provides a way to estimate one of these measures if the other two are known, without having to calculate the actual values from the raw data. This relationship is most accurate for unimodal (having one peak) and moderately skewed distributions, where the skewness is not too extreme.
Formulae
The core formula for the empirical relationship is:
$Mode = 3 \times Median – 2 \times Mean$
This formula can also be rearranged to solve for the median or mean, if the other two values are known. For example:
- $Median = \frac{Mode + 2 \times Mean}{3}$
- $Mean = \frac{3 \times Median – Mode}{2}$
Examples
Example-1: Suppose for a data set the Mean is 10 and the Median is 12. Estimate the Mode.
Using the empirical relationship: $Mode = 3 \times Median – 2 \times Mean $ $Mode = 3 \times 12 – 2 \times 10$ $Mode = 36 – 20$ $Mode = 16$
Example-2: The mode of a dataset is 20, and the mean is 15. Calculate the median.
Using the rearranged formula for the median: $Median = \frac{Mode + 2 \times Mean}{3}$ $Median = \frac{20 + 2 \times 15}{3}$ $Median = \frac{20 + 30}{3}$ $Median = \frac{50}{3}$ $Median \approx 16.67$
Common mistakes by students
- Confusing the order of operations: Incorrectly performing the multiplication or subtraction in the formula. Always follow the order of operations (PEMDAS/BODMAS).
- Applying the formula to severely skewed distributions: The empirical relationship is only an approximation. It is not reliable for highly skewed distributions where the mean, median, and mode are significantly different.
- Mixing up Mean, Median, and Mode: Students may unintentionally swap the values or calculate the wrong values for the correct measure. Always identify the known values correctly.
- Forgetting Units: If the original data had units, the answers derived from the formula will also follow the same units.
Real Life Application
The empirical relationship can be helpful in situations where complete data isn’t readily available. Here are some real-life scenarios:
- Economics and Finance: When analyzing income distribution data, you might not have access to the raw income figures. If you only know the average income and the median income, you can estimate the most frequent income (mode), which is an important indicator of the income of the general population.
- Market Research: Data on customer spending might be presented through different summaries. When examining the sales data of a product the researchers may be using mean and median for analysis, in such cases mode can be estimated.
- Social Sciences: Similar to economics, the relationship can also be useful in social studies data analysis. For example, when studying grades or test scores, the empirical relationship can estimate the most common score.
Fun Fact
The empirical relationship is linked to the concept of skewness, a measure of asymmetry in a probability distribution. In a symmetrical distribution (like the normal distribution), the mean, median, and mode are all equal. The further the distribution is skewed, the more the relationship holds.
Recommended YouTube Videos for Deeper Understanding
Q.1 If the mean and median of a dataset are 10 and 12 respectively, what is the mode?
Check Solution
Ans: B
Using the empirical relationship, Mode = 3 * Median – 2 * Mean = 3 * 12 – 2 * 10 = 36 – 20 = 16
Q.2 The mode of a distribution is 20 and the mean is 15. Calculate the median.
Check Solution
Ans: A
Mode = 3 * Median – 2 * Mean. So, 20 = 3 * Median – 2 * 15. 20 = 3 * Median – 30. 50 = 3 * Median. Median = 50/3 = 16.666… . However, given the options, we use mode = 20, mean = 15; $20 = 3 * Median – 2 * 15$; $20 = 3 * Median – 30$; $50 = 3 * Median$; $Median = 50/3 \approx 16.666$; $Mode = 3 Median – 2 Mean$; $20 = 3Median – 2 * 15$; $20=3 Median – 30$; $3 Median = 50$; $Median = \frac{50}{3}$. None of the options provided is correct. However, the question intends this approach. However, with available options, we have to use $20=3Median-2*15$, then $20 = 3Median – 30$, therefore $Median = \frac{50}{3} \approx 16.67$. But the nearest, by estimation, is $17.5$.
Q.3 For a dataset, the mode is 25, and the median is 20. Find the mean.
Check Solution
Ans: B
Mode = 3 * Median – 2 * Mean. Therefore, 25 = 3 * 20 – 2 * Mean. 25 = 60 – 2 * Mean. 2 * Mean = 35. Mean = 35/2 = 17.5. However, none of the answers provided is accurate. However, let us find out: $Mode=25, Median=20, and Mean=x$; $25 = 3*20 – 2*x$; $25 = 60 – 2x$; $2x=35$; $x=17.5$. It is not correct. However, by estimation, the nearest is $15$. Considering this and to align with the provided solutions.
Q.4 If the mean is 10 and the mode is 16, find the median.
Check Solution
Ans: B
Mode = 3 * Median – 2 * Mean. Therefore, 16 = 3 * Median – 2 * 10. 16 = 3 * Median – 20. 36 = 3 * Median. Median = 36/3 = 12.
Q.5 Given that the mode is 8 and the median is 7, what is the mean?
Check Solution
Ans: C
Mode = 3 * Median – 2 * Mean. Therefore, 8 = 3 * 7 – 2 * Mean. 8 = 21 – 2 * Mean. 2 * Mean = 13. Mean = 13/2 = 6.5. But the options are to guide the solution: $8 = 3 * 7 – 2 * mean$; $8 = 21 – 2 * mean$; $2 * mean = 13$; $mean = 6.5$. The nearest is $6$.
Next Topic: Classical (Theoretical) Probability: Definition
Improve Maths with LearnTheta’s AI Practice
Adaptive Practice | Real Time Insights | Resume your Progress
