Statistics — Mean, Median, Mode for Grouped Data (Class 9–10)

Mean by direct, assumed mean and step-deviation methods. Median using ogive. Mode formula. NCERT Class 10 Statistics with solved examples.

CBSE-9 CBSE-10 15 min read Classes 9, 10

What Statistics Is Actually About

Most students treat this chapter as “apply formula, get answer.” That works for board exams, but you’ll understand it much better if you see the real picture first.

Statistics is about summarising a large dataset into a single representative number. When we have 30 students’ marks and the principal asks “how did the class do?”, we can’t list all 30 numbers. We pick one number that represents the group — that’s what mean, median, and mode do.

The catch is that Class 10 NCERT moves from individual data (Class 9) to grouped data — data organised into class intervals like 10–20, 20–30, etc. This requires different formulas, and that’s where most students lose marks.

We’ll cover all three methods for mean, both types of median calculation (formula + ogive), the mode formula, and the exam patterns for CBSE Class 9 and 10.


Key Terms and Definitions

Class Interval: A range of values grouped together. In 10–20, the lower boundary is 10 and upper is 20.

Class Width (h): The difference between upper and lower limits. For 10–20, h=10h = 10.

Midpoint / Class Mark (xix_i): The middle value of a class interval.

xi=lower limit+upper limit2x_i = \frac{\text{lower limit} + \text{upper limit}}{2}

Frequency (fif_i): Number of observations falling in that class.

Cumulative Frequency: Running total of frequencies. If frequencies are 5, 8, 12, then cumulative frequencies are 5, 13, 25.

Assumed Mean (aa): A value we choose to simplify calculations — always the midpoint of the middle class interval.

Deviation (did_i): di=xiad_i = x_i - a

Step Deviation (uiu_i): ui=xiahu_i = \dfrac{x_i - a}{h}


Mean of Grouped Data

There are three methods. CBSE Class 10 expects you to know all three — they specify which one to use in each question.

Method 1: Direct Method

xˉ=fixifi\bar{x} = \frac{\sum f_i x_i}{\sum f_i}

Use when: The midpoints (xix_i) and frequencies are small numbers. If you’re getting three-digit products everywhere, switch to assumed mean.

Steps:

  1. Find midpoint xix_i of each class.
  2. Multiply: fi×xif_i \times x_i for each row.
  3. Sum the fif_i column and the fixif_i x_i column separately.
  4. Divide.

Method 2: Assumed Mean Method

xˉ=a+fidifi\bar{x} = a + \frac{\sum f_i d_i}{\sum f_i}

where di=xiad_i = x_i - a

Why this works: We’re essentially calculating how far the actual mean is from our assumed mean, then adding that correction. The algebra is identical to the direct method — this is just a computational shortcut.

Use when: Midpoints are large (like 150, 160, 170…) and direct method means multiplying large numbers.

Method 3: Step Deviation Method

xˉ=a+(fiuifi)×h\bar{x} = a + \left(\frac{\sum f_i u_i}{\sum f_i}\right) \times h

where ui=xiahu_i = \dfrac{x_i - a}{h}

Use when: All class widths are equal (which they almost always are in CBSE problems). This gives the smallest numbers to work with.

In CBSE exams, step deviation method saves the most time. If the question says “find mean using step deviation method,” you must use this — switching methods costs marks.


Median of Grouped Data

Formula Method

Median=l+(n2cff)×h\text{Median} = l + \left(\frac{\frac{n}{2} - cf}{f}\right) \times h
  • ll = lower limit of median class
  • nn = total frequency
  • cfcf = cumulative frequency of class before the median class
  • ff = frequency of the median class
  • hh = class width

Finding the median class: Calculate n2\frac{n}{2}. The class whose cumulative frequency first exceeds this value is the median class.

Example: If n=50n = 50, we look for 502=25\frac{50}{2} = 25. Find which class has the cumulative frequency that crosses 25.

The most common mistake: using cfcf of the median class itself instead of the class before it. The formula needs the cumulative frequency that came before we entered the median class.

Ogive Method (Graphical)

An ogive (pronounced “oh-jive”) is a cumulative frequency curve. CBSE Class 10 asks you to draw it and find the median graphically.

Steps for “Less Than” Ogive:

  1. Write upper limits of each class.
  2. Write cumulative frequencies against each upper limit.
  3. Plot points: (upper limit, cumulative frequency).
  4. Join with a smooth curve — never straight lines between points.
  5. Draw a horizontal line at n2\frac{n}{2} on the y-axis.
  6. Where it meets the ogive, drop a vertical line to the x-axis.
  7. That x-value is the median.

For “More Than” Ogive: Use lower limits and “more than” cumulative frequencies. The intersection of both ogives also gives the median.

In CBSE 10 board exams, the ogive question is worth 3–4 marks. You must: correctly plot all points, join them as a smooth curve (not straight lines), and clearly mark where you read off the median.


Mode of Grouped Data

Mode=l+(f1f02f1f0f2)×h\text{Mode} = l + \left(\frac{f_1 - f_0}{2f_1 - f_0 - f_2}\right) \times h
  • ll = lower limit of modal class
  • f1f_1 = frequency of modal class
  • f0f_0 = frequency of class before modal class
  • f2f_2 = frequency of class after modal class
  • hh = class width

Modal class: The class with the highest frequency.

Why this formula?: Mode in grouped data can’t be read directly — we don’t know exact values. This formula interpolates within the modal class, assuming the mode is closer to whichever neighbouring class has a larger frequency.

If you’re unsure which is f0f_0 and which is f2f_2: f0f_0 is always the class before (lower class interval), f2f_2 is always the class after. Don’t swap them — it changes the answer.


Solved Examples

Example 1 — Easy (CBSE Class 10)

Find the mean of the following data using the direct method:

Marks0–1010–2020–3030–4040–50
Students510253010

Solution:

Classxix_ifif_ifixif_i x_i
0–105525
10–201510150
20–302525625
30–4035301050
40–504510450
Total802300
xˉ=230080=28.75\bar{x} = \frac{2300}{80} = 28.75

Example 2 — Medium (CBSE Class 10 Board Pattern)

Find the mean marks using step deviation method:

Marks10–2525–4040–5555–7070–8585–100
Students237666

Solution: Take a=62.5a = 62.5 (middle class midpoint), h=15h = 15.

Classxix_ifif_iui=xi62.515u_i = \frac{x_i - 62.5}{15}fiuif_i u_i
10–2517.52–3–6
25–4032.53–2–6
40–5547.57–1–7
55–7062.5600
70–8577.5616
85–10092.56212
Total30–1
xˉ=62.5+130×15=62.50.5=62\bar{x} = 62.5 + \frac{-1}{30} \times 15 = 62.5 - 0.5 = 62

Example 3 — Hard (Finding Both Median and Mode)

The following table shows daily wages:

Wages (₹)100–120120–140140–160160–180180–200
Workers12148610

Find the median and mode.

Finding Median:

ClassFrequencyCumulative Frequency
100–1201212
120–1401426
140–160834
160–180640
180–2001050

n=50n = 50, so n2=25\frac{n}{2} = 25.

Cumulative frequency first exceeds 25 in the class 120–140. So median class is 120–140.

Here l=120l = 120, cf=12cf = 12, f=14f = 14, h=20h = 20.

Median=120+251214×20=120+1314×20=120+18.57=138.57\text{Median} = 120 + \frac{25 - 12}{14} \times 20 = 120 + \frac{13}{14} \times 20 = 120 + 18.57 = 138.57

Finding Mode:

Highest frequency is 14 (class 120–140). Modal class = 120–140.

l=120l = 120, f1=14f_1 = 14, f0=12f_0 = 12, f2=8f_2 = 8, h=20h = 20.

Mode=120+14122(14)128×20=120+28×20=120+5=125\text{Mode} = 120 + \frac{14 - 12}{2(14) - 12 - 8} \times 20 = 120 + \frac{2}{8} \times 20 = 120 + 5 = 125

Exam-Specific Tips

CBSE Class 10 Board Exams

Statistics typically carries 5–6 marks in CBSE Class 10 boards — one 3-mark and one or two 2-mark questions. The ogive question (draw + find median graphically) appears almost every year. PYQs from 2019, 2020, 2022, and 2023 all had it.

  • Show your complete table — even if the final answer is wrong, you get step marks for correct fixif_i x_i columns.
  • For ogive: scale matters. Label both axes clearly. The examiner checks that your curve is smooth, not pointy.
  • The empirical relationship Mode=3Median2Mean\text{Mode} = 3\text{Median} - 2\text{Mean} appears as a 1-mark verification question. Memorise it.

CBSE Class 9

In Class 9, statistics covers ungrouped data and frequency polygons. The shift to grouped data formulas happens in Class 10. Class 9 board exams will ask for mean/median/mode of raw data and drawing bar graphs or frequency polygons.

Relationship Between Mean, Median, Mode

Mode=3×Median2×Mean\text{Mode} = 3 \times \text{Median} - 2 \times \text{Mean}

This is an approximate relationship that holds for moderately skewed distributions. CBSE uses it to verify answers or as a short 1-mark question.


Common Mistakes to Avoid

Mistake 1: Wrong cumulative frequency in median formula. cfcf in the median formula is the cumulative frequency of the class before the median class, not the median class itself. Write the cumulative frequency column first, identify the median class, then pick cfcf from the row above it.

Mistake 2: Confusing f0f_0 and f2f_2 in mode formula. f0f_0 is the frequency of the class with a smaller class interval (comes before), f2f_2 comes after. Swapping them gives a different (wrong) answer that you can’t recover from.

Mistake 3: Using class boundaries instead of midpoints. For the mean, use the midpoint xi=lower+upper2x_i = \frac{\text{lower} + \text{upper}}{2}, never the lower or upper limit alone.

Mistake 4: Drawing straight lines instead of a smooth curve for ogive. An ogive is a smooth S-shaped curve. Straight-line segments between points is a frequency polygon, not an ogive. This is an explicit marking criterion.

Mistake 5: Wrong median class identification. The median class is the one whose cumulative frequency first exceeds n2\frac{n}{2}. If n2=25\frac{n}{2} = 25 and cumulative frequencies are 12, 25, 34…, the median class is the one with CF = 34 (the one that crosses 25), not the one with CF = 25.


Practice Questions

Q1. Find the mean of the following data using the assumed mean method:

Class0–2020–4040–6060–8080–100
Frequency6810124

Take a=50a = 50 (midpoint of 40–60), h=20h = 20.

xix_ifif_idi=xi50d_i = x_i - 50fidif_i d_i
106–40–240
308–20–160
501000
701220240
90440160
400

xˉ=50+040=50\bar{x} = 50 + \frac{0}{40} = 50


Q2. Find the mode of: 10–20 (4 students), 20–30 (7), 30–40 (12), 40–50 (8), 50–60 (5).

Modal class = 30–40 (highest frequency 12).

l=30l = 30, f1=12f_1 = 12, f0=7f_0 = 7, f2=8f_2 = 8, h=10h = 10.

Mode=30+1272(12)78×10=30+59×10=30+5.56=35.56\text{Mode} = 30 + \frac{12 - 7}{2(12) - 7 - 8} \times 10 = 30 + \frac{5}{9} \times 10 = 30 + 5.56 = 35.56

Q3. The mean of a distribution is 28 and mode is 16. Find the median using the empirical formula.

Mode = 3 Median – 2 Mean

16=3×Median2×2816 = 3 \times \text{Median} - 2 \times 28

16=3×Median5616 = 3 \times \text{Median} - 56

3×Median=723 \times \text{Median} = 72

Median = 24


Q4. Find median from: 0–10 (3), 10–20 (5), 20–30 (9), 30–40 (5), 40–50 (3).

n=25n = 25, n2=12.5\frac{n}{2} = 12.5.

Cumulative frequencies: 3, 8, 17, 22, 25.

CF first exceeds 12.5 at 17. Median class = 20–30.

l=20l = 20, cf=8cf = 8, f=9f = 9, h=10h = 10.

Median=20+12.589×10=20+5=25\text{Median} = 20 + \frac{12.5 - 8}{9} \times 10 = 20 + 5 = 25

Q5. Ages of 40 patients: 5–14 (6), 15–24 (11), 25–34 (21), 35–44 (23), 45–54 (14), 55–64 (5). Find mean using step deviation.

Note: classes are not of equal upper–lower difference if written as above. Treat as continuous: 4.5–14.5, etc. Mid-values: 9.5, 19.5, 29.5, 39.5, 49.5, 59.5. Take a=29.5a = 29.5, h=10h = 10.

Wait — re-reading, this is a standard NCERT-type problem. Take midpoints as 9.5, 19.5, 29.5, 39.5, 49.5, 59.5. Choose a=29.5a = 29.5.

xix_ifif_iuiu_ifiuif_i u_i
9.56–2–12
19.511–1–11
29.52100
39.523123
49.514228
59.55315
8043

xˉ=29.5+4380×10=29.5+5.375=34.87534.9\bar{x} = 29.5 + \frac{43}{80} \times 10 = 29.5 + 5.375 = 34.875 \approx 34.9


Q6. If fi=50\sum f_i = 50 and fiui=7\sum f_i u_i = -7 with a=25a = 25 and h=5h = 5, find the mean.

xˉ=25+750×5=250.7=24.3\bar{x} = 25 + \frac{-7}{50} \times 5 = 25 - 0.7 = 24.3

Q7. A dataset has mean 35 and median 33. Estimate the mode.

Mode = 3 × Median – 2 × Mean = 3(33) – 2(35) = 99 – 70 = 29


Q8. Draw a less-than ogive for the data below and find the median:

Height (cm)140–145145–150150–155155–160160–165
Girls511141010

Less-than cumulative frequencies:

Less thanCF
1455
15016
15530
16040
16550

Plot these 5 points: (145, 5), (150, 16), (155, 30), (160, 40), (165, 50). Draw smooth curve.

n2=25\frac{n}{2} = 25. Draw horizontal line at CF = 25. It meets the curve above approximately 152.

Verify with formula: n=50n = 50, n2=25\frac{n}{2} = 25. Median class = 150–155 (CF crosses 25 at 30).

Median=150+251614×5=150+4514=150+3.21=153.21\text{Median} = 150 + \frac{25 - 16}{14} \times 5 = 150 + \frac{45}{14} = 150 + 3.21 = 153.21 cm.


Frequently Asked Questions

Which mean method should I use when the question doesn’t specify?

Direct method is fine for simple numbers. For large midpoints or when class width is uniform, step deviation is faster and less error-prone. In a board exam, use what makes the arithmetic cleanest — and always show your table in full.

Can mode have two values?

Yes — if two classes tie for highest frequency, the data is bimodal. The formula applies to each modal class separately. CBSE rarely tests this, but know the term.

Why do we use midpoints instead of actual values?

In grouped data, we don’t have individual values — only that 10 students scored between 30 and 40. The midpoint (35) is our best estimate of what those 10 students actually scored on average.

How is the ogive different from a frequency polygon?

A frequency polygon plots frequency against midpoints — it shows where data is concentrated. An ogive plots cumulative frequency against upper (or lower) limits — it tells you “how many observations fall below this value.” The shape is different: ogive is S-shaped, frequency polygon is wave-shaped.

The empirical formula gives an approximation, not exact values. When is it exact?

The formula Mode = 3 Median – 2 Mean holds exactly for a perfectly symmetric or normally distributed dataset. For real data, it’s an approximation. CBSE uses it as an exact relationship for problem-solving — just apply it without worrying about this.

What’s the difference between less-than and more-than ogive?

Less-than ogive: plot upper class limits vs cumulative frequency (increasing curve). More-than ogive: plot lower class limits vs “more than” cumulative frequency (decreasing curve). Their intersection gives the median. Both are ascending/descending S-curves.

How many marks is Statistics in Class 10 boards?

Statistics falls under the “Statistics and Probability” unit, which carries 11 marks in the CBSE Class 10 board exam. Statistics alone (Chapter 14) typically accounts for 6–7 marks.

Is step deviation method only for equal class widths?

Technically yes — the formula divides by hh, which assumes uniform class width. If class widths vary, use the assumed mean method instead. CBSE problems almost always have equal class widths, so this rarely comes up in practice.

Practice Questions