Practice Questions

Statistics

1
easySubjective

A shoe company wants to decide which shoe size to produce the most. Justify which measure of central tendency (mean, median, or mode) is the most appropriate for this decision.

2
easySubjective

Analyze the class interval 22.537.522.5 - 37.5 and calculate its class mark.

3
easySubjective

Examine the following frequency distribution and identify the modal class.

Class IntervalFrequency
10-208
20-3015
30-4019
40-5012
50-607
4
easySubjective

Define the term 'class mark' as used in the context of grouped data.

5
easySubjective

Identify the term used for the class interval with the highest frequency in a grouped data distribution.

6
easySubjective

Calculate the class size and the class mark for the class interval 100120100-120.

7
easySubjective

Propose a modification to the class intervals of the data set 50-59, 60-69, 70-79 to make it suitable for median calculation and justify the necessity of this change.

8
easySubjective

State the formula for calculating the mean of grouped data using the Direct Method.

9
easySubjective

Explain the meaning of each symbol in the formula for calculating the mode of grouped data: Mode=l+(f1f02f1f0f2)×h\text{Mode} = l + \left(\frac{f_{1} - f_{0}}{2f_{1} - f_{0} - f_{2}}\right) \times h

10
mediumSubjective

For a frequency distribution, the sum of frequencies is N=60N=60. Analyze the cumulative frequency table below and determine the median class.

MarksCumulative Frequency
Below 105
Below 2012
Below 3025
Below 4041
Below 5060
11
mediumSubjective

For a moderately skewed distribution, the mean is calculated to be 28.528.5 and the median is 3030. Apply the empirical relationship between mean, median, and mode to calculate the mode.

12
mediumSubjective

The following table shows the daily income of 50 families in a locality. Calculate the mean daily income using the direct method.

Daily Income (in ₹)Number of Families (fif_i)
100-12012
120-14014
140-1608
160-1806
180-20010
13
mediumSubjective

The distribution below shows the number of runs scored by batsmen in a cricket season. Calculate the mode of the runs scored.

Runs ScoredNumber of Batsmen
2000-30005
3000-400010
4000-500018
5000-60009
6000-70004
14
mediumSubjective

The weights of 60 students in a class are given in the following distribution. Calculate the median weight.

Weight (in kg)Number of Students
40-455
45-5010
50-5520
55-6015
60-656
65-704
15
mediumSubjective

A dataset has a calculated Mean of 25 and a Mode of 20. A student uses the empirical formula 3 Median=Mode+2 Mean3 \text{ Median} = \text{Mode} + 2 \text{ Mean} to find the median. Critique the reliability of this method. Calculate the median using this formula and explain why it is considered an approximation and not an exact value.

16
mediumSubjective

Recall and state the empirical relationship between the three measures of central tendency: mean, median, and mode.

17
mediumSubjective

What is a 'cumulative frequency distribution'?

18
mediumSubjective

Describe the first two steps to convert a 'less than' type cumulative frequency distribution into a normal frequency distribution.

19
mediumSubjective

Explain the purpose of choosing an 'assumed mean' in the Assumed Mean Method for calculating the mean of grouped data.

20
mediumSubjective

Consider the following frequency distribution table. From this table, identify: (a) The modal class (b) The median class

Class IntervalFrequencyCumulative Frequency
10-2055
20-301217
30-401532
40-50840
50-60646
21
mediumSubjective

Summarize the Step-deviation method for finding the mean of grouped data by listing the key steps and the final formula.

22
mediumSubjective

Summarize the complete procedure for calculating the median of a grouped frequency distribution. List all the steps from beginning to end.

23
mediumSubjective

Describe the three different methods for calculating the mean of grouped data. For each method, state its name, its formula, and explain briefly when it is most appropriate to use.

24
mediumSubjective

The following distribution gives the daily expenditure on milk of 30 households in a locality. Calculate the mean daily expenditure by using the assumed mean method.

Expenditure (in ₹)No. of Households
10-305
30-506
50-708
70-907
90-1104
25
mediumSubjective

The following table gives the production yield per hectare of wheat of 100 farms of a village. Calculate the mean production yield using the step-deviation method.

Production Yield (in kg/ha)Number of Farms
50-552
55-608
60-6512
65-7024
70-7538
75-8016
26
mediumSubjective

A student calculates the mean of a grouped data set and finds it is lower than the lowest class limit. Critique this result. Is it possible? Justify your answer.

27
mediumSubjective

To calculate the mean using the assumed mean method, a student proposes choosing the assumed mean 'a' as 0. Formulate an argument explaining if this is a valid choice and evaluate its usefulness.

28
mediumSubjective

The class marks (xix_i) of a distribution are 125, 175, 225, 275, 325, 375 and their corresponding frequencies (fif_i) are 20, 15, 25, 30, 12, 8. Justify which method (Direct, Assumed Mean, or Step-Deviation) is most suitable for finding the mean and explain why it is more efficient than the others in this case. You do not need to calculate the mean.

29
mediumSubjective

For the given frequency distribution, a student claims the median is 45 because the middle value of the class intervals is 40-50. Evaluate this claim. If it is incorrect, identify the median class and justify your choice.

Class Interval10-2020-3030-4040-5050-60
Frequency81015125
30
mediumSubjective

Design a plan to determine the average time (in minutes) students in your school spend on social media daily. Your plan should propose the data collection method, the structure of the grouped frequency table you would use (including class intervals), and justify which measure of central tendency would be most appropriate to report your findings.

31
mediumSubjective

The monthly incomes of employees in two startups, 'Innovate Inc.' and 'Tech Forward', are given below.

Innovate Inc.: Mean Income = ₹70,000; Median Income = ₹45,000; Modal Income = ₹40,000.

Tech Forward: Mean Income = ₹50,000; Median Income = ₹48,000; Modal Income = ₹52,000.

A financial analyst claims that 'Innovate Inc.' is a better company to work for because its mean income is much higher. Critique this claim. Justify which company likely offers a more equitable salary structure for a typical employee, using the given measures of central tendency to support your argument.

32
hardSubjective

Consider the following data for the weights of students:

Weight (kg)40-4545-5050-5555-6060-65
No. of students581566

If 5 students from the '40-45' kg class were mistakenly recorded and their actual weight is in the '60-65' kg class, evaluate how this correction would impact the mean and the modal class. Justify your answer without calculating the exact new mean.

33
hardSubjective

For a grouped frequency distribution with unequal class sizes, evaluate the statement: "The step-deviation method cannot be used." Is this statement always true? Justify.

34
hardSubjective

Describe what the 'mean' and 'median' of a data set represent and explain the main difference in how they are affected by extreme values (outliers).

35
hardSubjective

Create a discrete frequency distribution with 5 distinct integer values and a total frequency of 10, such that the mean is exactly 5 and the median is 4. Justify your created distribution.

36
hardSubjective

Formulate a grouped frequency distribution with 5 classes and a total frequency of 50 that is skewed to the right. Create the distribution table. Then, calculate the mean, median, and mode for your created data. Justify that the relationship Mean > Median > Mode holds true for your distribution, and explain why this relationship is characteristic of a right-skewed distribution.

37
hardSubjective

The mean of the following frequency distribution is 2525. Analyze the data to find the value of the missing frequency pp.

Class IntervalFrequency
0-105
10-2018
20-3015
30-40pp
40-506
38
hardSubjective

The median of the distribution below is 35. The total number of observations is 170. Create the complete frequency distribution by finding the missing frequencies f1f_1 and f2f_2. After finding the frequencies, evaluate the modal class and calculate the mode for the completed data.

Class0-1010-2020-3030-4040-5050-6060-70
Frequency1020f1f_140f2f_22515
39
hardSubjective

The formula for the mode of a grouped data is Mode=l+(f1f02f1f0f2)×h\text{Mode} = l+\left(\frac{f_{1}-f_{0}}{2 f_{1}-f_{0}-f_{2}}\right) \times h. Formulate a real-world scenario represented by a grouped frequency distribution where the modal class is 40-50, but the calculated mode is less than 45. Justify your formulation with appropriate values for l,f1,f0,f2,l, f_1, f_0, f_2, and hh.

40
hardSubjective

The data regarding the heights of 50 girls of Class X is given below. Analyze the data to calculate the median, mean and mode for the given distribution.

Height (in cm)Number of Girls
120-1302
130-1408
140-15012
150-16020
160-1708
41
hardSubjective

The median of the distribution given below is 4646. The total frequency is 230. Analyze the data to find the values of the missing frequencies xx and yy.

Class IntervalFrequency
10-2012
20-3030
30-40xx
40-5065
50-60yy
60-7025
70-8018
42
hardSubjective

The lengths of 40 leaves of a plant are measured correct to the nearest millimeter. The data obtained is represented in the table below. The class intervals are discontinuous. First, convert the distribution to a continuous frequency distribution. Then, calculate the median length of the leaves.

Length (in mm)Number of Leaves
118-1263
127-1355
136-1449
145-15312
154-1625
163-1714
172-1802
43
hardSubjective

Explain the concept of 'measures of central tendency'. Describe the three main measures (mean, median, and mode) and provide a real-life example for each where it would be the most suitable measure to use.

44
hardSubjective

Explain why it is necessary for class intervals to be continuous when calculating the median or mode of grouped data.