0% found this document useful (0 votes)

9 views13 pages

Statistics Project

This statistics project explores measures of central tendency, including mean, median, and mode, through various datasets and methods such as histograms and ogives. It includes detailed analyses of diabetic patient data and salary distributions, highlighting the practical applications of statistics in real-life scenarios. The project concludes with insights on the significance of statistical methods in health studies and economics.

Uploaded by

aayushpillai

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views13 pages

Statistics Project

Uploaded by

aayushpillai

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

📑 STATISTICS PROJECT

Page 1: ACKNOWLEDGMENT
I sincerely thank my Mathematics teacher for her valuable guidance, motivation, and support
throughout the completion of this project. Her encouragement helped me to understand and
apply statistical concepts effectively.

I am also grateful to my parents and friends for their continuous help and inspiration during this
project.

Page 2: INDEX
Page Content

1 Acknowledgment

2 Index

3 Introduction

4-6 Types of Measures of Central Tendency (Advantages &

Disadvantages)

7-9 Problem 1 – Diabetic Patients Data (Step Deviation, Histogram, Ogive)

10-12 Ogive-based Problem (Median, Quartiles, Deciles, Modal Class)

13-14 Analysis & Comparison of Results

15-18 Problem 2 – Salaries (Median, Interquartile Range, Mode)

19-20 Problem 3 – Missing Frequencies

21 Conclusion

22 Bibliography
Page 3: INTRODUCTION
Statistics is the branch of mathematics that deals with the collection, analysis, interpretation,
and presentation of data. In real life, data is often large and unorganized, and it becomes
necessary to represent it in a simplified form for proper understanding and decision-making.

One important tool of statistics is the measure of central tendency, which is a single
representative value that gives an idea about the central or typical value in a set of data.

The three most common measures of central tendency are:

● Mean (Average)

● Median (Middle value)

● Mode (Most frequent value)

This project aims to explore these measures using different datasets and statistical methods,
including step deviation, histograms, ogives, median, quartiles, deciles, and missing frequency
problems.
Pages 4-6: TYPES OF MEASURES OF CENTRAL
TENDENCY
1. Mean

● Definition: The mean is the sum of all data values divided by the number of values.

● Advantages:

1. Easy to calculate and understand.

2. Considers all observations in the dataset.

3. Useful for further algebraic and statistical analysis.

● Disadvantages:

1. Heavily affected by extreme values (outliers).

2. Not suitable for qualitative data.

2. Median

● Definition: The median is the middle value of an ordered dataset. For grouped data, it is
obtained using interpolation.

● Advantages:

1. Not affected by outliers.

2. Best suited for skewed data.

● Disadvantages:

1. Ignores most of the data values.

2. Cannot be used in further algebraic analysis.

3. Mode

● Definition: The mode is the most frequently occurring value in the dataset.

● Advantages:

1. Very simple to understand and apply.

2. Suitable for categorical and qualitative data.

● Disadvantages:

1. Mode may not exist, or there may be more than one.

2. Does not use all data values.

Pages 7-9: PROBLEM 1 – DIABETIC PATIENTS
Data Table:

Age (years) <10 <20 <30 <40 <50 <60 <70 <80

No. of Patients 2 11 27 37 45 55 65 72

Step Deviation Method (Mean)

1. Convert cumulative data to class intervals:

0–10, 10–20, 20–30, 30–40, 40–50, 50–60, 60–70, 70–80

2. Class width (h) = 10

3. Take assumed mean A = 45 (class 40–50 midpoint).

4. Compute deviations:u =

(Here, you’ll show the calculation table with midpoints, deviations, and products. The final
answer will come out as the approximate mean age of diabetic patients.)

Histogram

(Graphical representation – insert histogram with Age on X-axis, Frequency on Y-axis.)

Ogive

(Graphical representation – insert ogive with cumulative frequency vs. upper class limits.)
Pages 10-12: OGIVE-BASED PROBLEM

1. Frequency distribution (class width 10)

Upper-cumulative frequencies read from the ogive:
at 10 → 5, 20 → 15, 30 → 29, 40 → 50, 50 → 75, 60 → 109, 70 → 145, 80 → 172, 90 → 188,
100 → 200.

Frequency = successive differences:

Class Cumulative Freq (CF) Class Freq (f)

(marks)

0 – 10 5 5

10 – 20 15 10

20 – 30 29 14

30 – 40 50 21

40 – 50 75 25

50 – 60 109 34

60 – 70 145 36

70 – 80 172 27

80 – 90 188 16

90 –100 200 12

Total 200 200

Total students N=200N=200.

(i) Scale of the graph
● X-axis (Marks): 1 small square = 1 mark (1 big square = 10 marks).

● Y-axis (Number of students): 1 small square = 2 students (1 big square = 20 students).

(These are the scales consistent with the axis labelling and the point at (100,200).)

(ii) Median and median class

● Median position = = 100th observation.

● CF at 50 = 75, at 60 = 109 → median lies in class 50–60 (class frequency f=34).

Use linear interpolation:

Median ≈ 57.35 marks. Median class = 50–60.

(iii) Upper quartile (Q3) and its class

● Q3 position = = 150th observation.

● CF at 70 = 145, at 80 = 172 → Q3 lies in class 70–80 ( f=27).

Interpolation:

Q3 ≈ 71.85 marks. Upper-quartile class = 70–80.

(iv) First decile
● D1D_1 position = N/10=20N/10 = 20th observation.

● CF at 20 = 15, at 30 = 29 → lies in class 20–30 ( f=14).

Interpolation:

First decile D1≈23.57

marks (class 20–30)

(v) Modal class

The class with maximum frequency is 60–70 (frequency = 36).
Modal class = 60–70.

(vi) Number of students who scored 95% or more

95% of 100 = 95 marks. We need number of students with marks

From the CF: CF at 90 = 188, CF at 100 = 200. The 90–100 class has 12 students.
Assuming uniform distribution inside that class,

So number scoring
Approximately 6 students scored 95% or more.
Pages 13-14: ANALYSIS OF PROBLEM 1 & 2
● By comparing Mean, Median, and Mode, we can see how different measures of central
tendency provide slightly different insights into the same dataset.

● The age group 60–70 shows the highest number of diabetic patients.

● Possible reasons: age-related health decline, less physical activity, poor dietary habits,
and hereditary factors.

● The graphical methods (Histogram, Ogive) visually confirm these results.

Pages 15-18: PROBLEM 2 – SALARIES
Data Table:

Salary (₹ ‘000) 12 27 33 42 51 56 58 62 70

No. of Persons 49 128 63 15 6 7 4 2 1

i) Median Salary

(Step-by-step working here with N=275. Show cumulative frequencies, identify median class,
then calculate median.)

ii) Interquartile Range (IQR)

(Show detailed calculation using interpolation for Q1 and Q3 positions.)

iii) Modal Salary

(Show working by identifying the modal class and substituting values.)

Pages 19-20: PROBLEM 3 – MISSING FREQUENCIES
Given:

● Σf = 120

● Mean = 50

Class Interval Frequenc

0–20 17

20–40 a

40–60 32

60–80 b

80–100 19

● Class midpoints: 10, 30, 50, 70, 90

Equation of mean:

Also:

Solve simultaneously for a and b.

Page 21: CONCLUSION
This project gave me practical knowledge of how statistics is applied to real-life situations. I
learned how to calculate mean, median, mode, quartiles, and deciles using different methods. I
also understood how to draw and interpret histograms and ogives.

Through the problems on diabetic patients and salary distribution, I realized the importance of
statistics in health studies, economics, and social sciences.

Page 22: BIBLIOGRAPHY

1. NCERT Mathematics Textbook (Class X)

2. R.S. Aggarwal – Statistics and Probability

3. Online Resources: Khan Academy, BYJU’s, PhysicsWallah, Vedantu

4. Teacher’s classroom notes

Data Management and Statistics Guide
No ratings yet
Data Management and Statistics Guide
10 pages
Statistics
No ratings yet
Statistics
6 pages
Unit Ii Descriptive Statistics Measures of Central Tendency Types of Data
No ratings yet
Unit Ii Descriptive Statistics Measures of Central Tendency Types of Data
29 pages
Central Tendency Measures Guide
No ratings yet
Central Tendency Measures Guide
7 pages
Central Tendency, Position, and Variation
No ratings yet
Central Tendency, Position, and Variation
37 pages
Book P2 2025 F
No ratings yet
Book P2 2025 F
131 pages
Mathematics As A Tool (Descriptive Statistics) (Midterm Period) Overview: This Module Tackles Mathematics As Applied To Different Areas Such As Data
No ratings yet
Mathematics As A Tool (Descriptive Statistics) (Midterm Period) Overview: This Module Tackles Mathematics As Applied To Different Areas Such As Data
33 pages
Worked Solutions 3
No ratings yet
Worked Solutions 3
17 pages
Data Presentation Basics
100% (1)
Data Presentation Basics
45 pages
s.4 Statistics Lesson Notes
No ratings yet
s.4 Statistics Lesson Notes
31 pages
8.1 Measures of Central Tendency Mean Median Mode Weighted Mean
No ratings yet
8.1 Measures of Central Tendency Mean Median Mode Weighted Mean
36 pages
Data Analysis for Statisticians
No ratings yet
Data Analysis for Statisticians
21 pages
Summary of Lesson Week 1 4
No ratings yet
Summary of Lesson Week 1 4
47 pages
Chapter 14 Statistics Test 03
No ratings yet
Chapter 14 Statistics Test 03
15 pages
MMW Module 4 - Statistics
No ratings yet
MMW Module 4 - Statistics
18 pages
Chapter 1
No ratings yet
Chapter 1
16 pages
Engineering Students' Guide to Statistics
No ratings yet
Engineering Students' Guide to Statistics
32 pages
Statistics Maths Clinic Gr12 Eng
No ratings yet
Statistics Maths Clinic Gr12 Eng
6 pages
14 Statistics 1
No ratings yet
14 Statistics 1
74 pages
1 Review of Statistics
No ratings yet
1 Review of Statistics
24 pages
Class X Statistics Assignment
No ratings yet
Class X Statistics Assignment
6 pages
Test 1 Review A
No ratings yet
Test 1 Review A
7 pages
5-MEASURES of DISPERSION-02-Aug-2019Material I 02-Aug-2019 Exp. No. 1 - Measures of Central Tendency Dispersion Skewness and Kurtosi
No ratings yet
5-MEASURES of DISPERSION-02-Aug-2019Material I 02-Aug-2019 Exp. No. 1 - Measures of Central Tendency Dispersion Skewness and Kurtosi
10 pages
Unit4 Fundamental Stat Maths2 (D)
No ratings yet
Unit4 Fundamental Stat Maths2 (D)
28 pages
Greenwood High School 2021 - 2022 Mathematics - Project 2: Aarav Batra Grade 9, B
No ratings yet
Greenwood High School 2021 - 2022 Mathematics - Project 2: Aarav Batra Grade 9, B
19 pages
2 Frequency-Distribution
No ratings yet
2 Frequency-Distribution
75 pages
Statistics
No ratings yet
Statistics
8 pages
09042020212640practical - Manual - Ag - Statistics - Ug and PG - Courses
No ratings yet
09042020212640practical - Manual - Ag - Statistics - Ug and PG - Courses
79 pages
Gec 4 Final Problem Sets With Answers HL
No ratings yet
Gec 4 Final Problem Sets With Answers HL
14 pages
Organization of Data
No ratings yet
Organization of Data
56 pages
EXP-1 - Statistics and Plotting
No ratings yet
EXP-1 - Statistics and Plotting
23 pages
11.11 Statistics
No ratings yet
11.11 Statistics
28 pages
The Measures of Central Tendency
No ratings yet
The Measures of Central Tendency
40 pages
Chapter One Illustration
No ratings yet
Chapter One Illustration
9 pages
Data Organization 2
No ratings yet
Data Organization 2
35 pages
Statistics
No ratings yet
Statistics
6 pages
Lecture 5 Introduction To Statistics
No ratings yet
Lecture 5 Introduction To Statistics
54 pages
Statistics (Dental)
No ratings yet
Statistics (Dental)
40 pages
02 Measure of Central Tendency
No ratings yet
02 Measure of Central Tendency
56 pages
Math CBSE Class 10th Statistics
No ratings yet
Math CBSE Class 10th Statistics
28 pages
Ipsita Panda-Biostats Assignment
No ratings yet
Ipsita Panda-Biostats Assignment
11 pages
Stats Form 4
100% (2)
Stats Form 4
35 pages
Business Statistics
No ratings yet
Business Statistics
106 pages
Data Presentation Techniques Guide
No ratings yet
Data Presentation Techniques Guide
17 pages
03 Statistics
No ratings yet
03 Statistics
19 pages
Data Handling Revision Manual
No ratings yet
Data Handling Revision Manual
42 pages
Cricket Stats & Data Analysis
No ratings yet
Cricket Stats & Data Analysis
12 pages
Statistics Overview
No ratings yet
Statistics Overview
13 pages
Introduction to Statistics & Data
No ratings yet
Introduction to Statistics & Data
98 pages
Measures of Central Tendency Guide
No ratings yet
Measures of Central Tendency Guide
20 pages
Research and Statistics Counselling and Family Therapy
No ratings yet
Research and Statistics Counselling and Family Therapy
10 pages
Frequency Distribution Table (FDT) : Where N Total Number of Values To Be Grouped
No ratings yet
Frequency Distribution Table (FDT) : Where N Total Number of Values To Be Grouped
7 pages
Group8 - Case Study - DBM30033
No ratings yet
Group8 - Case Study - DBM30033
17 pages
Statistics
No ratings yet
Statistics
15 pages
01 WS1 Mean and Standard Deviation For Grouped Data PDF
No ratings yet
01 WS1 Mean and Standard Deviation For Grouped Data PDF
22 pages
Term 3 MP Training Manual 2025
No ratings yet
Term 3 MP Training Manual 2025
72 pages
Frequency Distribution and Statistics Guide
No ratings yet
Frequency Distribution and Statistics Guide
15 pages
Math Review for USTET Prep
100% (1)
Math Review for USTET Prep
3 pages
Data Presentation and Analysis Methods
No ratings yet
Data Presentation and Analysis Methods
7 pages
Transpiration Ws
No ratings yet
Transpiration Ws
2 pages
Drink Recipe
No ratings yet
Drink Recipe
3 pages
Endocrine Ws
No ratings yet
Endocrine Ws
2 pages
Awakening of Nation Towards Freedom
No ratings yet
Awakening of Nation Towards Freedom
7 pages
QBE - Approach To Risk Management
100% (1)
QBE - Approach To Risk Management
8 pages
Perceived Advantage of Social Networking Sites in Selected Restaurants in Lucena City Chapter 3
0% (1)
Perceived Advantage of Social Networking Sites in Selected Restaurants in Lucena City Chapter 3
5 pages
BW LME Tutorial2 PDF
No ratings yet
BW LME Tutorial2 PDF
22 pages
Pedestrian Deaths GHSA
No ratings yet
Pedestrian Deaths GHSA
38 pages
Correlation UNIT III
No ratings yet
Correlation UNIT III
2 pages
Project Report: Tourism Products of Irctc
No ratings yet
Project Report: Tourism Products of Irctc
36 pages
Aston - Interpreting The Landscape
100% (2)
Aston - Interpreting The Landscape
169 pages
Birla Institute of Technology & Science, Pilani Course Handout Part A: Content Design
No ratings yet
Birla Institute of Technology & Science, Pilani Course Handout Part A: Content Design
5 pages
STDM Presentation On Sums From Levin & Rubin by
No ratings yet
STDM Presentation On Sums From Levin & Rubin by
7 pages
Secundo - 2017
No ratings yet
Secundo - 2017
60 pages
Benchmarking for Organizational Success
No ratings yet
Benchmarking for Organizational Success
3 pages
GBS541 Quantitative Methods Course Outline
No ratings yet
GBS541 Quantitative Methods Course Outline
3 pages
Bank of America
No ratings yet
Bank of America
26 pages
Standard Format of Term Paper
100% (1)
Standard Format of Term Paper
4 pages
Lessonly CustomerServicePolicyExamples
100% (1)
Lessonly CustomerServicePolicyExamples
41 pages
The Study of Factors Affecting Customer's Satisfaction With The Three Star Hotels in Dubai
No ratings yet
The Study of Factors Affecting Customer's Satisfaction With The Three Star Hotels in Dubai
4 pages
Assessment 2 Template (v4)
No ratings yet
Assessment 2 Template (v4)
3 pages
Samrat CRM
No ratings yet
Samrat CRM
48 pages
Stress Impact on Academic Success
No ratings yet
Stress Impact on Academic Success
14 pages
Aviation S
No ratings yet
Aviation S
31 pages
Blood Stain Pattern Analysis AComprehensive Reviewof Methods Reliabilityof Computerized Analysis
No ratings yet
Blood Stain Pattern Analysis AComprehensive Reviewof Methods Reliabilityof Computerized Analysis
8 pages
Contoh Soal Selidik
No ratings yet
Contoh Soal Selidik
3 pages
Grobelnik 2022 J. Phys. Conf. Ser. 2292 012008
No ratings yet
Grobelnik 2022 J. Phys. Conf. Ser. 2292 012008
11 pages
Project Report Sufi
100% (1)
Project Report Sufi
24 pages
Non Doctrinal Research
No ratings yet
Non Doctrinal Research
52 pages
Research Process Guide for Students
No ratings yet
Research Process Guide for Students
12 pages
Statistical Method For Economics QUESTION BANK 2010-11: Bliss Point
No ratings yet
Statistical Method For Economics QUESTION BANK 2010-11: Bliss Point
16 pages
Seminars, BM
No ratings yet
Seminars, BM
9 pages
Restructuring The National Professional Qualification For Educational Leaders (Npqel PDF
No ratings yet
Restructuring The National Professional Qualification For Educational Leaders (Npqel PDF
18 pages
A Basic Course in Partial Differential Equations - Qing Han
100% (9)
A Basic Course in Partial Differential Equations - Qing Han
305 pages

Statistics Project

Uploaded by

Statistics Project

Uploaded by

📑 STATISTICS PROJECT

4-6 Types of Measures of Central Tendency (Advantages &

7-9 Problem 1 – Diabetic Patients Data (Step Deviation, Histogram, Ogive)

10-12 Ogive-based Problem (Median, Quartiles, Deciles, Modal Class)

13-14 Analysis & Comparison of Results

15-18 Problem 2 – Salaries (Median, Interquartile Range, Mode)

19-20 Problem 3 – Missing Frequencies

The three most common measures of central tendency are:

●​ Median (Middle value)​

●​ Mode (Most frequent value)​

1.​ Easy to calculate and understand.​

2.​ Considers all observations in the dataset.​

3.​ Useful for further algebraic and statistical analysis.​

1.​ Heavily affected by extreme values (outliers).​

2.​ Not suitable for qualitative data.​

1.​ Not affected by outliers.​

2.​ Best suited for skewed data.​

1.​ Ignores most of the data values.​

2.​ Cannot be used in further algebraic analysis.​

1.​ Very simple to understand and apply.​

2.​ Suitable for categorical and qualitative data.​

2.​ Does not use all data values.​

Step Deviation Method (Mean)

1.​ Convert cumulative data to class intervals:​

2.​ Class width (h) = 10​

3.​ Take assumed mean A = 45 (class 40–50 midpoint).​

4.​ Compute deviations:u = ​

(Graphical representation – insert histogram with Age on X-axis, Frequency on Y-axis.)

1. Frequency distribution (class width 10)

Frequency = successive differences:

Class Cumulative Freq (CF) Class Freq (f)

Total 200 200

Total students N=200N=200.

●​ Y-axis (Number of students): 1 small square = 2 students (1 big square = 20 students).​

(ii) Median and median class

●​ Median position = = 100th observation.​

●​ CF at 50 = 75, at 60 = 109 → median lies in class 50–60 (class frequency f=34).​

Median ≈ 57.35 marks. Median class = 50–60.

(iii) Upper quartile (Q3) and its class

●​ Q3 position = = 150th observation.​

●​ CF at 70 = 145, at 80 = 172 → Q3 lies in class 70–80 ( f=27).​

Q3 ≈ 71.85 marks. Upper-quartile class = 70–80.

●​ CF at 20 = 15, at 30 = 29 → lies in class 20–30 ( f=14).​

First decile D1≈23.57

marks (class 20–30)

(v) Modal class

(vi) Number of students who scored 95% or more

●​ The graphical methods (Histogram, Ogive) visually confirm these results.​

No. of Persons 49 128 63 15 6 7 4 2 1

ii) Interquartile Range (IQR)

(Show detailed calculation using interpolation for Q1 and Q3 positions.)

iii) Modal Salary

(Show working by identifying the modal class and substituting values.)

Class Interval Frequenc

●​ Class midpoints: 10, 30, 50, 70, 90​

Solve simultaneously for a and b.

Page 22: BIBLIOGRAPHY

2.​ R.S. Aggarwal – Statistics and Probability​

3.​ Online Resources: Khan Academy, BYJU’s, PhysicsWallah, Vedantu​

4.​ Teacher’s classroom notes​

You might also like

● Median (Middle value)

● Mode (Most frequent value)

1. Easy to calculate and understand.

2. Considers all observations in the dataset.

3. Useful for further algebraic and statistical analysis.

1. Heavily affected by extreme values (outliers).

2. Not suitable for qualitative data.

1. Not affected by outliers.

2. Best suited for skewed data.

1. Ignores most of the data values.

2. Cannot be used in further algebraic analysis.

1. Very simple to understand and apply.

2. Suitable for categorical and qualitative data.

2. Does not use all data values.

1. Convert cumulative data to class intervals:

2. Class width (h) = 10

3. Take assumed mean A = 45 (class 40–50 midpoint).

4. Compute deviations:u =

● Y-axis (Number of students): 1 small square = 2 students (1 big square = 20 students).

● Median position = = 100th observation.

● CF at 50 = 75, at 60 = 109 → median lies in class 50–60 (class frequency f=34).

● Q3 position = = 150th observation.

● CF at 70 = 145, at 80 = 172 → Q3 lies in class 70–80 ( f=27).

● CF at 20 = 15, at 30 = 29 → lies in class 20–30 ( f=14).

● The graphical methods (Histogram, Ogive) visually confirm these results.

● Class midpoints: 10, 30, 50, 70, 90

2. R.S. Aggarwal – Statistics and Probability

3. Online Resources: Khan Academy, BYJU’s, PhysicsWallah, Vedantu

4. Teacher’s classroom notes