0% found this document useful (0 votes)

32 views34 pages

Central Tendency - Lecture Notes

The document provides an overview of central tendency in statistics, including definitions and calculations for mean, median, and mode. It also discusses weighted mean, mean of grouped data, and variability measures such as range, variance, and standard deviation. Additionally, it covers quartiles and interquartile range, including methods for identifying outliers.

Uploaded by

Fahim Ansari

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

32 views34 pages

Central Tendency - Lecture Notes

Uploaded by

Fahim Ansari

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 34

3.

Data Distribution
Chapter 1 - Central Tendency
Statistics

Descriptive Inferential
Statistics Statistics

Central Tendency
Central Tendency
Descriptive Statistics Outcome is one value

Deﬁnition:

a descriptive summary of a dataset through a single value that reﬂects

the center of the data distribution

To tell us

How the center of distribution?

3. Data Distribution
Chapter 2 - Understanding Mean, Median, Mode
Central Tendency

Name Age

John 33

Mark

Susan

Joe
28

46
? Average
= Mean

… … Average age of employees in the data distribution (data set)

Bob 29
Statistics

Descriptive Inferential
Statistics Statistics

Central Tendency

Mean Median Mode

= Average
Calculating Mean

Name Age

John 33

Mark 28
33 + 28 + 25 + 46 + 32 + 29 + 42 + 21
Susan 25
8
Joe 46

Ema 32

Bob 29 Mean value = 32

Keith 42

Julia 21
Calculating Mean

Day Sales

Sunday 9500

Monday 100
9500 + 100 + 50 + 150 + 100 + 150 + 100
Tuesday 50
7
Wednesday 150

Thursday 100 More than 10 times compared

to most of the sales
Friday 150 Mean value = 1450
Saturday 100
Mean
● Average value of a data series

○ e.g. data[100, 200, 50, 150], then mean is 100+200+50+150 / 4 = 125

● Outlier(s) in our data set can mislead mean value

○ e.g. data[9500, 100, 50, 350], then mean is 9500 + 100 + 50 + 350 / 4 = 2500

○ in above example, mean value is too far high then most of our data in the dataset

● Formula,

○ Mean = 𝝨x/N
Median
● Middle value in our dataset
● Must sort out the data from low to high
○ e.g. data[150, 50, 600, 200, 350]
○ sorted_data[50, 150, 200, 350, 600]
○ therefore, median value is 200
● Formula
○ {(n + 1) ÷ 2}th value

● if n is even, the median is calculated by averaging the two middle values

○ e.g. data[150, 50, 600, 200, 350, 100]
○ sorted_data[50, 100, 150, 200, 350, 600]
○ therefore, media value is 150+200/2 = 175
Mode
● Value that frequently appears in the data set
○ e.g. data[100, 50, 200, 50, 150]
○ therefore, mode is 50 as it appears twice in the data set
● Can be more than one mode in a single data set
○ e.g. data[100, 50, 200, 50, 150, 100]
○ therefore, mode is 50 and 100

● Some data set do not have mode if there is no repeating number

○ e.g. data[50, 100, 150, 200, 250, 300]
● Sorting is preferred as it helps visually
3. Data Distribution
Chapter 3 - Understanding Weighted Mean
and Mean of Grouped Data
Statistics

Descriptive Inferential
Statistics Statistics

Central Tendency

Mean Median Mode

= Average
Weighted Mean

Weighted:

some part of the data is more important than other

calculating mean based on each weight

Weighted Mean - Example

Assessment Type Scores/Marks Weight in Percentage

Mid-term Exam 95 15%

Practical Project 85 35%

Final Exam 82 50%

● Year end score is computed based on:

○ Get 15% of overall score from mid-term exam
○ Get 35% of overall score from practical project
○ Get 50% of overall score from ﬁnal exam
Weighted Mean - Calculation

Assessment Type Scores/Marks Weight Weight x Score

Mid-term Exam 95 0.15 14.25

Practical Project 85 0.35 29.75

Final Exam 82 0.50 41

Grade Point 85

● To calculate the weighted mean

∑w i . x i
Sum (Multiply weight of each value with its value)
WA =
∑w i
○
○ Sum (weight of each value)
○ Then divide w = weight of each value
x = data value
Mean of Grouped Data - Example

Frequency Distribution

Sales Group No. of Days

0-2 11

3-5 8

6-8 5

9-11 3

12-14 1

15-17 2

Mean/Avg = 150/30 = 5
Mean of Grouped Data - Calculation

Sales Midpoint (x) No. of Days f.x

∑fi . xi
Group (frequency, f) GM =
0-2 1 11 11 ∑fi
3-5 4 8 32
f = frequency of each group
6-8 7 5 35 x = midpoint
9-11 10 3 30
153
12-14 13 1 13
30
15-17 16 2 32
Grouped Mean = 5.1
30 153
3. Data Distribution
Chapter 4 - Variability
Statistics

Descriptive Inferential
Statistics Statistics

Central
Tendency Variability

Mean Median Mode Range Variance Standard Deviation

3. Data Distribution
Chapter 5 - Understanding Range, Variance
and Standard Deviation
Statistics

Descriptive Inferential
Statistics Statistics

Central
Tendency Variability

Mean Median Mode Range Variance Standard Deviation

Range - Example & Calculation

● the difference between the largest number and the smallest number
○ e.g. data[100, 50, 200, 50, 150]
○ therefore, range is 200 - 50 = 150

● same outlier effect as mean

○ e.g. data[100, 50, 9000, 50, 100]
○ therefore, range is 9000 - 50 = 8950

● Formula
○ Range = Largest number - Smallest number

● sorting is preferred as it helps visually

Variance - Calculation

● Calculating Variance

○ Step 1 - calculate the mean value

○ Step 2 - subtract mean value from each data point

○ Step 3 - get the squared value for each subtracted value

○ Step 4 - calculate the average of each squared value

Variance - Example
● E.g. data[15, 17, 16, 14, 18, 16] ● Step 1 - calculate the mean value
● Step 2 - subtract mean value from each data point
Step 1 ● Step 3 - get the squared value for each subtracted value
● Step 4 - calculate the average of each squared value
mean = 15+17+16+14+18+16 / 6 = 96/6 = 16

Step 2 & 3

15-16 = -1 => (-1)2 = 1 Step 4

17-16 = 1 => (1)2 = 1
1 + 1 + 0 + 4 + 4 + 0 = 10
2
16-16 = 0 => (0) = 0
n = 6, therefore VAR = 10/6 = 1.67
2
14-16 = -2 => (-2) = 4
18-16 = 2 => (2)2 = 4
16-16 = 0 => (0)2 = 0
Variance - Interpretation

● If the variance is too small, then our data is very close to the mean
○ E.g. data[15, 17, 16, 14, 18, 16]
○ Variance = 1.67, Mean = 16
○ Since the value of variance is small, each data point is not much far from mean

● If the variance is large, then our data is very far from the mean
○ E.g. data[13, 3, 40, 12, 3, 25]
○ Variance = 170, Mean = 16
○ Since the value of variance is large, each data point is considered far from mean
Standard Deviation - Interpretation

● the value of standard deviation shows us how far each data is deviated
from the mean
● Formula
○ take the square root of Variance

● E.g. data[15, 17, 16, 14, 18, 16]

○ VAR = 1.67
○ SD = √1.67 = 1.29
3. Data Distribution
Chapter 6 - Understanding Quartiles and
Interquartiles Range
Quartiles

● Divide the data set into four equal segments after arranging in ascending order
Quartiles

● First quartile, denoted as Q1

○ splits off the lowest 25% of data from the highest 75%
● Second quartile, denoted as Q2
○ cuts data set in half, median value
● Third quartile, denoted as Q3
○ splits off the highest 25% of data from the lowest 75%
Quartiles - Calculating Q1, Q2, Q3

● Step 1: Arrange data in ascending order

● Step 2: Find the median value, i.e. Q2
● Step 3: Find the median value of lower half of the data set, i.e. Q1
● Step 4: Find the median value of upper half of the data set, i.e. Q3
Quartiles - Example
● Step 1: Arrange data in ascending order
e.g. 7, 18, 16, 10, 2, 5, 13, 11, 3
● Step 2: Find the median value, i.e. Q2
Step 1 ● Step 3: Find the median value of lower half of the data set, i.e. Q1

2, 3, 5, 7, 10, 11, 13, 16, 18 ● Step 4: Find the median value of upper half of the data set, i.e. Q3

Step 2 Step 2

Median = 10 = Q2 Median Upper = (13+16)/2 = 14.5 = Q3

Step 2

Median Lower = (3+5)/2 = 4 = Q1

Interquartile Range (IQR)

● Measure spread of the center half of the data set

IQR = Q3 - Q1

● Useful to spot outliers

Any values that are more than:
Q3 + 1.5 IQR

Any values that are less than:

Q1 - 1.5 IQR
Finding Outliers - Example
● Step 1: Arrange data in ascending order
e.g. 11, 41, 44, 47, 51, 53, 57, 75
● Step 2: Find the median value, i.e. Q2
Sort First
● Step 3: Find the median value of lower half of the data set, i.e. Q1
11, 41, 44, 47, 51, 53, 57, 75 ● Step 4: Find the median value of upper half of the data set, i.e. Q3

Q3
Q3 + 1.5 IQR
(53+57)/2 = 55
55 + (1.5 x 12.5) = 73.75
Q1

(41+44)/2 = 42.5 Q1 - 1.5 IQR

IQR
42.5 - (1.5 x 12.5) = 23.75
55-42.5 = 12.5

Exploring Numerical Data - Students
No ratings yet
Exploring Numerical Data - Students
97 pages
Math264 Numerical Measures Apaydın
No ratings yet
Math264 Numerical Measures Apaydın
64 pages
Representation of Data - 1.1.4
No ratings yet
Representation of Data - 1.1.4
6 pages
Lecture 04
No ratings yet
Lecture 04
88 pages
Intro to Central Tendency Basics
No ratings yet
Intro to Central Tendency Basics
13 pages
03 Numerical Description
No ratings yet
03 Numerical Description
52 pages
Probability Theory & Statistics: Describing Data: Numerical
No ratings yet
Probability Theory & Statistics: Describing Data: Numerical
36 pages
Descriptive Statistics 1
No ratings yet
Descriptive Statistics 1
63 pages
Measusres of Locations
No ratings yet
Measusres of Locations
52 pages
Chapter 1
No ratings yet
Chapter 1
44 pages
2 Measures of Location - Dispersion
No ratings yet
2 Measures of Location - Dispersion
61 pages
Ken Black QA ch03
0% (1)
Ken Black QA ch03
61 pages
Lecture 06-Describing Data Visual Information
No ratings yet
Lecture 06-Describing Data Visual Information
49 pages
Basic 1
No ratings yet
Basic 1
60 pages
2 Descriptives
No ratings yet
2 Descriptives
43 pages
Measures of Location and VARIATION For 1 Variable
No ratings yet
Measures of Location and VARIATION For 1 Variable
44 pages
Statistics for Data Analysis
No ratings yet
Statistics for Data Analysis
59 pages
Week 6+7+8
No ratings yet
Week 6+7+8
37 pages
Data Analytics TB
No ratings yet
Data Analytics TB
1,944 pages
Chapter 3
No ratings yet
Chapter 3
17 pages
Lecture 2b - Describing Data-Numerical
No ratings yet
Lecture 2b - Describing Data-Numerical
47 pages
Statistics For Data Science
No ratings yet
Statistics For Data Science
26 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
38 pages
AGA 3842-2022-2023. Descriptive Statistics
No ratings yet
AGA 3842-2022-2023. Descriptive Statistics
101 pages
Discriptive Statistics
No ratings yet
Discriptive Statistics
50 pages
Discriptive Statistics
No ratings yet
Discriptive Statistics
23 pages
FDSA Unit 2
No ratings yet
FDSA Unit 2
44 pages
Measure of Variation
No ratings yet
Measure of Variation
50 pages
Introduction To Measuring The Central Tendency:: Practice Example
No ratings yet
Introduction To Measuring The Central Tendency:: Practice Example
14 pages
Part 2-Chapter 3 - Describing Data - Edit
No ratings yet
Part 2-Chapter 3 - Describing Data - Edit
46 pages
3-Measures of Dispersion
No ratings yet
3-Measures of Dispersion
33 pages
Descriptive Statistics W25
No ratings yet
Descriptive Statistics W25
41 pages
2 Basic Statistics Unit-II Class
No ratings yet
2 Basic Statistics Unit-II Class
28 pages
Lec1 Statistics
No ratings yet
Lec1 Statistics
30 pages
Descriptive Statistics Guide
No ratings yet
Descriptive Statistics Guide
5 pages
Quantitative Methods For Management
No ratings yet
Quantitative Methods For Management
118 pages
Stat 1101 4 7
No ratings yet
Stat 1101 4 7
18 pages
Measures
No ratings yet
Measures
8 pages
2 - Chapter 1 - Measures of Central Tendency and Variation New
100% (1)
2 - Chapter 1 - Measures of Central Tendency and Variation New
18 pages
Lesson 3.2 Measures of Central Tendency Position and Variation
No ratings yet
Lesson 3.2 Measures of Central Tendency Position and Variation
62 pages
Basic Business Statistics: Concepts & Applications: Activity 4+ 5 + 6 Descriptive Statistics and Graphical Analysis
No ratings yet
Basic Business Statistics: Concepts & Applications: Activity 4+ 5 + 6 Descriptive Statistics and Graphical Analysis
33 pages
Numerical Measures: Bf1206-Business Mathematics SEMESTER 2 - 2016/2017
No ratings yet
Numerical Measures: Bf1206-Business Mathematics SEMESTER 2 - 2016/2017
25 pages
Business Statistics CH
No ratings yet
Business Statistics CH
37 pages
EECM3724 Unit 1 Ch3 Slides 2022
No ratings yet
EECM3724 Unit 1 Ch3 Slides 2022
48 pages
Social Science Statistics (June-Aug) 2025-Topic 2
No ratings yet
Social Science Statistics (June-Aug) 2025-Topic 2
21 pages
Central Tendency Variation Outliers
No ratings yet
Central Tendency Variation Outliers
59 pages
Work Book Related To Mean, Median, Mode
No ratings yet
Work Book Related To Mean, Median, Mode
14 pages
Lec006 - Measures of Dispersion
No ratings yet
Lec006 - Measures of Dispersion
42 pages
MCS Lecture 3
No ratings yet
MCS Lecture 3
57 pages
Godinez Kizzha G Asynchronous Output 3
No ratings yet
Godinez Kizzha G Asynchronous Output 3
7 pages
St130: Basic Statistics Week 3: Lecture: School of Computing Information and Mathematical Sciences
No ratings yet
St130: Basic Statistics Week 3: Lecture: School of Computing Information and Mathematical Sciences
62 pages
Jerome Statistics
No ratings yet
Jerome Statistics
12 pages
Lec 1 Probability
No ratings yet
Lec 1 Probability
34 pages
Chapter 1
100% (1)
Chapter 1
75 pages
2 Stats Intro 14022024 105150am
No ratings yet
2 Stats Intro 14022024 105150am
19 pages
Introduction To Statistics
No ratings yet
Introduction To Statistics
24 pages
Dsbda Unit 2
No ratings yet
Dsbda Unit 2
155 pages
Lecture 02 - Exploratory Data and Descriptive Statistics
No ratings yet
Lecture 02 - Exploratory Data and Descriptive Statistics
27 pages
T or False 1. Informatics Is The Science of Processing Data For Storage and Retrieval
No ratings yet
T or False 1. Informatics Is The Science of Processing Data For Storage and Retrieval
5 pages
Student Performance PowerBI Full Report
No ratings yet
Student Performance PowerBI Full Report
25 pages
"Smart Bus Pass System Using QR Code": Computer Science and Engineering
80% (5)
"Smart Bus Pass System Using QR Code": Computer Science and Engineering
46 pages
Databricks Certified Data Engineer Professional Dumps by Ball 21-03-2024 10qa Ebraindumps
100% (1)
Databricks Certified Data Engineer Professional Dumps by Ball 21-03-2024 10qa Ebraindumps
19 pages
COSC 6335 Data Mining (Dr. Eick) Solution Sketches Midterm Exam October 25, 2012
No ratings yet
COSC 6335 Data Mining (Dr. Eick) Solution Sketches Midterm Exam October 25, 2012
11 pages
Concept of Schema Instance & Data Independance
No ratings yet
Concept of Schema Instance & Data Independance
7 pages
Alteryx Workflow Tools Overview
No ratings yet
Alteryx Workflow Tools Overview
16 pages
CS8492-Database Management Systems Department of CSE: Relational Databases
No ratings yet
CS8492-Database Management Systems Department of CSE: Relational Databases
18 pages
Extc Sem 7 Bda R-2016
No ratings yet
Extc Sem 7 Bda R-2016
4 pages
Three Schema Architecture in DBMS
No ratings yet
Three Schema Architecture in DBMS
14 pages
The SQL Tutorial For Data Analysis v2
No ratings yet
The SQL Tutorial For Data Analysis v2
103 pages
Enqueue Waits: Locks Thanks To Doug Burns For Much of The Row Lock Example
No ratings yet
Enqueue Waits: Locks Thanks To Doug Burns For Much of The Row Lock Example
66 pages
Azure MySQL Infographic - Final
No ratings yet
Azure MySQL Infographic - Final
2 pages
Resume Zubair
No ratings yet
Resume Zubair
5 pages
Active Directory Domain Controller
No ratings yet
Active Directory Domain Controller
33 pages
Project Report On Business Intelligence
No ratings yet
Project Report On Business Intelligence
64 pages
PHP Basics for Web Developers
No ratings yet
PHP Basics for Web Developers
51 pages
SQL Commands for Sales and Client Management
No ratings yet
SQL Commands for Sales and Client Management
94 pages
Kv2 Computer Science
No ratings yet
Kv2 Computer Science
184 pages
2 CC2024 MV Encode
No ratings yet
2 CC2024 MV Encode
2 pages
Statistics Concepts for Students
No ratings yet
Statistics Concepts for Students
2 pages
Database Normalization Guide
No ratings yet
Database Normalization Guide
23 pages
Im97k Mlv0u
No ratings yet
Im97k Mlv0u
74 pages
What Is Search GPT and What Is It Used For?
No ratings yet
What Is Search GPT and What Is It Used For?
6 pages
CS322 - Lec 3 - S25
No ratings yet
CS322 - Lec 3 - S25
42 pages
Pending Papers: PC-3, PC-8, PC-16
No ratings yet
Pending Papers: PC-3, PC-8, PC-16
3 pages
Information Technology NSC P2 QP Sept 2021 Eng
No ratings yet
Information Technology NSC P2 QP Sept 2021 Eng
13 pages
Venkateshwaran Gopal: Professional
No ratings yet
Venkateshwaran Gopal: Professional
5 pages
Ooad UNIT 5 Notes
No ratings yet
Ooad UNIT 5 Notes
29 pages
Maruthisai Sambaraju Datastage Developer With Netezza
No ratings yet
Maruthisai Sambaraju Datastage Developer With Netezza
9 pages