MATH211 ProbabilityAndStatistics
MATH211 ProbabilityAndStatistics
Spring - 2023
1
These course slides should not be reproduced nor used by others (without
permission).
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 1 / 463
Lecture Outline I
1 Lecture 1: Introduction
1.2 Background
1.8 Variable
1.11 Homework
2.5 Homework
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 3 / 463
Lecture Outline III
3.4 Histogram
3.6 Exercises
3.7 Homework
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 4 / 463
Lecture Outline IV
4.5 Median
4.6 Mode
4.9 Boxplots
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 5 / 463
Lecture Outline V
4.10 Exercises
4.11 Homework
5.4 Exercises
5.5 Homework
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 6 / 463
Lecture Outline VI
8.2 Skewness
9.2 Events
9.6 Homework
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 8 / 463
Lecture Outline VIII
10.1 Joint and Marginal Probabilities
10.3 Homework
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 9 / 463
Lecture Outline IX
14.2 Homework
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 11 / 463
Lecture Outline XI
18.1 Interval Estimation of µ1 − µ2
18.2 Determining the Sample Size
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 12 / 463
Lecture Outline XII
21.2 Covariance
21.4 Exercises
22.3 Homework
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 15 / 463
Lecture 1: Introduction
Lecture 1: Introduction
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 16 / 463
Lecture 1: Introduction
1 Lecture 1: Introduction
1.2 Background
1.8 Variable
1.11 Homework
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 17 / 463
Lecture 1: Introduction Text and Reference Book List
Text Book
Reference list
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 18 / 463
Lecture 1: Introduction Text and Reference Book List
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 19 / 463
Lecture 1: Introduction Text and Reference Book List
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 20 / 463
Lecture 1: Introduction Text and Reference Book List
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 21 / 463
Lecture 1: Introduction Text and Reference Book List
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 22 / 463
Lecture 1: Introduction Text and Reference Book List
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 23 / 463
Lecture 1: Introduction Text and Reference Book List
source: https://www.newagebd.net/article/172787/
big-budget-with-high-bank-borrowing-proposed
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 24 / 463
Lecture 1: Introduction Text and Reference Book List
Myocardial Infarction
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 25 / 463
Lecture 1: Introduction Text and Reference Book List
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 26 / 463
Lecture 1: Introduction Text and Reference Book List
Chapter-1: Introduction to
Statistics
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 27 / 463
Lecture 1: Introduction What is Statistics?
What is Statistics??
. Statistics
▸ it is a science of data collection, organization, summarization,
classication, comparison and drawing of inferences about the
population
▸ also provides tools for prediction and forecasting using data and
statistical models
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 28 / 463
Lecture 1: Introduction What is Statistics?
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 29 / 463
Lecture 1: Introduction What is Statistics?
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 30 / 463
Lecture 1: Introduction Limitation of Statistics
Limitation of Statistics
Any Limitation!
statistics is always not suited to the study of qualitative phenomenon
privacy???
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 31 / 463
Lecture 1: Introduction What is Data?
What is Data?
Data
data are individual units of information
these facts and gures are collected, analyzed, and summarized for
presentation and interpretation
all the data collected in a particular study are referred to as the data
set for the study
Datum
datum (a singular form of "data") is a single piece of information
Raw data
data are collected by survey, census etc.
Types of data
1 quantitative data ex: Family size, Score, Age
▸ nominal-level data
▸ ordinal-level data
Types of data based on sources
1 primary data
2 secondary data
Types of data based on the purposes of statistical analysis
1 time series data
2 cross-sectional data
3 Pooled data
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 33 / 463
Lecture 1: Introduction What is Data?
for the purpose of statistical analysis, distinguishing time series data and
cross-sectional data are meaningful
1 cross-sectional data
▸ are data collected at the same or approximately the same point in time
for example, company's prot, students prole we collect at the
same time
Note: in this course most of the data will be considered as cross-sectional
data
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 34 / 463
Lecture 1: Introduction What is Data?
the data in Table 1.1 are cross-sectional because they describe the
ve variables for the 25 mutual funds at the same point in time
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 35 / 463
Lecture 1: Introduction What is Data?
▸ for example, the time series in Figure 1.1 shows the U.S. average price
per gallon of conventional regular gasoline between 2012 and 2018
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 36 / 463
Lecture 1: Introduction What is Data?
3 pooled data
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 37 / 463
Lecture 1: Introduction Population & Sample
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 38 / 463
Lecture 1: Introduction Population & Sample
Random Sample
▸ is a representative part of the target population which has been drawn
randomly
Census
▸ method to collect data on the entire population
▸ As per as 2022 census, Bangladesh have a population of 165,158,616
people, of which 81,712,824 are male, while 83,347,206 are female. As
many as 113,063,587 of them live in rural areas and 52,009,072 live in
Urban.
Sample survey
▸ method for collecting data about the random sample
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 39 / 463
Lecture 1: Introduction Population & Sample
Statistical methods
can be used to summarize or describe a collection of statistical
methods statistics data; this is called descriptive statistics
allows to make predictions (inferences ) from that data; this is called
inferential statistics
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 40 / 463
Lecture 1: Introduction Variable
Variable
. is a characteristics of interest for the elements
. it is also called feature, or factor that vary or change respondent to
respondent
. denoted by X, Y , Z, or denoted by rst letter (e.g., Score (S ), Age
(A))
Types of Variable
1 qualitative variable (also known as categorical variable)
2 quantitative variable (also known as numerical variable)
▸ discrete variable
▸ continuous variable
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 41 / 463
Lecture 1: Introduction Variable
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 42 / 463
Lecture 1: Introduction Level or Scales of Measurement
1 nominal
2 ordinal
3 interval
4 ratio
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 43 / 463
Lecture 1: Introduction Level or Scales of Measurement
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 44 / 463
Lecture 1: Introduction Level or Scales of Measurement
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 45 / 463
Lecture 1: Introduction Level or Scales of Measurement
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 46 / 463
Lecture 1: Introduction Level or Scales of Measurement
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 47 / 463
Lecture 1: Introduction Level or Scales of Measurement
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 48 / 463
Lecture 1: Introduction Computer and Statistical packages
Minitab
Excel
R language
MATLAB
Strata
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 49 / 463
Lecture 1: Introduction Homework
Homework
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 50 / 463
Lecture 1: Introduction Homework
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 51 / 463
Lecture 2: Summarizing Raw Data
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 52 / 463
Lecture 2: Summarizing Raw Data
2.5 Homework
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 53 / 463
Lecture 2: Summarizing Raw Data
Dot Plot
Histogram
Ogive
Frequency Distribution
a frequency distribution is a tabular summary of data showing the number
(frequency) of items in each of several nonoverlapping classes
Solution:
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 56 / 463
Lecture 2: Summarizing Raw Data Graphical Presentation of data
Data Presentation
Types of diagram/chart:
. bar diagram
. pie diagram
. line diagram
. frequency polygon
. cumulative frequency polygon or Ogive
. histogram
. stem-and-leaf plot
. box plot
. scatter diagram
. ⋯
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 57 / 463
Lecture 2: Summarizing Raw Data Bar Diagram
Bar Diagram
bar diagram (also called bar chart) is a graphical device for displaying
categorical data summarized in a frequency, relative frequency, or percent
frequency distribution
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 58 / 463
Lecture 2: Summarizing Raw Data Bar Diagram
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 59 / 463
Lecture 2: Summarizing Raw Data Pie Chart
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 60 / 463
Lecture 2: Summarizing Raw Data Pie Chart
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 61 / 463
Lecture 2: Summarizing Raw Data Pie Chart
Example
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 62 / 463
Lecture 2: Summarizing Raw Data Pie Chart
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 63 / 463
Lecture 2: Summarizing Raw Data Pie Chart
Step 5: Decide the starting point: the lower class limit or class boundary
should cover the smallest value in the raw data.
Step 6: Tally and count the observations under each interval
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 65 / 463
Lecture 2: Summarizing Raw Data Pie Chart
Example
Let's take an example to understand how to construct a frequency
distribution. Suppose we have a weekly expenditure of 30 students.
Constructing a frequency distribution table for the following numbers of
observations:
423, 369, 387, 411, 393, 394, 371, 377, 389, 409, 392, 408, 431, 401, 363,
391, 405, 382, 400, 381, 399, 415, 428, 422, 396, 372, 410, 419, 386, 390
363 369 371 372 377 381 382 386 387 389 390 391 392 393 394
396 399 400 401 405 408 409 410 411 415 419 422 423 428 431
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 66 / 463
Lecture 2: Summarizing Raw Data Pie Chart
2 Step 2:
Minimum Observation is 363 and maximum observation is 431
4 Step 4:
Maximum observation - Minimum observation
h≥
Number of class
431 − 363
≥ = 11.33 ≈ 12
6
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 67 / 463
Lecture 2: Summarizing Raw Data Pie Chart
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 68 / 463
Lecture 2: Summarizing Raw Data Pie Chart
Supplementary Exercises
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 69 / 463
Lecture 2: Summarizing Raw Data Pie Chart
90 88 78
87 69 93
56 78 57
67 85 46
95 59 89
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 71 / 463
Lecture 2: Summarizing Raw Data Pie Chart
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 72 / 463
Lecture 2: Summarizing Raw Data Homework
Homework
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 73 / 463
Lecture - 3: Summarizing Raw Data (Conti...)
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 74 / 463
Lecture - 3: Summarizing Raw Data (Conti...)
3.4 Histogram
3.6 Exercises
3.7 Homework
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 75 / 463
Lecture - 3: Summarizing Raw Data (Conti...) Line Diagram
Line Diagram
a line diagram is a graph that shows related information by drawing a
continuous line between all the points on a grid (usually, as change over
time)
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 76 / 463
Lecture - 3: Summarizing Raw Data (Conti...) Line Diagram
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 77 / 463
Lecture - 3: Summarizing Raw Data (Conti...) Frequency Polygons
Frequency Polygons
a frequency polygon is a graph constructed by using lines to join the
midpoints of each class interval. The heights of the points represent the
frequencies.
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 78 / 463
Lecture - 3: Summarizing Raw Data (Conti...) Ogive or Cumulative Frequency Polygons
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 79 / 463
Lecture - 3: Summarizing Raw Data (Conti...) Ogive or Cumulative Frequency Polygons
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 80 / 463
Lecture - 3: Summarizing Raw Data (Conti...) Ogive or Cumulative Frequency Polygons
it looks very much like a bar chart, but there are important
dierences between them
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 82 / 463
Lecture - 3: Summarizing Raw Data (Conti...) Histogram
Histogram
to draw a histogram, we sometimes use the mid-value of the class on the
X-axis and frequency (or percentages of relative frequency) on the Y-axis
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 83 / 463
Lecture - 3: Summarizing Raw Data (Conti...) Histogram
data about the Basketball teams and the heights of the players
the coach is comparing the heights of his team, the Markwell Cougars
to the rival team, the Sampson Hawks
Markwell Cougars:
170, 172, 175, 176, 176, 176, 178, 181, 182, 183, 183, 183, 185, 185,
187, 188, 188, 189, 190, 195
Sampson Hawks:
169, 175, 176, 176, 178, 179, 180, 183, 183, 186, 186, 186, 187, 187,
187, 187, 187, 188, 190, 191, 192
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 85 / 463
Lecture - 3: Summarizing Raw Data (Conti...) Histogram
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 86 / 463
Lecture - 3: Summarizing Raw Data (Conti...) Histogram
90 88 78
87 69 93
56 78 57
67 85 46
95 59 89
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 87 / 463
Lecture - 3: Summarizing Raw Data (Conti...) Histogram
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 88 / 463
Lecture - 3: Summarizing Raw Data (Conti...) Histogram
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 89 / 463
Lecture - 3: Summarizing Raw Data (Conti...) Stem-and-Leaf Plot
Stem-and-Leaf Plot
A stem-and-leaf Plot is a special table where each data value is split into a
"stem" (the rst digit or digits) and a "leaf" (usually the last digit). Like
in this example:
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 90 / 463
Lecture - 3: Summarizing Raw Data (Conti...) Stem-and-Leaf Plot
Stem-and-Leaf Plot
the "stem" values are listed down, and the "leaf" values go right (or
left) from the stem values
the "stem" is used to group the scores and each "leaf" shows the
individual scores within each group
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 91 / 463
Lecture - 3: Summarizing Raw Data (Conti...) Stem-and-Leaf Plot
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 92 / 463
Lecture - 3: Summarizing Raw Data (Conti...) Exercises
90 88 78
87 69 93
56 78 57
67 85 46
95 59 89
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 93 / 463
Lecture - 3: Summarizing Raw Data (Conti...) Exercises
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 94 / 463
Lecture - 3: Summarizing Raw Data (Conti...) Exercises
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 95 / 463
Lecture - 3: Summarizing Raw Data (Conti...) Exercises
Supplementary Exercises
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 96 / 463
Lecture - 3: Summarizing Raw Data (Conti...) Exercises
Exercises
1 What do you mean by Statistics? Write down some applications of
Statistics in engineering eld.
2 What is meant by population and sample? Discuss the dierent types
of data.
3 What do you mean by sampling (or sampling technique)? Discuss the
dierent types of sampling technique for selecting a sample.
4 What are the points to be borne in mind in the formation of
frequency table? Choosing appropriate class-intervals, form a
frequency table for the following data:
10.2 0.5 5.2 6.1 3.1 6.7 8.9 7.2 8.9
5.4 3.6 9.2 6.1 7.3 2.0 1.3 6.4 8.0
4.3 4.7 12.4 8.6 13.1 3.2 9.5 7.6 4.0
5.1 8.1 1.1 11.5 3.1 6.8 7.0 8.2 2.0
3.1 6.5 11.2 12.0 5.1 10.9 11.2 8.5 2.3
3.4 5.2 10.7 4.9 6.2
Draw the histogram, frequency curve and stem-and-leaf for the above
data.
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 97 / 463
Lecture - 3: Summarizing Raw Data (Conti...) Exercises
Exercises: Ross(2020)
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 98 / 463
Lecture - 3: Summarizing Raw Data (Conti...) Homework
Homework
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 99 / 463
Lecture - 3: Summarizing Raw Data (Conti...) Summarizing bivariate data
▸ A tabular summary of data for two variables. The classes for one
variable are represented by the rows; the classes for the other variable
are represented by the columns.
Graphical method- scatter diagram
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 100 / 463
Lecture - 3: Summarizing Raw Data (Conti...) Summarizing bivariate data
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 101 / 463
Lecture - 3: Summarizing Raw Data (Conti...) Summarizing bivariate data
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 102 / 463
Lecture - 3: Summarizing Raw Data (Conti...) Summarizing bivariate data
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 103 / 463
Lecture - 3: Summarizing Raw Data (Conti...) Scatter Diagram
Scatter Diagram
a scatter diagram is a graphic tool used to display the relationship between
two variables
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 104 / 463
Lecture - 3: Summarizing Raw Data (Conti...) Practice Problem in the Classroom
Strength
Presence of outliers
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 105 / 463
Lecture - 3: Summarizing Raw Data (Conti...) Practice Problem in the Classroom
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 106 / 463
Lecture - 3: Summarizing Raw Data (Conti...) Practice Problem in the Classroom
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 107 / 463
Lecture - 4: Measures of Central Tendency
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 108 / 463
Lecture - 4: Measures of Central Tendency
4.5 Median
4.6 Mode
4.9 Boxplots
4.10 Exercises
4.11 Homework
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 109 / 463
Lecture - 4: Measures of Central Tendency
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 110 / 463
Lecture - 4: Measures of Central Tendency Measures of Central Tendency
Questions
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 111 / 463
Lecture - 4: Measures of Central Tendency Arithmetic Mean
Arithmetic Mean
1
n
x1 + x2 + ⋯ + xn
x̄ = (∑ xi ) =
n i=1 n
ex: the arithmetic mean of ve values: 4, 36, 45, 50, 75 is:
x1 + x2 + ⋯ + x5 4 + 36 + 45 + 50 + 75 210
x̄ = = = = 42
5 5 5
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 112 / 463
Lecture - 4: Measures of Central Tendency Geometric Mean
Geometric Mean
1
n n 1
x̄ = (∏ xi ) = (x1 x2 ⋯xn ) n
i= 1
ex: the geometric mean of ve values: 4, 36, 45, 50, 75 is:
1 1 √
5
x̄ = (x1 x2 ⋯x5 ) 5 = (4 × 36 × 45 × 50 × 75) 5 = 24 300 000 = 30
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 113 / 463
Lecture - 4: Measures of Central Tendency Harmonic Mean
Harmonic Mean
Harmonic mean (HM)
is an average which is useful for sets of numbers which are dened in
relation to some unit, for example speed (distance per unit of time).
n
1
− 1
x̄ = n (∑ )
i= 1 xi
ex: the harmonic mean of the ve values: 4, 36, 45, 50, 75 is
5 − 1
1 5 5
x̄ = 5 (∑ ) = 1 1 1 1 1 = 1 = 15
i= 1 xi 4 + 36 + 45 + 50 + 75 3
Relationship between AM, GM, and HM
AM ≥ GM ≥ HM
equality holds if and only if all the elements of the given sample are equal
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 114 / 463
Lecture - 4: Measures of Central Tendency Harmonic Mean
1 Consider a sample with data values of 10, 20, 12, 17, and 16.
Compute the arithmetic mean, geometric mean and harmonic mean
and comment on your results.
2 Consider a sample with data values of 10, 20, 12, 17,10, and 16.
Compute the arithmetic mean, geometric mean and harmonic mean
and comment on your results.
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 115 / 463
Lecture - 4: Measures of Central Tendency Harmonic Mean
it is unique
...
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 116 / 463
Lecture - 4: Measures of Central Tendency Harmonic Mean
Merits of Mean
rigidly dened by algebraic formula
easy to calculate and simple to understand
based on all observations of the given data
capable of being treated mathematically hence it is widely used in statistical
analysis
can be computed even if the derailed distribution is not known but some of
the observation and number of the observation are known
least aected by the uctuation of sampling
for every kind of numeric data mean can be calculated
...
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 117 / 463
Lecture - 4: Measures of Central Tendency Harmonic Mean
Demerits of Mean
1 an not be computed for qualitative data like data on intelligence honesty and
smoking habit etc.
2 too much aected by extreme observations and hence it is not adequately
represent data consisting of some extreme point
3 can not be computed when class intervals have open ends
4 if any one of the data is missing then mean can not be calculated
5 ...
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 118 / 463
Lecture - 4: Measures of Central Tendency Median
Median
is the value separating the higher half from the lower half of a data
sample (a population or a probability distribution)
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 119 / 463
Lecture - 4: Measures of Central Tendency Median
1 Consider a sample with data values of 10, 20, 12, 17, and 16.
Compute the mean and median.
2 Consider a sample with data values of 10, 20, 12, 17, 10 and 16.
Compute the mean and median.
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 120 / 463
Lecture - 4: Measures of Central Tendency Median
Merits of Median
1 easy to calculate and simple to understand
2 well dened an ideal average should be
3 can also be computed in case of frequency distribution with open
ended classes
4 not aected by extreme values and also interdependent of range or
dispersion of the data
5 can be determined graphically
6 proper average for qualitative data where items are not measured but
are scored
7 only suitable average when the data are qualitative & it is possible to
rank various items according to qualitative characteristics.
8 ...
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 121 / 463
Lecture - 4: Measures of Central Tendency Median
Demerits of Median
1 for computing median data needs to be arranged in ascending or
descending order
2 not based on all the observations of the data
3 can not be given further algebraic treatment
4 aected by uctuation of sampling
5 not accurate when the data size is not large
6 ...
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 122 / 463
Lecture - 4: Measures of Central Tendency Mode
Mode
is simply the number which appears most often
⧫ To nd the mode, or modal value, it is best to put the numbers in order.
hen count how many of each number. A number that appears most often
is the mode.
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 123 / 463
Lecture - 4: Measures of Central Tendency Mode
1 Consider a sample with data values of 10, 12, 12, 16, and 18.
Compute the mode.
2 Consider a sample with data values of 10, 12, 12, 16, 16 and 18.
Compute the mode and comments on your results.
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 124 / 463
Lecture - 4: Measures of Central Tendency Quartiles and Percentiles
n+1
Qk = ( × k) th observation k = 1 , 2, 3
4
n+1
Pk = ( × k) th observation k = 1, 2, . . . , 99
100
n+1
Dk = ( × k) th observation k = 1, 2, . . . , 9
10
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 125 / 463
Lecture - 4: Measures of Central Tendency Quartiles and Percentiles
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 126 / 463
Lecture - 4: Measures of Central Tendency Quartiles and Percentiles
Example
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 127 / 463
Lecture - 4: Measures of Central Tendency Quartiles and Percentiles
Example
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 128 / 463
Lecture - 4: Measures of Central Tendency Five-Number summaries and Boxplot
Five-Number Summary
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 129 / 463
Lecture - 4: Measures of Central Tendency Boxplots
Boxplot
a boxplot is a standardized way of displaying the distribution of data
based on a ve number summary (minimum, rst quartile (Q1),
median, third quartile (Q3), and maximum)
it can tell you about your outliers and what their values are
it can also tell you if your data is symmetrical, how tightly your data
is grouped, and if and how your data is skewed.
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 130 / 463
Lecture - 4: Measures of Central Tendency Boxplots
Boxplot
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 131 / 463
Lecture - 4: Measures of Central Tendency Boxplots
Example
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 132 / 463
Lecture - 4: Measures of Central Tendency Boxplots
Example
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 133 / 463
Lecture - 4: Measures of Central Tendency Boxplots
Example
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 134 / 463
Lecture - 4: Measures of Central Tendency Boxplots
Detecting Outliers
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 135 / 463
Lecture - 4: Measures of Central Tendency Boxplots
Example
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 136 / 463
Lecture - 4: Measures of Central Tendency Exercises
Homework
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 138 / 463
Lecture - 5: Measures of Dispersion
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 139 / 463
Lecture - 5: Measures of Dispersion
5.4 Exercises
5.5 Homework
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 140 / 463
Lecture - 5: Measures of Dispersion Measure of Dispersion
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 141 / 463
Lecture - 5: Measures of Dispersion Measure of Dispersion
A Measure of Dispersion
Dispersion
. is the state of getting dispersed, i. e. spread of data
. is contrasted with location or central tendency, and together they are
the most used properties of distributions
. the measure which express the scattering of observation in terms of
distances i.e., range, quartile deviation
. the measure which expresses the variations in terms of the average of
deviations of observations like mean deviation and standard deviation
. spread can also be shown in graphs: dot plots, boxplots, and stem
and leaf plots have a greater distance with samples that have a larger
dispersion and vice versa
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 142 / 463
Lecture - 5: Measures of Dispersion Measure of Dispersion
Absolute Measures
1 range: maximum value - minimum value
2 interquartile range (IQR): Q3 − Q1
3 semi-interquartile range : (Q3 − Q1 )/2
4 variance (or standard deviation)
5 mean absolute dierence (also known as Gini mean absolute
dierence)
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 143 / 463
Lecture - 5: Measures of Dispersion Absolute Measures
Absolute Measures
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 144 / 463
Lecture - 5: Measures of Dispersion Absolute Measures
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 145 / 463
Lecture - 5: Measures of Dispersion Absolute Measures
Practice in Classroom
1 Consider a sample with data values of 10, 20, 12, 17, and Compute
the range and interquartile range.
2 Consider a sample with data values of 10, 20, 12, 17, and Compute
the range, interquartile range and Semi-interquartile range.
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 146 / 463
Lecture - 5: Measures of Dispersion Absolute Measures
Variance
the variance (and standard deviation) is a measure of how widely
values are dispersed from the average value (the mean)
▸ for population
1 N 2 1 n 2
σ2 = 2
∑ (Xi − µ) = (∑ Xi − Nµ )
N i=1 N i=1
▸ for sample
1 n 2 1 n
s2 = ∑ (xi − x̄) = (∑ xi2 − nx̄ 2 )
n − 1 i=1 n − 1 i=1
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 147 / 463
Lecture - 5: Measures of Dispersion Absolute Measures
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 148 / 463
Lecture - 5: Measures of Dispersion Absolute Measures
Standard Deviation
is a square root of a variance and sample standard deviation is
denoted by s
for sample
¿ ¿
Á
s =Á
À 1
n
2 Á
∑ (xi − x̄) = Á
À 1
n
(∑ xi2 − nx̄ 2 )
n − 1 i=1 n−1 i= 1
the major characteristics of the standard deviation are:
1 it is in the same units as the original data
2 it is the square root of the average squared distance from the mean
3 it cannot be negative
4 it is the most widely reported measure of dispersion
advantages & disadvantages?
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 149 / 463
Lecture - 5: Measures of Dispersion Absolute Measures
Practice in Classroom
1 Consider a sample with data values of 10, 20, 12, 17, and Compute
the range and interquartile range.
2 Consider a sample with data values of 10, 20, 12, 17, and Compute
the variance and standard deviation.
3 Consider a sample with data values of 27, 25, 20, 15, 30, 34, 28, and
25. Compute the range, interquartile range, variance, and standard
deviation.
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 150 / 463
Lecture - 5: Measures of Dispersion Absolute Measures
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 151 / 463
Lecture - 5: Measures of Dispersion Absolute Measures
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 152 / 463
Lecture - 5: Measures of Dispersion Relative Measures
Coecient of Variation
the coecient of variation (CV) is a statistical measure of the
dispersion of data points in a data series around the mean
the coecient of variation represents the ratio of the standard
deviation to the mean, and it is a useful statistic for comparing the
degree of variation from one data series to another, even if the means
are drastically dierent from one another
▸ for sample
s
cv = × 100%
x̄
advantages & disadvantages?
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 153 / 463
Lecture - 5: Measures of Dispersion Exercises
Exercises
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 154 / 463
Lecture - 5: Measures of Dispersion Homework
Homework
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 155 / 463
Lecture - 6: Review Class
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 156 / 463
Lecture - 6: Review Class
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 157 / 463
Lecture - 7: Class Test - 1
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 158 / 463
Lecture - 7: Class Test - 1
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 159 / 463
Lecture - 8: Measures of Shape of the Distribution
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 160 / 463
Lecture - 8: Measures of Shape of the Distribution
8.2 Skewness
8.8 Homework
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 161 / 463
Lecture - 8: Measures of Shape of the Distribution Shape of the Distribution
▸ skewness
▸ kurtosis
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 162 / 463
Lecture - 8: Measures of Shape of the Distribution Shape of the Distribution
Skewness
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 163 / 463
Lecture - 8: Measures of Shape of the Distribution Shape of the Distribution
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 164 / 463
Lecture - 8: Measures of Shape of the Distribution Skewness
Skewness
skewness characterizes the degree of asymmetry of a distribution
around its mean
n n
xi − x̄ 3
skewness: s1 = ∑( )
(n − 1)(n − 2) i=1 s
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 165 / 463
Lecture - 8: Measures of Shape of the Distribution Skewness
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 166 / 463
Lecture - 8: Measures of Shape of the Distribution Skewness
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 167 / 463
Lecture - 8: Measures of Shape of the Distribution Skewness
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 168 / 463
Lecture - 8: Measures of Shape of the Distribution Skewness
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 169 / 463
Lecture - 8: Measures of Shape of the Distribution Skewness
Kurtosis
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 170 / 463
Lecture - 8: Measures of Shape of the Distribution Skewness
Kurtosis
n(n + 1) n
xi − x̄ 4 3(n − 1)2
s2 = ∑( ) −
(n − 1)(n − 2)(n − 3) i=1 s (n − 2)(n − 3)
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 171 / 463
Lecture - 8: Measures of Shape of the Distribution Skewness
Kurtosis
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 172 / 463
Lecture - 8: Measures of Shape of the Distribution Mean for grouped data
Exercises
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 173 / 463
Lecture - 8: Measures of Shape of the Distribution Mean for grouped data
∑ki=1 fi xi f1 x1 + f2 x2 + ⋯ + fk xk
x̄ = =
∑ki=1 fi ∑ki=1 fi
example:
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 174 / 463
Lecture - 8: Measures of Shape of the Distribution Mean for grouped data
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 175 / 463
Lecture - 8: Measures of Shape of the Distribution Weighted Mean
Weighted Mean
w1 x1 + w2 x2 + ⋯ + wk xk
x̄w =
w1 + w2 + ⋯ + wk
Self Exercise: Springers sold 95 Antonelli men's suits for the regular price
of $400. For the spring sale, the suits were reduced to $200 and 126 were
sold. At the nal clearance, the price was reduced to $100 and the
remaining 79 suits were sold.
(b). Springers paid $200 a suit for the 300 suits. Comment on the store's
prot per suit if a salesperson receives a $25 commission for each one
sold.
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 176 / 463
Lecture - 8: Measures of Shape of the Distribution Weighted Mean
xi weight (wi )
3.2 6
2.0 3
2.5 2
5.0 8
(ii). Compute the sample mean of the four data values without weighting.
Note the dierence in the results provided by the two computations.
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 177 / 463
Lecture - 8: Measures of Shape of the Distribution Median for Group Data
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 178 / 463
Lecture - 8: Measures of Shape of the Distribution Median for Group Data
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 179 / 463
Lecture - 8: Measures of Shape of the Distribution Median for Group Data
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 180 / 463
Lecture - 8: Measures of Shape of the Distribution Mode for the Group data
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 181 / 463
Lecture - 8: Measures of Shape of the Distribution Mode for the Group data
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 182 / 463
Lecture - 8: Measures of Shape of the Distribution Mode for the Group data
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 183 / 463
Lecture - 8: Measures of Shape of the Distribution Standard Deviation for Group Data
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 184 / 463
Lecture - 8: Measures of Shape of the Distribution Standard Deviation for Group Data
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 185 / 463
Lecture - 8: Measures of Shape of the Distribution Standard Deviation for Group Data
Exercises: Ross(2020)
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 186 / 463
Lecture - 8: Measures of Shape of the Distribution Standard Deviation for Group Data
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 187 / 463
Lecture - 8: Measures of Shape of the Distribution Homework
Homework
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 188 / 463
Lecture - 8: Measures of Shape of the Distribution Homework
Chapter - 4: Introduction to
Probability
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 189 / 463
Lecture - 9: Introduction to probability
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 190 / 463
Lecture - 9: Introduction to probability
9.2 Events
9.6 Homework
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 191 / 463
Lecture - 9: Introduction to probability
Chapter 3: Introduction to
probability
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 192 / 463
Lecture - 9: Introduction to probability
What are the chances that sales will decrease if we increase prices?
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 193 / 463
Lecture - 9: Introduction to probability Sample Space and Events
Sample Space
an experiment is any action whose outcome is subject to uncertainty
(randomness) i.e., outcome is not predictable in advance
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 194 / 463
Lecture - 9: Introduction to probability Sample Space and Events
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 195 / 463
Lecture - 9: Introduction to probability Sample Space and Events
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 196 / 463
Lecture - 9: Introduction to probability Sample Space and Events
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 197 / 463
Lecture - 9: Introduction to probability Events
Events
Events
an event is a subset of the sample space
for example, S = {1, 3, 4} and A = {4, 1}, since A is a subset of the
sample space S and hence, A in an event
Complements of Events:
the complement of an event A, is the event consisting of everything in
the sample space S that is not contained within the event A
′
the notation A is used for the complement of A
for example, a six-sides dice has a sample space S = {1, 2, 3, 4, 5, 6}
if event A = {1, 3, 6} then complementary of event A = { 2, 4, 5}
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 198 / 463
Lecture - 9: Introduction to probability Probabilities Dened on Events
1 An experiment has ve outcomes, I, II, III, IV, and V. If P(I) = 0.13,
P(II) = 0.24, P(III) = 0.07, and P(IV) = 0.38, what is P(V)?
2 An experiment has ve outcomes, I, II, III, IV, and V. If P(I) = 0.08,
P(II) = 0.20, and P(III) = 0.33, what are the possible values for the
probability of outcome V? If outcomes IV and V are equally likely,
what are their probability values?
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 200 / 463
Lecture - 9: Introduction to probability Counting Rules, Combinations, and Permutations
Tree Diagram
A tree diagram is a graphical representation that helps in visualizing a
multiple-step experiment.
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 202 / 463
Lecture - 9: Introduction to probability Counting Rules, Combinations, and Permutations
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 204 / 463
Lecture - 9: Introduction to probability Counting Rules, Combinations, and Permutations
Permutations
A third counting rule that is permutations. It allows one to compute the
number of experimental outcomes when n objects are to be selected from
a set of N objects where the order of selection is important.
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 205 / 463
Lecture - 9: Introduction to probability Approaches Assigning Probabilities
(i). classical,
(iii). subjective
the classical and empirical methods are objective and are based on
information and data
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 206 / 463
Lecture - 9: Introduction to probability Approaches Assigning Probabilities
Classical Probability
the classical probability is based on the assumption that the outcomes
of an experiment are equally likely
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 207 / 463
Lecture - 9: Introduction to probability Approaches Assigning Probabilities
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 208 / 463
Lecture - 9: Introduction to probability Approaches Assigning Probabilities
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 209 / 463
Lecture - 9: Introduction to probability Approaches Assigning Probabilities
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 210 / 463
Lecture - 9: Introduction to probability Approaches Assigning Probabilities
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 211 / 463
Lecture - 9: Introduction to probability Approaches Assigning Probabilities
Complementary Event
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 212 / 463
Lecture - 9: Introduction to probability Approaches Assigning Probabilities
Example of Events
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 213 / 463
Lecture - 9: Introduction to probability Approaches Assigning Probabilities
Intersections of Events
contains common outcomes between two events
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 214 / 463
Lecture - 9: Introduction to probability Approaches Assigning Probabilities
for example, A = {1, 3, 4}, B = {5}, since event A and B don't have
any common outcome, so they are Mutually Exclusive Events
Unions of Events
consists of the outcomes that are contained within at least one of the
events A and B
for example, suppose A = {1, 3, 4}, B = {5}, then A ∪ B = {1, 3, 4, 5}
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 215 / 463
Lecture - 9: Introduction to probability Approaches Assigning Probabilities
Unions of Events
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 216 / 463
Lecture - 9: Introduction to probability Homework
Homework
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 217 / 463
Lecture - 10: Basic rules for computing probability
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 218 / 463
Lecture - 10: Basic rules for computing probability
10.3 Homework
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 219 / 463
Lecture - 10: Basic rules for computing probability
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 220 / 463
Lecture - 10: Basic rules for computing probability
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 221 / 463
Lecture - 10: Basic rules for computing probability
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 222 / 463
Lecture - 10: Basic rules for computing probability
▸ Pr(girl) = 13
30 , Pr(A-grade student) =
9
30 and
5
Pr(girl ∩ A-grade student) = 30
▸ therefore,
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 223 / 463
Lecture - 10: Basic rules for computing probability
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 224 / 463
Lecture - 10: Basic rules for computing probability
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 225 / 463
Lecture - 10: Basic rules for computing probability Joint and Marginal Probabilities
example:
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 226 / 463
Lecture - 10: Basic rules for computing probability Joint and Marginal Probabilities
Another Example
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 227 / 463
Lecture - 10: Basic rules for computing probability Joint and Marginal Probabilities
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 228 / 463
Lecture - 10: Basic rules for computing probability Conditional Probability
Pr(A∣B) = Pr(A)
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 229 / 463
Lecture - 10: Basic rules for computing probability Conditional Probability
Continuous Assessment:
1 what the conditional probability when two events A and B are
mutually exclusive?
2 what the conditional probability that an event B is contained within
an event A, that is B ⊂A
Back to the Previous Example
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 231 / 463
Lecture - 10: Basic rules for computing probability Conditional Probability
Solution of (a).
Continuous Assessment:
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 232 / 463
Lecture - 10: Basic rules for computing probability Conditional Probability
Continuous Assessment:
(a). If somebody rolls a fair die without showing you but announces that
the result is even, then what is the probability of scoring a 6?
(b). What is the probability that at least one 6 is obtained on the two
dice?
(c). Suppose that somebody rolls the two dice without showing you but
announces that at least one 6 has been scored. What is the
probability that the red die scored a 6?
(d). If a red die and a blue die are thrown. What is the probability that
the red die scores a 6 given that an exactly one 6 has been score?
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 233 / 463
Lecture - 10: Basic rules for computing probability Conditional Probability
Solution
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 234 / 463
Lecture - 10: Basic rules for computing probability Conditional Probability
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 235 / 463
Lecture - 10: Basic rules for computing probability Conditional Probability
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 236 / 463
Lecture - 10: Basic rules for computing probability Conditional Probability
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 237 / 463
Lecture - 10: Basic rules for computing probability Conditional Probability
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 238 / 463
Lecture - 10: Basic rules for computing probability Conditional Probability
Example
Suppose that somebody secretly rolls two fair six-sided dice, and we
wish to compute the probability that the face-up value of the rst one
is 2, given the information that their sum is no greater than 5.
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 239 / 463
Lecture - 10: Basic rules for computing probability Conditional Probability
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 240 / 463
Lecture - 10: Basic rules for computing probability Conditional Probability
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 241 / 463
Lecture - 10: Basic rules for computing probability Conditional Probability
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 242 / 463
Lecture - 10: Basic rules for computing probability Conditional Probability
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 243 / 463
Lecture - 10: Basic rules for computing probability Conditional Probability
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 244 / 463
Lecture - 10: Basic rules for computing probability Homework
Homework
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 245 / 463
Lecture - 11: Review Class
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 246 / 463
Lecture - 11: Review Class
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 247 / 463
Lecture - 12: Mid-term test
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 248 / 463
Lecture - 12: Mid-term test
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 249 / 463
Lecture - 12: Mid-term test
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 250 / 463
Lecture 13: Normal Distribution
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 251 / 463
Lecture 13: Normal Distribution
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 252 / 463
Lecture 13: Normal Distribution Random Variable
Random Variable
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 253 / 463
Lecture 13: Normal Distribution Random Variable
random variable
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 254 / 463
Lecture 13: Normal Distribution Random Variable
Example
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 255 / 463
Lecture 13: Normal Distribution Random Variable
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 256 / 463
Lecture 13: Normal Distribution Random Variable
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 257 / 463
Lecture 13: Normal Distribution Random Variable
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 258 / 463
Lecture 13: Normal Distribution Random Variable
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 259 / 463
Lecture 13: Normal Distribution Random Variable
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 260 / 463
Lecture 13: Normal Distribution Random Variable
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 261 / 463
Lecture 13: Normal Distribution Random Variable
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 262 / 463
Lecture 13: Normal Distribution Random Variable
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 263 / 463
Lecture 13: Normal Distribution Random Variable
Practice in Classroom
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 264 / 463
Lecture 13: Normal Distribution Normal Probability Distribution
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 265 / 463
Lecture 13: Normal Distribution Normal Probability Distribution
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 266 / 463
Lecture 13: Normal Distribution Normal Probability Distribution
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 267 / 463
Lecture 13: Normal Distribution Normal Probability Distribution
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 268 / 463
Lecture 13: Normal Distribution Normal Probability Distribution
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 269 / 463
Lecture 13: Normal Distribution Normal Probability Distribution
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 270 / 463
Lecture 14: Standard Normal Distribution
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 271 / 463
Lecture 14: Standard Normal Distribution
14.2 Homework
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 272 / 463
Lecture 14: Standard Normal Distribution Standard Normal Probability Distribution
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 273 / 463
Lecture 14: Standard Normal Distribution Standard Normal Probability Distribution
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 274 / 463
Lecture 14: Standard Normal Distribution Standard Normal Probability Distribution
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 275 / 463
Lecture 14: Standard Normal Distribution Standard Normal Probability Distribution
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 276 / 463
Lecture 14: Standard Normal Distribution Standard Normal Probability Distribution
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 277 / 463
Lecture 14: Standard Normal Distribution Standard Normal Probability Distribution
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 278 / 463
Lecture 14: Standard Normal Distribution Standard Normal Probability Distribution
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 279 / 463
Lecture 14: Standard Normal Distribution Standard Normal Probability Distribution
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 280 / 463
Lecture 14: Standard Normal Distribution Standard Normal Probability Distribution
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 281 / 463
Lecture 14: Standard Normal Distribution Standard Normal Probability Distribution
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 282 / 463
Lecture 14: Standard Normal Distribution Standard Normal Probability Distribution
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 283 / 463
Lecture 14: Standard Normal Distribution Standard Normal Probability Distribution
Practice in Classroom
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 284 / 463
Lecture 14: Standard Normal Distribution Standard Normal Probability Distribution
Practice in Classroom
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 285 / 463
Lecture 14: Standard Normal Distribution Standard Normal Probability Distribution
Practice in Classroom
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 286 / 463
Lecture 14: Standard Normal Distribution Standard Normal Probability Distribution
Practice in Classroom
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 287 / 463
Lecture 14: Standard Normal Distribution Homework
Homework
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 288 / 463
Lecture 15: Class Test - 2
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 289 / 463
Lecture 15: Class Test - 2
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 290 / 463
Lecture 15: Class Test - 2
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 291 / 463
Lecture 16: Random Sampling Methods
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 292 / 463
Lecture 16: Random Sampling Methods
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 293 / 463
Lecture 16: Random Sampling Methods Population & Sample
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 294 / 463
Lecture 16: Random Sampling Methods Census & Survey
Random Sample
▸ is a representative part of the target population which has been drawn
randomly
Census
▸ method to collect data on the entire population
▸ As per as 2022 census, Bangladesh have a population of 165,158,616
people, of which 81,712,824 are male, while 83,347,206 are female. As
many as 113,063,587 of them live in rural areas and 52,009,072 live in
Urban.
▸ Bangladesh Population Census: 1974, 1981, 1991, 2001, 2011, 2022
▸ advantages of a Census accurate and complete information
▸ according to the UN, the census is featured by the following aspects:
(i) Individual enumeration, (ii) Universality within a dened territory,
(iii) Simultaneity and (iv) Dened periodicity.
Sample survey
▸ method for collecting data about the random sample
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 295 / 463
Lecture 16: Random Sampling Methods Census & Survey
Disadvantages of a Census
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 296 / 463
Lecture 16: Random Sampling Methods Census & Survey
▸ saving of time
▸ reduced cost
▸ detailed study
▸ accuracy of result
▸ administrative convenience
▸ impossibility of the use of census method, etc.
Disadvantages/Limitations of Sampling
▸ chances of bias
▸ diculties in selecting truly a representative sample
▸ in adequate knowledge in the subject
▸ changeability of sampling units
▸ impossibility of sampling, etc.
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 297 / 463
Lecture 16: Random Sampling Methods Census & Survey
Sampling Error
▸ the error, which arises entirely due to sampling and no other reasons
can be attributed to cause such error, is called sampling error
▸ is the dierence between a sample statistic and its corresponding
population parameter
Non-sampling Error
▸ non-sampling error can be attributed to sources other than sampling
and they may be random or non-random
Non-response Error
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 298 / 463
Lecture 16: Random Sampling Methods Census & Survey
Sampling Units
▸ Sample units are the members of the population from which
measurements are taken during sampling
▸ examples
∎ all rickshaw puller of J.U campus
∎ all under ve children of J.U
∎ all sex worker of a brothel
Sampling Frame
▸ a sampling frame is a representation of the elements of the target
population
▸ it consist of a least or a set of narrations for identify the target
population
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 299 / 463
Lecture 16: Random Sampling Methods Census & Survey
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 300 / 463
Lecture 16: Random Sampling Methods Sampling Methods
Sampling Methods
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 301 / 463
Lecture 16: Random Sampling Methods Sampling Methods
Collection of Data
Sampling methods
1 simple random sampling
⇒ same/equal chance to select sampling unit
4 cluster sampling
⇒ selection of all (or some) sampling unit from selected cluster(s)
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 302 / 463
Lecture 16: Random Sampling Methods Sampling Methods
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 303 / 463
Lecture 16: Random Sampling Methods Sampling Methods
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 304 / 463
Lecture 16: Random Sampling Methods Sampling Methods
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 305 / 463
Lecture 16: Random Sampling Methods Simple Random Sampling
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 306 / 463
Lecture 16: Random Sampling Methods Simple Random Sampling
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 307 / 463
Lecture 16: Random Sampling Methods Simple Random Sampling
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 308 / 463
Lecture 16: Random Sampling Methods Simple Random Sampling
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 309 / 463
Lecture 16: Random Sampling Methods How to draw a simple random sample?
lottery method
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 310 / 463
Lecture 16: Random Sampling Methods How to draw a simple random sample?
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 311 / 463
Lecture 16: Random Sampling Methods How to draw a simple random sample?
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 312 / 463
Lecture 16: Random Sampling Methods How to draw a simple random sample?
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 313 / 463
Lecture 16: Random Sampling Methods How to draw a simple random sample?
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 314 / 463
Lecture 16: Random Sampling Methods How to draw a simple random sample?
Then, in the given random number table, starting with the rst
number and moving row wise (or column wise or diagonal wise) to
pick out the numbers in pairs, one by one, ignoring those numbers
which are greater than 50, until a selection of 10 numbers is made.
Selected row-wise sample numbers: 27, 15, 45, 11, 02, 14, 18, 07, 39,
31
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 315 / 463
Lecture 16: Random Sampling Methods How to draw a simple random sample?
Homework
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 316 / 463
Lecture 16: Random Sampling Methods How to draw a simple random sample?
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 317 / 463
Lecture 17: Interval estimation
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 318 / 463
Lecture 17: Interval estimation
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 319 / 463
Lecture 17: Interval estimation Interval Estimation
Estimation
Point estimation
It is the single best value. For example, mean and SD of total marks
for a course of IUB students are point estimates because these are
single value
Interval estimation
Interval estimation is the use of sample data to calculate an interval
of possible (or probable) values of an unknown population parameter,
in contrast to point estimation, which is a single number.
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 320 / 463
Lecture 17: Interval estimation Interval Estimation
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 321 / 463
Lecture 17: Interval estimation Condence interval for µ based on large samples
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 322 / 463
Lecture 17: Interval estimation Condence interval for µ based on large samples
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 323 / 463
Lecture 17: Interval estimation Condence interval for µ based on large samples
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 324 / 463
Lecture 17: Interval estimation Condence interval for µ based on large samples
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 325 / 463
Lecture 17: Interval estimation Condence interval for µ based on large samples
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 326 / 463
Lecture 17: Interval estimation Condence interval for µ based on large samples
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 327 / 463
Lecture 17: Interval estimation Condence interval for µ based on large samples
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 328 / 463
Lecture 17: Interval estimation Condence interval for µ based on large samples
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 329 / 463
Lecture 17: Interval estimation Condence interval for µ based on small samples
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 330 / 463
Lecture 17: Interval estimation Condence interval for µ based on small samples
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 331 / 463
Lecture 17: Interval estimation Condence interval for µ based on small samples
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 332 / 463
Lecture 17: Interval estimation Condence interval for µ based on small samples
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 333 / 463
Lecture 17: Interval estimation Condence interval for µ based on small samples
When sample size is less than 30, i.e., n < 30, the sampling distribution of
the sample is a student's-t distribution. The student's-t distribution is
very similar to the standard normal distribution.
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 334 / 463
Lecture 17: Interval estimation Condence interval for µ based on small samples
Problem 1
Solution:
Let X is height of the students. It is given that X ∼ N(µ, 3.6”) and n = 20.
95% Condence interval for µ
s s
x̄ − t α ,(n−1) √ ≤ µ ≤ x̄ + t α ,(n−1) √
2 n 2 n
Here, x̄ = 67.3”, s = 3.6”, n = 25, α = 1 − 0.95 = 0.05, α/2 = 0.025 and
t α
n
,(n− 1) = t0.025,(19) = 2.093. Thus
3.6 3.6
67.3 − 2.093 × √ ≤ µ ≤ 67.3 + 2.093 × √ ⇒ 65.61 ≤ µ ≤ 68.98
20 20
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 336 / 463
Lecture 17: Interval estimation Condence interval for µ based on small samples
Home Work
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 337 / 463
Lecture 17: Interval estimation Condence interval for variance and standard deviation
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 338 / 463
Lecture 17: Interval estimation Condence interval for variance and standard deviation
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 339 / 463
Lecture 17: Interval estimation Condence interval for variance and standard deviation
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 340 / 463
Lecture 17: Interval estimation Condence interval for variance and standard deviation
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 341 / 463
Lecture 17: Interval estimation Condence interval for variance and standard deviation
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 342 / 463
Lecture 17: Interval estimation Condence interval for variance and standard deviation
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 343 / 463
Lecture 17: Interval estimation Condence interval for variance and standard deviation
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 344 / 463
Lecture 17: Interval estimation Condence interval for variance and standard deviation
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 345 / 463
Lecture 17: Interval estimation Condence interval for variance and standard deviation
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 346 / 463
Lecture 18: Interval Estimations about Two Population Means
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 347 / 463
Lecture 18: Interval Estimations about Two Population Means
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 348 / 463
Lecture 18: Interval Estimations about Two Population Means Interval Estimation of µ1 − µ2
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 349 / 463
Lecture 18: Interval Estimations about Two Population Means Interval Estimation of µ1 − µ2
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 350 / 463
Lecture 18: Interval Estimations about Two Population Means Interval Estimation of µ1 − µ2
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 351 / 463
Lecture 18: Interval Estimations about Two Population Means Interval Estimation of µ1 − µ2
Exercises
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 352 / 463
Lecture 18: Interval Estimations about Two Population Means Interval Estimation of µ1 − µ2
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 353 / 463
Lecture 18: Interval Estimations about Two Population Means Interval Estimation of µ1 − µ2
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 354 / 463
Lecture 18: Interval Estimations about Two Population Means Determining the Sample Size
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 355 / 463
Lecture 18: Interval Estimations about Two Population Means Determining the Sample Size
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 356 / 463
Lecture 18: Interval Estimations about Two Population Means Determining the Sample Size
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 357 / 463
Lectures 19: Tests of hypothesis
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 358 / 463
Lectures 19: Tests of hypothesis
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 359 / 463
Lectures 19: Tests of hypothesis
Tests of hypothesis
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 360 / 463
Lectures 19: Tests of hypothesis Problems & Motivation
Myocardial Infarction
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 361 / 463
Lectures 19: Tests of hypothesis Problems & Motivation
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 362 / 463
Lectures 19: Tests of hypothesis The Null and Alternative Hypotheses
Hypothesis
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 363 / 463
Lectures 19: Tests of hypothesis The Null and Alternative Hypotheses
Exercise
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 364 / 463
Lectures 19: Tests of hypothesis The Null and Alternative Hypotheses
Exercise
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 365 / 463
Lectures 19: Tests of hypothesis The Null and Alternative Hypotheses
Exercise
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 366 / 463
Lectures 19: Tests of hypothesis Key Components of Test of Hypothesis
types of error
In Reality
H0 is TRUE H0 is FALSE
Correct Decision Type II error
Decision
θ̂ − θ0
test statistic =
̂
se(θ)
▸ Z -test statistic
▸ t -test statistic
▸ F -test statistic
▸ χ2 -test statistic
▸ ⋯
(4). Critical region: is the region of values that corresponds to the
rejection of the null hypothesis at some chosen probability level
▸ acceptance region
▸ rejection region
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 368 / 463
Lectures 19: Tests of hypothesis Key Components of Test of Hypothesis
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 369 / 463
Lectures 19: Tests of hypothesis Key Components of Test of Hypothesis
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 370 / 463
Lectures 19: Tests of hypothesis Key Components of Test of Hypothesis
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 371 / 463
Lectures 19: Tests of hypothesis The p -value
The p -value
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 372 / 463
Lectures 19: Tests of hypothesis Basic Steps for Test of Hypothesis
H0 ∶ µ = 89
vs. Ha ∶ µ > 89
we generalize this situation and consider the following one-side test
⎧
⎪
⎪ H0 ∶ µ = µ0
⎨
⎪ µ > µ0
⎩ Ha ∶
⎪
Step 2: select the level of signicance level α = 5% , i.e., α = 0.05
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 374 / 463
Lectures 19: Tests of hypothesis Testing on a Single Mean
92.2 − 89
zcal = √ = 0.9237
12/ 12
x̄ − µ0
zcal = √ ∼ N ( 0, 1) under H0
σ/ n
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 376 / 463
Lectures 19: Tests of hypothesis Testing on a Single Mean
Step 5: decision
since,
zcal = 0.9237 < 1.645
we can not reject H0 at signicance level 5%
Alternatively
we nd the same conclusion from the p -value
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 377 / 463
Lectures 19: Tests of hypothesis Testing on a Single Mean
Exercise
As hypotheses, we get
⎧
⎪
⎪ H0 ∶ µ=1
⎨
⎪
⎩ Ha ∶
⎪ µ>1
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 378 / 463
Lectures 19: Tests of hypothesis Testing on a Single Mean
Exam Question-1
Individuals ling federal income tax returns prior to March 31 had an
average refund of $1056. Consider the population of last minute lers who
mail their returns during the last 5 days of the income tax period typically
April 10 to April 15. A researcher suggests that one of the reasons
individuals wait until the last 5 days to le their returns is that on average
those individuals have a lower refund than early llers.
(b). Using 5% level of signicance, what is the critical value for the test
statistic and what is the rejection rule?
(c). For a sample of 400 individuals who led a return between April 10
and April 15, the sample mean refund was $910 and the sample
standard deviation was $1600. Compute the value of the test statistic.
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 379 / 463
Lectures 19: Tests of hypothesis Testing on a Single Mean
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 380 / 463
Lectures 19: Tests of hypothesis Testing on a Single Mean
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 381 / 463
Lectures 19: Tests of hypothesis Testing on a Single Mean
Exam Question-2
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 382 / 463
Lectures 19: Tests of hypothesis Testing on a Single Mean
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 383 / 463
Lectures 19: Tests of hypothesis Testing on a Single Mean
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 384 / 463
Lectures 19: Tests of hypothesis Testing on a Single Mean
Exam Question-3
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 385 / 463
Lectures 19: Tests of hypothesis Testing on a Single Mean
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 386 / 463
Lectures 19: Tests of hypothesis Testing on a Single Mean
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 387 / 463
Lectures 19: Tests of hypothesis Testing on a Single Mean
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 388 / 463
Lectures 19: Tests of hypothesis Testing on a Single Mean
Exam Question-4
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 389 / 463
Lectures 19: Tests of hypothesis Testing on a Single Mean
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 390 / 463
Lectures 19: Tests of hypothesis Testing on a Single Mean
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 391 / 463
Lectures 19: Tests of hypothesis Testing on a Single Mean
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 392 / 463
Lectures 19: Tests of hypothesis Testing on a Single Mean
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 393 / 463
Lectures 19: Tests of hypothesis Testing on a Single Mean
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 394 / 463
Lectures 19: Tests of hypothesis Testing on a Single Mean
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 395 / 463
Lectures 19: Tests of hypothesis Testing on a Single Mean
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 396 / 463
Lectures 19: Tests of hypothesis Testing on a Single Mean
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 397 / 463
Lectures 20: Tests of hypothesis (Conti...)
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 398 / 463
Lectures 20: Tests of hypothesis (Conti...)
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 399 / 463
Lectures 20: Tests of hypothesis (Conti...)
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 400 / 463
Lectures 20: Tests of hypothesis (Conti...)
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 401 / 463
Lectures 20: Tests of hypothesis (Conti...)
▸ 2
known variance σ
▸ unknown variance σ 2
2 X has a general distribution, but we have a large sample size (n ≥ 30)
Hypothesis H0 ∶ µ = µ0 vs. H1 ∶ µ ≠ µ0
Assumption σ 2 is known σ 2 is unknown
any n n < 30 n ≥ 30
Test statistic zcal = (x̄−µ0 )
√
σ/ n
tcal = (x̄−µ0 )
σ
√
̂/ n
zcal = (x̄−µ0 )
σ
√
̂/ n
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 402 / 463
Lectures 20: Tests of hypothesis (Conti...)
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 403 / 463
Lectures 20: Tests of hypothesis (Conti...) Testing on a Single Proportion
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 404 / 463
Lectures 20: Tests of hypothesis (Conti...) Testing on a Single Proportion
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 405 / 463
Lectures 20: Tests of hypothesis (Conti...) Testing on a Single Proportion
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 406 / 463
Lectures 20: Tests of hypothesis (Conti...) Testing on a Single Proportion
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 407 / 463
Lectures 20: Tests of hypothesis (Conti...) Testing on a Single Proportion
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 408 / 463
Lecture 21: Correlation Analysis
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 409 / 463
Lecture 21: Correlation Analysis
21.2 Covariance
21.4 Exercises
21.6 Homework
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 410 / 463
Lecture 21: Correlation Analysis Scatter Diagram
Scatter Diagram
a scatter diagram is a graphic tool used to display the relationship between
two variables
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 411 / 463
Lecture 21: Correlation Analysis Covariance
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 412 / 463
Lecture 21: Correlation Analysis Covariance
Sample Covariance
what is the nature of the relationship between x and y?
sample covariance:
∑(xi − x̄)(yi − xy
¯)
sxy =
n−1
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 413 / 463
Lecture 21: Correlation Analysis Covariance
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 414 / 463
Lecture 21: Correlation Analysis Correlation Coecient
Correlation analysis
▸▸ a group of techniques to measure the relationship between two variables
Pearson's Correlation coecient
is a measure of the strength of the linear relationship between two
variables
n
∑ (xi − x̄)(yi − ȳ )
i=1
rxy =√ √
n n
∑ (xi − x̄)2 ∑ (yi − ȳ )2
i=1 i=1
n
1
where x̄ = n ∑ xi (the sample mean); and analogously for ȳ
i=1
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 415 / 463
Lecture 21: Correlation Analysis Correlation Coecient
n
∑ xi yi − nx̄ ȳ
i=1
rxy =¿ ¿
Á n Á n
À( ∑ x 2 − nx̄ 2 ) Á
Á À( ∑ y 2 − nȳ 2 )
i= 1 i
i= 1 i
sxy
=
sx sy
√
sx = 1 ∑ n
2
where
n−1 i=1(xi − x̄) (the sample standard deviation); and
analogously for sy
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 416 / 463
Lecture 21: Correlation Analysis Correlation Coecient
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 417 / 463
Lecture 21: Correlation Analysis Correlation Coecient
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 418 / 463
Lecture 21: Correlation Analysis Correlation Coecient
Correlation Coecient
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 419 / 463
Lecture 21: Correlation Analysis Correlation Coecient
Correlation Coecient
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 420 / 463
Lecture 21: Correlation Analysis Correlation Coecient
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 421 / 463
Lecture 21: Correlation Analysis Correlation Coecient
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 422 / 463
Lecture 21: Correlation Analysis Exercises
Exercises
1 The following sample of observations were randomly selected.
x 4 5 3 6 10
y 4 6 5 7 7
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 423 / 463
Lecture 21: Correlation Analysis Exercises
Exercises
Car Age (years) Selling Price ($000) Car Age (years) Selling Price ($000)
1 9 8.1 7 8 7.6
2 7 6.0 8 11 8.0
3 11 3.6 9 10 8.0
4 12 4.0 10 12 6.0
5 8 5.0 11 6 8.6
6 7 10.0 12 6 8.0
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 424 / 463
Lecture 21: Correlation Analysis Test for Signicance Using Correlation
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 425 / 463
Lecture 21: Correlation Analysis Test for Signicance Using Correlation
Another Example
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 426 / 463
Lecture 21: Correlation Analysis Test for Signicance Using Correlation
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 427 / 463
Lecture 21: Correlation Analysis Test for Signicance Using Correlation
Example
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 428 / 463
Lecture 21: Correlation Analysis Test for Signicance Using Correlation
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 429 / 463
Lecture 21: Correlation Analysis Homework
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 430 / 463
Lecture 21: Correlation Analysis Homework
Homework
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 430 / 463
Lecture 21: Correlation Analysis Homework
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 431 / 463
Lecture 22: Regression Analysis
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 432 / 463
Lecture 22: Regression Analysis
22.3 Homework
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 433 / 463
Lecture 22: Regression Analysis Concept of Regression Analysis
Regression Analysis
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 434 / 463
Lecture 22: Regression Analysis Concept of Regression Analysis
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 435 / 463
Lecture 22: Regression Analysis Linear Regression Model
yi = β0 + β1 xi + εi ; i = 1, 2, . . . , n
ŷi = b0 + b1 xi
b0 = ȳ − b1 x̄
n
∑ xi yi − nx̄ ȳ
b1 =
i= 1
∑ xi2 − nx̄ 2
n
i= 1
these estimators are also called ordinary least square (OLS) estimators
the expression of b1 can also be written as
n
∑ (xi − x̄)(yi − ȳ )
b1 =
i= 1
∑ (xi − x̄)2
n
i= 1
because
n n n n
2 2
∑(xi − x̄) = ∑ xi − nx̄
2 and ∑(xi − x̄)(yi − ȳ ) = ∑ xi yi − nx̄ ȳ
i= 1 i= 1 i= 1 i= 1
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 437 / 463
Lecture 22: Regression Analysis Linear Regression Model
the sign of b1 and the sign of r, the correlation coecient, are always
the same
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 438 / 463
Lecture 22: Regression Analysis Linear Regression Model
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 439 / 463
Lecture 22: Regression Analysis Linear Regression Model
Example
To illustrate the least squares method, suppose data were collected from a
sample of 10 Armand's Pizza Parlor restaurants located near college
campuses. For the i th observation or restaurant in the sample, xi is the
size of the student population (in thousands) and yi is the quarterly sales
(in thousands of dollars).
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 440 / 463
Lecture 22: Regression Analysis Linear Regression Model
(i). Show the relationship between the size of student population and the
quarterly sales by using scatter diagram. Make a comment on the
scatter diagram.
(ii). Write down the regression model and estimated regression equation.
(ii). Find the least square estimates and write the estimated regression
equation. Interpret the results.
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 441 / 463
Lecture 22: Regression Analysis Linear Regression Model
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 442 / 463
Lecture 22: Regression Analysis Linear Regression Model
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 443 / 463
Lecture 22: Regression Analysis Linear Regression Model
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 444 / 463
Lecture 22: Regression Analysis Linear Regression Model
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 445 / 463
Lecture 22: Regression Analysis Linear Regression Model
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 446 / 463
Lecture 22: Regression Analysis Linear Regression Model
ŷ = 60 + 5 × 16 = 140
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 447 / 463
Lecture 22: Regression Analysis Linear Regression Model
Classroom Practice
1 The following sample of observations were randomly selected.
x 4 5 3 6 10
y 4 6 5 7 7
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 448 / 463
Lecture 22: Regression Analysis Linear Regression Model
n
∑ xi yi − nx̄ ȳ
b1 =
i= 1 = 0.3630 and b0 = ȳ − b1 x̄ = 3.7671
∑ xi2 − nx̄ 2
n
i=1
ŷ = b0 + b1 x = 3.7671 + 0.3630x
when x is 7.
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 449 / 463
Lecture 22: Regression Analysis Linear Regression Model
Classroom Practice
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 450 / 463
Lecture 22: Regression Analysis Linear Regression Model
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 451 / 463
Lecture 22: Regression Analysis Homework
Homework
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 452 / 463
Lecture 23: Regression Analysis (Conti...)
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 453 / 463
Lecture 23: Regression Analysis (Conti...)
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 454 / 463
Lecture 23: Regression Analysis (Conti...) Coecient of Determination (R 2 )
Coecient of Determination
the total sum of squares (proportional to the variance of the data):
SStot = ∑(yi − ȳ )2 ,
i
The sum of squares of residuals, also called the residual sum of
squares: SSres = ∑(yi − ŷi )2 = ∑ ε2i
i i
the coecient of determination is the proportion of the total variation
in the dependent variable Y that is explained, or accounted for, by
the variation in the independent variable X
2
it is denoted by R and dened by
R2 ≡ 1 −
SSres
SStot
Suppose R 2 = 0.49. This implies that 49% of the variability of the
dependent variable has been accounted for, and the remaining 51% of
the variability is still unaccounted for
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 455 / 463
Lecture 23: Regression Analysis (Conti...) Coecient of Determination (R 2 )
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 456 / 463
Lecture 23: Regression Analysis (Conti...) Adjusted R 2
Adjusted R 2
n−1
R̄ 2 = 1 − (1 − R 2 )
n−p−1
where p is the total number of independent variables in the model
(not including the constant term), and n is the sample size
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 457 / 463
Lecture 23: Regression Analysis (Conti...) Adjusted R 2
Exercises (Conti...)
1 The following sample of observations were randomly selected.
x 4 5 3 6 10
y 4 6 5 7 7
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 458 / 463
Lecture 23: Regression Analysis (Conti...) Adjusted R 2
Classroom Practice
1 The following sample of observations were randomly selected.
No. of TV Commercials (x ) 2 5 1 3 4 1 5 3 4 2
Total Sales(y ) 50 57 41 54 54 38 63 48 59 46
(a). Find the linear relationship between number of TV Commercials and total
sales.
(b). Fit a model y on x .
(c). Find the coecient of determination. Interpret your ndings.
(d). Find the adjusted R 2 . Interpret your ndings.
(e). Predict (or forecast) total sales when x = 5.
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 459 / 463
Lecture 23: Regression Analysis (Conti...) Homework
Homework
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 460 / 463
Lecture 24: Review Class
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 461 / 463
Lecture 24: Review Class
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 462 / 463
Lecture 24: Review Class Syllabus for the Final Exam and Marks Distribution
Dr. Md. Rezaul Karim (Assoticate Professor, Department of Statistics, JU) Statistics Spring - 2023 463 / 463