0% found this document useful (0 votes)

44 views37 pages

Lecture7 - Sampling Distribution - 0930

Uploaded by

九.

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

44 views37 pages

Lecture7 - Sampling Distribution - 0930

Uploaded by

九.

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 37

DOTE 2011 | Fall 2024

@ CUHK Business School

Statistical Analysis for Business Decisions

Sample Distribution

Yunduan Lin
Assistant Professor
Department of Decisions, Operations and Technology
CUHK Business School
Agenda

Statistical Analysis for Business Decisions

01 Law of Large Number

o Population and sample
o Property of sample mean

02 Central Limit Theorem

o Approximation of sample mean
Homework 1 – 1(d)

KURT function in excel:

Returns sample excesskurtosis

Homework 1 – 3(b)

o We are asking about the value of a conditional probability

Bayes theorem
(or you can also start from the
definition of conditional probability)
o Some term in the equation is not directly given.
o There is also some knowledge not used in the statement. How to relate them together?

A and B both happen A happens but B not

o Still, some term in the equation is not directly given. But it is easy to derive.
Homework 1 – 3(c)

o Either A or B = Union (it will count the case that both A and B happens for once)
o Both A and B = Intersection
Homework 1 – 3(e)

o How to interpret these sentences?

o Define the events: A - has the disease; B - have positive report
o What does these number mean and what is the problem asking for?
▪ 90% of those who have the disease will get a positive result
Fact What we care about

▪ 10% of those who do not have the disease will get a positive result
Fact What we care about

▪ The probability that a person has the disease given positive report
What we care about Fact
Quiz 1 - 1
Combinations (true or false):

Choose r objects from n objects Choose (n-r) objects from n objects

Implies Implies
There remains (n-r) objects There remains r objects
Quiz 1 - 2
Pick one number from 1 and 1000 (1 and 1000 included). Suppose every number is
equally likely to be chosen. What is the probability that the number picked is not divisible
by either 2 or 5?

o Sample space = {1, 2, …, 1000}

o Every other integer is divisible by 2, so there are 500 integers divisible by 2.
o Every 5 integer is divisible by 5, so there are 200 integers divisible by 5.
o Every 10 integer is both divisible by 2 and by 5, so there are 100 integers.
o There are 500+200-100=600 integers divisible by either 2 or 5.

o Therefore, the required probability is

Quiz 1 - 3
A student has to sell 2 books from a collection of 6 math, 7 science, and 4 economics
books. How many choices are possible if both books are to be on the same subject?

There are 3 cases:

o Two math books

o Two science books
o Two economics books
Recap - Discrete Random Variable
Mean Variance PMF

Bernoulli
o Binary outcome

Binomial
Euler constant = 2.718
o Count of successes for repeated discrete trials

Poisson
o Count of events over a continuous time
o Binomial approaches Poisson when n is really large and p is really small
o Can be used to approximate binomial and is easy to calculate, because has only 1 parameter
Recap - Continuous Random Variable
Mean Variance PDF

Exponential
o Time between independent random events
o Poisson: event count -> exponential: time between events
o Memoryless property

For exponential distribution, we have

e.g., the life of a light bulb

Normal
Population and Sample

Population

o Objects we would like to know

o e.g., age and incomes of individuals in a city, satisfaction level of consumers

Sample

o Subset of population

Goal of Inference

Use representative sample (small picture) to make an educated guess on the

population (big picture)
Population and Sample

Population

o represented by bar chart/histogram

o summarized by (relative) frequency table f(x)
o mean: μ; variance: σ2

Sample
o an observation from population

Random Sample

o A random draw from population

o A random variable with probability function is the same as frequency table f(x)
o For a sample with a size n, we write X1, X2, . . . , Xn
Simple Random Sample - Definition

Simple Random Sample: most basic random sample

o Each element has equal probability being selected.

o Each element is selected independently

Explanation:
Probability mass function
X1, …, Xn is a simple random sample if
o X1, … , Xn are independent random variables, and
o X1, . . . , Xn follow the same probability function P(x) or f(x) Probability density function
Simple Random Sample - Property

Consider a population with mean μ and variance σ2.

Property of Simple Random Sample:

If X1, …, Xn is a simple random sample, then

o
Simple random sample in fact has an even strong property
Each observation follows the same distribution as the population
o
This includes all summary statistics

o
Other Sampling Methods

Simple random sample is simple but difficult to achieve in practice:

o Online surveys likely exclude seniors who do not use internet often

o Samples from offline surveys are likely to be dependent due to geographical correlation (e.g.,
economic condition, location preference)

o Advanced sampling method to reduce sampling error: Stratified sampling - divide population into
subsamples, and do simple random sample within each subsample, and produce weighted average
across subsamples
Statistics - Definition

Statistics:

A function of a sample X1, ... Xn

o Data summary
o Data reduction (simplification)

Examples: sample mean, sample variance

Sample Mean - Definition

Sample mean:
It is useful to guess population mean

Sample mean is the mean of a sample.

o This varies sample by sample

o Sample mean is also a random variable.
Hence, we can also derive expectation,
variance for the sample mean.
Sample Mean - Expectation

Expectation of sample mean:

Expectation of sample mean is population mean

Intuition:

o If we sample many times, average of all sample means is the population mean

o This nice property is known as unbiasedness (see next chapter)

Sample Mean - Expectation
Average of sample means: Rolling a dice for (infinitely) many times

Amy rolls a dice for 5 times Charlie rolls a dice for 10 times

Mean for Amy's sample Mean for Charlie's

(5 results) sample (10 results)
Sample Mean - Expectation Example
Example:
Consider population has three numbers: 1, 2, and 3, each with the same probability.

o Population mean

o Consider sample with size=1, the sample mean can be one of {1,2,3} with the same probability.
Expectation of the sample mean for size=1 is

o Consider sample with size=2, the sample mean can be one of the following 9 results with the
same probability. Expectation of the sample mean for size=2 is
x1\x2 1 2 3
1 1 1.5 2
2 1.5 2 2.5
3 2 2.5 3 The sample size can be larger, and even larger than 3, and
there are more possibilities.
Sample Mean – Expectation Proof

Linear property of expectation

Expectation of sum = sum of expectation

Sample Mean - Variance

Variance of sample mean:

It is not the sample variance!

Population variance divided by sample size:

Standard error of sample mean:

Standard deviation of a statistics
is often called standard error
Standard deviation of the sample mean:
Sample Mean - Variance Example
Example:
Consider population has three numbers: 1, 2, and 3, each with the same probability.

o Population mean Population variance

o Consider sample with size=1, the expectation of the sample mean is .

Therefore, the variance of sample mean is

o Consider sample with size=2, . Therefore, the

variance of sample mean is

x1\x2 1 2 3
1 1 1.5 2
2 1.5 2 2.5
3 2 2.5 3
Sample Mean – Variance Proof

Transformation of variance

Variance of sum = sum of variance if independent

Sample Mean – Large Samples

When sample size gets larger,

o As sample size n enlarges, the variance of sample mean shrinks

o Moreover, variance vanishes as n goes to infinity, that is,

o As , when n gets larger, we have the sample mean eventually very close to population
mean, that is,
Law of Large Numbers

Let X1, . . . , Xn be a random sample from a distribution with mean μ and variance σ2.

Law of large numbers:

For any , when n is sufficiently large, we have

Or more rigorously,

Loosely speaking, when sample size is large, variation disappears and the sample mean becomes
population mean. Or, with a larger sample, sample mean is closer to population mean, and it can be
as close as we want.
Law of Large Number

Markov inequality
Consider a nonnegative random variable, , then for all t>0,

Hence, we get the Markov inequality

Chebyshev inequality
Consider , then by Markov inequality

Hence, we get Chebyshev inequality

Law of Large Number

As we have Chebyshev inequality

Then, since , we have

Taking the limit on both sides, we arrive at the law of large number.
Sample Mean – Large Samples

When sample size gets larger,

o Law of large numbers says that sample mean is eventually close to μ.

o But, sample mean itself is still a random variable. What is the distribution function of sample
mean when n becomes larger?

The distribution of sample mean RATHER THAN the distribution of a sample itself

Always normal distribution, regardless of how population looks like

Sample Mean - Variance Example
Example:
Consider population has three numbers: 1, 2, and 3, each with the same probability.

Let's look at the CDF of the sample mean for different sample sizes.

n=1 n=2 n=10

Normal distribution

n=100 n=1000 n=10000

Central Limit Theorem

Central limit theorem:

sample mean approximately follows a normal distribution with a large enough sample.

When n gets large, we have

Rule of thumb: sample size n is at least 35.

Central Limit Theorem - Example
Example:
Consider a population with mean 5 and variance 64. Consider a sample with size 100. What is the
probability that the sample mean is no more than 4?

No matter what is the distribution for population. We can use normal distribution to approximate the
sample mean with size 100.

By central limit theorem, we have

Central Limit Theorem - Binary Variable
Example:
Consider the population follows Bernoulli distribution, which means that each element in the
population is ether 0 or 1, the probability of having 1 (success) is p.

o Population mean
o Population variance

Central limit theorem for binary variable:

When n gets large, we have

Rule of thumb: good approximation when np and n(1−p) are at least 5.

Central Limit Theorem - Binary Variable
Comparison between binomial distribution and its normal approximation:

n=1 n=2 n=5

n=10 n=30 n=100

Central Limit Theorem - Binary Variable Example
Example:

Let X be binomial distribution with n = 100 and p = 0.6. What is the probability that X is less than
55?

Check first np = 100(0.6) = 60 and n(1−p) = 100(0.4) = 40 are at least 5.

We can use normal approximation:
A Feedback Form for the Entire Term

https://docs.google.com/forms/d/e/1FAIpQLSfsEgnMFLypI_KW6GF7j_FXtVY5E4Jrmf2P_BDwaG8GXWDc0A/viewform?usp=sf_link

4 SamplingDistribution
No ratings yet
4 SamplingDistribution
63 pages
Bizstat ssn2
No ratings yet
Bizstat ssn2
55 pages
Sample and Sampling Procedure: Population
No ratings yet
Sample and Sampling Procedure: Population
21 pages
Central Limit Theorem Grade 11 Group 4
No ratings yet
Central Limit Theorem Grade 11 Group 4
7 pages
Finding The Mean and Variance of The Sampling Distribution of Means
100% (1)
Finding The Mean and Variance of The Sampling Distribution of Means
25 pages
Probability & Statistics
No ratings yet
Probability & Statistics
43 pages
Ch1 Prob II NAU Spring23
No ratings yet
Ch1 Prob II NAU Spring23
17 pages
Sampling and Sampling Distribution
No ratings yet
Sampling and Sampling Distribution
14 pages
Sampling Distribution
No ratings yet
Sampling Distribution
41 pages
Chapter 2
No ratings yet
Chapter 2
39 pages
Sampling Distributions of Sample Means
No ratings yet
Sampling Distributions of Sample Means
7 pages
Chapter 2
No ratings yet
Chapter 2
37 pages
Brief Lecture Notes
No ratings yet
Brief Lecture Notes
13 pages
Statistical Inference
No ratings yet
Statistical Inference
106 pages
Chapter 2
No ratings yet
Chapter 2
39 pages
Sampling Technique and Sampling Distribution
No ratings yet
Sampling Technique and Sampling Distribution
47 pages
Sampling Distribution of Mean
No ratings yet
Sampling Distribution of Mean
6 pages
Week 7
No ratings yet
Week 7
2 pages
Sampling Distributions: The Basic Practice of Statistics
No ratings yet
Sampling Distributions: The Basic Practice of Statistics
14 pages
Hypothesis Testing 23.09.2023
No ratings yet
Hypothesis Testing 23.09.2023
157 pages
Gsbiju MA202 3 1
No ratings yet
Gsbiju MA202 3 1
5 pages
Variance: Variance Is The Expectation of The
No ratings yet
Variance: Variance Is The Expectation of The
21 pages
ST Topic 3
No ratings yet
ST Topic 3
71 pages
Probability and Statistics
No ratings yet
Probability and Statistics
8 pages
7 Sampling Distribution 1
No ratings yet
7 Sampling Distribution 1
7 pages
Statistics and Probability Quarter 2 - Module 3: For Senior High School
100% (1)
Statistics and Probability Quarter 2 - Module 3: For Senior High School
18 pages
Statistics Group 1
No ratings yet
Statistics Group 1
59 pages
And Estimation Sampling Distributions: Learning Outcomes
No ratings yet
And Estimation Sampling Distributions: Learning Outcomes
12 pages
And Estimation Sampling Distributions: Learning Outcomes
No ratings yet
And Estimation Sampling Distributions: Learning Outcomes
12 pages
Course: Statistical Inference & Applications: Instructor in Charge
No ratings yet
Course: Statistical Inference & Applications: Instructor in Charge
30 pages
iQRM Warm Up Week 5 February 17 Corrected
No ratings yet
iQRM Warm Up Week 5 February 17 Corrected
39 pages
Statistics and Probability Module 3 CLT - RPUNO - Digital
No ratings yet
Statistics and Probability Module 3 CLT - RPUNO - Digital
17 pages
Population - Entire Group of Individuals About Which We Want Information. Sample - Part of The Population From Which We Actually Collect Information
No ratings yet
Population - Entire Group of Individuals About Which We Want Information. Sample - Part of The Population From Which We Actually Collect Information
5 pages
What Is Statistic
No ratings yet
What Is Statistic
129 pages
Sampling Distribution With CLT
No ratings yet
Sampling Distribution With CLT
22 pages
Sampling Distribution & Central Limit Theorem
No ratings yet
Sampling Distribution & Central Limit Theorem
6 pages
Statistics in Research Guide
No ratings yet
Statistics in Research Guide
91 pages
M-Iii Unit-3ln
No ratings yet
M-Iii Unit-3ln
44 pages
Week 7 & 8
No ratings yet
Week 7 & 8
37 pages
Statistics PDF
No ratings yet
Statistics PDF
17 pages
Sampling Distribution
No ratings yet
Sampling Distribution
19 pages
Estadística II T2
No ratings yet
Estadística II T2
4 pages
Review of Probability and Statistics
No ratings yet
Review of Probability and Statistics
34 pages
STAT515 Lecture
No ratings yet
STAT515 Lecture
85 pages
Random Sampling & Statistics Guide
No ratings yet
Random Sampling & Statistics Guide
2 pages
Seminar Week 4 - With Solutions - Fullpage
No ratings yet
Seminar Week 4 - With Solutions - Fullpage
35 pages
Why "Sample" The Population? Why Not Study The Whole Population?
No ratings yet
Why "Sample" The Population? Why Not Study The Whole Population?
9 pages
Random Variables & Sampling
100% (1)
Random Variables & Sampling
5 pages
Chapter 6-8 Sampling and Estimation
No ratings yet
Chapter 6-8 Sampling and Estimation
48 pages
Unit 4 - Introduction To Statistical Inference Vs2
No ratings yet
Unit 4 - Introduction To Statistical Inference Vs2
24 pages
Chapter 5
No ratings yet
Chapter 5
35 pages
Sampling Distribution and Central Limit Theorem: Session 2
No ratings yet
Sampling Distribution and Central Limit Theorem: Session 2
19 pages
15chap 3.1 Sampling Distribution
No ratings yet
15chap 3.1 Sampling Distribution
33 pages
W7PS
No ratings yet
W7PS
6 pages
P1 結構用詞-的得地
No ratings yet
P1 結構用詞-的得地
5 pages
GEUC 2102A Presentation
No ratings yet
GEUC 2102A Presentation
25 pages
Chapter8 (With Solutions)
No ratings yet
Chapter8 (With Solutions)
31 pages
S-S6 Booster Writing 1
No ratings yet
S-S6 Booster Writing 1
3 pages
Chapter 7 Problem (With Solutions)
No ratings yet
Chapter 7 Problem (With Solutions)
3 pages
Chapter 8 Problem (Without Solutions)
No ratings yet
Chapter 8 Problem (Without Solutions)
3 pages
Chapter 7 Problem (Without Solutions)
No ratings yet
Chapter 7 Problem (Without Solutions)
4 pages
Accounting Budgeting Exercises
No ratings yet
Accounting Budgeting Exercises
6 pages
2111 ch6
No ratings yet
2111 ch6
33 pages
2111 ch5
No ratings yet
2111 ch5
59 pages
2111 ch3
No ratings yet
2111 ch3
62 pages
Lecture4 - Probability - 0916
No ratings yet
Lecture4 - Probability - 0916
29 pages
2111 ch1
No ratings yet
2111 ch1
47 pages
Lecture6 - Random Variable - 0925
No ratings yet
Lecture6 - Random Variable - 0925
33 pages
Lecture5 - Random Variable - 0923
No ratings yet
Lecture5 - Random Variable - 0923
44 pages
Lecture2 - Descriptive Statistics - 0909
No ratings yet
Lecture2 - Descriptive Statistics - 0909
29 pages
Statistical Analysis For Business Decisions: Probability
No ratings yet
Statistical Analysis For Business Decisions: Probability
38 pages
A Short Course On Synchronous Machines and Synchronous Condensers
No ratings yet
A Short Course On Synchronous Machines and Synchronous Condensers
109 pages
Sensory Shelf-Life Insights
No ratings yet
Sensory Shelf-Life Insights
11 pages
TDS Hy 170
No ratings yet
TDS Hy 170
14 pages
FCDS Rev - 4
No ratings yet
FCDS Rev - 4
54 pages
NRTs Phase-01 For TYM - AY-2024-2025 Version 1.0
No ratings yet
NRTs Phase-01 For TYM - AY-2024-2025 Version 1.0
1 page
AIR Conditioner: Installation Manual
No ratings yet
AIR Conditioner: Installation Manual
25 pages
Yamanouchi Quantum Mechanics Molecular Structures
50% (2)
Yamanouchi Quantum Mechanics Molecular Structures
276 pages
MS Excel - Excercises - BA Lab Manual
No ratings yet
MS Excel - Excercises - BA Lab Manual
27 pages
Strategic Leadership in The Implementation of County Integrated Development Plan (Cidp) in Busia County
No ratings yet
Strategic Leadership in The Implementation of County Integrated Development Plan (Cidp) in Busia County
18 pages
Arvind Textile Internship Report-Final 2015
0% (2)
Arvind Textile Internship Report-Final 2015
44 pages
Python & Delphi Integration Guide
100% (1)
Python & Delphi Integration Guide
21 pages
Particles and Moisture Effect On Dielectric Strength of Transformer Oil PDF
No ratings yet
Particles and Moisture Effect On Dielectric Strength of Transformer Oil PDF
6 pages
Best Practices Guide For Microsoft SQL Server With ONTAP: Technical Report
No ratings yet
Best Practices Guide For Microsoft SQL Server With ONTAP: Technical Report
51 pages
CV Restu Budiono PDF
No ratings yet
CV Restu Budiono PDF
4 pages
Huawei Basic DHCP Nat
No ratings yet
Huawei Basic DHCP Nat
2 pages
English M23 Battery Charger, Choke, Condencer, Diode and Transformer (SSEE6)
No ratings yet
English M23 Battery Charger, Choke, Condencer, Diode and Transformer (SSEE6)
16 pages
Year 1 Year 2 Year 3 Year 4 Year 5 Year 6: English
No ratings yet
Year 1 Year 2 Year 3 Year 4 Year 5 Year 6: English
10 pages
Me215 5
No ratings yet
Me215 5
56 pages
Note of IB DP Math Lesson Seven - Sine and Cosine Rules, Radian and Bearing
No ratings yet
Note of IB DP Math Lesson Seven - Sine and Cosine Rules, Radian and Bearing
16 pages
Evaluation of Methods For Design Discharge Estimation in Ungauged Catchments, A Case of Tigithe River Catchment in Mara River Basin
No ratings yet
Evaluation of Methods For Design Discharge Estimation in Ungauged Catchments, A Case of Tigithe River Catchment in Mara River Basin
11 pages
Introduction To Refrigeration and Air Conditioning
100% (3)
Introduction To Refrigeration and Air Conditioning
101 pages
Low-Loss Photonic Reservoir Computing With Multimode Photonic Integrated Circuits
No ratings yet
Low-Loss Photonic Reservoir Computing With Multimode Photonic Integrated Circuits
10 pages
10 Viii Jazz Concertos Cool Jazz Modern 1954 1962
No ratings yet
10 Viii Jazz Concertos Cool Jazz Modern 1954 1962
51 pages
Updated 1 Final 300 L MECHANICAL Second Semester 22 - 23 Lectiure Timetable
No ratings yet
Updated 1 Final 300 L MECHANICAL Second Semester 22 - 23 Lectiure Timetable
1 page
Impression Brochure
No ratings yet
Impression Brochure
2 pages
Advanced Concrete Stress-Strain Analysis
No ratings yet
Advanced Concrete Stress-Strain Analysis
45 pages
Math7 Q1mod4 Integer Fraction Decimal Beverly Wanawan Bgo v1
No ratings yet
Math7 Q1mod4 Integer Fraction Decimal Beverly Wanawan Bgo v1
24 pages
Math Plan
No ratings yet
Math Plan
7 pages
Ordinary Portland Cement and Pozzolana Cement A Basis For Its Application
No ratings yet
Ordinary Portland Cement and Pozzolana Cement A Basis For Its Application
66 pages
MATLAB - Lecture 1 - Overview
No ratings yet
MATLAB - Lecture 1 - Overview
98 pages