0% found this document useful (0 votes)

59 views6 pages

Sampling Unit 7

This document discusses cluster sampling methodology. It defines what a cluster is and provides examples of clusters. It then discusses simple one-stage cluster sampling and multi-stage cluster sampling. The key reasons for using cluster sampling being feasibility and cost-effectiveness are also covered.

Uploaded by

yonasante2121

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

59 views6 pages

Sampling Unit 7

Uploaded by

yonasante2121

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Chapter 7: Cluster Sampling

7.1 Definition

A cluster, when used in sample survey methodology, can be defined as any sampling
unit, containing a set of elements, treated as a single unit for the purpose of selecting a
sample. The unit can be geographical, temporal, or spatial in nature. Some practical
examples of clusters are as follows.
Elementary
Cluster Listing unit Application
unit
City block Household Person Estimation of total persons in
city
School Classroom Student Estimation of mean of
academic achievement among
student in a district
Week Day Day Estimation of all days having
maximum rain fall or
temperature
District/Woreda Hospital Patient Estimation of the proportion
discharged dead in a particular
state
Village Farm Farm Estimation of production

For example, a list containing of 52 calendar weeks can be compiled and a sample of the
weeks can be selected from this list. For each of the weeks selected in the sample, a
sample of days can be selected, and on each sample day measurements of rainfall or
temperature can be made.

Cluster Sampling is a process of any sampling plan that uses a frame consisting of
clusters of listing units. The sampling plan is often characterized in terms of the number
of stages involved, working down from larger clusters to smaller ones. We can select a
sample of clusters by simple random sampling or by systematic sampling. We can group
the clusters into strata and take a stratified random sample of clusters.
 Simple one- Stage Cluster Sampling: Cluster sampling is a sampling plan in which
clusters are chosen by simple random sampling in only one step and, every listing
unit within each of the selected clusters is included in the sample.
 Multi-Stage Cluster Sampling: is a process of sampling by which several stages of
sampling are often involved. That is, a sample of clusters selected at different stages
within each successive selected samples. More than one sampling frame might be
involved in the process.
After the first stage of sampling, the sampling frame is compiled from only those clusters
chosen in the sample. Once the sample clusters are selected at the first state, the listing of
second stage sampling units is compiled only for the sample clusters. Likewise, if there
are more than two stages of sampling, sampling units at any later stage are listed only for
those sampling units selected at the previous stage.

1
Why is cluster sampling widely used? The two most important reasons for cluster
sampling so widely used in practice, are feasibility and economy.
Cluster sampling is often the only feasible method of sampling because the only sampling
frames readily available for the target population are lists of clusters. This is especially
true for the surveys of human populations for which the household serves as the listing
unit. It is almost never feasible in terms of time and resources to compile a list of
households for any sizable population for the sole purpose of conducting a survey.
However, lists of higher clusters (geographical units) can be compiled relatively easily,
and these can serve as the sampling frame.
Cluster sampling is often the most economical form of sampling. Not only are listing
costs almost always lowest for cluster sampling, but also traveling costs are often lowest.

One disadvantage of cluster sampling is that the standard errors of estimates obtained
from cluster sampling designs are often high compared with those obtained from samples
of the same number of listing units chosen by other sampling designs. Another problem is
that the costs and problems of statistical analysis are greater.
In this section, we treat only a simple one-stage cluster sampling having clusters of equal
size.

7.2 Simple One-Stage Cluster Sampling (Cluster of Equal Size)

Suppose that a population has M cluster and L units in each cluster. The total population
units N = ML and all L units are included. Structure of clusters with observations is
shown below.
Clusters
Units 1 2    i    M
1 Y11 Y21    Yi1    YM 1
2 Y12 Y22    Yi 2    YM 2
' ' ' ' '
' ' ' ' '
' ' ' ' '
j Y1 j Y2 j    Yij    YM j
' ' ' ' '
' ' ' ' '
' ' ' ' '
L Y1L Y2 L    YiL    YML
C.T . Y1 Y2    Yi    YM
C.M Y1 Y2    Yi    YM
Where C.T. is cluster total and C.M. cluster mean. The following notations are used for
population:
Yij  Value obtained for listing unit j in population cluster i, (i = 1, 2, - - -, M; j = 1, 2, - -
-, L)

2
L
Yi   Yij  Aggregate of characteristic y for the i th population cluster,
j 1
M M L
Y   Yi   Y ij , Population total for characteristic y,
i 1 i 1 j 1
L

Y
Y
j 1
ij

Yi  i  , Mean for cluster i,

L L
M  L  M
 Yi   Yij   Yi
Y i 1 1 M  j 1  i 1 , the mean of the population unit, or the mean of
Y 
N

ML
 
M i 1  L  M
 
 
the M cluster means,

Population Variance:
M L M L

  Y Y   Y Y 
2 2
ij ij
i 1 j 1 i 1 j 1
S2   , Population variance for SRS
N 1 ML  1
L

 Y  Yi 
2
ij
j 1
S i2  , Population variances for cluster i
L 1
M

 Y Y 
2
i
S a2  i 1
, Variance of cluster means
M 1
Since the population in clusters is generally not random, the degree of homogeneity
between any two units within the cluster could be measured by intra-cluster correlation (
 w ).Thus the variance of cluster means can be expressed in terms of intra-cluster
correlation (  w ). The correlation coefficient between elements in the same cluster is
expressed as:
E Yij  Y Yik  Y 
w  , i = 1, 2, - - -, M; and i, j, = 1, 2, - - -, L. Expressing the
E Yij  Y  E Yik  Y 
2 2

variance of cluster means ( S a2 ) in terms of intra-cluster correlation would give:

2 S 2 ( ML  1)  w ( L  1)  1 2 S 2  w ( L  1)  1
S 
a  S a 
 , this approximation is valid
L2 ( M  1) L
for large L units. (Verify)

7.3 Estimation from Sample of Clusters:

Suppose a sample of m clusters, each containing L elements, is drawn from M clusters by

simple random sample. The sample cluster data structure will be as follows.

3
Clusters
Units 1 2    i    m
1 y11 y 21    y i1    yM 1
2 y12 y 22    yi 2    ym2
' ' ' ' '
' ' ' ' '
' ' ' ' '
j y1 j y2 j    y ij    ym j
' ' ' ' '
' ' ' ' '
' ' ' ' '
L y1L y2L    y iL    y mL
Total. y1 y2    yi    ym
Mean y1 y2    yi    ym

Notation for sample:

y ij  Value obtained for listing unit j in sample cluster i (i = 1, 2,- - -, m; j = 1,2,- - -, L).
L
y i   y ij , Aggregate value for the i th sample cluster
j 1
L

yj 1
ij
yi
yi   , Mean for sample cluster i
L L
m L m m
 y ij
i 1 j 1
 yi y i
i 1 i 1
y ce    , The sample mean per element or the mean of the m
n mL m
n mL m
cluster means. Sampling fraction, f    .
N ML M

Theorem 7.1: A simple random sample of m clusters, each containing L elements, is

drawn from M clusters in the population. Then the sample mean per element y ce is an
unbiased estimate of Y with variance

1  f S 2 ML  1 w  L  1  1 1  f S 2
Var  y ce     w L  1  1
m L2 M  1 m L
Prove this theorem.

Corollary: For the population total, an unbiased estimate and its variance are,
respectively, Yˆce  N y ce  MLy ce and var Yˆce  N 2
1  f  S 2  M 2 L2 1  f  S 2
 
a a
m m

4
Estimation of the variance from a sample:
m

 y i  y ce 2
If the unit variance between cluster means is given by s a2  i 1
, then
m 1
the sample variance of the mean, y ce , is
1 f 2 1 f
var  yce   sa and its standard error is s.e. y ce   sa
m m

7.4 Comparison of Cluster of Equal Size with SRS

Consider a simple random sample of size n is taken from N, where n = mL, and N = ML.
If the Var ( y cl ) < Var ( y ) , then the cluster sampling is more efficient. Show that this is
1
true when  w   or  w  0 , for large N.
N 1
Example

An agricultural extension agent wishes to estimate the average farm size (in hectares) per
household in a given community. In this particular community there are 4000 households
living in 400 geographical clusters of 10 households each. Because of fund constraints, a
simple random sample of four clusters is selected. The household data from all 4 selected
cluster samples are given below.

Cluster Farm size (in hectares)

1 1.0, 2.0, 1.5, 2.2, 3.0, 3.5, 1.0, 4.1, 1.6, 1.0
2 1.1, 3.1, 2.2, 2.8, 3 .5, 1.0, 4.4, 1.1, 1.0, 2.3
3 2.3, 1.0, 1.2, 1.4, 1.0, 3.2, 2.1, 1.5, 3.3, 1.0
4 1.6, 1.2, 3.0, 2.0, 1.3, 5.0, 0.2, 2.2, 3.5, 1.0

i. Estimate the average farm size per household for the community
ii. Find the standard error of the estimate
iii. Estimate the total area under crops, assuming that all farm size of households are
cultivated.
Solution:
L

y
j 1
ij

(i) First, find the cluster mean y i , i.e, y i  , Where L = 10 and y ij represent
L
the characteristic, farm size in hectares, of households.
1.0  2.0  1.5  2.2  3.0  3.5  1.0  4.1  1.6  1.0 20.9
y1    2.09
10 10
1.1  3.1  ....  2.3 22.5
y2    2.25
10 10

5
2.3  1.0  ...  1.0 18.0
y3    1.80
10 10

1.6  1.2  ...  1.0 22.0

y4    2.20
10 10
The overall estimate for the community is obtained by
m

y
i 1
i
y ce  , Where m = 4, i = 1, 2, 3, 4.
m
2.09  2.25  1.80  2.20 8.34
y ce    2.085  2.1 Hectares.
4 4
(ii) To find the standard error of y ce , we must calculate the variance of y ce
m
2
  yi  yce 
1 f 2 m
var  yce   sa , where f  2
, M  400 and sa  i 1

m M m 1
 4 
1   2 2 2 2
400   2.09  2.085  2.25  2.085  1.8  2.085  2.2  2.085 
 var  yce     
4  4 1 

1  0.01  0.000025  0.027225  0.081225  0.013225 

=
4  3 

0.99  0.1217 
=   = 0.01004
4  3 
s.e.  y ce   0.01004  0.1002
(iii) Estimate of total area cultivated
yˆ ce  Ny ce  MLy ce  400  10  2.085  8340 Hectares

If 95% confidence interval is required, then it will be calculated as follows. A 95%

confidence interval for population mean Y ( Z=1.96).
Y  yce  Z  s.e. yce 
2

 Y  2.085  1.96  0.1002

 Y  2.085  0.196
1.889  Y  2.281
We are 95% confident that the actual average farm size per household in the community
lies between 1.889 and 2.281 hectares.

Sampling CH-7
No ratings yet
Sampling CH-7
5 pages
Cluster Sampling 1
No ratings yet
Cluster Sampling 1
5 pages
Cluster Sampling Techniques Explained
No ratings yet
Cluster Sampling Techniques Explained
8 pages
Lec. Note E5
No ratings yet
Lec. Note E5
7 pages
Online Lecture On Cluster Samplingpdf
No ratings yet
Online Lecture On Cluster Samplingpdf
8 pages
Methods in Sample Surveys: Cluster Sampling
No ratings yet
Methods in Sample Surveys: Cluster Sampling
14 pages
Lecture 6 Cluster Sampling
No ratings yet
Lecture 6 Cluster Sampling
7 pages
Cluster Sampling Guide
No ratings yet
Cluster Sampling Guide
9 pages
Chapter9 Sampling Cluster Sampling
No ratings yet
Chapter9 Sampling Cluster Sampling
21 pages
Cluster Sampling
No ratings yet
Cluster Sampling
18 pages
Cluster Sampling
No ratings yet
Cluster Sampling
9 pages
Cluster Sampling
No ratings yet
Cluster Sampling
22 pages
Lecture On Unequal Cluster Sampling PDF
No ratings yet
Lecture On Unequal Cluster Sampling PDF
13 pages
Cluster Sampling
No ratings yet
Cluster Sampling
3 pages
Cluster Sampling
No ratings yet
Cluster Sampling
13 pages
Cluster Sampling - Definition, Method and Examples
No ratings yet
Cluster Sampling - Definition, Method and Examples
6 pages
Stat For Comp (7-9)
No ratings yet
Stat For Comp (7-9)
22 pages
ST2187 - Block 8 Sampling and Sampling Distributions
No ratings yet
ST2187 - Block 8 Sampling and Sampling Distributions
13 pages
Cluster Sampling
No ratings yet
Cluster Sampling
21 pages
Probability Sampling Methods Guide
No ratings yet
Probability Sampling Methods Guide
4 pages
Cluster Sampling
No ratings yet
Cluster Sampling
15 pages
M.Rahat Hussain Dadabhoy Institute of Higher Education Hill Park Campus
100% (1)
M.Rahat Hussain Dadabhoy Institute of Higher Education Hill Park Campus
29 pages
Cluster Samling
No ratings yet
Cluster Samling
7 pages
Unit 5 - Cluster Sampling
No ratings yet
Unit 5 - Cluster Sampling
62 pages
Sampling
No ratings yet
Sampling
13 pages
Cluster Sampling
No ratings yet
Cluster Sampling
10 pages
Population Total
No ratings yet
Population Total
13 pages
Statistical Sampling Guide
No ratings yet
Statistical Sampling Guide
40 pages
Sampling Techniques in Economics
No ratings yet
Sampling Techniques in Economics
11 pages
Sampling Design and Techniques
No ratings yet
Sampling Design and Techniques
39 pages
Edit 1
No ratings yet
Edit 1
3 pages
Sampling Methods
No ratings yet
Sampling Methods
24 pages
Methods of Sampling: Dr. Shital S. Patil, Asst. Professor Dept. of Community Medicine
No ratings yet
Methods of Sampling: Dr. Shital S. Patil, Asst. Professor Dept. of Community Medicine
48 pages
Lecture Research Method4
No ratings yet
Lecture Research Method4
29 pages
Cluster Sampling Explained
No ratings yet
Cluster Sampling Explained
9 pages
Chapter-7-Sampling & Sampling Distributions
No ratings yet
Chapter-7-Sampling & Sampling Distributions
3 pages
Online Lecture On Cluster Sampling 2
No ratings yet
Online Lecture On Cluster Sampling 2
17 pages
Sampling and Sample Size Determination
100% (6)
Sampling and Sample Size Determination
55 pages
Chapter 8 Sampling and Confidence Intervals
No ratings yet
Chapter 8 Sampling and Confidence Intervals
38 pages
Stat For MGT II New (1) - 1
No ratings yet
Stat For MGT II New (1) - 1
67 pages
Process: Statistics Survey Methodology Statistical Population
No ratings yet
Process: Statistics Survey Methodology Statistical Population
14 pages
Lesson 2.1
No ratings yet
Lesson 2.1
9 pages
MANAGERIAL STAT-WPS Office
No ratings yet
MANAGERIAL STAT-WPS Office
4 pages
Sampling Methods and Errors Guide
No ratings yet
Sampling Methods and Errors Guide
10 pages
Notes On Sample Survey
No ratings yet
Notes On Sample Survey
34 pages
Lesson: Sampling and Sampling Distributions
No ratings yet
Lesson: Sampling and Sampling Distributions
2 pages
BA 14 - Sampling
No ratings yet
BA 14 - Sampling
36 pages
Objective - STA 5313 01F Theory of Sample Surveys With Applications
No ratings yet
Objective - STA 5313 01F Theory of Sample Surveys With Applications
5 pages
Sample Designs and Sampling Procedures
No ratings yet
Sample Designs and Sampling Procedures
32 pages
Sampling: Dept. of Psychology Kurukshetra University
No ratings yet
Sampling: Dept. of Psychology Kurukshetra University
38 pages
Sampling
No ratings yet
Sampling
86 pages
Sampling Techniques
No ratings yet
Sampling Techniques
34 pages
Stat II Chapter One
No ratings yet
Stat II Chapter One
5 pages
Lecture 3 Statbio
No ratings yet
Lecture 3 Statbio
4 pages
Survey Analyis
No ratings yet
Survey Analyis
24 pages
2015 Exit Exam - Questions
No ratings yet
2015 Exit Exam - Questions
159 pages
ch-2 Sample Survey
No ratings yet
ch-2 Sample Survey
164 pages
Presentation 2
No ratings yet
Presentation 2
18 pages
Research Methods and Sampling Practice
No ratings yet
Research Methods and Sampling Practice
94 pages
Systematic Sampling Explained
No ratings yet
Systematic Sampling Explained
7 pages
Presentation 4
No ratings yet
Presentation 4
22 pages
Introduction to Sample Surveys
No ratings yet
Introduction to Sample Surveys
28 pages
Sampling Unit 6
No ratings yet
Sampling Unit 6
5 pages
Intro to Statistics for Students
No ratings yet
Intro to Statistics for Students
91 pages
Ethiopian Stats Exit Exam Guide
100% (2)
Ethiopian Stats Exit Exam Guide
9 pages
Basic Statics
No ratings yet
Basic Statics
218 pages
Time Series Lecture Notes-Ch-1
No ratings yet
Time Series Lecture Notes-Ch-1
24 pages
Time Series Randomness Tests
No ratings yet
Time Series Randomness Tests
21 pages
Social & Economic Stats Guide
100% (1)
Social & Economic Stats Guide
71 pages
DB Lecture Note All in ONE
No ratings yet
DB Lecture Note All in ONE
85 pages
POTATO Experiment
No ratings yet
POTATO Experiment
4 pages
Research in Daily Life 1 (4th Periodical)
No ratings yet
Research in Daily Life 1 (4th Periodical)
2 pages
Chapter One 1.1 Background of The Study
No ratings yet
Chapter One 1.1 Background of The Study
24 pages
Cha 2024 Final
No ratings yet
Cha 2024 Final
89 pages
Business Research Methods: William G. Zikmund
No ratings yet
Business Research Methods: William G. Zikmund
18 pages
Biostatistics For Practitioners An Interpretative Guide For Medicine and Biology 1st Edition Julien I. E. Hoffman Online Reading
100% (1)
Biostatistics For Practitioners An Interpretative Guide For Medicine and Biology 1st Edition Julien I. E. Hoffman Online Reading
100 pages
Chapter-I Company Profile Overview To Tara Health Food LTD
No ratings yet
Chapter-I Company Profile Overview To Tara Health Food LTD
31 pages
Eng10 1st QT Week 2.1 Noting Important Information (Without Drill)
No ratings yet
Eng10 1st QT Week 2.1 Noting Important Information (Without Drill)
20 pages
Entrepreneurial Intentions Final
No ratings yet
Entrepreneurial Intentions Final
74 pages
Final Brian
No ratings yet
Final Brian
71 pages
Action Research Raine
No ratings yet
Action Research Raine
36 pages
De Guzman Et - Al Chapters 3 1
100% (1)
De Guzman Et - Al Chapters 3 1
62 pages
Zahra Abdulwahid
No ratings yet
Zahra Abdulwahid
107 pages
Ddi Documentation English - Microdata 6100
No ratings yet
Ddi Documentation English - Microdata 6100
74 pages
BIO 610 Lab Manual: Design & Analysis
No ratings yet
BIO 610 Lab Manual: Design & Analysis
17 pages
Summer Internship Project Report: Guru Gobind Singh Indraprastha University, Delhi
No ratings yet
Summer Internship Project Report: Guru Gobind Singh Indraprastha University, Delhi
57 pages
MENA Project Implementation Insights
No ratings yet
MENA Project Implementation Insights
52 pages
A Simplified Guide To Determination of Sample Size
No ratings yet
A Simplified Guide To Determination of Sample Size
11 pages
1133
No ratings yet
1133
22 pages
Thesis Bullying
74% (19)
Thesis Bullying
35 pages
Unit 2 Ipr
No ratings yet
Unit 2 Ipr
15 pages
Borman Dowling 2017 Teacher Attrition and Retention A Meta Analytic and Narrative Review of The Research
No ratings yet
Borman Dowling 2017 Teacher Attrition and Retention A Meta Analytic and Narrative Review of The Research
43 pages
Solid Waste Management Benefits To The Community Background of The Study
100% (3)
Solid Waste Management Benefits To The Community Background of The Study
12 pages
Chapter-1 Introduction of The Study: Page 1 Mba Program 2020-21
No ratings yet
Chapter-1 Introduction of The Study: Page 1 Mba Program 2020-21
34 pages
Effect Sizes Means
No ratings yet
Effect Sizes Means
10 pages
MS251
No ratings yet
MS251
1 page
Mean Median Mode Range 5
No ratings yet
Mean Median Mode Range 5
2 pages
Dissertation Risk Assessment
100% (1)
Dissertation Risk Assessment
8 pages
Chao Wattpad
100% (1)
Chao Wattpad
45 pages
Final Thesis Abdi Majid Mohamed Hassan
100% (1)
Final Thesis Abdi Majid Mohamed Hassan
26 pages

Sampling Unit 7

Uploaded by

Sampling Unit 7

Uploaded by

Chapter 7: Cluster Sampling

7.2 Simple One-Stage Cluster Sampling (Cluster of Equal Size)

Yi  i  , Mean for cluster i,

variance of cluster means ( S a2 ) in terms of intra-cluster correlation would give:

7.3 Estimation from Sample of Clusters:

Suppose a sample of m clusters, each containing L elements, is drawn from M clusters by

Notation for sample:

Theorem 7.1: A simple random sample of m clusters, each containing L elements, is

7.4 Comparison of Cluster of Equal Size with SRS

Cluster Farm size (in hectares)

1.6  1.2  ...  1.0 22.0

1  0.01  0.000025  0.027225  0.081225  0.013225 

If 95% confidence interval is required, then it will be calculated as follows. A 95%

 Y  2.085  1.96  0.1002

You might also like