Get started
Sampling Techniques
Seema Singh
Jul 26, 2018 · 6 min read
Sampling helps a lot in research. It is one of the
most important factors which determines the
accuracy of your research/survey result. If
anything goes wrong with your sample then it
will be directly re:ected in the ;nal result.
There are lot of techniques which help us to
gather sample depending upon the need and
situation. This blog post tries to explain some of
those techniques.
To start with, let’s have a look on some basic
terminology
Population
Sample
Sampling
Population is the collection of the elements
which has some or the other characteristic in
common. Number of elements in the
population is the size of the population.
Sample is the subset of the population. The
process of selecting a sample is known as
sampling. Number of elements in the sample is
the sample size.
Sampling
There are lot of sampling techniques which are
grouped into two categories as
Probability Sampling
Non- Probability Sampling
The diGerence lies between the above two is
whether the sample selection is based on
randomization or not. With randomization,
every element gets equal chance to be picked
up and to be part of sample for study.
Probability Sampling
This Sampling technique uses randomization to
make sure that every element of the population
gets an equal chance to be part of the selected
sample. It’s alternatively known as random
sampling.
Simple Random Sampling
Strati2ed sampling
Systematic sampling
Cluster Sampling
Multi stage Sampling
Simple Random Sampling: Every element has
an equal chance of getting selected to be the
part sample. It is used when we don’t have any
kind of prior information about the target
population.
For example: Random selection of 20 students
from class of 50 student. Each student has
equal chance of getting selected. Here
probability of selection is 1/50
Single Random Sampling
Strati6ed Sampling
This technique divides the elements of the
population into small subgroups (strata) based
on the similarity in such a way that the
elements within the group are homogeneous
and heterogeneous among the other subgroups
formed. And then the elements are randomly
selected from each of these strata. We need to
have prior information about the population to
create subgroups.
Strati9ed Sampling
Cluster Sampling
Our entire population is divided into clusters or
sections and then the clusters are randomly
selected. All the elements of the cluster are
used for sampling. Clusters are identi;ed using
details such as age, sex, location etc.
Cluster sampling can be done in following
ways:
· Single Stage Cluster Sampling
Entire cluster is selected randomly for
sampling.
Single Stage Cluster Sampling
· Two Stage Cluster Sampling
Here ;rst we randomly select clusters and then
from those selected clusters we randomly select
elements for sampling
Two Stage Cluster Sampling
Systematic Clustering
Here the selection of elements is systematic and
not random except the ;rst element. Elements
of a sample are chosen at regular intervals of
population. All the elements are put together in
a sequence ;rst where each element has the
equal chance of being selected.
For a sample of size n, we divide our population
of size N into subgroups of k elements.
We select our ;rst element randomly from the
;rst subgroup of k elements.
To select other elements of sample, perform
following:
We know number of elements in each group is
k i.e N/n
So if our ;rst element is n1 then
Second element is n1+k i.e n2
Third element n2+k i.e n3 and so on..
Taking an example of N=20, n=5
No of elements in each of the subgroups is N/n
i.e 20/5 =4= k
Now, randomly select ;rst element from the
;rst subgroup.
If we select n1= 3
n2 = n1+k = 3+4 = 7
n3 = n2+k = 7+4 = 11
Systematic Clustering
Multi-Stage Sampling
It is the combination of one or more methods
described above.
Population is divided into multiple clusters and
then these clusters are further divided and
grouped into various sub groups (strata) based
on similarity. One or more clusters can be
randomly selected from each stratum. This
process continues until the cluster can’t be
divided anymore. For example country can be
divided into states, cities, urban and rural and
all the areas with similar characteristics can be
merged together to form a strata.
Multi-Stage Sampling
Non-Probability Sampling
It does not rely on randomization. This
technique is more reliant on the researcher’s
ability to select elements for a sample. Outcome
of sampling might be biased and makes di\cult
for all the elements of population to be part of
the sample equally. This type of sampling is also
known as non-random sampling.
Convenience Sampling
Purposive Sampling
Quota Sampling
Referral /Snowball Sampling
Convenience Sampling
Here the samples are selected based on the
availability. This method is used when the
availability of sample is rare and also costly. So
based on the convenience samples are selected.
For example: Researchers prefer this during
the initial stages of survey research, as it’s quick
and easy to deliver results.
Purposive Sampling
This is based on the intention or the purpose of
study. Only those elements will be selected
from the population which suits the best for the
purpose of our study.
For Example: If we want to understand the
thought process of the people who are
interested in pursuing master’s degree then the
selection criteria would be “Are you interested
for Masters in..?”
All the people who respond with a “No” will be
excluded from our sample.
Quota Sampling
This type of sampling depends of some pre-set
standard. It selects the representative sample
from the population. Proportion of
characteristics/ trait in sample should be same
as population. Elements are selected until exact
proportions of certain types of data is obtained
or su\cient data in diGerent categories is
collected.
For example: If our population has 45%
females and 55% males then our sample should
re:ect the same percentage of males and
females.
Referral /Snowball Sampling
This technique is used in the situations where
the population is completely unknown and
rare.
Therefore we will take the help from the ;rst
element which we select for the population and
ask him to recommend other elements who will
;t the description of the sample needed.
So this referral technique goes on, increasing
the size of population like a snowball.
Referral /Snowball Sampling
For example: It’s used in situations of highly
sensitive topics like HIV Aids where people will
not openly discuss and participate in surveys to
share information about HIV Aids.
Not all the victims will respond to the questions
asked so researchers can contact people they
know or volunteers to get in touch with the
victims and collect information
Helps in situations where we do not have the
access to su\cient people with the
characteristics we are seeking. It starts with
;nding people to study.
. . .
Hope now you all have a good idea about
sampling and it’s techniques.
Thanks for reading!
Data Science Statistics Sample Population
Probability
1.3K claps
WRITTEN BY
Seema Singh
Data Science Aspirant | Learner
Follow
Towards Data Science
Sharing concepts, ideas, and codes.
Follow
See responses (14)
More From Medium
More from Towards Data Science
12 Things I Learned During My
First Year as a Machine Learning
Engineer
Daniel Bourke in Tow… Science
Jul 6 · 11 min read
Science 4.6K
More from Towards Data Science
How a simple mix of object-
oriented programming can sharpen
your deep learning prototype
Tirthajyoti Sark… Data Science
Data
Jul 6 ·Science
11 min read 2.1K
More from Towards Data Science
5 Useful Statistics Data Scientists
Need to Know
George Seif in Towar… Science
Jul 11 · 7 min read
Science 664