Sampling - definition
• Measuring a small portion of something and then
making a general statement about the whole thing.
• Process of selecting a number of units for a study in such
a way that the units represent the larger group from
which they are selected.
Sampling
• A sample is a set of data chosen from a population and
is a subset of the population.
• A sampling unit - individual member of a sample.
• A sampling frame - list of all members of a population.
• A parameter is a characteristic of the population. (Eg.
population mean, population variance and population
proportion.)
• A statistic is a characteristic of the sample.
• The information obtained from the sample about the
parameter (e.g. sample mean, sample variance and
sample proportion.)
Why sample?
• Very difficult to study each and every unit of the
population when population unit are heterogeneous.
• Chance of inaccuracy
• Finance, time
Characteristics of good samples
• Representative
• Accessible
• Low cost
• Must be valid
• Precision
• Accuracy
• Reasonable
Disadvantages of sampling
• If sampling is biased, the conclusion may not be valid
and reliable.
• If the population is very large and there are many
sections and subsections, the sampling procedure
becomes very complicated.
• If the researcher does not possess the necessary skill and
technical knowledge affects sampling procedure.
Steps in sampling process
• Define the population
• Identify the sampling frame
• Select a sampling design or procedure
• Determine the sample size
• Draw the sample
Sampling methods - Classification
Probability Sampling
• Each and every unit of the population has equal chance
for selection as a sampling unit by means of some
systematic way
• Also called formal sampling or random sampling
• Probability samples are more accurate
• Probability samples allows to estimate the accuracy of
the sample
• Randomization is a feature of the selection process
rather than an assumption
• More complex, time consuming and more costly
Non-probability sampling
• The sample is not a proportion of the population
• No system in selecting the sample, depends on the
situation.
• No assurance is given that each item has a chance of
being included as a sample
• An assumption that there is an even distribution of
characteristics within the population, any sample would
be representative.
Types of Probability Sampling
• Simple Random Sampling
• Stratified Sampling
• Cluster Sampling
• Systematic Sampling
• Multistage Sampling
Simple Random Sampling
• The purest form of probability sampling
• Assures each element in the population has an equal
chance of being included in the sample.
• Also called the lottery or raffle type of sampling.
• This may be used if the population has no differentiated
levels, sections, or classes.
• Done with or without replacement
Types of Simple Random Sample
• With replacement
• Without replacement
With replacement
• The unit once selected has the chance for again
selection
Without replacement
• The unit once selected can not be selected again
Methods of SRS
• Tippet method
• Lottery method
• Random table
Advantages of SRS
• Minimal knowledge of population is needed.
• Easy to analyze data
Disadvantages
▪ High cost, low frequency of use
▪ Requires sampling frame
▪ Does not use researchers’ expertise
▪ Larger risk of random error than stratified
Steps
• Define the population
• Determine the desired sample size
• List all the members of the population
• Assign each of the individuals on the list a consecutive
number from zero to the required number, e.g. 01-99 or 001-
249
• Select an arbitrary number in the table of random numbers
(Close your eyes and point)
• For the selected number, look at only the appropriate
number of digits
• If the selected number corresponds to the number assigned
to any individual in the population, then that individual is in
the sample
• Repeat the steps until the desired sample size is reached.
Stratified Random Sampling
• Population is divided into two or more groups called
strata, according to some criterion.
• Geographic location, grade level, age, or income, and
subsamples are randomly selected from each strata.
• Elements within each strata are homogeneous, but are
heterogeneous across strata.
Advantage
• Assures representation of all groups in sample
population needed
• Characteristics of each stratum can be estimated and
compare
• Reduces variability
Disadvantage
• Requires accurate information on proportions of
each stratum
• Stratified lists costly to prepare
Steps involves in stratified sampling:
• Define the population
• Determine the desired sample size
• Identify the variable and subgroups (strata) for which you
want to guarantee appropriate representation (either
proportion or equal)
• Classify all members of the population as members of
one of the identified subgroups
• Randomly select (using table of random numbers) an
appropriate number of individuals from subgroups.
Cluster sampling
• The population is divided into subgroups (clusters) like
families.
• A simple random sample is taken from subgroups and
then all members of the cluster selected are surveyed.
• Also called as multistage cluster sampling
• Used when the population is so big or the geographical
area of the research is so large.
Steps in cluster sampling
• Define the population
• Determine the desired sample size
• Identify and define a logical cluster
• Obtain, or make a list of all clusters in the population
• Estimate the average number of population members per
cluster
• Determine the number of clusters needed by dividing the
sample size by the estimated size of the cluster
• Randomly select the needed number of clusters (using a table
of random numbers)
• Include in the sample all population members in selected
cluster
Systematic Random Sampling
• Arrange all units in the sampling frame based on some
variable and then every kth number on the list is selected.
• Subjects or respondents in the study are arrayed or
arranged in some systematic or logical manner.
• Gaps between elements are equal and constant
• Also called as interval sampling
Steps in systematic sampling:
• Define the population
• Determine the desired sample size
• Obtain a list (preferably randomized) of the population
• Determine K by dividing the size of the population
desired sample size
• Select some random place at the top of the population
list
• Starting at that point, take every kth name on the list until
desired sample size is reached
• If the end of the list is reached before the desired sample
is reached, go back to the top of the list.
Advantage
• Moderate cost
• External validity high; internal validity high
• Simple to draw sample; easy to perform
Disadvantage
• Sample becomes biased if the persons in the list
belong to a class by themselves whereas the
investigation requires that all sectors of the
population are to be involved.
Non Probability Sampling
• The probability of each case being selected from the
total population is not known.
• All have NOT equal chance of being selected.
• Sampling units are chosen on the basis of personal
judgment or convenience.
• There are NO statistical techniques for measuring
random sampling error in a non-probability sample.
Non Probability Sampling
• Involves non random methods in selection of sample.
• Selection depend upon situation.
• Considerably less expensive.
Types of non probability sampling
• Purposive Sampling
• Quota sampling (larger populations)
• Snowball sampling
• Self-selection sampling
• Convenience sampling
Purposive Sampling
• Also called judgment sampling
• Selects the sample based on some appropriate
characteristics.
• Reject the sample, which is not fit for a particular
profile.
• Start with a purpose in mind.
Merits
• Sample are chosen well based on some criteria.
• Meet the specific objective
Demerits
• Bias selection of sample may occur
• Time consuming process
Quota sampling
• Population is divided into cells on the basis of relevant
control characteristics.
• A convenient sample is drawn for each cell until the
quota is met.
• It is entirely non random and it is normally used for
interview surveys.
Self selection sampling
• It occurs when you allow each case usually individuals, to
identify their desire to take part in the research.
• Publish your need for cases, either by advertising
through appropriate media or by asking them to take
part.
• Collect data from those who respond.
Convenience sampling
• Called as Accidental / Incidental Sampling
• Selecting those cases that are easiest to obtain.
• Sample most available are chosen.
• It is done at the “convenience” of the researcher.
Snowball sampling
• Used in cases where there is no precalculated list of
target population details.
• Samples have traits that are rare to find.
• People who are not willing to be identified.