Unit II Question Bank With Hints and Answers
Unit II Question Bank With Hints and Answers
Types of data
7) z = (x – μ) / σ - Z-score
UNIT II
PART
Uses:
A frequency distribution helps us to detect any pattern in the data (assuming a pattern exists) by superimposing
some order on the inevitable variability among observations.
Marks
obtained in
No. of Students (Frequency)
Test (class
intervals)
0–5 3
6 – 10 4
11 – 15 5
16 – 20 8
Marks
obtained in
No. of Students (Frequency)
Test (class
intervals)
Total 20
When observations are sorted into classes of single values the result is referred to as a frequency distribution for
ungrouped data. In the ungrouped frequency distribution table, we don't make class intervals, we write the accurate
frequency of individual data.
5 3
10 4
15 5
18 4
20 4
Total 20
7. What is Histograms?
A histogram is the most commonly used graph to show frequency distributions. It is a bar-type graph for
quantitative data. The common boundaries between adjacent bars emphasize the continuity of the data, as with
continuous variables.
13. What if a distribution have More than one mode or no mode at all?
Distributions can have more than one mode (or no mode at all). Distributions with two obvious peaks, even
though they are not exactly the same height, are referred to as bimodal. Distributions
with more than two peaks are referred to as multimodal. The presence of more than one mode might reflect important
differences among subsets of data.
Frequency distribution gives a clear picture of the distribution of values. By organizing data in a
distribution table, researchers can identify impossible values and the location of scores in a distribution.
A frequency distribution shows how high or low measurements are.
Types of Frequency Distributions
There are three types of frequency distribution:
Categorical frequency distribution.
Grouped frequency distribution.
Ungrouped frequency distribution
Categorical frequency distribution
Categorical frequency distribution is the distribution frequency of classifiable values such as blood
type or educational level.
Here is an example of a categorical frequency distribution table:
Relative
X = Blood type f
frequency
A 7 0.35 or 35%
B 4 0.20 or 20%
AB 6 0.30 or 30%
O 2 0.10 or 10%
A+ 1 0.05 or 5%
In a frequency distribution, researchers can also compute relative frequencies.
Relative frequency: shows how often a score occurs within total frequencies in a distribution table. To
get the relative frequency of a score in a frequency distribution, divide a score's frequency by the total
number of frequencies.
To find the relative frequency of the first row, divide 7 by 20 (the total number of outcomes), which is
equal to 0.35 or 35%.
Frequency distributions also include cumulative relative frequencies.
Cumulative relative frequency: the sum of prior relative frequencies in a distribution table. To find
the cumulative relative frequency of a score in distribution frequency, combine its relative frequency
with all relative frequencies above it.
X = Blood
f Relative frequency Cumulative relative frequency
type
Here are a few guidelines for the distribution frequency of grouped data:
96–98 3
93–95 3
90–92 4
87–89 3
84–86 3
81–83 3
78–80 3
75–77 3
Ungrouped frequency distribution
Ungrouped frequency distribution is the distribution frequency of ungrouped data listed as individual
values in a distribution table. This type of frequency distribution is ideal for a small set of values.
X f
7 1
6 2
5 1
4 3
3 2
2 4
1 3
In this frequency distribution, X represents the number of children in a household, and f is the number
of families with said number of children. Here, we can see that four homes have two children, and one
has seven children
Central tendency refers to identifying the central position of the given data set. Central tendency has
3 important measures that are Mean, Median, and Mode.
Mean: Mean, here the arithmetic mean is the most used measure of central tendency. With both discrete
and continuous data set, a mean can be obtained. To obtain the mean of a set of numbers, you have to
sum up all the numbers and then divide it using the total numbers. In short, it is taking out the average of
all numbers. It can also be said that it is the ratio of the sum of all observations to the total no of
observations.
Median: When you arrange a given set of numbers from smaller to biggest, the middle number is said to
be the median. Arrange any given series in either ascending or descending order or the middle value
then is termed as the median. To find the median the data can be arranged in either descending order or
ascending order. In geometry, it is defined as the centre or midpoint of a polygon.
Mode: The most repeated number in a given set of observations is the mode or it can also be said that
the Number or Value which have the highest frequency in a given series of numbers. It is also known as
the modal value. It is a part of 3 central tendencies apart from the median and Mean. It is the highest bar
if presented in a histogram or a chart form. If there is no repeated number in a given series then no
Mode will exist for that series
For Example
We have a set of numbers that is 4, 8, 2, 1, 1, 4, 3, 1. Find the mean, median, and mode.
Solution:
Mean:
8 + 4 + 2 + 1 + 1 + 4 + 3 + 1 = 24 and 24/8 = 3
Median:
2 + 3/2 = 2.5 (after arranging the numbers in ascending order as 1, 1, 1, 2, 3, 4, 4, 8 and middle terms
are 2 and 3 as total number of terms are 8 which is even)
Mode:
3. because it is present 3 times in the sequence
3. Analyze how graph can be used to represent qualitative and quantitative data?
There are two types of data that we can collect:
Qualitative data describes a subject, and cannot be expressed as a number.
Quantitative data defines a subject and is expressed as a number (it can be quantified) that can be
analyzed. There are two types of quantitative data continuous and discrete.
Graphs
Qualitative data are words describing a characteristic of the individual. There are several different
graphs that are used for qualitative data. These graphs include bar graphs, Pareto charts, and pie charts.
Pie charts and bar graphs are the most common ways of displaying qualitative data. A spreadsheet
program like Excel can make both of them. The first step for either graph is to make a frequency or
relative frequency table. A frequency table is a summary of the data with counts of how often a data
value (or category) occurs.
Example
Example
Suppose you have the following data for which type of car students at a college drive?
Ford, Chevy, Honda, Toyota, Toyota, Nissan, Kia, Nissan, Chevy, Toyota, Honda, Chevy, Toyota, Nissan,
Ford, Toyota, Nissan, Mercedes, Chevy, Ford, Nissan, Toyota, Nissan, Ford, Chevy, Toyota, Nissan,
Honda, Porsche, Hyundai, Chevy, Chevy, Honda, Toyota, Chevy, Ford, Nissan, Toyota, Chevy, Honda,
Chevy, Saturn, Toyota, Chevy, Chevy, Nissan, Honda, Toyota, Toyota, Nissan
Step 1 : Construct Frequency table and calculate Relative frequency
Table 2.1.22.1.2: Relative Frequency Table for Type of Car Data
Chevy 12 0.24
Honda 6 0.12
Toyota 12 0.24
Nissan 10 0.20
Other 5 0.10
Total 50 1.00
the relative frequency of each category is just the frequency divided by the total. As an example for Ford
category:
relative frequency =550=0.10
There are several different types of graphs that can be used: bar chart, pie chart, and Pareto charts.
Bar graphs or charts consist of the frequencies on one axis and the categories on the other axis. Then you
draw rectangles for each category with a height (if frequency is on the vertical axis) or length (if frequency
is on the horizontal axis) that is equal to the frequency. All of the rectangles should be the same width, and
there should be equally width gaps between each bar.
A pie chart is where you have a circle and you divide pieces of the circle into pie shapes that are
proportional to the size of the relative frequency. There are 360 degrees in a full circle. Relative frequency
is just the percentage as a decimal. All you have to do to find the angle by multiplying the relative
frequency by 360 degrees. Remember that 180 degrees is half a circle and 90 degrees is a quarter of a
circle
Quantitative graphs are used to present and summarize numerical information coming from
the study of a categorical quantitative variable.
The most frequently used types of quantitative graphs are:
Bar chart
Pictogram
Pie chart
EXAMPLE
A teacher records scores on a 20-point quiz for the 30 students in his class. The scores are:
19 20 18 18 17 18 19 17 20 18 20 16 20 15 17 12 18 19 18 19 17 20 18 16 15 18 20 5 0 0
These scores could be summarized into a frequency table by grouping like values:
Score Frequency
0 2
5 1
12 1
15 2
16 2
17 4
18 8
19 4
20 6
Using the table from the first example, it would be possible to create a standard bar chart from this summary,
like we did for categorical data:
However, since the scores are numerical values, this chart doesn’t really make sense; the first and
second bars are five values apart, while the later bars are only one value apart. It would be more correct to treat
the horizontal axis as a number line. This type of graph is called a histogram. A histogram is like a bar graph,
but where the horizontal axis is a number line.
Other graph types such as pie charts are possible for quantitative data. The usefulness of different graph
types will vary depending upon the number of intervals and the type of data being represented. For example, a
pie chart of our weight data is difficult to read because of the quantity of intervals we used.
FREQUENCY POLYGON
An alternative representation is a frequency polygon. A frequency polygon starts out like a histogram,
but instead of drawing a bar, a point is placed in the midpoint of each interval at height equal to the frequency.
Typically the points are connected with straight lines to emphasize the distribution of the data.
EXAMPLE
This graph makes it easier to see that reaction times were generally shorter for the larger target, and that the
reaction times for the smaller target were more spread out.
4. Generate the ungrouped and grouped frequency table for the
following data
90,92,87,88,87,92,98,90,90,87,87,88,88,89,90,87,89,92,92,92,98,90
,95,87,87
(i) How many people scored 98?
(ii) How many people scored 90 or less?
(iii) What proportion scored 87?
5. (i) Calculate the sum of square population standard deviation for the given
x data value 13,10,11,7,9,11,9
(ii) Calculate the sample standard deviation for the given data 7,3,1,0,4
Answer
(i) SS = ΣX2 - ((ΣX)2 / N)
X X2
13 169
10 100
11 121
7 49
9 81
11 121
9 81
Total 70 722
X X2
7 49
3 9
1 1
0 0
4 16
Total 15 75
6. Suppose the IQ score have a bell shaped distribution with a mean of 100 and standard
deviation of 15 then calculate the following,
(i) What percentage of people should have an IQ score between 85 and 115 .
(ii) What percentage of people should have an IQ score between 70 and 130
(iii) What percentage of people should have an IQ score more than 130
(iv) A person with an IQ score greater than 145 is considered as genius. Does the
empirical rule support this statement
Solution
Mean μ= 100
Std.Deviation σ =15
The empirical rule in statistics, also known as the 68 95 99 rule, states that for normal distributions,
68% of observed data points will lie inside one standard deviation of the mean, 95% will fall within
two standard deviations, and 99.7% will occur within three standard deviations.
IQ scores are normally distributed with a mean of 100 and a standard deviation of 15.