ASSIGNMENT FOR JUNE 2020 EXAMINATION
Course Name: Decision Science
Session: Jan- June 2020 /Semester 2
Answer 1.
S. No Variable Type Data Type
1 Gender Nominal Level
2 Education Background Ordinal Level
3 Satisfaction Ordinal Level
4 Motivation Ordinal Level
5 Exchange Rate Interval Level
6 Gold Price Interval Level
7 Preference of cars Ordinal Level
8 Teachers Feedback Ordinal Level
9 Grades in Post-Graduation Ordinal Level
10 Marital Status Nominal Level
11 Quality of services Ordinal Level
12 Age Group Ordinal level
13 GDP Ratio Level
14 Interest Rate Interval Level
15 Twitter Comments Ordinal Level
16 Facebook Pictures Ordinal Level
Answer 2 (a)
Frequency Distribution, Frequency and Cumulative Frequency.
Frequency Relative Cumulative
Range (F) frequency (F/N) Frequency
30 under 40 41 0.197115385 36
40 under 50 24 0.115384615 63
50 under 60 33 0.158653846 95
60 under 70 32 0.153846154 128
70 under 80 22 0.105769231 149
80 under 90 26 0.125 175
90 under 100 30 0.144230769 208
208 1
Frequency distribution: is the summary of data presented in the form of class intervals and frequencies.
Relative frequency: is the proportion of total frequency that is in any given class interval in a frequency
distribution.
Cumulative frequency: is the running total of frequencies through the classes of a frequency distribution.
Answer 2 (b)
Mean, Median, Quartiles & Mode
Frequency
Range (F) Mid-Point (M) MxF
30 under 40 36 35 1260
40 under 50 27 45 1215
50 under 60 32 55 1760
60 under 70 33 65 2145
70 under 80 21 75 1575
80 under 90 26 85 2210
90 under 100 33 95 3135
N= 208 13300
(i) Mean of grouped data is the average of group of numbers and is computed by summing mid-
point multiplied by frequency dividing by the number of values.
Mean= ∑M*F/ N
= 13300/208
= 63.94
(ii) Median of grouped data: N/2 – Cfp
L + -------------------- (W)
fmed
Frequency Relative Cumulative
Range (F) frequency (F/N) Frequency
30 under 40 36 0.197115385 36
40 under 50 27 0.115384615 63
50 under 60 32 0.158653846 95
60 under 70 33 0.153846154 128
70 under 80 21 0.105769231 149
80 under 90 26 0.125 175
90 under 100 33 0.144230769 208
208 1
N= 208 W=10 Cfp = 95 L= 60 Fmed = 33 (From Above table)
208/2 - 95
= 60 + ------------ (10) = 62.72 (Median)
33
(iii) Mode is the class mid-point of the modal class. Modal class is the class interval with the greatest
frequency which is 30-under 40 for given data. The class midpoint of thus modal class is 35. Thus,
the mode for the given data is 35.
(iv) Quartiles, from above answer (ii) & corresponding table
Q2 is Median = 62.72 (Quartile 2)
For Q1, N/4 = 52, so Cfp =36 , L= 40 , Fmed = 27 , W=10
Therefore, by formula, Q1 = 45.92 (Quartile 1)
For Q3, 3N/4 = 156, so Cfp= 149, L= 80, Fmed = 26, W=10
Therefore, by formula, Q3 = 82.69 (Quartile3)
Answer 2 (c)
Variance and standard Deviation
Frequency Mid-Point
Range (f) (M) M*f M-µ (M-µ)2 f(M-µ)2
30 under 40 36 35 1260 -28.94 837.5236 30150.85
40 under 50 27 45 1215 -18.94 358.7236 9685.537
50 under 60 32 55 1760 -8.94 79.9236 2557.555
60 under 70 33 65 2145 1.06 1.1236 37.0788
70 under 80 21 75 1575 11.06 122.3236 2568.796
80 under 90 26 85 2210 21.06 443.5236 11531.61
90 under 100 33 95 3135 31.06 964.7236 31835.88
Total=208 13300 88367.31
µ=∑ f*M/∑ f = 13300/208= 63.94
Variance = σ2 = ∑ f(M-µ)2/N = 88367.31/ 208 = 424.84
Standard deviation = σ = √424.84 = 20.6
Answer 3 (a)
(i) Range is the difference between largest value of a data set and the smallest value of a data set.
Range= Highest -Lowest = 98-30
=68 (Range)
(ii) Interquartile Range from Answer 2(b) iv,
Q3- Q1 = 82.69 – 45.92
= 36.77 (Interquartile range)
(iii) Z- Scores
From the given data set, Mean = 63.35 & Standard deviation = 20.71
Using formula, Z Score = x- (mean) / Std.Dev for each value the z scores can be calculated.
Below is a list of all Z scores & Corresponding Plot for the same.
-1.610 -1.272 -0.886 -0.452 -0.065 0.369 0.949 1.335
-1.562 -1.224 -0.886 -0.452 -0.065 0.369 0.949 1.383
-1.562 -1.224 -0.838 -0.452 -0.065 0.418 0.997 1.383
-1.562 -1.224 -0.838 -0.452 -0.017 0.466 1.045 1.383
-1.514 -1.224 -0.790 -0.403 -0.017 0.514 1.045 1.383
-1.514 -1.176 -0.790 -0.403 -0.017 0.514 1.094 1.383
-1.466 -1.176 -0.741 -0.403 -0.017 0.562 1.094 1.432
-1.466 -1.176 -0.741 -0.403 -0.017 0.562 1.094 1.432
-1.466 -1.176 -0.741 -0.355 0.031 0.611 1.094 1.432
-1.466 -1.176 -0.741 -0.355 0.031 0.611 1.142 1.432
-1.466 -1.128 -0.693 -0.307 0.031 0.611 1.190 1.480
-1.466 -1.128 -0.645 -0.307 0.031 0.659 1.190 1.480
-1.417 -1.128 -0.645 -0.258 0.128 0.659 1.190 1.480
-1.417 -1.128 -0.596 -0.258 0.128 0.659 1.190 1.480
-1.417 -1.128 -0.596 -0.210 0.128 0.659 1.190 1.528
-1.417 -1.079 -0.596 -0.210 0.128 0.659 1.239 1.528
-1.417 -1.031 -0.548 -0.210 0.176 0.707 1.239 1.528
-1.417 -1.031 -0.500 -0.162 0.176 0.756 1.239 1.528
-1.369 -1.031 -0.500 -0.162 0.224 0.756 1.239 1.528
-1.369 -1.031 -0.500 -0.162 0.224 0.804 1.287 1.577
-1.321 -0.983 -0.500 -0.114 0.224 0.804 1.287 1.577
-1.321 -0.983 -0.500 -0.114 0.224 0.804 1.287 1.625
-1.321 -0.983 -0.500 -0.114 0.273 0.901 1.335 1.673
-1.321 -0.934 -0.500 -0.114 0.273 0.901 1.335 1.673
-1.321 -0.934 -0.500 -0.114 0.321 0.949 1.335 1.673
-1.321 -0.886 -0.452 -0.065 0.321 0.949 1.335 1.673
Z SCORES PLOT
2.000
1.500
1.000
0.500
0.000
0 50 100 150 200 250
-0.500
-1.000
-1.500
-2.000
(iv) Skewness & Kurtosis
Using the Question data Set and The formulas below in excel , the skewness and kurtosis are found:
Skewness: =SKEW.P (B1 : B208) { Range of all values } it came out as = 0.096965
Skewness is when distribution is asymmetrical or lacks symmetry. In above value it says the skewness
is negative that means the curve is more distributed towards the negative right-hand side.
Kurtosis: = KURT (B1 : B208) { Range of all values } it came out as = -1.28542
Kurtosis defines the amount of peakedness of a distribution. In above case the kurtosis points to a more
Platykurtic distribution, where the spread is flatter. It will have thinner tails since excess kurtosis value is
negative.
(v) Distribution Curve & Data analysis.
Distribution Curve
0.025
0.02
0.015
0.01
0.005
0
0 50 100 150 200 250
➢ The curve is Platykurtic in kurtosis and Skewed in positive way. It means the Tails are longer and
the inclination is flat for the curve.
➢ It means the mean is behind the mode and median in the curve & there is no normal distribution
Answer 3 (b)
(i) Histogram for the data
Histogram
40
36
35 33 33
32
30
27
26
25
21
frequency
20
15
10
0
frequency
performance scores of employees class inetrvals
30- under 40 40-under 50 50-under 60 60-under 70 70-under 80 80-under 90 90-under 100
(ii) BOX plot Diagram
(iii) Frequency Polygon
Frequency Polygons of given data
40
35
30
25
frequency
20
15
10
5
0
35 45 55 65 75 85 95
Performance score of employees ( Mid point of class intervals)
(iv) O give Diagram
O give for given data
250
200 208
175
150 149
128
100 95
63
50
36
0 0
30- under 40 40-under 50 50-under 60 60-under 70 70-under 80 80-under 90 90-under
100
cumulative frequency