North South University: Assignment Using IBM SPS' Course Code: Bus 511.3 Course Title: Business Statistics Spring 2021
North South University: Assignment Using IBM SPS' Course Code: Bus 511.3 Course Title: Business Statistics Spring 2021
Submitted by Submitted to
a.
(I) these following variables are qualitative and quantitative:
3. Township is qualitative.
4. Garage is qualitative.
b.
Township
N Valid 82
Missing 0
2
Township
Cumulative
Frequency Percent Valid Percent Percent
Interpretation: 25.6% of the homes have been sold in Gulshan. On the other hand, the next
highest is 23.2% which sold in Niketan. The lowest in Bashundhara. Only 14.6 % have been sold
in Bashundhara.
3
Relative Frequency table for ‘Township’
We convert Class frequencies to Relative class frequencies to show the fraction of the total
number of observations in each class. Relative frequencies captures the relationship between a
class total & the total number of observations. Here, the fraction of homes sold at the gulshan is
0.257 found by 21 divided by 82 & the lowest fraction is 0.146 which sold at bashundhara.
C.
Class interval,
4
Selling price
Interpretation:
5
The lowest selling price is $100 & the highest selling price is $380.
The largest Concentration or the highest frequency is the range 180-220. So there are 26
observations.
The selling price from home ranged between $100 and $380.
Majority of the selling price that is 61 out of 82 where 180-300
Only 2 sale were below 140 & only 1 is above 340.
Interpretation:
6
The largest concentration or highest frequency is in the $180000 up to $220000 class.
There are 27 observations.
Majority of the selling price, f = 27 + 21 + 13 = 61, out of 82 where in the range of
$180000 to $300000.
Only 02 selling price were below $140000.
Only 01 selling price were more than or above $340000.
Mean x̅ = ( Σfxi ) / n
=18360/ 82
=223.90
= $223900
82 / 2 = 41th
=178.095
= $178095.24
7
Variance S2 = Σ(xi- x̅)2 / n -1
Selling Price ($000)
= Σ(1680-223.90)2 / 81
= 26175.64
= $26175644.57
=
(√ 209866.82)/81
Mean 226.27
= 50.901
Median 219.00 = $50901
= 3 (223900– 178095.24) /
50901 Std. Deviation 48.007
= 2.697
Variance 2.3053
There is a Positive skewness in the
distribution.
Skewness .421
d.
Std. Error of Skewness .266
Range 220
Minimum 129
Maximum 349
Percentiles 10 172.50
20 182.80
8
25 192.50
Interpretation:
2. Median: 50% of the selling prices were below $219000 and 50% were above it.
3. Mode: $180000 is the most appeared selling price in the series of data.
4. Range: The range is distorted because the highest value $349000 is extremely large compare
to the lowest value which is $129000 which means range is $220000.
5. Standard Deviation: Here the standard deviation $48008 which is higher so it indicates that
the data are spread out over a large range of values.
9
6. Variance: As the variance here $2305.3 is very high means that the data are well spread out
from the mean and from one another.
7 ( Q1): 25 % of the selling prices are below $192500 and 75% are above it.
8. (Q2): 50% of the selling prices are below $219000 and 50% are above it.
9. (Q3) : 75% of the selling prices are below $256500 and 50% are above it.
10. D1 : 10% of the homes sold below $172500 & 90% above it.
11.D2 : 20% of the of the selling price which is D2 are below $182800 and 70% above it.
12. D3: 30% of the of the selling price which is D3 are below $197000 and 70% above it.
14. D4: 40 % of the of the selling price which is D4 are below $211200 and 60% above it.
15. D5: 50% of the of the selling price which is D5 are below $219000 and 50% above it.
16. D6: 60% of the of the selling price which is D6 are below $231400 and 40% above it.
17.D7: 70% of the homes sold below $250000 & 30% above it.
18.D8: 80% of the homes sold below $271800 & 90% above it.
19.D9: 90% of the homes sold below 297400 & 10% above it.
20. 85th percentile (P 85): 85% of the selling prices are below $276550 and 45% are above it
21.Skewness is .421for the variable selling price. The distribution is positively week skewed as it
is .421. which is near to 1. The standard error of skewness is 26.6%.
e.
Here the mean counted from grouped data is x̅=223.90 and mean from raw data is x̅=226.27. As
raw data mean is always more appropriate and group data mean provides approximate result I
will select mean from raw data (c) which is x̅=226.27.
f.
10
Chebyshev’s theorem is concerned with any set of distribution of values can have any
shapes.90% of the selling price lie within the mean (X̄) ± two standard deviations (S). The rule
used is called “Empirical Rule”, sometimes called “Normal Rule”, which establishes the mean
relationship between Arithmetic Mean and Standard Deviation, assuming that data set has
symmetrical distribution.
Normal distribution
Range = $192032
Range = $96016.
11
47.5% of the selling price of homes will be within $130254 to $226270.
g.
An outlier is a value that is inconsistent with the rest of the data. This value can be extremely
large or small compared to rest of the data set.
Q1 = (82 + 1) / 4 Q3 = 3 × (82 + 1) / 4
12
There is no outlier from the lower and upper part in the Selling prices of Homes.
Excercise:2
Interpretation: Bar chart is use for representing the qualitative data . here Township is a
qualitative variable. In the bar , the highest homes is 21 have been sold at gulshan. The majority
of the homes 56 out of 82 have been sold in gulshan , niketan & banani. The lowest is 12 homes
have been sold at bashundhara.
13
Chart 2: Pie chart for ‘Township’.
Interpretation: Pie chart is use for representing the qualitative data in percentages. here,
Township is a qualitative variable. In the bar , the highest homes is 25.61% have been sold at
gulshan. The majority of the homes 68.29% out of 100% have been sold in gulshan , niketan &
banani. The lowest is 14.63 % homes have been sold at bashundhara.
Exercise:3
Senior Citizens 18 32 33 40 45 47 53 58
74 77 80 80 82 87 88
14
Young Adults 28 31 37 40 41 42 49 50
51 55 57 59 82 84 86
a.
Since, 33.79% < 37.60%, the variation of CDs owned by Young Adult is less than that of Senior
Citizens.
b.
= 4th position
=40
=3((15+1)/4)th position
=48/4 th position
=12th position
= 59
15
75% of the CDs owned by Young Adults are below 59 & 25% are above it.
30% of the CDs owned by Young Adults are below 40.8 & 70% are above it.
iii. IQR: Q3 – Q1= 59 – 40 = 19, 19 is the variation of the middle 50% of the CDs owned by
Young Adults.
Iv. Median shall be calculated to find the appropriate measure of central tendency. Median is not
affected by extremely large or small values.
So, Median: 50
Exercise :4
I.D. 2016309660
A 4 93%
A- 3.7 90-92%
B+ 3.3 88%
B 3 85%
B- 2.7 81%
C+ 2.3 78%
16
C- 1.7 75%
D+ 1.3 68%
D 1
F 0
I 0
W 0
R 0
18
∑( x . w)
X̄ =
∑w
17
(x 1. w 1+ x 2. w 2+ x 3. w 3+ x 4. w 4 + x 5. w 5+ x 6. w 6+ x 7.w 7 + x 8. w 8)
= w 1+ w 2+w 3+ w 4 +w 5+ w 6+w 7 +w 8
Fall20 2.3
Spring 21 4
-------------
18