[go: up one dir, main page]

0% found this document useful (0 votes)
15 views36 pages

Oup 8

Download as pdf or txt
Download as pdf or txt
Download as pdf or txt
You are on page 1/ 36

DATA ANALYSIS USING SPSS

MBA 112 – Business Research Methods

Lab Report submitted to


DEPARTMENTOF MANAGEMENT
CENTRAL UNIVERSITY OF TAMILNADU
THIRUVARUR

Inthe
partialfulfillmentoftherequirements for
the Award of the Degree of
MASTER OF BUSINESS ADMINISTRATION
(Batch: 2022- 2024)
By

PRAKASH RAJ , P221836


PRIYADHARSHINI M, P221837
RAGAVENDRAN , P221838
SANJAY YADAV, P221839
SETHU VISHNU, P221840
August 2023

Submitted

Dr.R.VISALAKSHMI
AssistantProfessor
Departmentof Management
CENTRAL UNIVERSITY OF TAMIL NADU
Thiruvarur–610005
TABLE OF CONTENTS

S.no NAME OF THE EXPERIMENT Page no

1 FREQUENCIES

2. DESCRIPTIVE STATISTICS 1

3. DESCRIPTIVE STATISTICS 2

4. SIMPLE CORRELATION

5. MULTIPLE CORRELATION

6. SIMPLE REGRESSION

7. MULTIPLE REGRESSION

8. CHI SQUARE TEST

9. ONE SAMPLE T TEST

10. PAIRED SAMPLE T TEST


1 .FREQUENCIES

Frequency distributions are visual displays that organise and present


frequency counts so that the information can be interpreted more easily.
Frequency distributions can show absolute frequencies or relative
frequencies, such as proportions or percentages.

THE PROBLEM: To investigate the height and weight of gender using


the SPSS SOFTWARE which will analyse the data given and then can
conclude,
Steps involved in it are as follows:
Step1: Import excel data

Step 2: Analyse the data


ANALYSE → DESCRIPTIVE STATISTICS → FREQUENCIES
Step 3: Plot the variables:
Id,gender,hieght and weightand then click OK
Step 4: The output will be generated.

Frequencies.

Statistics
Height Weight
N Valid 10 10
Missing 0 0
Mean 65.80 133.00
Std. Error of Mean .757 4.243
Median 66.00 132.00
a
Mode 64 116a
Std. Deviation 2.394 13.416
Variance 5.733 180.000
Skewness .233 .278
Std. Error of Skewness .687 .687
Kurtosis -.369 -1.400
Std. Error of Kurtosis 1.334 1.334
Range 8 37
Minimum 62 116
Maximum 70 153
Sum 658 1330
Percentiles 10 62.20 116.20
20 64.00 118.80
30 64.00 122.60
40 64.80 125.60
50 66.00 132.00
60 66.00 137.20
70 67.40 142.20
80 68.00 149.60
90 69.80 152.80
a. Multiple modes exist. The smallest value is shown
HISTOGRAM
Frequency Table

Height
Cumulative
Frequency Percent Valid Percent Percent
Valid 62 1 10.0 10.0 10.0
64 3 30.0 30.0 40.0
66 3 30.0 30.0 70.0
68 2 20.0 20.0 90.0
70 1 10.0 10.0 100.0
Total 10 100.0 100.0

Weight
Cumulative
Frequency Percent Valid Percent Percent
Valid 116 1 10.0 10.0 10.0
118 1 10.0 10.0 20.0
122 1 10.0 10.0 30.0
124 1 10.0 10.0 40.0
128 1 10.0 10.0 50.0
136 1 10.0 10.0 60.0
138 1 10.0 10.0 70.0
144 1 10.0 10.0 80.0
151 1 10.0 10.0 90.0
153 1 10.0 10.0 100.0
Total 10 100.0 100.0
Step 4: The output will be generated
Descriptive Statistics

Std.
Rang Minim Maxi Deviatio Varia
N e um mum Sum Mean n nce Skewness Kurtosis
Statis Statis Statis Statis Statis Statis Std. Statis Statis Std. Statis Std.
tic tic tic tic tic tic Error Statistic tic tic Error tic Error
DATE 1959. 2001. 3.3E 1.980 149.6 -
169 42.0 .9409 12.2322 .001 .187 .371
1 1 5 E3 26 1.200
GDP 9747. 1024 6.0E 3.573 221.0 2873.15 8.255
169 496.1 .701 .187 -.751 .371
5 3.6 5 E3 122 81 E6
Valid N
169
(listwise)

INTERPRETATION
This data represents a group of individuals with varying heights and
weights. The height data seems to have a slightly right-skewed
distribution . The weight data also shows more variability compared to
height. The range of values, percentiles, and other measures provide a
comprehensive view of the dataset's characteristics.
2. DESCRIPTIVE STATISTICS 1
Descriptive statistics refers to a branch of statistics that involves
summarizing, organizing, and presenting data meaningfully and
concisely. It focuses on describing and analyzing a dataset's main
features and characteristics without making any generalizations or
inferences to a larger population.

THE PROBLEM: To investigate the date and gdp using the SPSS
SOFTWARE we will analyse the data given and then can conclude.

Steps involved in it are as follows:

STEP 1: Open SPSS: Launch the SPSS software on your computer.

STEP 2: Import data from excel

STEP 3 : ANALYSE DESCRIPTIVE DATA


STEP 4 : OUTPUT OF DESCRIPTIVE DATA

Descriptive Statistics

Maximu Std.
N Range Minimum m Mean Deviation Variance Skewness Kurtosis

Std. Std. Std.


Statistic Statistic Statistic Statistic Statistic Error Statistic Statistic Statistic Error Statistic Error

GDP 169 9747.5 496.1 10243.6 3.573E3 221.0122 2873.1581 8.255E6 .701 .187 -.751 .371

Valid N
169
(listwise)

INTERPRETATION:
The mean value is approximately 3572.739, with a relatively high
standard deviation of approximately 2873.1581. The distribution of
values appears to be slightly right-skewed and has less peakedness
compared to a normal distribution.
3.DESCRIPTIVE STATISTICS -2

Descriptive statistics is a set of methods used to summarize and describe the main
features of a dataset, such as its central tendency, variability, and distribution. These
methods provide an overview of the data and help identify patterns and relationships.

THE PROBLEM: To investigate the Number of Employees and Age using the SPSS
SOFTWARE we will analyse the data given and then can conclude.
Steps involved in it are as follows:

Step1: Open SPSS: Launch the SPSS software on your computer.

Step2: Import excel data

Step3:Analyse the descriptive data


Step4: Output of descriptive data

INTERPRETATION

The statistics describe the "Age" dataset, which contains 60 data points. The ages
range from 17 to 64, with an mean of 37.68. The distribution of ages appears to be
slightly left-skewed, indicating that there may be relatively more younger individuals
in the dataset. The summary provides insights into the distribution of ages in the
dataset, their spread, and central tendency.
4.SIMPLE CORRELATION

Simple correlation is a measure used to determine the strength and the direction of
the relationship between two variables

THE PROBLEM : To investigate the Income and Expenditure using the SPSS
SOFTWARE we will analyse the data given and then can conclude
Steps involved in it are as follows:

Step1: Click Analyze > Correlate >Bivariate... on the main menu, You will be
presented with the Bivariate Correlations dialogue box.

Step2: Transfer the variables Income and Expenditure into the Variables box by
dragging-and-dropping them or by clicking on them and then clicking on the button.
Step3: Output of simple correlation
OUTPUT

INTERPRETATION:

The data suggests that there is a significant positive correlation between


income and expenditure, meaning that higher levels of income are
associated with higher levels of expenditure.
5.MULTIPLE CORRELATIONS
A multiple correlation yields the maximum degree of liner relationship
that can be obtained between two or more independent variables and a
single dependent variable.

THE PROBLEM:
To investigate if Government Expenditure, Consumption, Investment has
a correlation with GDP
Using the SPSS SOFTWARE we will analyse the data given and then can
conclude
Steps involved in it are as follows:

STEP 1: IMPORT DATA FROM EXCEL

STEP 2 Analyse the date

ANALYSE → CORRELATE → BIVARIATE


Step 3 : plot the variables in the variables column and mark the
correlation coefficients Pearson and Spearman

Step 4 : The output will be generated .


OUTPUT

Descriptive Statistics

Std.
Mean Deviation N
GDP 1142153.22 233846.634 37
GOVTEXP 125973.30 32913.541 37
CONS 674818.65 136232.151 37
INV 374394.76 88811.183 37

Correlations

GOVTE
GDP XP CONS INV
GDP Pearson
1 .928** .964** .977**
Correlation
Sig. (2-tailed) .000 .000 .000
N 37 37 37 37
GOVTE Pearson
.928** 1 .885** .872**
XP Correlation
Sig. (2-tailed) .000 .000 .000
N 37 37 37 37
CONS Pearson
.964** .885** 1 .952**
Correlation
Sig. (2-tailed) .000 .000 .000
N 37 37 37 37
INV Pearson
.977** .872** .952** 1
Correlation
Sig. (2-tailed) .000 .000 .000
N 37 37 37 37
**. Correlation is significant at the 0.01 level (2-tailed).
Non parametric correlations :

Correlations

GDP GOVTEXP CONS INV


Spearman's GDP Correlation
rho Coefficient 1.000 .922** .965** .972**

Sig. (2-tailed) . .000 .000 .000


N 37 37 37 37
GOVTEXP Correlation
.922** 1.000 .877** .864**
Coefficient
Sig. (2-tailed) .000 . .000 .000
N 37 37 37 37
CONS Correlation
.965** .877** 1.000 .946**
Coefficient
Sig. (2-tailed) .000 .000 . .000
N 37 37 37 37
INV Correlation
.972** .864** .946** 1.000
Coefficient
Sig. (2-tailed) .000 .000 .000 .
N 37 37 37 37
**. Correlation is significant at the 0.01 level (2-tailed).

INTERPRETATION:
The data suggests that GDP, Government Expenditure, Consumption,
and Investment are closely interrelated, with strong positive correlations
among them. An increase in one of these variables is associated with an
increase in the others, pointing towards an interconnected economic
system.
6.SIMPLE REGRESSION
Simple Linear Regression is a type of Regression algorithms that models
the relationship between a dependent variable and a single independent
variable

THE PROBLEM: To identify the relationship between Age of a car and


Maintenance of a car
Using the SPSS SOFTWARE we will analyse the data given and then can
conclude

STEP 1: Import the data from excel

STEP 2 : Analyse the data

ANALYSE →REGRESSION → LINEAR REGRESSION


STEP 3 : Plot the Dependent and Independent variable in respective
columns
STEP 4: The output will be generated

OUTPUT

Variables Entered/Removeda
Mode Variables Variables
l Entered Removed Method
1 Maintenanc
. Enter
e costb
a. Dependent Variable: Age of a car
b. All requested variables entered.

Model Summaryb
Change Statistics
Std. R
Adjust Error Squar Durbi
R ed R of the e F Sig. F n-
Mod Squa Squar Estima Chan Chan df df Chan Wats
el R re e te ge ge 1 2 ge on
1 .99 212.7
a .982 .977 .284 .982 1 4 .000 1.669
1 61
a. Predictors: (Constant), Maintenance cost
b. Dependent Variable: Age of a car

ANOVAa
Sum of Mean
Model Squares df Square F Sig.
1 Regressio
17.177 1 17.177 212.761 .000b
n
Residual .323 4 .081
Total 17.500 5
a. Dependent Variable: Age of a car
b. Predictors: (Constant), Maintenance cost

Coefficientsa
Standardize 95.0%
Unstandardize d Confidence
d Coefficients Coefficients Interval for B
Lowe Uppe
r r
Std. Boun Boun
Model B Error Beta t Sig. d d
1 (Constant) .08
.528 .234 2.254 -.122 1.179
7
Maintenanc 14.58 .00
.435 .030 .991 .352 .518
e cost 6 0
a. Dependent Variable: Age of a car

Residuals Statisticsa
Minimu Maximu Std.
m m Mean Deviation N
Predicted Value 1.40 6.18 3.50 1.853 6
Residual -.398 .297 .000 .254 6
Std. Predicted
-1.134 1.447 .000 1.000 6
Value
Std. Residual -1.401 1.046 .000 .894 6
a. Dependent Variable: Age of a car

INTERPRETATION :

The data is examining the relationship between one independent and dependent
variable. A positive coefficient indicates that as the value of the independent variable
increases, the mean of the dependent variable also tends to increase.
7.MULTIPLE REGRESSIONS

Multiple regression is a statistical technique that can be used to analyze


the relationship between a single dependent variable and several
independent variables.

THE PROBLEM: To investigate if Government Expenditure,


Consumption, Investment has a significant impact on GDP
Using the SPSS SOFTWARE we will analyse the data given and then can
conclude
Steps involved in it are as follows:
Step1: Import excel data

Step 2: Analyse the data


ANALYSE → REGRESSION → LINEAR REGRESSION
Step 3: Plot the Dependent and independent variable
Dependent variable: GDP
Independent Variable: Government Expenditure, Consumption,
Investment and then click OK

Step 4: The output will be generated.


OUTPUT

Regression

Descriptive Statistics
Mean Std. Deviation N
GDP 1.14E6 233846.634 37
GOVTEXP 1.26E5 32913.541 37
CONS 6.75E5 136232.151 37
INV 3.74E5 88811.183 37
Correlations
GDP GOVTEXP CONS INV
Pearson Correlation GDP 1.000 .928 .964 .977
GOVTEXP .928 1.000 .885 .872
CONS .964 .885 1.000 .952
INV .977 .872 .952 1.000
Sig. (1-tailed) GDP . .000 .000 .000
GOVTEXP .000 . .000 .000
CONS .000 .000 . .000
INV .000 .000 .000 .
N GDP 37 37 37 37
GOVTEXP 37 37 37 37
CONS 37 37 37 37
INV 37 37 37 37

Variables Entered/Removedb

Model Variables Entered Variables Removed Method


1 INV, GOVTEXP,
. Enter
CONSa
a. All requested variables entered.
b. Dependent Variable: GDP
Model Summaryb
Change Statistics
Adjusted Std. Error R
R R of the Square F Sig. F Durbin-
Model R Square Square Estimate Change Change df1 df2 Change Watson
1 .991a .982 .980 32780.380 .982 599.683 3 33 .000 2.176
a. Predictors: (Constant), INV,
GOVTEXP, CONS
b. Dependent Variable: GDP

ANOVAb

Model Sum of Squares df Mean Square F Sig.


1 Regression 1.933E12 3 6.444E11 599.683 .000a
Residual 3.546E10 33 1.075E9
Total 1.969E12 36
a. Predictors: (Constant), INV, GOVTEXP, CONS
b. Dependent Variable: GDP

Coefficientsa
Unstandardized Standardized 95% Confidence
Coefficients Coefficients Interval for B
Lower Upper
Model B Std. Error Beta t Sig. Bound Bound
1 (Constant) 123496.030 29386.413 4.202 .000 63708.923 183283.136
GOVTEXP 1.918 .365 .270 5.260 .000 1.176 2.660
CONS .360 .141 .210 2.561 .015 .074 .646
INV 1.427 .205 .542 6.963 .000 1.010 1.843
a. Dependent Variable: GDP
Residuals Statisticsa
Minimum Maximum Mean Std. Deviation N
Predicted Value 7.28E5 1.51E6 1.14E6 231730.967 37
Residual -5.809E4 7.023E4 .000 31384.824 37
Std. Predicted Value -1.789 1.602 .000 1.000 37
Std. Residual -1.772 2.142 .000 .957 37
a. Dependent Variable: GDP

INTERPRETATION:

There is a significant difference between the Government Expenditure,


Consumption and investment on GDP . The analysis explores the
relationships between GDP and the other variables. The regression model
shows that these variables have a significant combined impact on GDP.
The correlations suggest strong positive linear relationships among the
variables.
8.CHI SQUARE TEST
The Chi-Square test is a statistical procedure for determining the
difference between observed and expected This test can also be used to
determine whether it correlates to the categorical variables in our data. It
helps to find out whether a difference between two categorical variables
is due to chance or a relationship between them.

The null and alternative hypothesis will be:

H 0 : There is no association between smoking and cancer

H 1 : There is an association between smoking and cancer.

Using the SPSS SOFTWARE we will analyse the data given and then
can conclude

Step 1: import the data from excel


STEP:2 Analyse the data
ANALYSE → REGRESSION → LINEAR REGRESSION

STEP 3 :Define your null and alternative hypotheses in a respective way.


STEP 4: OUTPUT WILL BE GENERATED AS FOLLOWS:

Cancer
Observed N Expected N Residual
1 27 25.0 2.0
2 23 25.0 -2.0
Total 50

Test Statistics
Cancer
Chi-Square .320a
df 1
Asymp. Sig. .572
a. 0 cells (0.0%) have
expected frequencies less than
5. The minimum expected cell
frequency is 25.0.

INTERPRETATION:
The provided data shows a comparison between observed and expected cancer cases in two
categories .The expected number of cases in the first category was 25.0. For the second
category, there were 23 observed cases of cancer . Positive residuals indicate over
performance, while negative residuals indicate underperformance compared to the expected
values
9.ONE PAIRED T TEST

The paired sample t-test, sometimes called the dependent sample t-test,
is a statistical procedure used to determine whether the mean difference
between two sets of observations is zero.

THE PROBLEM: To investigate the I’d and scores of the students


using the SPSS SOFTWARE we will analyse the data given and then can
conclude
Steps involved in it are as follows:
Step1: Import excel data

STEP 2: Analyses the data


ANALYSE → COMPARE MEANS AND PROPORTIONS → ONE
SAMPLE T TEST
STEP 3 :plot the variables

STEP 4: OUTPUT WILL BE GENERATED


One-Sample Statistics
N Mean Std. Deviation Std. Error Mean
Scores 11 31.300 1.6763 .5054

One-Sample Test
Test Value = 0
95% Confidence Interval of the
Difference
t df Sig. (2-tailed) Mean Difference Lower Upper
Scores 61.928 10 .000 31.3000 30.174 32.426

INTERPRETATION:

The dataset’s mean score is 31.300 with a relatively low variability and a small standard error,
suggesting that the average score is a reliable estimate of the population mean.

10. PAIRED SAMPLE T TEST

The Paired-Samples T Test procedure compares the means of two


variables for a single group. The procedure computes the differences
between values of the two variables for each case and tests whether the
average differs from 0.
THE PROBLEM: To investigate the difference between before sales and
after sales using the SPSS SOFTWARE we will analyse the data given
and then can conclude
Steps involved in it are as follows:
Step1: Import excel data

Step 2: Analyse the data


ANALYSE → COMPARE MEANS AND PROPORTIONS →
PAIRED SAMPLE T TEST
Step 3: plot the variables.

OUTPUT:

Paired Samples Statistics


Mean N Std. Deviation Std. Error Mean
Pair 1 Before Sales 14.50 10 5.836 1.845
After Sales 17.50 10 3.567 1.128

Paired Samples Correlations


N Correlation Sig.
Pair 1 Before Sales & After Sales 10 .841 .002

Paired Samples Test


Paired Differences
95% Confidence
Interval of the
Std. Std. Error Difference Sig. (2-
Mean Deviation Mean Lower Upper t df tailed)
Pair Before Sales -
-3.000 3.432 1.085 -5.455 -.545 -2.764 9 .022
1 After Sales

INTERPRETATION:

These paired sample statistics compare Before Sales and after


Sales data. The mean sales increased from 14.50 before to
17.50 after. The variability decreased from 5.836 to 3.567. This
indicates that, on average, sales increased after the observed
change.
GROUP 8

You might also like