Oup 8

DATA ANALYSIS USING SPSS
MBA 112 – Business Research Methods
Lab Report submitted to

DEPARTMENTOF MANAGEMENT
CENTRAL UNIVERSITY OF TAMILNADU
THIRUVARUR
Inthe
partialfulfillmentoftherequirements for
the Award of the Degree of
MASTER OF BUSINESS ADMINISTRATION
(Batch: 2022- 2024)
By
PRAKASH RAJ , P221836

PRIYADHARSHINI M, P221837
RAGAVENDRAN , P221838
SANJAY YADAV, P221839
SETHU VISHNU, P221840
August 2023
Submitted
Dr.R.VISALAKSHMI
AssistantProfessor
Departmentof Management
CENTRAL UNIVERSITY OF TAMIL NADU
Thiruvarur–610005
TABLE OF CONTENTS
S.no NAME OF THE EXPERIMENT Page no
1 FREQUENCIES
2. DESCRIPTIVE STATISTICS 1
4. SIMPLE CORRELATION
5. MULTIPLE CORRELATION
6. SIMPLE REGRESSION
7. MULTIPLE REGRESSION
8. CHI SQUARE TEST
9. ONE SAMPLE T TEST
10. PAIRED SAMPLE T TEST

1 .FREQUENCIES
Frequency distributions are visual displays that organise and present

frequency counts so that the information can be interpreted more easily.
Frequency distributions can show absolute frequencies or relative
frequencies, such as proportions or percentages.
THE PROBLEM: To investigate the height and weight of gender using

the SPSS SOFTWARE which will analyse the data given and then can
conclude,
Steps involved in it are as follows:
Step1: Import excel data
Step 2: Analyse the data

ANALYSE → DESCRIPTIVE STATISTICS → FREQUENCIES
Step 3: Plot the variables:
Id,gender,hieght and weightand then click OK
Step 4: The output will be generated.
Frequencies.
Statistics
Height Weight
N Valid 10 10
Missing 0 0
Mean 65.80 133.00
Std. Error of Mean .757 4.243
Median 66.00 132.00
a
Mode 64 116a
Std. Deviation 2.394 13.416
Variance 5.733 180.000
Skewness .233 .278
Std. Error of Skewness .687 .687
Kurtosis -.369 -1.400
Std. Error of Kurtosis 1.334 1.334
Range 8 37
Minimum 62 116
Maximum 70 153
Sum 658 1330
Percentiles 10 62.20 116.20
20 64.00 118.80
30 64.00 122.60
40 64.80 125.60
50 66.00 132.00
60 66.00 137.20
70 67.40 142.20
80 68.00 149.60
90 69.80 152.80
a. Multiple modes exist. The smallest value is shown
HISTOGRAM
Frequency Table
Height
Cumulative
Frequency Percent Valid Percent Percent
Valid 62 1 10.0 10.0 10.0
64 3 30.0 30.0 40.0
66 3 30.0 30.0 70.0
68 2 20.0 20.0 90.0
70 1 10.0 10.0 100.0
Total 10 100.0 100.0
Weight
Cumulative
Frequency Percent Valid Percent Percent
Valid 116 1 10.0 10.0 10.0
118 1 10.0 10.0 20.0
122 1 10.0 10.0 30.0
124 1 10.0 10.0 40.0
128 1 10.0 10.0 50.0
136 1 10.0 10.0 60.0
138 1 10.0 10.0 70.0
144 1 10.0 10.0 80.0
151 1 10.0 10.0 90.0
153 1 10.0 10.0 100.0
Total 10 100.0 100.0
Step 4: The output will be generated
Descriptive Statistics
Std.
Rang Minim Maxi Deviatio Varia
N e um mum Sum Mean n nce Skewness Kurtosis
Statis Statis Statis Statis Statis Statis Std. Statis Statis Std. Statis Std.
tic tic tic tic tic tic Error Statistic tic tic Error tic Error
DATE 1959. 2001. 3.3E 1.980 149.6 -
169 42.0 .9409 12.2322 .001 .187 .371
1 1 5 E3 26 1.200
GDP 9747. 1024 6.0E 3.573 221.0 2873.15 8.255
169 496.1 .701 .187 -.751 .371
5 3.6 5 E3 122 81 E6
Valid N
169
(listwise)
INTERPRETATION
This data represents a group of individuals with varying heights and
weights. The height data seems to have a slightly right-skewed
distribution . The weight data also shows more variability compared to
height. The range of values, percentiles, and other measures provide a
comprehensive view of the dataset's characteristics.
Descriptive statistics refers to a branch of statistics that involves
summarizing, organizing, and presenting data meaningfully and
concisely. It focuses on describing and analyzing a dataset's main
features and characteristics without making any generalizations or
inferences to a larger population.
THE PROBLEM: To investigate the date and gdp using the SPSS
SOFTWARE we will analyse the data given and then can conclude.
STEP 1: Open SPSS: Launch the SPSS software on your computer.
STEP 2: Import data from excel
STEP 3 : ANALYSE DESCRIPTIVE DATA

STEP 4 : OUTPUT OF DESCRIPTIVE DATA
Maximu Std.
N Range Minimum m Mean Deviation Variance Skewness Kurtosis
Std. Std. Std.

Statistic Statistic Statistic Statistic Statistic Error Statistic Statistic Statistic Error Statistic Error
GDP 169 9747.5 496.1 10243.6 3.573E3 221.0122 2873.1581 8.255E6 .701 .187 -.751 .371
Valid N
169
(listwise)
INTERPRETATION:
The mean value is approximately 3572.739, with a relatively high
standard deviation of approximately 2873.1581. The distribution of
values appears to be slightly right-skewed and has less peakedness
compared to a normal distribution.
3.DESCRIPTIVE STATISTICS -2
Descriptive statistics is a set of methods used to summarize and describe the main
features of a dataset, such as its central tendency, variability, and distribution. These
methods provide an overview of the data and help identify patterns and relationships.
THE PROBLEM: To investigate the Number of Employees and Age using the SPSS
SOFTWARE we will analyse the data given and then can conclude.
Step1: Open SPSS: Launch the SPSS software on your computer.
Step3:Analyse the descriptive data

Step4: Output of descriptive data
INTERPRETATION
The statistics describe the "Age" dataset, which contains 60 data points. The ages
range from 17 to 64, with an mean of 37.68. The distribution of ages appears to be
slightly left-skewed, indicating that there may be relatively more younger individuals
in the dataset. The summary provides insights into the distribution of ages in the
dataset, their spread, and central tendency.
4.SIMPLE CORRELATION
Simple correlation is a measure used to determine the strength and the direction of
the relationship between two variables
THE PROBLEM : To investigate the Income and Expenditure using the SPSS
SOFTWARE we will analyse the data given and then can conclude
Step1: Click Analyze > Correlate >Bivariate... on the main menu, You will be
presented with the Bivariate Correlations dialogue box.
Step2: Transfer the variables Income and Expenditure into the Variables box by
dragging-and-dropping them or by clicking on them and then clicking on the button.
Step3: Output of simple correlation
OUTPUT
INTERPRETATION:
The data suggests that there is a significant positive correlation between

income and expenditure, meaning that higher levels of income are
associated with higher levels of expenditure.
5.MULTIPLE CORRELATIONS
A multiple correlation yields the maximum degree of liner relationship
that can be obtained between two or more independent variables and a
single dependent variable.
THE PROBLEM:
To investigate if Government Expenditure, Consumption, Investment has
a correlation with GDP
Using the SPSS SOFTWARE we will analyse the data given and then can
conclude
STEP 1: IMPORT DATA FROM EXCEL
STEP 2 Analyse the date
ANALYSE → CORRELATE → BIVARIATE

Step 3 : plot the variables in the variables column and mark the
correlation coefficients Pearson and Spearman
Step 4 : The output will be generated .

OUTPUT
Std.
Mean Deviation N
GDP 1142153.22 233846.634 37
GOVTEXP 125973.30 32913.541 37
CONS 674818.65 136232.151 37
INV 374394.76 88811.183 37
Correlations
GOVTE
GDP XP CONS INV
GDP Pearson
1 .928** .964** .977**
Correlation
Sig. (2-tailed) .000 .000 .000
N 37 37 37 37
GOVTE Pearson
.928** 1 .885** .872**
XP Correlation
Sig. (2-tailed) .000 .000 .000
N 37 37 37 37
CONS Pearson
.964** .885** 1 .952**
Correlation
Sig. (2-tailed) .000 .000 .000
N 37 37 37 37
INV Pearson
.977** .872** .952** 1
Correlation
Sig. (2-tailed) .000 .000 .000
N 37 37 37 37
**. Correlation is significant at the 0.01 level (2-tailed).
Non parametric correlations :
Correlations
GDP GOVTEXP CONS INV

Spearman's GDP Correlation
rho Coefficient 1.000 .922** .965** .972**
Sig. (2-tailed) . .000 .000 .000

N 37 37 37 37
GOVTEXP Correlation
.922** 1.000 .877** .864**
Coefficient
Sig. (2-tailed) .000 . .000 .000
N 37 37 37 37
CONS Correlation
.965** .877** 1.000 .946**
Coefficient
Sig. (2-tailed) .000 .000 . .000
N 37 37 37 37
INV Correlation
.972** .864** .946** 1.000
Coefficient
Sig. (2-tailed) .000 .000 .000 .
N 37 37 37 37
**. Correlation is significant at the 0.01 level (2-tailed).
INTERPRETATION:
The data suggests that GDP, Government Expenditure, Consumption,
and Investment are closely interrelated, with strong positive correlations
among them. An increase in one of these variables is associated with an
increase in the others, pointing towards an interconnected economic
system.
6.SIMPLE REGRESSION
Simple Linear Regression is a type of Regression algorithms that models
the relationship between a dependent variable and a single independent
variable
THE PROBLEM: To identify the relationship between Age of a car and

Maintenance of a car
conclude
STEP 1: Import the data from excel
STEP 2 : Analyse the data
ANALYSE →REGRESSION → LINEAR REGRESSION

STEP 3 : Plot the Dependent and Independent variable in respective
columns
STEP 4: The output will be generated
OUTPUT
Variables Entered/Removeda
Mode Variables Variables
l Entered Removed Method
1 Maintenanc
. Enter
e costb
a. Dependent Variable: Age of a car
b. All requested variables entered.
Model Summaryb
Change Statistics
Std. R
Adjust Error Squar Durbi
R ed R of the e F Sig. F n-
Mod Squa Squar Estima Chan Chan df df Chan Wats
el R re e te ge ge 1 2 ge on
1 .99 212.7
a .982 .977 .284 .982 1 4 .000 1.669
1 61
a. Predictors: (Constant), Maintenance cost
b. Dependent Variable: Age of a car
ANOVAa
Sum of Mean
Model Squares df Square F Sig.
1 Regressio
17.177 1 17.177 212.761 .000b
n
Residual .323 4 .081
Total 17.500 5
b. Predictors: (Constant), Maintenance cost
Coefficientsa
Standardize 95.0%
Unstandardize d Confidence
d Coefficients Coefficients Interval for B
Lowe Uppe
r r
Std. Boun Boun
Model B Error Beta t Sig. d d
1 (Constant) .08
.528 .234 2.254 -.122 1.179
7
Maintenanc 14.58 .00
.435 .030 .991 .352 .518
e cost 6 0
Residuals Statisticsa
Minimu Maximu Std.
m m Mean Deviation N
Predicted Value 1.40 6.18 3.50 1.853 6
Residual -.398 .297 .000 .254 6
Std. Predicted
-1.134 1.447 .000 1.000 6
Value
Std. Residual -1.401 1.046 .000 .894 6
INTERPRETATION :
The data is examining the relationship between one independent and dependent
variable. A positive coefficient indicates that as the value of the independent variable
increases, the mean of the dependent variable also tends to increase.
7.MULTIPLE REGRESSIONS
Multiple regression is a statistical technique that can be used to analyze

the relationship between a single dependent variable and several
independent variables.
THE PROBLEM: To investigate if Government Expenditure,

Consumption, Investment has a significant impact on GDP
conclude

ANALYSE → REGRESSION → LINEAR REGRESSION
Step 3: Plot the Dependent and independent variable
Dependent variable: GDP
Independent Variable: Government Expenditure, Consumption,
Investment and then click OK
Step 4: The output will be generated.

OUTPUT
Regression
Mean Std. Deviation N
GDP 1.14E6 233846.634 37
GOVTEXP 1.26E5 32913.541 37
CONS 6.75E5 136232.151 37
INV 3.74E5 88811.183 37
Correlations
GDP GOVTEXP CONS INV
Pearson Correlation GDP 1.000 .928 .964 .977
GOVTEXP .928 1.000 .885 .872
CONS .964 .885 1.000 .952
INV .977 .872 .952 1.000
Sig. (1-tailed) GDP . .000 .000 .000
GOVTEXP .000 . .000 .000
CONS .000 .000 . .000
INV .000 .000 .000 .
N GDP 37 37 37 37
GOVTEXP 37 37 37 37
CONS 37 37 37 37
INV 37 37 37 37
Variables Entered/Removedb
Model Variables Entered Variables Removed Method

1 INV, GOVTEXP,
. Enter
CONSa
a. All requested variables entered.
b. Dependent Variable: GDP
Model Summaryb
Change Statistics
Adjusted Std. Error R
R R of the Square F Sig. F Durbin-
Model R Square Square Estimate Change Change df1 df2 Change Watson
1 .991a .982 .980 32780.380 .982 599.683 3 33 .000 2.176
a. Predictors: (Constant), INV,
GOVTEXP, CONS
ANOVAb
Model Sum of Squares df Mean Square F Sig.

1 Regression 1.933E12 3 6.444E11 599.683 .000a
Residual 3.546E10 33 1.075E9
Total 1.969E12 36
a. Predictors: (Constant), INV, GOVTEXP, CONS
Coefficientsa
Unstandardized Standardized 95% Confidence
Coefficients Coefficients Interval for B
Lower Upper
Model B Std. Error Beta t Sig. Bound Bound
1 (Constant) 123496.030 29386.413 4.202 .000 63708.923 183283.136
GOVTEXP 1.918 .365 .270 5.260 .000 1.176 2.660
CONS .360 .141 .210 2.561 .015 .074 .646
INV 1.427 .205 .542 6.963 .000 1.010 1.843
a. Dependent Variable: GDP
Residuals Statisticsa
Minimum Maximum Mean Std. Deviation N
Predicted Value 7.28E5 1.51E6 1.14E6 231730.967 37
Residual -5.809E4 7.023E4 .000 31384.824 37
Std. Predicted Value -1.789 1.602 .000 1.000 37
Std. Residual -1.772 2.142 .000 .957 37
a. Dependent Variable: GDP
INTERPRETATION:
There is a significant difference between the Government Expenditure,

Consumption and investment on GDP . The analysis explores the
relationships between GDP and the other variables. The regression model
shows that these variables have a significant combined impact on GDP.
The correlations suggest strong positive linear relationships among the
variables.
8.CHI SQUARE TEST
The Chi-Square test is a statistical procedure for determining the
difference between observed and expected This test can also be used to
determine whether it correlates to the categorical variables in our data. It
helps to find out whether a difference between two categorical variables
is due to chance or a relationship between them.
The null and alternative hypothesis will be:
H 0 : There is no association between smoking and cancer
H 1 : There is an association between smoking and cancer.
Using the SPSS SOFTWARE we will analyse the data given and then
can conclude
Step 1: import the data from excel

STEP:2 Analyse the data
ANALYSE → REGRESSION → LINEAR REGRESSION
STEP 3 :Define your null and alternative hypotheses in a respective way.

STEP 4: OUTPUT WILL BE GENERATED AS FOLLOWS:
Cancer
Observed N Expected N Residual
1 27 25.0 2.0
2 23 25.0 -2.0
Total 50
Test Statistics
Cancer
Chi-Square .320a
df 1
Asymp. Sig. .572
a. 0 cells (0.0%) have
expected frequencies less than
5. The minimum expected cell
frequency is 25.0.
INTERPRETATION:
The provided data shows a comparison between observed and expected cancer cases in two
categories .The expected number of cases in the first category was 25.0. For the second
category, there were 23 observed cases of cancer . Positive residuals indicate over
performance, while negative residuals indicate underperformance compared to the expected
values
9.ONE PAIRED T TEST
The paired sample t-test, sometimes called the dependent sample t-test,
is a statistical procedure used to determine whether the mean difference
between two sets of observations is zero.
THE PROBLEM: To investigate the I’d and scores of the students

using the SPSS SOFTWARE we will analyse the data given and then can
conclude
STEP 2: Analyses the data

ANALYSE → COMPARE MEANS AND PROPORTIONS → ONE
SAMPLE T TEST
STEP 3 :plot the variables
STEP 4: OUTPUT WILL BE GENERATED

One-Sample Statistics
N Mean Std. Deviation Std. Error Mean
Scores 11 31.300 1.6763 .5054
One-Sample Test
Test Value = 0
95% Confidence Interval of the
Difference
t df Sig. (2-tailed) Mean Difference Lower Upper
Scores 61.928 10 .000 31.3000 30.174 32.426
INTERPRETATION:
The dataset’s mean score is 31.300 with a relatively low variability and a small standard error,
suggesting that the average score is a reliable estimate of the population mean.
10. PAIRED SAMPLE T TEST
The Paired-Samples T Test procedure compares the means of two

variables for a single group. The procedure computes the differences
between values of the two variables for each case and tests whether the
average differs from 0.
THE PROBLEM: To investigate the difference between before sales and
after sales using the SPSS SOFTWARE we will analyse the data given
and then can conclude

ANALYSE → COMPARE MEANS AND PROPORTIONS →
PAIRED SAMPLE T TEST
Step 3: plot the variables.
OUTPUT:
Paired Samples Statistics

Mean N Std. Deviation Std. Error Mean
Pair 1 Before Sales 14.50 10 5.836 1.845
After Sales 17.50 10 3.567 1.128
Paired Samples Correlations

N Correlation Sig.
Pair 1 Before Sales & After Sales 10 .841 .002
Paired Samples Test

Paired Differences
95% Confidence
Interval of the
Std. Std. Error Difference Sig. (2-
Mean Deviation Mean Lower Upper t df tailed)
Pair Before Sales -
-3.000 3.432 1.085 -5.455 -.545 -2.764 9 .022
1 After Sales
INTERPRETATION:
These paired sample statistics compare Before Sales and after

Sales data. The mean sales increased from 14.50 before to
17.50 after. The variability decreased from 5.836 to 3.567. This
indicates that, on average, sales increased after the observed
change.
GROUP 8

Oup 8

Uploaded by

Document Informationclick to expand document information

Copyright:

Available Formats

Oup 8

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Oup 8

Uploaded by

Copyright:

Available Formats

DATA ANALYSIS USING SPSS

MBA 112 – Business Research Methods

Lab Report submitted to

PRAKASH RAJ , P221836

S.no NAME OF THE EXPERIMENT Page no

8. CHI SQUARE TEST

9. ONE SAMPLE T TEST

10. PAIRED SAMPLE T TEST

Frequency distributions are visual displays that organise and present

THE PROBLEM: To investigate the height and weight of gender using

Step 2: Analyse the data

Steps involved in it are as follows:

STEP 1: Open SPSS: Launch the SPSS software on your computer.

STEP 2: Import data from excel

STEP 3 : ANALYSE DESCRIPTIVE DATA

Std. Std. Std.

Step1: Open SPSS: Launch the SPSS software on your computer.

Step2: Import excel data

Step3:Analyse the descriptive data

The data suggests that there is a significant positive correlation between

STEP 1: IMPORT DATA FROM EXCEL

STEP 2 Analyse the date

ANALYSE → CORRELATE → BIVARIATE

Step 4 : The output will be generated .

GDP GOVTEXP CONS INV

Sig. (2-tailed) . .000 .000 .000

THE PROBLEM: To identify the relationship between Age of a car and

STEP 1: Import the data from excel

STEP 2 : Analyse the data

ANALYSE →REGRESSION → LINEAR REGRESSION

Multiple regression is a statistical technique that can be used to analyze

THE PROBLEM: To investigate if Government Expenditure,

Step 2: Analyse the data

Step 4: The output will be generated.

Model Variables Entered Variables Removed Method

Model Sum of Squares df Mean Square F Sig.

There is a significant difference between the Government Expenditure,

The null and alternative hypothesis will be:

H 0 : There is no association between smoking and cancer

H 1 : There is an association between smoking and cancer.

Step 1: import the data from excel

STEP 3 :Define your null and alternative hypotheses in a respective way.

THE PROBLEM: To investigate the I’d and scores of the students

STEP 2: Analyses the data

STEP 4: OUTPUT WILL BE GENERATED

10. PAIRED SAMPLE T TEST

The Paired-Samples T Test procedure compares the means of two

Step 2: Analyse the data

Paired Samples Statistics

Paired Samples Correlations

Paired Samples Test

These paired sample statistics compare Before Sales and after

You might also like