0% found this document useful (0 votes)

7 views21 pages

Introduction and Overview

The document outlines a Business Statistics course that spans 4 weeks and includes 12 sessions focused on data and descriptive statistics. Key topics include data types, collection methods, descriptive and inferential statistics, and measures of central tendency and dispersion. The course features assignments, a mid-term exam, and a final exam, with an emphasis on academic integrity and the use of AI.

Uploaded by

stutisinha.chandra

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views21 pages

Introduction and Overview

Uploaded by

stutisinha.chandra

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

1.

Introduction to Data &

Descriptive Statistics

Business Statistics
June 30, 2025

Parasuram Balasubramanian
Assistant Professor of Strategy
About the course

• 4 weeks, 12 sessions

• Sessions: lecture + exercise

• Assigned seating and name tags

• Attendance – Programs office policy

• Punctuality

• Office hours

– Thursday: 11 am – 12 pm, 2 – 3 pm

– After lectures, or email for appointment

2
About the course

• Reference books if needed (see course outline)

• Assignments: 3 (due on 10/07, 18/07, 25/07)

• Mid-term exam: written, on paper (July 12, Saturday)

• Final exam: written + MS Excel (July 28, Monday)

• Use of AI

• Honor code and academic integrity

3
Session outline
1. Introduction to Data & Descriptive 7. Simple Linear Regression
Statistics
8. Simple Linear Regression
2. Introduction to Probability
9. Multiple Linear Regression
3. Probability Distributions
10. Multiple Linear Regression
4. Sampling and Estimation
11. Non-Linear Response
5. Hypothesis Testing
12. Miscellaneous Topics and Recap
6. Comparing Groups: independent and
paired t-tests

4
Outline
• What is Business Statistics? Its importance in business decision-making.

• Types of data: categorical vs. numerical, scales of measurement (nominal, ordinal,

interval, ratio)

• Data collection methods: surveys, experiments, observations, and sampling

techniques

• Descriptive statistics: measures of central tendency (mean, median, mode) and

dispersion (range, variance, standard deviation)

• Data visualizations

• In-class exercise

5
What is Statistics?

• Statistics - collection, analysis, interpretation, and presentation of data

• Descriptive statistics - Organize and summarize data

– Includes measures such as mean, median, standard deviation, and

visualization tools

• Inferential statistics - a formal method to draw conclusions from the data

– Use of probability to determine confidence in the conclusions

– Includes hypothesis testing, confidence intervals, regression analysis

6
What is Business Statistics?
• Business statistics is the application of statistical techniques to analyze and
interpret data for effective decision-making

• Involves collecting, summarizing, and interpreting quantitative information

to measure performance, identify trends, as well as predict future outcomes

• Use cases

– to analyze sales data

– to understand customer behavior

– to improve operational efficiency

– enable managers to make data-driven decisions

7
Types of data
• Numerical data – count or measure attributes of a population

– Number of people in a town, amount of money, number of students in

university, stock price

– Discrete: no. of students, children; number of stocks in portfolio

– Continuous: height, weight, stock prices

• Categorical data

– Type of car: sedan, hatchback, SUV

– Movie genre: action, comedy, drama, kids

– Education level: dropout, high school, college, master’s, PhD

8
Methods of data collection
• Questionnaires, surveys

– Customer feedback, common in primary market research

• Experiments: hypothesis testing in controlled settings

– A/B testing in marketing campaigns, testing of new drugs

• Observational studies: to collect data without inference

– Tracking number of customers in a store

• Archival or secondary data sources

– Quantitative data such as stock prices, financial data

9
Descriptive vs Inferential statistics
• Descriptive statistics summarizes and provides a description of the sample
(dataset)

– Includes measures such as mean, median, standard deviation, minimum,

maximum

– Visualizations such as histograms, box plots

• Inferential statistics uses sample data to make an inference or prediction

about a population

– Hypothesis testing, confidence intervals, regression analysis

10
Scales of measurement
Nominal
Categories without any particular
order, e.g., color, marriage status
Categorical
(qualitative) Ordinal
Categories that can be ordered, e.g.,
rankings, education level
Variable
Discrete
A variable that takes on distinct,
countable values, e.g., number of
steps, number of births
Numerical
(quantitative)
Continuous
A variable that can take on any
value within a range, and can have
infinite values within that range,
e.g., distance walked, weight of
newborn babies
11
Descriptive statistics

12
Measures of central tendency

• Central tendency – extent to which all data values group

around a typical or central value
– Mean, median, mode

• Mean is used quite often, unless outliers or extreme values

exist
• Since median is not sensitive to outlier values, this measure is
also commonly used
– E.g. median home prices are often reported

• Mean and median together

13
Measures of dispersion

• Variation – amount of dispersion or scattering of values

• Standard deviation: numerical measure of overall amount of
variation in a dataset
– Can be used to determine whether data values are close to or far off
from the mean

• Small standard deviation, values are bunched around the mean

• If ‘x’ is a data value, then ‘x – mean’ is called its deviation
• Variance: average of squares of deviations

14
Skewness

Symmetric Left or negatively Right or positively

distribution skewed distribution skewed distribution

• Skewness: quantifies the degree to which a distribution’s tail extends toward

one side

• The long, thin part of the curve is the skewed portion

• What does distribution does income of the population follow?

15
Population parameters and sample statistics

Population
Measure Sample Statistic
Parameter

Mean 𝜇 𝑋ത

Variance 𝜎2 𝑠2

Standard Deviation 𝜎 𝑠

16
Quartiles
• Quartiles split data into four segments with an equally distributed values in
each segment

25% 25% 25% 25%

Q1 Q2 Q3

• First quartile Q1, value for which 25% of observations are smaller, 75%
larger

• Q2, same as median, 50% of observations on either side

• Interquartile range (IQR) = Q3 – Q1

17
Outliers

• Outliers are observations that lie far outside the typical range of a dataset

• They can arise from data entry errors, measurement issues, or genuine
extreme events

• Why should we care about outliers?

– They skew summary statistics (mean, variance)

– Distort model estimates and weaken predictive accuracy

– However, outliers may signal data quality problems or important rare

events

18
Detecting Outliers

• Boxplot (IQR Rule): values < Q1 – 1.5·IQR; or values > Q3 + 1.5·IQR

• 3 standard deviations away from the mean can also flag extreme values

• Visualization: scatterplots, histograms

• Example: Monthly sales data at the store level

• Confirm true value before deciding to exclude or model separately

• Use robust measures (median, trimmed mean) if genuine extreme values

are business-relevant

• Should outliers be dropped?

19
Data visualization

• Histograms

• Scatter plot

• Box plot

• Bar graph, pie chart, dot plot, etc.

20
In-class exercise
• Dataset: “s1_hotels_Vienna.xlsx”

• Calculate descriptive statistics for hotel prices per night in Vienna

• Generate data visualizations

• Interpret your results

Basic Statistics
100% (10)
Basic Statistics
73 pages
Business Statistics: Lecture 1: Course Introduction & Descriptive Statistics
No ratings yet
Business Statistics: Lecture 1: Course Introduction & Descriptive Statistics
46 pages
Statistics For Data Science 1
No ratings yet
Statistics For Data Science 1
65 pages
Introduction Bus Statistics
No ratings yet
Introduction Bus Statistics
32 pages
Overall Descriptive Statistics
No ratings yet
Overall Descriptive Statistics
127 pages
Introduction To Statistics Final
No ratings yet
Introduction To Statistics Final
30 pages
Business Statistics Basics
No ratings yet
Business Statistics Basics
24 pages
1 - Introduction To Statistics - June-22, 2011 (Compatibility Mode)
No ratings yet
1 - Introduction To Statistics - June-22, 2011 (Compatibility Mode)
12 pages
Simple Regression Analysis
No ratings yet
Simple Regression Analysis
13 pages
Lecture1 Introduction
No ratings yet
Lecture1 Introduction
49 pages
Business Statistics Overview
No ratings yet
Business Statistics Overview
94 pages
Statistics Module: Arijit Mitra
No ratings yet
Statistics Module: Arijit Mitra
25 pages
Week 1 Course Material
No ratings yet
Week 1 Course Material
15 pages
1 - Business Statistics
No ratings yet
1 - Business Statistics
82 pages
Deck 1 - Data Types, Data Display, and Summary 2024F
No ratings yet
Deck 1 - Data Types, Data Display, and Summary 2024F
42 pages
Lecture 1 Statistics and Lecture2
No ratings yet
Lecture 1 Statistics and Lecture2
44 pages
1 DATA and Decision Making
No ratings yet
1 DATA and Decision Making
28 pages
Business Statistics Course Overview
No ratings yet
Business Statistics Course Overview
7 pages
Statistics - Unit1 PDF
No ratings yet
Statistics - Unit1 PDF
94 pages
Eco2061 Week 2
No ratings yet
Eco2061 Week 2
68 pages
Chapter 2 BSC TY Statistical Data Analysis
No ratings yet
Chapter 2 BSC TY Statistical Data Analysis
124 pages
Intro to Statistics for Beginners
No ratings yet
Intro to Statistics for Beginners
37 pages
Chapter 1
No ratings yet
Chapter 1
34 pages
Desc. Stat
No ratings yet
Desc. Stat
41 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
63 pages
Stats
No ratings yet
Stats
109 pages
BS Week1
No ratings yet
BS Week1
141 pages
Quantitative Methods For Management: Term II 4 Credits MGT 408
No ratings yet
Quantitative Methods For Management: Term II 4 Credits MGT 408
75 pages
Statistics for Decision Making Course
No ratings yet
Statistics for Decision Making Course
65 pages
Session 1
No ratings yet
Session 1
71 pages
Statistics
No ratings yet
Statistics
14 pages
Business Statistics Course Overview
No ratings yet
Business Statistics Course Overview
63 pages
Introduction to Business Statistics
No ratings yet
Introduction to Business Statistics
54 pages
RM EBBA Class 8 CH0 11 Quatitative Analysis
No ratings yet
RM EBBA Class 8 CH0 11 Quatitative Analysis
37 pages
Quantitative Methods Q & A Sheet
No ratings yet
Quantitative Methods Q & A Sheet
93 pages
Chapter 1 Introduction To Statistics
No ratings yet
Chapter 1 Introduction To Statistics
69 pages
Ge8 Statistics
No ratings yet
Ge8 Statistics
2 pages
Business Statistics Chapter One
0% (1)
Business Statistics Chapter One
20 pages
Introduction to Business Statistics
No ratings yet
Introduction to Business Statistics
7 pages
Week 01
No ratings yet
Week 01
71 pages
Statistics For Data Analysis
No ratings yet
Statistics For Data Analysis
13 pages
Statistical Analysis (Lecture 1)
No ratings yet
Statistical Analysis (Lecture 1)
40 pages
Statistic
No ratings yet
Statistic
22 pages
Business Analytics
No ratings yet
Business Analytics
44 pages
Intro to Business Statistics Course
No ratings yet
Intro to Business Statistics Course
81 pages
1 Introduction
No ratings yet
1 Introduction
51 pages
Describing Data Using Numerical Measures: Chapter Goals
No ratings yet
Describing Data Using Numerical Measures: Chapter Goals
20 pages
Applications of Inference Statistics
No ratings yet
Applications of Inference Statistics
28 pages
Chapter1 S
No ratings yet
Chapter1 S
100 pages
1 - Business Statistics INTRO
No ratings yet
1 - Business Statistics INTRO
50 pages
Chapter1 K57 S
No ratings yet
Chapter1 K57 S
80 pages
Análisis y Visualización de Datos
No ratings yet
Análisis y Visualización de Datos
36 pages
1 Basic Statistics Unit-I Class
No ratings yet
1 Basic Statistics Unit-I Class
30 pages
Business Statistics Notes
No ratings yet
Business Statistics Notes
19 pages
Statistik 2
No ratings yet
Statistik 2
34 pages
SOB 1040B Lecture 1 - Introduction To Business Statistics
No ratings yet
SOB 1040B Lecture 1 - Introduction To Business Statistics
25 pages
Lecture 1 - Chap 1-2
No ratings yet
Lecture 1 - Chap 1-2
43 pages
Business Statistics: Course Description
No ratings yet
Business Statistics: Course Description
6 pages
Rotation-Only Bundle Adjustment
No ratings yet
Rotation-Only Bundle Adjustment
1 page
UI Lab Manual
No ratings yet
UI Lab Manual
47 pages
Plant Tissue Structure for Students
No ratings yet
Plant Tissue Structure for Students
8 pages
Map Automation PDF
No ratings yet
Map Automation PDF
8 pages
Python Basics and Features Guide
No ratings yet
Python Basics and Features Guide
129 pages
NAgad
No ratings yet
NAgad
60 pages
Manifest Desires: 2 Principles & 1 Process
No ratings yet
Manifest Desires: 2 Principles & 1 Process
5 pages
A Popular Numerologist and Master Trainer of
100% (3)
A Popular Numerologist and Master Trainer of
3 pages
Proposal Requirements
No ratings yet
Proposal Requirements
3 pages
Robotics
No ratings yet
Robotics
22 pages
Paraplanning Role in Australia
No ratings yet
Paraplanning Role in Australia
2 pages
Convocation Report 2012 2013
No ratings yet
Convocation Report 2012 2013
27 pages
Calendar 2015-2016
No ratings yet
Calendar 2015-2016
12 pages
Kevin Flanigan, Ph.D. West Chester University Kflanigan@wcupa - Edu
No ratings yet
Kevin Flanigan, Ph.D. West Chester University Kflanigan@wcupa - Edu
65 pages
SWIFT MT202 Format Guide
No ratings yet
SWIFT MT202 Format Guide
10 pages
Atkins, P.J. (2005) Mapping Foodscapes, Food & History 3, 1, 267-80
No ratings yet
Atkins, P.J. (2005) Mapping Foodscapes, Food & History 3, 1, 267-80
13 pages
Computational Mathematics With Python (Lecture Notes)
100% (5)
Computational Mathematics With Python (Lecture Notes)
438 pages
Boosting Productivity On Projects PMI
No ratings yet
Boosting Productivity On Projects PMI
6 pages
De Kiem Tra Cuoi HK1 Anh 4 Global de 1
No ratings yet
De Kiem Tra Cuoi HK1 Anh 4 Global de 1
7 pages
Student IT System Upgrade Guide
No ratings yet
Student IT System Upgrade Guide
6 pages
The Therian in Fact and Form
No ratings yet
The Therian in Fact and Form
3 pages
Operation in Service Industry: by Praveen Sidola
No ratings yet
Operation in Service Industry: by Praveen Sidola
20 pages
Certificate of Analysis: National Plastic Factory L.L.C Date Supplier
No ratings yet
Certificate of Analysis: National Plastic Factory L.L.C Date Supplier
1 page
Thesis Acknowledgement Help
100% (3)
Thesis Acknowledgement Help
8 pages
Amit Verma Resume
No ratings yet
Amit Verma Resume
2 pages
Hids by Signature For Embedded Devices in Iot Networks
No ratings yet
Hids by Signature For Embedded Devices in Iot Networks
8 pages
Weekly Challenge 2 - Coursera
No ratings yet
Weekly Challenge 2 - Coursera
1 page
Oviatt and McDougall 1994 JIBS Toward A Theory of International New Ventures
No ratings yet
Oviatt and McDougall 1994 JIBS Toward A Theory of International New Ventures
14 pages
Art 10 Worksheet: Tech-Based Arts
No ratings yet
Art 10 Worksheet: Tech-Based Arts
6 pages

Introduction and Overview

Uploaded by

Introduction and Overview

Uploaded by

1.

Introduction to Data &

• Sessions: lecture + exercise

• Assigned seating and name tags

• Attendance – Programs office policy

– After lectures, or email for appointment

• Reference books if needed (see course outline)

• Assignments: 3 (due on 10/07, 18/07, 25/07)

• Mid-term exam: written, on paper (July 12, Saturday)

• Final exam: written + MS Excel (July 28, Monday)

• Honor code and academic integrity

• Types of data: categorical vs. numerical, scales of measurement (nominal, ordinal,

• Data collection methods: surveys, experiments, observations, and sampling

• Descriptive statistics: measures of central tendency (mean, median, mode) and

• Statistics - collection, analysis, interpretation, and presentation of data

• Descriptive statistics - Organize and summarize data

– Includes measures such as mean, median, standard deviation, and

• Inferential statistics - a formal method to draw conclusions from the data

– Use of probability to determine confidence in the conclusions

– Includes hypothesis testing, confidence intervals, regression analysis

• Involves collecting, summarizing, and interpreting quantitative information

– to analyze sales data

– to understand customer behavior

– to improve operational efficiency

– enable managers to make data-driven decisions

– Number of people in a town, amount of money, number of students in

– Discrete: no. of students, children; number of stocks in portfolio

– Continuous: height, weight, stock prices

– Type of car: sedan, hatchback, SUV

– Movie genre: action, comedy, drama, kids

– Education level: dropout, high school, college, master’s, PhD

– Customer feedback, common in primary market research

• Experiments: hypothesis testing in controlled settings

– A/B testing in marketing campaigns, testing of new drugs

• Observational studies: to collect data without inference

– Tracking number of customers in a store

• Archival or secondary data sources

– Quantitative data such as stock prices, financial data

– Includes measures such as mean, median, standard deviation, minimum,

– Visualizations such as histograms, box plots

• Inferential statistics uses sample data to make an inference or prediction

– Hypothesis testing, confidence intervals, regression analysis

• Central tendency – extent to which all data values group

• Mean is used quite often, unless outliers or extreme values

• Mean and median together

• Variation – amount of dispersion or scattering of values

• Small standard deviation, values are bunched around the mean

Symmetric Left or negatively Right or positively

• Skewness: quantifies the degree to which a distribution’s tail extends toward

• The long, thin part of the curve is the skewed portion

• What does distribution does income of the population follow?

25% 25% 25% 25%

• Q2, same as median, 50% of observations on either side

• Interquartile range (IQR) = Q3 – Q1

• Why should we care about outliers?

– They skew summary statistics (mean, variance)

– Distort model estimates and weaken predictive accuracy

– However, outliers may signal data quality problems or important rare

• Boxplot (IQR Rule): values < Q1 – 1.5·IQR; or values > Q3 + 1.5·IQR

• Visualization: scatterplots, histograms

• Example: Monthly sales data at the store level

• Confirm true value before deciding to exclude or model separately

• Use robust measures (median, trimmed mean) if genuine extreme values

• Should outliers be dropped?

• Bar graph, pie chart, dot plot, etc.

• Calculate descriptive statistics for hotel prices per night in Vienna

• Generate data visualizations

• Interpret your results

You might also like