0% found this document useful (0 votes)

12 views27 pages

FBA Module 2

The document covers foundational concepts in statistics, including definitions of population and sample, types of variables, measures of central tendency and dispersion, and probability distributions. It explains the significance of understanding data distributions and introduces hypothesis testing, outlining its steps and common errors. Additionally, it discusses the Central Limit Theorem and the properties of normal distribution.

Uploaded by

nilsa.vp

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views27 pages

FBA Module 2

Uploaded by

nilsa.vp

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 27

Module 2:

Statistical Foundation
Population & Sample

• Population: The entire set of individuals or

observations relevant to a particular study.

• Sample: A subset of the population selected for

analysis.

• Sampling is necessary when studying an entire

population is impractical due to time, cost, or
accessibility constraints.
Variables
• Definition: A characteristic, number, or quantity that can be
measured or quantified.

• Types of Variables:
• Qualitative (Categorical) Variables: Describe non-numerical
characteristics (e.g., gender, education level)
• .Quantitative Variables: Represent numerical data and can be further
divided into:
• Discrete Variables: Can take only specific values (e.g., number of
students in a class).
• Continuous Variables: Can take any value within a range (e.g., height,
weight).
Measures of central tendency
• Central tendency provides a summary of the dataset
using a single representative value.

• Mean (Arithmetic Average): Sum of all values divided

by the total number of values.

• Median: The middle value when data is arranged in

ascending or descending order.

• Mode: The most frequently occurring value in the

Measures of dispersion
Probability distributions
• A probability distribution describes how probabilities are
distributed over the values of a random variable.

• Types of Probability Distributions

• Discrete Probability Distribution: Applies to discrete

variables (e.g., binomial distribution, Poisson distribution).

• Continuous Probability Distribution: Applies to continuous

variables (e.g., normal distribution, exponential distribution).
Distribution
• Distribution refers to a mathematical expression that provides
an event's possible outcomes and how often they can occur.

• Ex: Rolling dice is a random experiment.

• A dice has six sides numbered from 1 to 6.

• When you roll the dice, the probability of getting 1 is an event

and that is one out of six (1/6)

• Similarly, the probability is one-sixth for all other values (2, 3,

4, 5, and 6).

• If you want to find the probability of getting a 7, it would be

zero, as it’s impossible to get such a value.
Event Probability
1 1/6
2 1/6
3 1/6
4 1/6
5 1/6
6 1/6

When plotted using a histogram, the distribution will provide a peculiar shape that
often helps you understand the distribution you are dealing with. In this case, you
will get a uniform distribution.
 Therefore, using this
probability distribution
you can know that the
possible values for a
dice roll are 1 to 6, with
the probability of
getting any value
between this range
being the same
 (in this case, it’s 1/6
which is roughly 0.17,
i.e., 17%).
 Every probability
distribution is
Frequency
Probability Distribution
Distribution

It records the likelihood that

It records how often an an event is to occur. It is
event occurs. It is based based on theoretical
on actual observations assumption of what should
happen
Suppose you are dealing with two dice
now.

In this case, what will be the probability

of getting the sum of two dice as 2?
(1,1) (2,1) (3,1) (4,1) (5,1) (6,1)

(1,2) (2,2) (3,2) (4,2) (5,2) (6,2)

(1,3) (2,3) (3,3) (4,3) (5,3) (6,3)

(1,4) (2,4) (3,4) (4,4) (5,4) (6,4)

(1,5) (2,5) (3,5) (4,5) (5,5) (6,5)

(1,6) (2,6) (3,6) (4,6) (5,6) (6,6)

• If you were to calculate the probability of each event, you need
to look at how often that outcome can occur.
• For example, the probability of getting a sum of two dice as 1 is
zero.
• The probability of getting the sum as 2 will be 1/36 because
this can only happen when both the dice return 1
• and of the 36 possible outcomes, there is only one such event
that returns the sum as 2.
• Similarly, the probability of getting the sum 3 would be 2/36
because of the 36 possible outcomes; only two such outcomes
return the sum as 3: (1,2) and (2,1).
• Therefore if we know the denominator, i.e., the count of
outcomes for each event, we can calculate the probabilities.
The total possible events and the probability for each event will differ, making the
distribution take different shapes, as shown below
Common Types of Data
Discrete Data
• When you roll a dice or pick a card from a deck

• you have a limited number of outcomes possible.

• This type of data is called Discrete Data

• Which can only take a specified number of values.

• For example, in rolling a dice

• The specified values are 1, 2, 3, 4, 5, and 6.

• Suppose you count the number of boys in a class; since the

value is countable, it is discrete
Continuous Data
• Continuous data is data that can take any value.

• Height, weight, temperature and length are all examples of continuous data.

• Some continuous data will change over time, the temperature in a room

throughout the day

• a person’s height has infinitely many values within a given interval.

• This type of data is called Continuous Data, which can have any value within

a given range. That range can be finite or infinite.

• Continuous data is measurable but not countable, hence, continuous.

• .
Types of Distribution
Distribution types can be divided into continuous and discreet distributions
Normal distribution
• Of the different types of distributions out there, the most
used distribution in statistics and data science is Normal, also
known as the Gaussian distribution.

• A normal distribution is a symmetrical distribution with a bell-

shaped curve, where most values are clustered around the
center and tapering off as you move away from the center.

• The unique property of normal distribution is that its mean,

medium, and mode are all equal.
Central Limit Theorem (CLT)
• You collect data from 100 individuals about their age and calculate
its mean

• And if you then repeat this process 1,000 times (a minimum of 30

samples are required for CLT to be true) and plot these means then
what you get is a sampling distribution.

• As per CLT, the mean of the sampling distribution and population

(from where the samples have been drawn) is equal.
• Also, the sampling distribution will follow a Gaussian distribution
regardless of the distribution of the population.
As Gaussian distribution follows a 68-95-99.7 rule which states that in such distribution, 68%
of values lie within one standard deviation from the mean, 95% within 2 and 99.7% within
three, it makes it easy to understand the probability of finding a value in the population.
Understanding Data Distributions

• When analysing data, it's important to understand the

distribution of the data. The distribution refers to how
the data is spread out or clustered around certain
values or ranges.
• By examining the distribution, we can gain insights into
the characteristics and patterns of the data, which can
be useful in making informed decisions and predictions.
• There are various types of data distributions, each with
its own unique properties and implications.
• Understanding these distributions is a fundamental
aspect of data analysis and can help us make more
accurate and meaningful interpretations of the data.
H y p o th e s is te s tin g & S ig n ifi c a n c e le v e ls

• Hypothesis testing is a statistical method used to make

decisions about population parameters based on sample data.
• Steps in Hypothesis Testing
1.State the Null () and Alternative () Hypothesis:
1. : No effect or no difference.
2. : Indicates a significant effect or difference.
2.Choose the Significance Level ():
1. Common values: 0.05 (5%) or 0.01 (1%).
3.Select the Appropriate Test:
1. Z-test, t-test, chi-square test, etc.
4.Compute the Test Statistic:
1. Compare with the critical value or use the p-value.
5.Make a Decision:
1. If p-value < , reject .
2. If p-value > , fail to reject .
Types of Errors in Hypothesis Testing

Probability Distributions-Sarin B
No ratings yet
Probability Distributions-Sarin B
20 pages
2466939-EDA and STATISTICS NOTES
No ratings yet
2466939-EDA and STATISTICS NOTES
15 pages
What Is Distribution?
No ratings yet
What Is Distribution?
4 pages
Statistics and Probability
No ratings yet
Statistics and Probability
43 pages
Unit II: Basic Data Analytic Methods
No ratings yet
Unit II: Basic Data Analytic Methods
38 pages
Notes
No ratings yet
Notes
29 pages
Lesson 4 Notes
No ratings yet
Lesson 4 Notes
14 pages
Lec # 2
No ratings yet
Lec # 2
22 pages
Prop Final 4
No ratings yet
Prop Final 4
119 pages
Intro to Descriptive Statistics
No ratings yet
Intro to Descriptive Statistics
51 pages
Unit 3 R As A Set of Statistical Tables
No ratings yet
Unit 3 R As A Set of Statistical Tables
31 pages
1 Intro-Statistics
No ratings yet
1 Intro-Statistics
61 pages
3 - Introduction To Inferential Statistics
No ratings yet
3 - Introduction To Inferential Statistics
32 pages
Section 4 - Analyze Phase
No ratings yet
Section 4 - Analyze Phase
179 pages
Class 10-Distribution in Data Science
No ratings yet
Class 10-Distribution in Data Science
22 pages
Probability and Statistics
No ratings yet
Probability and Statistics
8 pages
Intro to Statistics for Students
No ratings yet
Intro to Statistics for Students
28 pages
LQ1 Notes
No ratings yet
LQ1 Notes
15 pages
3 Statistical Distribution Functions
No ratings yet
3 Statistical Distribution Functions
4 pages
Intro to Statistics & Probability
100% (1)
Intro to Statistics & Probability
44 pages
Probability
No ratings yet
Probability
50 pages
Quality Control: Fundamentals of Statistics
No ratings yet
Quality Control: Fundamentals of Statistics
62 pages
Statistic S at Probabili TY: Teacher: Aldwin N. Petronio
No ratings yet
Statistic S at Probabili TY: Teacher: Aldwin N. Petronio
44 pages
Stats Review
No ratings yet
Stats Review
65 pages
Lecture Note On Biostatistics
No ratings yet
Lecture Note On Biostatistics
74 pages
Statistics Notes Part-2
No ratings yet
Statistics Notes Part-2
24 pages
Lecture Slides - Inferential Statistics
100% (1)
Lecture Slides - Inferential Statistics
42 pages
Ders 1
No ratings yet
Ders 1
34 pages
Reading Material Mod 3 Statistical Methods
No ratings yet
Reading Material Mod 3 Statistical Methods
15 pages
COM 201 - Inferential Statistics - 18032022-1
No ratings yet
COM 201 - Inferential Statistics - 18032022-1
58 pages
Probability Distributions and Hypothesis Testing
No ratings yet
Probability Distributions and Hypothesis Testing
9 pages
Statistics and Probability
No ratings yet
Statistics and Probability
12 pages
Probability Distribution
No ratings yet
Probability Distribution
10 pages
Unit 1 Ssmda Notes
No ratings yet
Unit 1 Ssmda Notes
35 pages
Statical Distriution Function
No ratings yet
Statical Distriution Function
8 pages
Normal Distribution Overview
No ratings yet
Normal Distribution Overview
19 pages
Submitted To: Mrs. Geetika Vashisht College of Vocational Studies University of Delhi
No ratings yet
Submitted To: Mrs. Geetika Vashisht College of Vocational Studies University of Delhi
36 pages
Statistics
No ratings yet
Statistics
36 pages
Statistics Part2
No ratings yet
Statistics Part2
28 pages
Decsci Reviewer CHAPTER 1: Statistics and Data
No ratings yet
Decsci Reviewer CHAPTER 1: Statistics and Data
7 pages
Lesson 02 Probability and Statistics
No ratings yet
Lesson 02 Probability and Statistics
127 pages
Statistics and Probability Reviewer
No ratings yet
Statistics and Probability Reviewer
10 pages
Probability & Testing in Data Analytics
No ratings yet
Probability & Testing in Data Analytics
70 pages
Probability and Statistics
No ratings yet
Probability and Statistics
5 pages
What Is Statistic
No ratings yet
What Is Statistic
129 pages
Statistics and Probability 2
No ratings yet
Statistics and Probability 2
16 pages
Qualitative Quantitative: Random Variable
No ratings yet
Qualitative Quantitative: Random Variable
4 pages
Key of Week1 - Lecture Notes
No ratings yet
Key of Week1 - Lecture Notes
10 pages
What Is Probability
No ratings yet
What Is Probability
8 pages
Sci Pi Statistics and Probability Handout
No ratings yet
Sci Pi Statistics and Probability Handout
4 pages
Research - Stats Notes
No ratings yet
Research - Stats Notes
44 pages
Distribution Prerequisite
No ratings yet
Distribution Prerequisite
11 pages
Descriptive Statistics and Probability Distributions: Session 1
No ratings yet
Descriptive Statistics and Probability Distributions: Session 1
34 pages
Probability & Statistics
No ratings yet
Probability & Statistics
108 pages
STATISTICS
No ratings yet
STATISTICS
9 pages
FBA Module 3
No ratings yet
FBA Module 3
41 pages
1.research Methodology-BBA S1M1
No ratings yet
1.research Methodology-BBA S1M1
65 pages
6.research Methodology-BBA S1M6
No ratings yet
6.research Methodology-BBA S1M6
64 pages
2.research Methodology-BBA S1M2
No ratings yet
2.research Methodology-BBA S1M2
22 pages
Stat Trek: Probability Distributions: Discrete vs. Continuous
No ratings yet
Stat Trek: Probability Distributions: Discrete vs. Continuous
3 pages
Robust Self-Scheduling Under Price Uncertainty Using Conditional Value-at-Risk
No ratings yet
Robust Self-Scheduling Under Price Uncertainty Using Conditional Value-at-Risk
7 pages
21mab204t - PQT - Unit 2, 3
No ratings yet
21mab204t - PQT - Unit 2, 3
23 pages
(Ebook PDF) Mind On Statistics 5th Edition Download
100% (4)
(Ebook PDF) Mind On Statistics 5th Edition Download
50 pages
2024 November Algebra 6 - OL
No ratings yet
2024 November Algebra 6 - OL
3 pages
Haar Measure on Compact Groups
No ratings yet
Haar Measure on Compact Groups
12 pages
Chapter 7 BRM
No ratings yet
Chapter 7 BRM
51 pages
Flow Matching Guide and Code
No ratings yet
Flow Matching Guide and Code
83 pages
Disentangling Classical and Bayesian Approaches To Uncertainty Analysis
No ratings yet
Disentangling Classical and Bayesian Approaches To Uncertainty Analysis
19 pages
Survival Analysis Approach To Reliability, Survivability
100% (1)
Survival Analysis Approach To Reliability, Survivability
20 pages
Engineering Prob & Stat Lecture Notes 6
No ratings yet
Engineering Prob & Stat Lecture Notes 6
12 pages
Statistics - Short Notes
No ratings yet
Statistics - Short Notes
11 pages
Syllabus
No ratings yet
Syllabus
52 pages
Probability and Computing 2nd Edition
100% (2)
Probability and Computing 2nd Edition
490 pages
(FREE PDF Sample) OpenIntro Statistics 4th Edition David Diez Ebooks
No ratings yet
(FREE PDF Sample) OpenIntro Statistics 4th Edition David Diez Ebooks
72 pages
03 Single Workstation Analysis With Solutions
No ratings yet
03 Single Workstation Analysis With Solutions
91 pages
Biostatistics Assignment
No ratings yet
Biostatistics Assignment
3 pages
Math11 SP Q3 M7
No ratings yet
Math11 SP Q3 M7
16 pages
Chapter 4. Distribution of Sample Statistics
No ratings yet
Chapter 4. Distribution of Sample Statistics
30 pages
18 - Expected Value
No ratings yet
18 - Expected Value
38 pages
Sequence Space Jacobian
No ratings yet
Sequence Space Jacobian
84 pages
Statistics Elect
No ratings yet
Statistics Elect
8 pages
Sma 2230 Probability and Statistics Ii
No ratings yet
Sma 2230 Probability and Statistics Ii
2 pages
Cot
No ratings yet
Cot
3 pages
Module Book-Business Statistics
No ratings yet
Module Book-Business Statistics
210 pages
Estimation of Claim Cost Data Using Zero Adjusted Gamma and Inverse Gaussian Regression Models
No ratings yet
Estimation of Claim Cost Data Using Zero Adjusted Gamma and Inverse Gaussian Regression Models
7 pages
Learning Plan (Stat)
No ratings yet
Learning Plan (Stat)
8 pages
Application Assisgnment
No ratings yet
Application Assisgnment
4 pages
l2 Mean Variance Standard D of Discrete PD 2
No ratings yet
l2 Mean Variance Standard D of Discrete PD 2
28 pages
Full Download Numerical and Statistical Methods For Civil Engineering Gujarat Technological University 2017 2nd Edition Ravish R Singh PDF
100% (2)
Full Download Numerical and Statistical Methods For Civil Engineering Gujarat Technological University 2017 2nd Edition Ravish R Singh PDF
57 pages

FBA Module 2

Uploaded by

FBA Module 2

Uploaded by

Module 2:

• Population: The entire set of individuals or

• Sample: A subset of the population selected for

• Sampling is necessary when studying an entire

• Mean (Arithmetic Average): Sum of all values divided

• Median: The middle value when data is arranged in

• Mode: The most frequently occurring value in the

• Types of Probability Distributions

• Discrete Probability Distribution: Applies to discrete

• Continuous Probability Distribution: Applies to continuous

• Ex: Rolling dice is a random experiment.

• A dice has six sides numbered from 1 to 6.

• When you roll the dice, the probability of getting 1 is an event

• Similarly, the probability is one-sixth for all other values (2, 3,

• If you want to find the probability of getting a 7, it would be

It records the likelihood that

In this case, what will be the probability

(1,2) (2,2) (3,2) (4,2) (5,2) (6,2)

(1,3) (2,3) (3,3) (4,3) (5,3) (6,3)

(1,4) (2,4) (3,4) (4,4) (5,4) (6,4)

(1,5) (2,5) (3,5) (4,5) (5,5) (6,5)

(1,6) (2,6) (3,6) (4,6) (5,6) (6,6)

• you have a limited number of outcomes possible.

• This type of data is called Discrete Data

• Which can only take a specified number of values.

• For example, in rolling a dice

• The specified values are 1, 2, 3, 4, 5, and 6.

• Suppose you count the number of boys in a class; since the

throughout the day

• a person’s height has infinitely many values within a given interval.

a given range. That range can be finite or infinite.

• Continuous data is measurable but not countable, hence, continuous.

• A normal distribution is a symmetrical distribution with a bell-

• The unique property of normal distribution is that its mean,

• And if you then repeat this process 1,000 times (a minimum of 30

• As per CLT, the mean of the sampling distribution and population

• When analysing data, it's important to understand the

• Hypothesis testing is a statistical method used to make

You might also like