[go: up one dir, main page]

0% found this document useful (0 votes)
32 views19 pages

Unit-4 Biostatistics Descriptive

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
32 views19 pages

Unit-4 Biostatistics Descriptive

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 19

Unit 4: The Normal distribution

Unit 4: The Normal distribution


Objectives:
 To understand The Normal and Standard Normal Distribution
 Explain Properties of standard normal curve
 Discuss characteristics of normal curve
 Discuss Empirical rule
 Define Normal Disrtibution
 What are the characteristics of Normal distribution
 Define Z-Score and standard normal distribution
 Understand the relationship of Z-score with standard deviation
 Discuss the empirical rule
 Discuss the application of of z-score
 Define and discuss central limit theorem with examples

Normal Distribution
Introduction
Normal distribution, often referred to as the Gaussian distribution, is a probability distribution that is
symmetric about the mean. It is characterized by its bell-shaped curve, where the mean, median, and
mode are all equal. The spread of the distribution is determined by its standard deviation, which
indicates how much the data varies from the mean.

Why Study Normal Distribution?

1. Prevalence in Data: While few real-world datasets are perfectly normal, many distributions
approximate normality. This is crucial because it allows us to apply statistical methods that assume
normality, even when the data isn’t exactly normal.

2. Central Limit Theorem: According to this theorem, the sampling distribution of the sample mean
will tend to be normally distributed, regardless of the shape of the population distribution, as long
as the sample size is sufficiently large (typically n ≥ 30). This is fundamental in statistics, as it
justifies the use of normal distribution in inferential statistics.

3. Applications in Statistics: Normal distribution plays a central role in estimation and inferential
statistics, such as:

 Hypothesis Testing: Many tests (e.g., t-tests, z-tests) rely on the assumption of normality for the
sampling distribution of the test statistic.
 Confidence Intervals: Calculating confidence intervals for means and proportions often uses the
normal distribution.
 Regression Analysis: Assumptions about residuals in regression models often include normality.

Characteristics of the Normal Curve


The shape of a normal distribution is influenced primarily by two parameters: the mean (μ) and the
standard deviation (σ).

Mean (μ):
The mean is the central point of the distribution and determines where the peak of the curve lies. In a
normal distribution, the mean, median, and mode are all equal and located at the center of the curve.
Changing the mean shifts the entire curve left or right along the x-axis without altering its shape.

Standard Deviation (σ):


The standard deviation measures the dispersion or spread of the data around the mean. It quantifies
how much individual data points typically deviate from the mean.
Unit 4: The Normal distribution

A larger standard deviation results in a flatter and wider curve, indicating that the data points are more
spread out. Conversely, a smaller standard deviation produces a steeper and narrower curve, indicating
that the data points are closer to the mean.

Larger σ: More spread out; the curve is flatter, indicating greater variability in the data.
Smaller σ: Less spread out; the curve is taller and narrower, indicating that the data points cluster more
closely around the mean.
These characteristics of the normal distribution are essential for understanding how data behaves and
for making statistical inferences.

The Normal Distribution


Purposes of normal distribution:

The normal distribution serves several important purposes in statistics:


1. Foundation for Statistical Inference
Hypothesis Testing: Many statistical tests, such as t-tests and ANOVAs, assume that the data are
normally distributed. This allows researchers to make inferences about population parameters based on
sample statistics.
Confidence Intervals: Normal distribution is used to construct confidence intervals around sample
estimates, enabling researchers to quantify the uncertainty of their estimates.
2. Central Limit Theorem
This theorem states that the means of sufficiently large samples from any population will be
approximately normally distributed, regardless of the population's distribution. This property justifies
the use of the normal distribution in various statistical procedures, even when the underlying data are
not normally distributed.
3. Standardization
The normal distribution allows for the transformation of different datasets into a common scale (Z-
scores). This standardization makes it easier to compare different data sets and assess their relative
positions within a distribution.
4. Modeling Real-World Phenomena
Many natural and social phenomena, such as heights, weights, and test scores, tend to be normally
distributed. Understanding this distribution helps in modeling and predicting outcomes in fields like
psychology, biology, economics, and quality control.
5. Probabilistic Interpretation
The normal distribution provides a clear way to calculate probabilities. By using Z-scores, statisticians
can determine the likelihood of observing a value within a certain range, which is essential for decision-
making.
6. Descriptive Statistics
The normal distribution simplifies the description of data. Parameters like the mean, median, and mode
are all equal in a perfectly normal distribution, providing a concise summary of the data.
7. Quality Control and Six Sigma
In manufacturing and quality assurance, normal distribution helps in monitoring processes and ensuring
that products meet specifications. Control charts and process capability analysis often rely on the
properties of normal distribution.
8. Error Analysis
In many measurement processes, errors tend to be normally distributed due to the central limit
theorem, aiding in the assessment of measurement accuracy and precision.
9. Data Transformation
When data do not meet normality assumptions, transformations (like logarithmic or square root
transformations) can be applied to help the data approximate a normal distribution, facilitating analysis.
10. Facilitating Communication
The normal distribution is a well-understood concept that allows statisticians and researchers to
communicate findings effectively, as it is widely recognized and applied across various disciplines.
Unit 4: The Normal distribution

Normal curves can differ in location or shape


Normal curves, which represent the normal distribution, can vary in two primary ways: location and
shape.

1. Location
The location of a normal curve is determined by its mean (μ):
Mean (μ): This is the central point of the distribution where the peak of the curve occurs. Changing the
mean shifts the entire curve left or right along the horizontal axis.
For example, if you have a normal distribution with a mean of 100 and another with a mean of 120, the
curve with the mean of 120 will be shifted to the right, while the one with the mean of 100 will be
shifted to the left. Both curves retain their bell-shaped form, but their center points differ.

2. Shape
The shape of a normal curve is determined by its standard deviation (σ):
Standard Deviation (σ): This measures the spread or dispersion of the data around the mean. A larger
standard deviation results in a flatter and wider curve, while a smaller standard deviation leads to a
steeper and narrower curve.
For instance, consider two normal distributions: one with a standard deviation of 10 and another with a
standard deviation of 20. The curve with the standard deviation of 10 will be tall and narrow, indicating
that the data points are closely clustered around the mean. In contrast, the curve with the standard
deviation of 20 will be shorter and wider, showing that the data points are more spread out.

Combining Location and Shape


When both the mean and standard deviation are considered, you can describe any normal distribution
in terms of its location and shape:
Location Shift: If the mean increases or decreases, the entire curve shifts horizontally without changing
its shape.
Shape Change: If the standard deviation increases or decreases, the curve’s height and width change,
but the mean remains constant.
Look at the Graph..
Unit 4: The Normal distribution

Understanding and Utilizing Non Gaussian Distributions


Definition of Non-Gaussian Distributions
Non-Gaussian distributions are probability distributions that do not follow the bell-shaped curve of
the normal distribution. These distributions can have different shapes, including skewed, heavy-tailed,
or multimodal forms.
Look at the grapn

 Properties of the Standard Normal Curve


 The standard normal curve is a specific type of normal distribution that has a mean (μ) of 0 and a
standard deviation (σ) of 1.

 Bell-Shaped:
 The standard normal curve has a distinct bell shape, which reflects the distribution of data. Most of
the values cluster around the mean, with fewer values occurring as you move away from the
center. This shape represents the probabilities of different outcomes, with the highest probabilities
occurring near the mean.
Unit 4: The Normal distribution

 Asymptotic:
 The tails of the curve approach, but never actually touch, the horizontal axis. This property means
that there is always a small probability of extreme values occurring, no matter how far from the
mean you go. In practical terms, this indicates that while extreme values are rare, they are still
possible.

 Perfectly Symmetrical:
 The standard normal curve is symmetric around the mean. This means that the left half of the
curve is a mirror image of the right half. As a result, the probabilities of values equidistant from the
mean are the same, reflecting a balanced distribution of data.

 Mean, Median, and Mode at the Center:


 In the standard normal curve, the mean (μ), median, and mode are all located at the center of the
curve, specifically at the peak (dome). This property emphasizes the central tendency of the data,
indicating that most observations are concentrated around the mean.

 Half the Values on Each Side:


 When the curve is divided into two equal halves at the highest point (the mean), exactly 50% of the
values lie to the left and 50% to the right. This property is crucial for statistical analysis, particularly
in hypothesis testing and confidence intervals, where the concept of the median being at the
center is fundamental.

 Two Determinants: Mean (μ) and Standard Deviation (σ):


 The shape and position of the standard normal curve are determined by two parameters:

 Mean (μ): For the standard normal curve, μ = 0. This parameter shifts the curve left or right on
the x-axis.

 Standard Deviation (σ): For the standard normal curve, σ = 1. This parameter affects the width
and height of the curve. A smaller standard deviation results in a steeper curve, while a larger
standard deviation makes the curve flatter and wider.

These properties of the standard normal curve are essential for understanding how data behaves and
for conducting statistical analyses. The standard normal distribution serves as a reference for converting
any normal distribution into a standard form using z-scores, which facilitates comparison across
different datasets and applications in inferential statistics.

The beauty of the normal curve: EMPIRICAL RULE


The Empirical Rule (68-95-99.7 Rule)

The Empirical Rule is a key property of the normal distribution that describes how data is distributed
around the mean. It applies to any normal distribution, regardless of the specific values of the mean (μ)
and standard deviation (σ).

Area within One Standard Deviation (μ ± σ):


Approximately 68% of the data falls within one standard deviation of the mean. This means that if you
take a normal distribution with a mean of μ and a standard deviation of σ, around 68% of the values will
lie between μ - σ and μ + σ.
Interpretation: This range captures a significant portion of the data, indicating that most values are
close to the mean.

Area within Two Standard Deviations (μ ± 2σ):


About 95% of the data falls within two standard deviations of the mean. Specifically, this means that
roughly 95% of the values lie between μ - 2σ and μ + 2σ.
Unit 4: The Normal distribution

Interpretation: This wider range includes nearly all the typical observations, highlighting that values
further from the mean are much less common.

Area within Three Standard Deviations (μ ± 3σ):


Approximately 99.7% of the data falls within three standard deviations of the mean. This
encompasses the interval from μ - 3σ to μ + 3σ.
Interpretation: Almost all values in a normal distribution will fall within this range, indicating that
extreme values are rare.

Visual Representation
Graph: If you were to plot the normal curve, you would see the peak at μ. The intervals (μ - σ to μ + σ),
(μ - 2σ to μ + 2σ), and (μ - 3σ to μ + 3σ) would be highlighted to show the areas representing 68%, 95%,
and 99.7% of the data, respectively.

Importance of the Empirical Rule

 Statistical Analysis: The Empirical Rule is invaluable in statistics for making predictions
about data and understanding variability. It helps in:
 Identifying Outliers: Values that fall outside of three standard deviations can be
considered outliers.
 Confidence Intervals: When estimating population parameters based on sample data, the
rule aids in constructing confidence intervals.
 Quality Control: In manufacturing and other fields, it helps set acceptable limits for
processes.

The Empirical Rule elegantly illustrates the characteristics of the normal distribution and provides a
framework for understanding the spread of data. By recognizing that most data points are clustered
around the mean, analysts can make informed decisions and interpretations in various fields, from
psychology to finance.

Normal distribution is commonly associated with the 68-95-99.7 rule, or empirical rule, which you can
see in the image below. Sixty-eight percent of the data is within one standard deviation (σ) of the mean
(μ), 95 percent of the data is within two standard deviations (σ) of the mean (μ), and 99.7 percent of
the data is within three standard deviations (σ) of the mean (μ).
Unit 4: The Normal distribution

Purpose of the Empirical Rule

Describing Data Distribution:


The empirical rule summarizes how data is distributed around the mean in a normal distribution. It
states that:
Approximately 68% of data falls within one standard deviation (σ) of the mean (μ).
About 95% of data falls within two standard deviations.
Roughly 99.7% of data falls within three standard deviations.

Quick Estimation:
The rule allows for quick assessments of data variability and the proportion of data within certain
ranges without complex calculations. This is especially useful in initial data analysis.

Identifying Outliers:
By knowing how much data falls outside the standard deviations, analysts can identify potential
outliers. For instance, data points that lie more than three standard deviations from the mean may be
considered unusual or outliers.

Guiding Statistical Inference:


The empirical rule helps in making inferences about a population based on sample data. It provides a
benchmark for understanding how sample statistics relate to population parameters.

Simplifying Probabilities:
It aids in probability assessments for normally distributed data, enabling statisticians to easily calculate
probabilities of outcomes within certain intervals.

Supporting Quality Control:


In manufacturing and quality assurance, the empirical rule is used to monitor processes and ensure
products meet specifications. It helps determine acceptable levels of variation.

Facilitating Communication:
Unit 4: The Normal distribution

The empirical rule is widely understood in statistics, making it easier for statisticians and researchers to
communicate findings and interpretations of data distribution.

Modeling and Prediction:


In fields such as finance, psychology, and biology, understanding the empirical rule aids in modeling and
predicting outcomes based on the properties of normal distributions.

Example:
How good is rule for real data
Check some example data: The mean weight of the 120 runners = 127.8
The standard deviation (SD) = 15.5

Solution:
Mean weight (μ): 127.8 pounds
Standard deviation (σ): 15.5 pounds
Sample size (n): 120 runners

Applying the Empirical Rule


Using the mean and standard deviation, we can apply the Empirical Rule to determine the
distribution of the weights of these runners:

One Standard Deviation (μ ± σ):


Lower Bound: μ−σ =127.8−15.5=112.3
Upper Bound: μ+σ =127.8+15.5=143.3
Interpretation: Approximately 68% of the runners will have weights between 112.3 pounds and 143.3
pounds.

Two Standard Deviations (μ ± 2σ):


Lower Bound: μ−2σ=127.8−2(15.5)=127.8−31=96.8
Upper Bound: μ+2σ=127.8+2(15.5)=127.8+31=158.8
Interpretation: Approximately 95% of the runners will have weights between 96.8 pounds and 158.8
pounds.

Three Standard Deviations (μ ± 3σ):


Lower Bound: μ−3σ=127.8−3(15.5)=127.8−46.5=81.3
Upper Bound: μ+3σ=127.8+3(15.5)=127.8+46.5=174.3
Interpretation: Approximately 99.7% of the runners will have weights between 81.3 pounds and 174.3
pounds.

So;
68% of runners weigh between 112.3 and 143.3 pounds.
95% of runners weigh between 96.8 and 158.8 pounds.
99.7% of runners weigh between 81.3 and 174.3 pounds.

This distribution provides valuable insights into the weight of the runners and helps identify the range
of typical weights within this population.
Unit 4: The Normal distribution
Unit 4: The Normal distribution

Example
Suppose SAT scores roughly follows a normal distribution in the U.S. population of college-bound
students (with range restricted to 200-800), and the average math SAT is 500 with a standard deviation
of 50, then:
1. What % of students will have scores between 450 and 550
2. What % will be between 400 and 600
3. What % will be between 350 and 650

Solution:
To analyze the SAT scores following a normal distribution with a mean (μ) of 500 and a standard
deviation (σ) of 50, we can use the properties of the normal distribution and the Empirical Rule, as well
as the z-score formula.

Given Data:
Mean (μ) = 500
Standard Deviation (σ) = 50

Z-Score Calculation
The z-score for a value X can be calculated using the formula:
Z= X−μ/ σ

1. Percentage of Students with Scores Between 450 and 550


For 450: Z450 = 450−500/50 = −50/50 =−1
For 550: Z550 =550−500/50 = 50/50 = 1
Using the Z-table or standard normal distribution:
The area between Z=−1 and Z=1
corresponds to approximately 68% of the data (Empirical Rule).

2. Percentage of Students with Scores Between 400 and 600


For 400: Z400 = 400−500/50 =−100/50 = -2
For 600: Z600 =600−500/50 =100/50 =2
Using the Z-table:
Unit 4: The Normal distribution

The area between Z=−2 and Z=2z


corresponds to approximately 95% of the data (Empirical Rule).

3. Percentage of Students with Scores Between 350 and 650


For 350: Z350 = 350−500/50 =−150/50 =−3
For 650: Z650 = 650−500/50 /=150/50 =3
Using the z-table:
The area between Z=−3 and Z=3
corresponds to approximately 99.7% of the data (Empirical Rule).

So the Results:
Percentage of students with scores between 450 and 550: 68%
Percentage of students with scores between 400 and 600: 95%
Percentage of students with scores between 350 and 650: 99.7%

These percentages illustrate how SAT scores are distributed around the mean in a typical normal
distribution for college-bound students.

Standard Normal Distribution (Z)


The Standard Normal Distribution is a specific form of the normal distribution with:
Mean (μ) = 0
Standard Deviation (σ) = 1

1. Transforming to Standard Normal Distribution


To convert any normal distribution into the standard normal distribution, we use the Z-score formula:
Z=X−μ/σ

Where:
Z is the Z-score, representing the number of standard deviations a data point (X) is from the mean (μ).
X is the value from the original distribution.
μ is the mean of the original distribution.
σ is the standard deviation of the original distribution.

2. Purpose of Z-Scores
The Z-score helps standardize different normal distributions, making it easier to compare and analyze
data. It indicates how far and in what direction a value deviates from the mean.

3. Using Z-Scores for Probability


Once you have a Z-score, you can determine the probability associated with that score. This is done
using a Z-table (standard normal table), which provides the cumulative probability (area under the
curve) for Z-scores.
Cumulative Probability: This indicates the probability that a randomly selected value from the
distribution is less than a given Z-score.

Example Calculation:
Suppose we want to find the probability that a value from a normal distribution (mean = 50, standard
deviation = 10) is less than 60.
Calculate the Z-score:
Z=60−50/10=1.0
Look up Z = 1.0 in the Z-table:
The table shows that the cumulative probability for Z = 1.0 is approximately 0.8413. This means there is
an 84.13% chance that a randomly selected value is less than 60.

4. Tables and Computational Tools


Unit 4: The Normal distribution

Historically, statisticians created Z-tables to list cumulative probabilities for Z-scores. This saved time
because it eliminated the need to integrate the probability density function of the normal distribution
for each calculation.
Integration: The area under the curve of the normal distribution is calculated using integration, which
can be complex. Instead, the pre-calculated values in the Z-table allow for quick reference.
Computers: Nowadays, statistical software and calculators can compute probabilities for any Z-score
instantly. Functions in software like R, Python, or Excel provide tools to calculate Z-scores and
probabilities without needing to refer to a table.

The Standard Normal Distribution is essential in statistics because it allows for the standardization of
any normal distribution. By using Z-scores and pre-calculated tables (or computational tools), we can
efficiently find probabilities, perform hypothesis testing, and conduct other statistical analyses.
The Standardized Normal Curve  The z-score  Tells us how many standard deviations there are
between the selected score (x) and the Mean (µ)

The Standard Normal Distribution


This table gives the area under the standardized normal curve from 0 to z, as shown by the shaded
portion of the following figure.

Examples:
If z is the standard normal random variable, then

Prob (0 ≤ z ≤ 1.32) = 0.4066


Prob (z ≥ 1.32) = 0.5000 – 0.4066 = 0.0934
Prob (z ≤ 1.32) = Prob (z ≤ 0) + Prob (0 ≤ z ≤ 1.32)
= 0.5000 + 0.4066 = 0.9066
Prob (z ≤ 21.32) = Prob (z ≥ 1.32) = 0.0934 (by symmetry)
Unit 4: The Normal distribution

To find the probability of getting a math SAT score of 575 or less, given a mean (μ\muμ) of 500 and a
standard deviation (σ\sigmaσ) of 50, you can follow these steps:

Calculate the Z-score:


Z=X−μ/σ
where X is the score you're interested in (575).

Z=575−500/50=75/50=1.5
Look up the Z-score in the standard normal distribution table: A Z-score of 1.50 corresponds to a
cumulative probability of approximately 0.9332. This means that about 93.32% of scores fall below 575.
Interpret the result: Therefore, the probability of scoring 575 or less on the math SAT is approximately
0.9332, or 93.32%.
Unit 4: The Normal distribution

Another Example
Suppose the scores in a test have a normal distribution with mean 70 and standard deviation equal to 5.
assume the students need to get at least 60 in the test to pass the course.
What percent of the students who take this test are able to pass the course?

To find the percentage of students who pass the test (score at least 60), we can use the properties of
the normal distribution.

Identify the parameters:


Mean (μ\muμ) = 70
Standard deviation (σ\sigmaσ) = 5
Passing score = 60

The z-score formula is:


Z =X−μ/σ
Z Score = 60 - 70 / 5 = - 2

The area under the curve between mean and -2 is 0.4772, and to the right side is 0.5, we add the
0.4772 and 0.5 and it comes to 0.9772. thus 97.72% students are able to pass.

Contd …
What percent of the students who take this test are able to pass the course with a score of at least
65?
Unit 4: The Normal distribution

Solution

Formula Z Score Mean is 70 and S.D. is 5


Z =X−μ/σ

Z Score = 65 - 70 / 5 = - 1 The area corresponding to -1 is 0.3413, thus adding 0.5 and 0.3413 is 0.8413
or 84.13% students are able to get at least 65 or more score

Students getting more than 65% marks

Contd …

What percent of the students who take this test are able to get score between 60 and 75? Mean = 70,
S.D. = 5

Formula; Z Score
Z= X - / 

Z Score1 = X1 - µ / S.D 60 – 70 / 5 = -2 From table Area1 = 0.4772


Z Score2 = X2 - µ / S.D 75 – 70 / 5 = 1 From table Area2 = 0.3413
Adding the 2 areas under the curve we get 0.8185 or 81. 85% students score between 60 and 75

Given Values:
Mean (μ\muμ) = 70
Standard deviation (σ\sigmaσ) = 5
Calculate Z-scores:

For X1=60
Z1= X1−μ/ σ = 60−70/5 = 5−10 = −2

For X2=75
Z2 = X2−μ/ σ = 75−70/5 = 5/5 = 1
Z Score1 = X1 - µ / S.D 60 – 70 / 5 = -2 From tableArea1 = 0.4772
Z Score2 = X2 - µ / S.D 75 – 70 / 5 = 1 From table Area2 = 0.3413

Adding the 2 areas under the curve we get 0.8185 or 81. 85% students score between 60 and 75

Example :
Z-Score Population Mean IQ Score = 100 and Standard deviation = 15

Formula:- Z = (X – μ) / σ If IQ score of a person is 110, how do you compare him to the rest of people
To compare the IQ score of 110 to the rest of the population using the Z-score

Use the Z-score formula:


Z=X−μ/σ

Substitute the values into the formula:


Z = 110−100/15 = 10/15 = 0.67

Interpret the Z-score: A Z-score of approximately 0.67 means that an IQ score of 110 is 0.67 standard
deviations above the mean.
Unit 4: The Normal distribution

In terms of comparison, a Z-score of 0.67 suggests that the individual scored higher than about 74.9% of
the population (using a standard normal distribution table or calculator). This indicates that the person
has an IQ that is above average compared to the general population.

Example:
Z-Score Population Mean IQ Score = 100 and Standard deviation = 15

Formula:-
Z = (X – μ) / σ

If IQ score of a person is 110, how do you compare him to the rest of people? Z = (110 – 100) / 15 Z =
0.67

With an IQ of 110, your Z-Score is + 0.67


• Go to Z-Score Table and find ‘z’ score for .67
• The area between the z-score and the mean is .2486
• We know that the area up to the mean is .50, and we know that our score was above the mean, so we
add the two numbers to get a total area of .7486
• Therefore, your percentile rank in terms of IQ is 74.86, which means that you scored equal to or
greater than 74.86% of the population

Example:
Assume that the mean weight of American males between 21 and 30 years old is 156 pounds with a
standard deviation of 8 pounds.

If you select a male at random, what is the probability that his weight will be 160 pounds or more

Z= x-/ 
Z Score = 160 - 156 / 8
Z Score = 4/ 8 = 0.5 on the Z-table = 0.1915
0.5 – 0.1915 = 0.3085

Probability that mean weight of American males between 21 and 30 years old will be 160 pounds or
more is about 31%

Tutorial
If birth weights in a population are normally distributed with a mean of 109 oz and a standard
deviation of 13 oz,

a. What is the chance of obtaining a birth weight of 131 oz or heavier when sampling birth records at
random?
b. What is the chance of obtaining a birth weight of 121 or lighter?

Answer a. What is the chance of obtaining a birth weight of 141 oz or heavier when
sampling birth records at random?

Z= x-/  Z= 131-109/13 = 1.69


2.46 corresponds to a right tail (greater than) area of: P(Z≥)1.69 = (.45)= thus subtracting 0.45 from 0.5
we get 0.05 or 5 %
Unit 4: The Normal distribution

Answer b. What is the chance of obtaining a birth weight of 121 or lighter?

Z= x-/  Z= 121-109/ 13= .92


Z of .92 corresponds to a left tail area of: P(Z ≤ .92)
= .3212 or 32.12% and adding with 50% we get = 82.12%

Example:

1: How to determine what proportion of a normal population lies above/below a certain level The
average height of Hobbit = 120 cm If distribution of Hobbit heights is normal with mean = 120 cm, SD =
20 What is probability of finding a Hobbit taller than 125 cm??

Solution:

Given:
Mean height (μ) = 120 cm
Standard deviation (σ) = 20 cm
Height of interest (X) = 125 cm

Step 1: Calculate the Z-score

Using the Z-score formula:


Z=X−μ/σ

Substituting the values:


Z= 125-120/20 = 5/20 = 0.25

Step 2: Find the probability corresponding to the Z-score


Now, we look up the Z-score of 0.25 in the Z-table. The Z-table gives us the area to the left of the Z-
score:

P(Z<0.25) = 0.5987

Step 3: Calculate the probability of finding a Hobbit taller than 125 cm


To find the probability of a height greater than 125 cm, we calculate:
P(X>125)=1−P(Z<0.25)=1−0.5987=0.4013

So ; The probability of finding a Hobbit taller than 125 cm is approximately 40.13%.

Example:
If the cholesterol level of men in the community is normally distributed with a mean of 160 and
standard deviation of 30.
Identify any outliers from among the following measurements. a) 248 b) 255 c) 197 d) 300 e) 99 f) 79 g)
89 h) 59

Solution:
Given:
Mean (μ) = 160
Standard deviation (σ) = 30

Step 1: Calculate the Z-score for each measurement


Unit 4: The Normal distribution

The Z-score formula is:


Z=X−μ/σ

Now we will calculate the Z-score for each of the provided cholesterol levels.
For 248:
Z = 248-160/30 = 88/30= 2.93

For 255:
Z = 255−160/30= 95/30= 3.17

For 197:
Z=197−160/30= 37/30= 1.23

For 300:
Z=300−160/30= 140/30= 4.67

For 99:
Z= 99−160/30 = 30−61=−2.03

For 79:
Z= 79−160/30 = 30−81= −2.70

For 89:
Z= 89−160/30 = 30−71= −2.37

For 59:
Z= 59−160/30 = 30−101= −3.37

Step 2: Identify outliers


Typically, any Z-score greater than 3 or less than -3 can be considered an outlier.

248: Z = 2.93 (not an outlier)


255: Z = 3.17 (outlier)
197: Z = 1.23 (not an outlier)
300: Z = 4.67 (outlier)
99: Z = -2.03 (not an outlier)
79: Z = -2.70 (not an outlier)
89: Z = -2.37 (not an outlier)
59: Z = -3.37 (outlier)

So the outliers in the provided cholesterol levels are:


255, 300, 59

Example:

If the cholesterol level of men in the community is normally distributed with a mean of 160 and
standard deviation of 30.
Find % of people who have cholesterol level 170 and < 170
Find % of people who have cholesterol level 130 and

Solution:
Unit 4: The Normal distribution

To find the percentages of people with specific cholesterol levels, we will use the Z-score formula and
the standard normal distribution table.

Given:
Mean (μ) = 160
Standard deviation (σ) = 30
Step 1: Calculate Z-scores

For cholesterol level 170:


Z170=170−160/30 = 10/30 ≈ 0.33

For cholesterol level 130:


Z130=130−160/30 = −30/30 =−1
For cholesterol level 190:
Z190= 190−160/30 = 30/30=1

Step 2: Find the probabilities using Z-scores


Using the Z-table (or a standard normal distribution calculator):

Percentage of people with cholesterol level < 170:


For Z170≈0.33 the area to the left is approximately 0.6293 (or 62.93%).
Therefore, the percentage with cholesterol level ≥ 170 is:
100%−62.93%= 37.07%

Percentage of people with cholesterol level < 130:


For Z130=−1, the area to the left is approximately 0.1587 (or 15.87%).
So, the percentage with cholesterol level ≥ 130 is:
100%−15.87%≈84.13%

Percentage of people with cholesterol level between 130 and 190:


For Z130=−1, the area to the left is 0.1587.
For Z190=1, the area to the left is approximately 0.8413 (or 84.13%).
Therefore, the percentage of people between 130 and 190 is:
0.8413 - 0.1587 = 0.6826 (or 68.26%)

Summary of Results:
% of people with cholesterol level ≥ 170: 37.07%
% of people with cholesterol level < 130: 15.87%
% of people with cholesterol level between 130 and 190: 68.26%

Add the chapter of central limit theorem

You might also like