Q4 Lessons 3 4 Random Variables Prob Distribution
Q4 Lessons 3 4 Random Variables Prob Distribution
Quarter IV
LESSONS 3 – 4
GENERALIZATION:
Random variable – is the set whose elements are the numbers assigned to the outcomes of a
random event or an experiment. It is usually denoted by a capital letter such as A, whose
elements are denoted by lower case letters such as 𝑎1, 𝑎2, 𝑎3 and so on.
Example: If a coin is tossed twice, the set of all possible outcomes (S) of the
experiment is:
TT 2
TH 1
HT 1
HH 0
From the above, instead of writing number of tails, we can denote it as set A whose elements are
0, 1, and 2. In symbols,
𝑨 = {𝟎, 𝟏, 𝟐} The
A is called the random variable.
Exercise: Cite and example of a real life situation that illustrates a random
variable.
GENERALIZATION:
Discrete Random variable – has a finite number of elements. It takes on countable number of
distinct values that can be represented by a whole number. These values usually arise from
“COUNTS”.
Continuous Random variable – has an infinite number of elements that CANNOT be represented
by whole numbers. These numbers usually arise from “MEASUREMENTS”.
1|Page
Statistics and Probability
Quarter IV
LESSONS 3 – 4
Example:
a. S = Scores of students in General Mathematics summative test
b. T = Dropout rate (%) in Iloilo National High School
c. U = The weight of selected Statistics and Probability books
d. V = The number of newly hired teachers in INHS - Senior High School
Answer:
a. S is a discrete random variable.
b. T is a continuous random variable.
c. U is a continuous random variable.
d. V is a discrete random variable.
GENERALIZATION:
The set of all possible values for a given random variable is called the range space.
Example: Find the possible values of each random variable. a.
X = number of heads in tossing two fair coins
b. Y = sum of two number that appears in throwing a pair of dice Answer:
a. X = {0, 1, 2}
b. Y = {2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12}
IV. Illustrates a probability distribution for a discrete random variable and its properties. ---
M11/12SP-IIIa-4
GENERALIZATION:
Probability Distribution is also known as probability mass function. It is a table showing all
possible values of a discrete random variable together with their corresponding probabilities.
2|Page
Statistics and Probability
Quarter IV
LESSONS 3 – 4
a.
Each probability value ranges from 0 to 1.
b.
The sum of all individual probabilities in the distribution is equal to 1.
The sample space would be 𝑆 = {𝐻𝐻𝐻, 𝐻𝐻𝑇, 𝐻𝑇𝐻, 𝑇𝐻𝐻, 𝐻𝑇𝑇, 𝑇𝐻𝑇, 𝑇𝑇𝐻, 𝑇𝑇𝑇} with 8 total
number of outcomes.
Exercise: Box A and Box B both contain the numbers 1, 2, 3, 4. Write the
probability distribution of the SUM when one number from each box is taken at
time, with replacement.
Answers to Exercises:
1: (Answers may vary)
3|Page
Statistics and Probability
Quarter IV
LESSONS 3 – 4
IV:
2 = (1, 1)
3 = (1, 2) (2, 1) There are 16 possible outcomes.
4 = (1, 3) (3, 1) (2, 2)
5 = (1, 4) (4, 1) (2, 3) (3, 2)
6 = (2, 4) (4, 2) (3, 3)
7 = (3, 4) (4, 3)
8 = (4, 4)
Hence the probability distribution is as follows:
𝑥 2 3 4 5 6 7 8
𝑓(𝑥)
V: Computes probabilities
corresponding to a given random variable.
--- M11/12SP-IIIa-6
GENERALIZATION:
The Support S of a random variable is the set of values that the random variable can take.
Probability mass function (PMF) is the probability distribution of a discrete random variable.
The PMF of x is denoted by f(x) and satisfies the following basic properties.
Property 1, states that for every element x in the Support S, all the probabilities must be positive.,
while according to Property 2, the sum of all probabilities for all x in the Support S must be
equal to 1.
Example 1. Suppose a random variable X can only take the four values 0, 1, 2, and
3. If each value has equal probability. What is the PMF?
4|Page
Statistics and Probability
Quarter IV
LESSONS 3 – 4
Example 2. Let f(x) = kx, for x = 1, 2, 3, 4. Find k so that f(x) satisfies the two
properties of being a PMF.
4
1 1 2 3 4 10
To check if the property 2 is satisfied, ∑ 𝑃(𝑋 = 𝑥) = 10 𝑥 = 10 + 10 + 10 + 10 = 10 = 1
5|Page
Statistics and Probability
Quarter IV
LESSONS 3 – 4
Exercise 2. Let f(x) = kx for x = 2, 3, 4. Find k so that the function of f(x) satisfies
the two properties of pmf.
GENERALIZATION:
The mean of a discrete random variable X is also called the expected value of X. It is the
weighted average of all the values that the random variable X would assume in the long run. The
expected value of X is the average of all outcomes that is likely to be obtained if the trial is
repeated over and over again. The expected value is denoted by E(X).
𝐸(𝑋) = ∑[𝑥𝑃(𝑥)]
x 0 1 2
Solution:
x P(x) xP(x)
0 0.25 0
1 0.50 0.50
2 0.25 0.50
∑[𝑥𝑃(𝑥)] = 1.00
The expected value is 1. So the average number of college graduates in the households of the
small town is 1.
Exercise. A random variable X has this probability distribution.
x 1 2 3 4
6|Page
Statistics and Probability
Quarter IV
LESSONS 3 – 4
GENERALIZATION:
The variance of a random variable X is denoted by 2 . It can likewise be written as Var(X). The
variance of a random variable is the expected value of the square of the difference between the
assumed value of random variable and the mean.
Where x = outcome
= population mean = E(X)
P(x) = probability of the outcome
Larger the value of the variance, the farther are the values of X from the mean.
Standard deviation is the square root of the variance;
x P(x)
1 0.15
2 0.25
3 0.30
4 0.15
5 0.10
6 0.05
Solution:
Steps: 1 Find the E(X).
2. Subtract the expected value from each outcome. Square each difference.
3. Multiply each difference by the corresponding probability.
4. Sum up all the figures obtained in step 3.
x P(x) xP(x) x- (𝑥 − 𝜇)2 (𝑥 − 𝜇)2 𝑃(𝑥)
1 0.15 0.15 1 – 2.95 = - 1.95 3.8025 0.570375
7|Page
Statistics and Probability
Quarter IV
LESSONS 3 – 4
x 0 1 2 3 4
P(x) 0.1 0.2 0.3 0.3 0.1
Exercise 2. Solve the variance and standard deviation of Exercise 1 using this
alternative formula.
Answers to Exercises:
V: Exercise 1. Property 1 and 2 are both satisfied, hence f(x) is indeed a pmf
Exercise 2. k = 1/9, both properties 1 and 2 are satisfied, hence f(x) is a pmf
VI: Exercise. E(X) = 2.85
GENERALIZATION:
From the previous discussion, you have learned how to compute for the mean and variance of a
discrete random variable.
This time, you will be able to interpret the result.
Example: Suppose four tiles numbered 1, 2, 3 and 4 are in a jar. A tile is picked
and returned in the jar 15 times. The results are as follow:
Number of
Tile Probability
times picked
1 2
2 4
3 8
4 1
8|Page
Statistics and Probability
Quarter IV
LESSONS 3 – 4
Solution:
x P(x) x ∙ P(x)
1 .13 .13
2 .27 .54
3 .53 1.59
4 .07 .28
𝑬(𝑿) = 𝟐. 𝟓𝟒
Interpret:
This means that for every tile picked from the jar, the number in the tile is in
average 2.54.
IX: Solves problems involving mean and variance of a discrete random variable.
--- M11/12SP-IIIb-4
GENERALIZATION:
After learning to solve for mean and variance of discrete random variables, let us apply what you
have learned in solving real life situations. It is best when you apply what you have learned in
real life so that you will be able to interpret results.
Example 1: Sandra is paid ₱20 whenever the results of tossing two coins are both
heads but she pays back ₱10 whenever the results are not both heads. What is the
expected gain per toss?
Solution:
There are four outcomes in tossing two coins in
1 3 which only 1 is a HH. The other results are HT, TH,
𝐸(𝑋) = ₱20 ( ) + {−₱10 ( )} = ₱5 − ₱7.50 = −₱2.5
4 4 TT. The probability of both heads is 1 out of 4 or
and the probability of not both heads is . Therefore,
Sandra’s expected gain per toss is ₱2.50.
Interpretation: This means that Sandra would lose ₱2.50 per toss.
Example 2: The number of phones sold per day at a retail store varies as shown in
the given probability distribution below. Find the expected number of mobile phones
that will be sold in one day.
X 30 33 38 40 50
9|Page
Statistics and Probability
Quarter IV
LESSONS 3 – 4
Solution:
𝐸(𝑋) = 30(0.2) + 33(0.2) + 38(0.35) + 40(0.23) + 50(0.02) = 𝟑𝟔. 𝟏
Interpretation: The number of mobile phones that will be sold in one day is 36.
Exercise: Find the estimated number of male teachers hired per elementary school
in a certain city. Determine its mean and interpret the result.
GENERALIZATION:
A continuous random variable is considered normal when its values are distributed normally, that
is, when the majority of the values are close to the expected value with only very few values that
are extremely smaller and extremely larger.
Normal Distribution is a distribution of a continuous random variable whose graph is a bell-
shaped curve called the NORMAL CURVE.
The standard deviation affects both the width and the height of the curve. It precisely describes
the spread of the normal curve.
The greater the variance/standard deviation, the more spread out the distribution is.
10 | P a g e
Statistics and Probability
Quarter IV
LESSONS 3 – 4
Now, take a look at the list of the properties of the Normal Distribution.
a. It is a bell-shaped curve.
b. The distribution is unimodal.
c. The total area under the curve is 1.
d. Its curve is symmetric about the y-axis.
e. Its curve is asymptotic with respect to the x-axis.
f. The mean, median, and mode coincide at the center of
the distribution.
Answers to Exercises:
XI: 𝐸(𝑋) = 1.9 ≈ 2
This means that the number of nuts in a chocolate bar is approximately 2.
X: 𝐸(𝑋) = 4.13
The number of teachers hired per elementary school in a certain city is 4.
XI: Identifies regions under the normal curve corresponding to different standard normal
values. --- M11/12SP-IIIc-3
GENERALIZATION:
Finding the shaded area of the polygon is different from finding the area of the shaded region in
the normal distribution. For polygons, we use formulas and simple calculations to find the
shaded region however, in the normal distribution, we use the z-table to locate the z-value.
A specific proportion of the area of the region under the curve can be calculated manually using
the formula
Where,
Y represents the height of the curve at a particular value of X
X represents any score in the distribution
σ represents the standard deviation of the population
μ represents the population mean
π = 3.1416
𝑒 = 2.7183
11 | P a g e
Statistics and Probability
Quarter IV
LESSONS 3 – 4
However, this formula is now rarely used because of the readily available z-Table which
displays all the area of the region under the curve given a z-value.
STEPS in finding the area under the normal curve given a z-value:
1. Express the given z-value into a three-digit number.
2. Using the z-Table, find the first 2 digits on the first column.
3. Find the third digit on the first row on the right.
4. Read the area for probability at the intersection of the row (first 2-digit number) and
column (third digit number). The value observed at the intersection indicates the area of
the given z-value.
When z is negative, we simply ignore the negative sign and proceed as before.
The negative sign informs us that the region we are interested in is found on the left side of
the mean. Areas are positive values.
Finding the area that corresponds to z = 2 is the same as finding the area between
z = 0 and z = 2.
Note: Take a look at the z-table on the UPPER RIGHT.
Steps Solution
12 | P a g e
Statistics and Probability
Quarter IV
LESSONS 3 – 4
Steps Solution
z = 1.45
(it is already expressed to a
a. Express the given zvalue
3 digit
to a 3 digit number
number, so just copy the
given)
b. In the table, find the first
two digit number on the (1.4)
first Column
c. Find the third digit on the
(.05)
first Row
d. Read the probability at
The area at the intersection
the intersection of row
is 0.4265
1.4 and column .05
Exercise: Find the area of the region under the curve that corresponds between the given
z-values.
1. z = 0.63 3. z = -2.08
2. z = 1.32 4. z = -1.70
XII: The learner identifies regions under the normal curve corresponding to different standard normal
values. --- M11/12SP-IIIc-3
GENERALIZATION:
Probability notations are commonly used to express a lengthy idea into symbols concerning the
normal curve.
The following are the most common probability notations used in studying concepts on the
normal curve.
P(a < z < b) z-value is between a and b
P(z> a) z-value is above a
P(z< a) z-value is below a where a and b are z-score values.
P(z = a) = 0 z-value is equal to a is 0.
This notation indicates that a z-value is equal to exactly one point on the curve.
With that single point, a line can be drawn signifying the probability can be
below or above it.
That is why, for a z-value to be exactly equal to a value its probability is equal to 0.
Let us familiarize some of the terms involved in Negative P(z < a) Positive P(z > a)
using notations.
“less than z” “greater than z”
“to the left of z” “to the right of z”
“below z” “above z”
“lower than” “more than z”
“under z” “at least z”
13 | P a g e
Statistics and Probability
Quarter IV
LESSONS 3 – 4
Locate from the z-Table the corresponding areas z = 2 has a corresponding area of 0.4772 z
of the given z-values. = 3 has a corresponding area of 0.4987
With the graph, decide on what operation will be
With the given graph, the operation to
used to identify the proportion of the area of the
be used is subtraction.
region. Use probability notation to avoid lengthy
P(2 < z <3) = 0.4987 – 0.4772 = 0.0215
expressions.
The required area between z = 2
Make a concluding statement.
and z = 3 is 0.0215.
Steps Solution
Draw a normal curve.
Locate the required z-value.
Shade the required region.
Locate from the z-Table the corresponding z = 1 has a corresponding area of 0.3413.
areas of the given z-value. This area signifies only from z = 0 to z = 1
With the given graph, the operation to be
used is addition.
With the graph, decide on what operation P(z < 1) = 0.5000+ 0.3413 = 0.8413
will be used to identify the proportion of the This is so because the area of the region
area of the region. Use probability notation from z = 0 to its left is 0.5 since it represents
to avoid lengthy expressions. half of the normal curve. With the property
that the curve has area equal to 1, therefore
half of its area signifies 0.5000 or 0.5.
Make a concluding statement. The required area below z = 1 is 0.8413.
Exercise: Solve for the proportion of the area of the following probability notations.
14 | P a g e
Statistics and Probability
Quarter IV
LESSONS 3 – 4
XIII: The learner converts a normal random variable to a standard normal variable and vice
versa. --- M11/12SP-IIIc-4
GENERALIZATION:
For any population, the mean and the standard deviation are fixed. This gives the way to
understand that for every raw score X, there corresponds an exactly one z-score value, and vice
versa. Therefore, if we wish to find the percentage associated with X, we must find its matched
z-value using the z formula.
The area of the region under the normal curve represents the probability or percentage or
proportion of a given measurement value. It is computed by subtracting the measurement
X to the sample Ẍ or population mean, then dividing the result by the standard deviation.
The resulting value gives the z-score.
The z-score indicates the distance between a given measurement X and the mean expressed in
standard deviations. It locates either within a sample or within a population. But for now, a
readily available z-Table is made to obtain the corresponding area given a z-score.
Steps in finding the z-score given the mean (µ), standard deviation ( σ ) and the measurement
(X)
2. Check the given values: mean (µ) , standard deviation (σ) and the measurement (X)
3. Substitute the given values in the formula
4. Compute for the z-value
Example 1: In Mrs. Shirley’s science class, a student gained a score of 46. What is
the z-value of his score if test result has population mean 45 with standard
deviation of 2?
Steps Solution
a. Use the formula for finding the z- 𝑥−𝜇
𝑧=
scores of population data 𝜎
b. Check the given values µ = 45 σ=2 X = 46
c. Substitute the given values in the 46 − 45
Formula 𝑧=
2
1
d. Compute the z- value 𝑧 = = 0.5
2
15 | P a g e
Statistics and Probability
Quarter IV
LESSONS 3 – 4
1. 𝑋 = 98 2. 𝑋 = 65 3. 𝑋 = 120
XIV: The learner computes probabilities and percentiles using the standard normal table.
--- M11/12SP-IIIc-d-1
GENERALIZATION:
A percentile is a measure used in statistics indicating the value below which a given
percentage of observations in a group of observations fall. It is a measure of relative standing
as it measures the relationship of a measurement of the rest of the data.
For instance, you were told by your teacher that your Mathematics grade for the quarter
is 92 and your grade is at the 88th percentile. This means 88% of your classmates scored lower
than 92 and 21% scored more than 92 in the subject. This explains that 88th percentile is located
at a point such that 88% of the total population lies below that point and 12% lies above.
At this point, we are given with the area or the proportion of the area of the region under
the normal curve. We wish to identify the corresponding z-value of the given area. With this, we
need the z-Table to find the z-value required corresponding to the given area and if not found, we
16 | P a g e
Statistics and Probability
Quarter IV
LESSONS 3 – 4
take the nearest value. The number that corresponds to the first column and first row of the
intersection attained from the locating the given area is the required z-value.
By definition, the P86 means locating an area before or below the point. We want to
know what the z-value is at this point.
Steps Solutions
g. Locate 1.08 on the curve and draw a Therefore, the z-score by which the
line through under the curve relative comparison starts at 1.08.
h. Shade the region to the left of 1.08.
By definition, the P32 means locating an area before or below the point. We want to
know what the z-value is at this point.
Steps Solutions
17 | P a g e
Statistics and Probability
Quarter IV
LESSONS 3 – 4
d. Locate the nearest value of 0.3200 from The nearest value of 0.3200 is 0.3212 which
the values 0.3186 and 0.3212 corresponds to the z-value of 0.92.
Therefore, the z-score by which the relative
comparison starts at 0.92. 32% is located at the
e. Locate the z-value 0.92 and draw a line left side of the curve since the value does not
through under the Curve reach 50% (half of the curve) or above. Thus,
the z-value 0.92 should be located at the left
side of the curve.
f. Shade the region to the left of 0.92
Since 0.92 must be located at the left side of
the curve, it must contain a negative sign
indicating to where it should be located. Thus,
g. Describe the shaded region
the shaded region is the 32% percentile of the
distribution. Meaning, 32% of the distribution
lies below the z-value -0.92.
Exercise: Solve for the percentile of the following numbers. Then sketch its graph.
1. 30th
2. 52nd
3. P15
4. P88
Answers to Exercises:
XI:
XII:
XIII:
XIV:
18 | P a g e