Hypothesis Testing in Six Sigma
Hypothesis Testing in Six Sigma
Country A
Country B
60.0 62.0 64.0 66.0 68.0 70.0 72.0 74.0 76.0 78.0 80.0
[inch]
Concepts Of Hypothesis Testing
1. All processes have 2. Samples from one
variation. given process may vary.
Continuous data:
- Differences in averages
- Differences in variation
- Differences in distribution
“shape” of values
Discrete data:
- Differences in proportions
Hypothesis Testing
Guilty vs. Innocent Example
The American justice system can be used to illustrate the
concept of hypothesis testing.
Write the null and alternate hypothesis testing statements for each scenario below:
Scenario 1: You have collected delivery time of supplier A and supplier B. You wish to test whether or not
there is a difference in delivery time from supplier A and B.
Scenario 2: You suspect that there is a difference in cycle time to process purchase orders in site 1 of
your company compared to site 2. You are going to perform a hypothesis test to verify your hypothesis.
Scenario 3: You have implemented process improvements to reduce the cycle time to process purchase
orders in your company. You have collected cycle time before the process improvements and after the
process improvement was implemented. You are going to perform a hypothesis test to verify that the
process improvements have resulted in a reduction in cycle time.
Incorrectly
Ho Innocent, Guilty,
ACCEPT
Set Free Set Free Ho
Set Free Type II
β
Verdict
Incorrectly
REJECT Innocent, Guilty,
Ha Ho Jailed Jailed
Jailed Type I
α
Hypothesis Testing
P > a : Accept Ho
P a: Reject Ho
Statistical Tests In Minitab
Some basic statistical tests are shown below with the command for running each test
in Minitab.
What The Tool Tests Statistical Test Graphical Test
Histogram
Mean of population data 1-Sample t-test
is different from an Stat > Basic Statistics
established target. > 1-Sample t
Histogram
Mean of population 1 is 2-Sample t-test
different from mean of Stat > Basic Statistics
population 2. > 2-Sample t
Frequency
Output counts from two Chi-Square Test Pareto
two or more subgroups of Independence
Stat > Tables >
differ. Cross Tabulation OR C ABD E MNO
Chi-Square Test Category
Data is normally
Normality Test
distributed Stat > Basic
Statistics
Select A Statistical Test
YES
NO
Logistic
Comparing 2 NO *HOV (test spread)
Regression or fewer
Groups? Multiple *ANOVA (test center)
Groups
YES
* Instructions for these
tests are on the Are we
following pages Can I Match NO comparing NO
X’s With X’s? the mean to a *HOV (spread)
Standard? *2 Sample t–Test
Pre and Post YES YES
(center)
Improvement (Note: Do HOV first and use
Paired t (center) 1 Sample t results to refine 2 Sample t)
Choosing the Appropriate Test
There are four items that we need to consider before we select the
right statistical test:
1. Is the Y Continuous or Discrete
2. Is (are) the X(s) Continuous or Discrete
3. Are we trying to compare the Variation or
Centering
4. Is Y Normal or non-Normal
Note: Not all four questions are used for the selection of
the proper test...
Statistical Test Flow Chart
Is Y Continuous or Discrete?
Continuou
Discrete
s
Continuou Continuou
Discrete Discrete
s s
Variation or Centering?
Regression Logistic
Chi-Square Regression
Variation Centering
Homogeneity Homogeneity
of Variance of Variance
Bartlett's Levene's
Comparing Mood's
1 Sample
Relative to a Yes Median
Target?
t-Test Note:
Homogeneity
of Variance Even though the tests
F-test No
Mann Whitney are broken down by
Comparing whether the dependent
2 Sample
only Two Yes
Groups?
t-Test
Non-
variable (Y) is normal or
Parametric
Tests
not, you may still
No perform the test as long
as you know the
ANOVA
limitations of the test
Which Hypothesis Testing Tool Would You Use?
For each scenario described below, which hypothesis testing tool would
you use? Assume normal distribution, where appropriate
1. A six-sigma project is being conducted in the field to improve the cycle time for warranty
repair returns. The warranty return cycle time was measured for a period of 6 weeks for 4
regions. The Green Belt suspects that there is a difference in average warranty repair cycle
time among each of the regions. How would you test whether there is a statistically
significant difference in mean cycle time for the different regions?
2. Tungsten steel erosion shields are fitted to the low pressure blading in steam turbines.
The most important feature of a shield is its resistance to wear. Resistance to wear can be
measured by abrasion loss, which is thought to be associated with the hardness of steel.
How would you test whether there is a statistically significant relationship between
resistance to wear and abrasion hardness of steel?
3. Your business purchases sheet stock from two different suppliers. It has found an
unacceptably large number of defects being caused by thickness beyond tolerance levels.
Data for overall mean thickness data was analyzed and found to be on target. Data was
collected that would identify a potential difference in the variation of the thickness of the
material by supplier.
4. Checks Are Us is a payroll processing firm. Timecard errors are routinely monitored and
recorded. A Black Belt investigating the errors wishes to determine if there are any
differences in the number of errors among five of its major customers. The number of
errors contained in a sample of 150 employees was recorded for five weeks. How would you
test if there is a statistically significant difference in the number of errors among the
customers?