Statistics Formulas: Parameters
Statistics Formulas: Parameters
Statistics Formulas: Parameters
Parameters
Population mean = μ = (Σ Xi) / N
Population standard deviation = σ = sqrt [ Σ (Xi - μ )2 / N]
Population variance = σ2 = Σ (Xi - μ )2 / N
Variance of population proportion = σP2 = PQ / n
Standardized score = Z = (X - μ) / σ
Population correlation coefficient = ρ = [ 1 / N] * Σ {[ (Xi - μX) / σx] * [ (Yi - μY) / σy]}
Statistics
Unless otherwise noted, these formulas assume simple random sampling.
Correlation
Pearson product-moment correlation = r = Σ (xy) / sqrt [ (Σ x2) * (Σ y2)]
Linear correlation (sample data) = r = [ 1 / (n - 1)] * Σ {[ (xi - x) / sx] * [ (yi - y) / sy]}
Linear correlation (population data) = ρ = [ 1 / N] * Σ {[ (Xi - μX) / σx] * [ (Yi - μY) / σy]}
Counting
n factorial: n! = n * (n-1) * (n - 2) * . . . * 3 * 2 * 1. By convention, 0! = 1.
Permutations of n things, taken r at a time: nPr = n! / (n - r)!
Combinations of n things, taken r at a time: nCr = n! / r! (n - r)! = nPr / r!
Probability
Rule of addition: P (A ∪ B) = P(A) + P(B) - P (A ∩ B)
Rule of multiplication: P (A ∩ B) = P(A) P(B|A)
Rule of subtraction: P(A') = 1 - P(A)
Random Variables
In the following formulas, X and Y are random variables, and a and b are constants.
Standard Error
Standard error of proportion = SEp = sp = sqrt [ p * (1 - p)/n] = sqrt (pq / n)
Standard error of difference for proportions = SEp = sp = sqrt {p * (1 - p) * [ (1/n1) + (1/n2)]}
Standard error of the mean = SEx = sx = s/sqrt(n)
Standard error of difference of sample means = SEd = sd = sqrt [ (s12 / n1) + (s22 / n2)]
Standard error of difference of paired sample means = SEd = sd = {sqrt [ (Σ (di - d)2 / (n - 1)]} / sqrt(n)
Pooled sample standard error = spooled = sqrt [ (n1 - 1) * s12 + (n2 - 1) * s22] / (n1 + n2 - 2)]
Standard error of difference of sample proportions = sd = sqrt {[p1(1 - p1) / n1] + [p2(1 - p2) / n2]}
Linear Transformations
For the following formulas, assume that Y is a linear transformation of the random variable X, defined by the
equation: Y = aX + b.
Estimation
Confidence interval: Sample statistic + Critical value * Standard error of statistic
Margin of error = (Critical value) * (Standard deviation of statistic)
Margin of error = (Critical value) * (Standard error of statistic)
Hypothesis Testing
Standardized test statistic = (Statistic - Parameter) / (Standard deviation of statistic)
One-sample z-test for proportions: z-score = z = (p - P0) / sqrt (p * q / n)
Two-sample z-test for proportions: z-score = z = z = [ (p1 - p2) - d] / SE
One-sample t-test for means: t statistic = t = (x - μ) / SE
Two-sample t-test for means: t statistic = t = [ (x1 - x2) - d] / SE
Matched-sample t-test for means: t statistic = t = [ (x1 - x2) - D] / SE = (d - D) / SE
Chi-square test statistic = Χ2 = Σ [ (Observed - Expected)2 / Expected]
Degrees of Freedom
The correct formula for degrees of freedom (DF) depends on the situation (the nature of the test statistic, the number
of samples, underlying assumptions, etc.).
One-sample t-test: DF = n - 1
Two-sample t-test: DF = (s12/n1 + s22/n2)2 / {[ (s12 / n1)2 / (n1 - 1)] + [ (s22 / n2)2 / (n2 - 1)]}
Two-sample t-test, pooled standard error: DF = n1 + n2 - 2
Simple linear regression, test slope: DF = n - 2
Chi-square goodness of fit test: DF = k - 1
Chi-square test for homogeneity: DF = (r - 1) * (c - 1)
Chi-square test for independence: DF = (r - 1) * (c - 1)
Sample Size
Below, the first two formulas find the smallest sample sizes required to achieve a fixed margin of error, using simple
random sampling. The third formula assigns sample to strata, based on a proportionate design. The fourth formula,
Neyman allocation, uses stratified sampling to minimize variance, given a fixed sample size. And the last formula,
optimum allocation, uses stratified sampling to minimize variance, given a fixed budget.