Factor Analysis 2023
Factor Analysis 2023
• Interdependence technique
- No distinction DV/IV: all variables considered simultaneously
- Search for structure/patterns in data
AY 2023-2024 • Highly correlated variables (= variables that share variance) are grouped into distinct sets (i.e. “factors”)
Laura De Kerpel
1 2
A supermarket chain asked 500 of its customers to fill in a questionnaire which 1. Data suitability
contained 12 questions about shopping behaviour, all on a 7 point Likert scale 2. Sample requirements
3. Usefulness of Factor Analysis
2 underlying dimensions found: 4. Number of factors
ü “Pleasure”
5. Interpretation
ü “Planning”
6. Data reduction
3 4
3 4
1
06/10/2023
STEP 1 STEP 2
ü Suitability of data ü Sample requirements
• Correlation matrix: Interval or ratio scaled variables
• Min 50 (better: min 100)
(Avoid non-metric variables)
• 5:1 ratio (better 10:1 or even 20:1)
• Item scales: Likert scales, semantic differential…
• High case-to-variable ratio à to avoid overfitting!
à Same level of measurement (standardize)
• Underlying structure exists? Factor analysis will always produce factors!
• Homogeneous sample?
“garbage in, garbage out”
E.g. Men vs. women and shoe characteristics
Men: 3 dimensions à appearance-situation-comfort
Women: 5 dimensions à style-material-colour-situation-
comfort
5 6
5 6
Visual inspection: high (>.30) and not equal (because some structure must exist)
①Correlation matrix
②Partial correlation/ Anti-image correlation
matrix
③Bartlett’s test of sphericity
④KMO
7 8
7 8
2
06/10/2023
9 10
9 10
①Correlation matrix • Anti-image correlation matrix: negative values of partial correlation should all be
low
②Partial correlation/ Anti-image correlation
matrix
③Bartlett’s test of sphericity
④KMO
11 12
11 12
3
06/10/2023
STATISTICAL ASSUMPTIONS IN FA
STATISTICAL ASSUMPTIONS IN FA
Analyze > Dimension Reduction > Factor > Descriptives
13 14
13 14
matrix
③Bartlett’s test of sphericity
④KMO
15 * Kaiser-Meyer-Olkin Measure of Sampling Adequacy 16
15 16
4
06/10/2023
17 18
17 18
1. Default in SPSS
2. Data reduction: account for max total variance in min factors
3. No factor indeterminacy problem (>< Common FA)
4. Invalid calculation of communalities
19 20
19 20
5
06/10/2023
21 22
21 22
STEP 4 & 5
ü Number of factors & interpretation COMPONENT MATRIX
TOTAL VARIANCE EXPLAINED Only contains values for the 3 relevant factors
• Initial Eigenvalues = explained variance by each factor
• Extraction Sums of Squared Loadings : the factors that passed the Kaiser Criterion (= Latent root criterion) Factor loadings: correlation between a set of factor
(i.e. Eigenvalue > 1) scores and a set of scores for an original variable (only
• Total variance explained: social sciences > 60% (rule of thumb) if orthogonal!)
Total variance = 12
Stop extracting
23 24
23 24
6
06/10/2023
26
25 26
• All factors contain some unique variance, the proportion of unique variance increases in
later factors
• Extract factors before unique variance > common variance 2
Iterative
Scree Plot
• Where curve straightens (“elbow”) 4. 5
4 process
3. 5
3
Eigenvalue
2. 5
1. 5
0. 5
0
1 2 3 4 5 6 7 8 9 10 11 12
C om ponent N um ber
Extract 3 factors 27 28
27 28
7
06/10/2023
29 30
29 30
STEP 6 STEP 6
üData reduction üData reduction
Reversing scores:
CREATING COMPOSITE MEASURES
= combining several variables that measure the same concept into a single variable
Analyze > Data Reduction > Factor > Scores > Save as variables > Regression
31 32
31 32
8
06/10/2023
33 34
RELIABILITY RELIABILITY
Always calculate the Cronbach’s Alpha
Rule of thumb
If the increase is very
small, just keep as the α ≥ 0.9: excellent
The higher the better analysis 0.8 ≤ α < 0.9: very good
If it’s the big increase, 0.7 ≤ α < 0.8: good
it’s worth 0.6 ≤ α < 0.7: acceptable
α < 0.6: unacceptable
Warning!
• Items should logically match (garbage in, garbage out)
• Min. 3 items (preferably) à If only 2 items, calculate Pearson’s r (Analyze > Correlate > Bivariate)
35 36
9
06/10/2023
When the value is zero, there is no correlation. When the value is (near) +1 or -1, there is a
perfect correlation.
37 38
37 38
39 40
39 40
10
06/10/2023
3) EXTRAVERSION/INTROVERSION 1. Run a FA on the data to see whether the Big Five structure is supported.
Social, gregarious vs. Solitary, reserved Remove variables that do not load satisfactory
5) NEUROTICISM
Nervous, mentally unstable vs. Stable, resilient
41 42
41 42
11