How To Run Cluster Analysis in Excel

The document outlines the steps for performing K-means clustering analysis, starting from data preparation to calculating cluster means and repeating the process until minimal improvement in SSE is achieved. It includes specific case data and calculations for cluster assignments and distances. The final output presents the means for three segments along with the number of respondents and their respective SSE values.

Uploaded by

Yazel Faith Poblador

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as XLSX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

81 views9 pages

How To Run Cluster Analysis in Excel

Uploaded by

Yazel Faith Poblador

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as XLSX, PDF, TXT or read online on Scribd

You are on page 1/ 9

How to run cluster an

K-means cluste
Geoff Fripp
Marketing Lecturer, The Universi

STEP ONE - Start with your data set STEP TWO - If two var

Case X Y Z
1 4.40 4.57 2.29
2 3.25 3.92 2.17
3 3.10 4.25 2.40
4 4.83 4.31 2.16
5 3.63 3.60 1.67
Start1 6 3.26 1.64 1.48
7 4.89 1.33 1.04
8 4.50 2.01 1.28
Start2 9 4.99 2.47 2.60
10 4.12 2.12 1.70
11 2.21 2.51 4.17
12 2.97 4.10 3.92
13 2.40 2.45 4.46
14 2.10 3.30 4.90
Start3 15 1.13 2.05 3.28

Min 1.13 1.33 1.04

Max 4.99 4.57 4.90
Median 3.26 2.51 2.29
STEP THREE - Work out sum of squares distance and first allocation
1 2 3
Case Start1 Start2 Start3 Min Initial Choice
1 10.54 4.84 18.07 4.84 2
2 5.64 5.30 9.19 5.30 2
3 7.65 6.75 9.50 6.75 2
4 10.01 3.58 20.05 3.58 2
5 4.00 3.97 11.25 3.97 2
6 - 4.91 7.95 - 1
7 2.94 3.75 19.67 2.94 1
8 1.72 2.18 15.40 1.72 1
9 4.91 - 15.54 - 2
10 1.02 1.67 11.47 1.02 1
11 9.09 10.19 2.18 2.18 3
12 12.10 8.50 8.02 8.02 3
13 10.28 10.19 3.17 3.17 3
14 15.79 14.31 5.15 5.15 3
15 7.95 15.54 - - 3

SSE 48.64

STEP FOUR - Calculate means of each current cluster

Cluster 1 Cluster 2
Current Case X Y Z X Y Z
2 1 4.40 4.57 2.29
2 2 3.25 3.92 2.17
2 3 3.10 4.25 2.40
2 4 4.83 4.31 2.16
2 5 3.63 3.60 1.67
1 6 3.26 1.64 1.48
1 7 4.89 1.33 1.04
1 8 4.50 2.01 1.28
2 9 4.99 2.47 2.60
1 10 4.12 2.12 1.70
3 11
3 12
3 13
3 14
3 15

Mean 4.19 1.77 1.37 4.03 3.85 2.21

Set as named new ranges

STEP FIVE - Repeat step three - with new cluster means above
1 2 3

Case Cluster 1 Cluster 2 Cluster 3 Min Revised Choice

1 8.71 0.66 11.33 0.66 2
2 6.12 0.63 6.14 0.63 2
3 8.35 1.06 5.82 1.06 2
4 7.42 0.84 13.11 0.84 2
5 3.73 0.53 8.82 0.53 2
6 0.90 6.01 9.85 0.90 1
7 0.79 8.49 19.52 0.79 1
8 0.16 4.50 14.48 0.16 1
9 2.61 2.96 10.56 2.61 1
10 0.23 3.26 10.40 0.23 1
11 12.29 8.96 0.14 0.14 3
12 13.42 4.12 2.19 2.19 3
13 13.21 9.70 0.34 0.34 3
14 19.14 11.26 0.75 0.75 3
15 13.10 12.82 2.52 2.52 3

SSE 14.35

STEP SIX+ - Repeat steps four and five - until SSE only has minor impro
Mean/Centroid

Segment 1
Segment 2
Segment 3
AVERAGE
Respondents
Segment 1
Segment 2
Segment 3
TOTAL
uster analysis in Excel
eans clustering
Geoff Fripp
cturer, The University of Sydney

STEP TWO - If two variables, run a visual check with a scatter graph
e and first allocation

Use =SUMXMY2

Case X Y Z
1 4.40 4.57 2.29
Start1 6 3.26 1.64 1.48
Difference 1.14 2.93 0.81
Sqaured 1.31 8.58 0.66 10.54

Case 1 =IF(G45=D45,D$43,IF(G45=E45,E$43,IF(G45=F45,F$43,"")))

Check
Cluster 1 4
Cluster 2 6
Cluster 3 5
TOTAL 15

Cluster 3
X Y Z
Case 1
Cluster 1 =IF($B66=1,D22,"")
Cluster 2 =IF($B66=2,D22,"")
Cluster 3 =IF($B66=3,D22,"")

2.21 2.51 4.17

2.97 4.10 3.92
2.40 2.45 4.46
2.10 3.30 4.90
1.13 2.05 3.28

2.16 2.88 4.15

r means above
Use =SUMXMY2

Case X Y Z
1 4.40 4.57 2.29
Cluster 1 6 4.19 1.77 1.37
Difference 0.21 2.80 0.92
Sqaured 0.04 7.82 0.84 8.71

Case 1 =IF(G45=D45,D$43,IF(G45=E45,E$43,IF(G45=F45,F$43,"")))

Check
Cluster 1 5
Cluster 2 5
Cluster 3 5
TOTAL 15

E only has minor improvement

Output for THREE Clusters/Segments
Mean/Centroid X Y Z Variable 4 Variable 5

Segment 1 4.35 1.91 1.62

Segment 2 3.84 4.13 2.14
Segment 3 2.16 2.88 4.15
AVERAGE 3.45 2.97 2.63
Respondents Number % SSE/Segment
Segment 1 5 33.3% 4.2
Segment 2 5 33.3% 3.1 SSE Total 13.2
Segment 3 5 33.3% 5.9
TOTAL 15 100.0%
ters/Segments
Variable 5 Variable 6 Variable 7 Variable 8

otal 13.2

The Sas System
No ratings yet
The Sas System
16 pages
Customer Waiting Time Analysis
No ratings yet
Customer Waiting Time Analysis
8 pages
Time Series Final
No ratings yet
Time Series Final
10 pages
CAPE Formula Sheet
No ratings yet
CAPE Formula Sheet
12 pages
CAPE Math Formula Booklet REVISED 2022
0% (1)
CAPE Math Formula Booklet REVISED 2022
11 pages
Pure and Applied Mathematics Formula Sheet-4E745
No ratings yet
Pure and Applied Mathematics Formula Sheet-4E745
11 pages
QA Formula & Stats Table
No ratings yet
QA Formula & Stats Table
4 pages
BS SRR-3
No ratings yet
BS SRR-3
20 pages
Table of F
No ratings yet
Table of F
4 pages
Standard Normal Distribution
No ratings yet
Standard Normal Distribution
3 pages
4 Primer
No ratings yet
4 Primer
4 pages
Biostatistics-Haramaya University Full - Aug 25 2008
No ratings yet
Biostatistics-Haramaya University Full - Aug 25 2008
88 pages
Testing R
No ratings yet
Testing R
2 pages
Karisma 23011101119 Eda Rec
No ratings yet
Karisma 23011101119 Eda Rec
88 pages
Jawaban Case 1-8
No ratings yet
Jawaban Case 1-8
14 pages
Data Science Formula - Very Imp
No ratings yet
Data Science Formula - Very Imp
6 pages
M0388
No ratings yet
M0388
1 page
Project Cost Overrun Scenarios
No ratings yet
Project Cost Overrun Scenarios
318 pages
Summary
No ratings yet
Summary
4 pages
Publicación Notas Finanzas Corporativas F
No ratings yet
Publicación Notas Finanzas Corporativas F
6 pages
04 - Statistic With Computer Application - T-Table
No ratings yet
04 - Statistic With Computer Application - T-Table
1 page
Statistical Tables and Formulae
No ratings yet
Statistical Tables and Formulae
8 pages
M0391
No ratings yet
M0391
1 page
T-Statistics P 0.01 P 0.001 Chi-Square Statistics
No ratings yet
T-Statistics P 0.01 P 0.001 Chi-Square Statistics
1 page
REsearch Assignment II
No ratings yet
REsearch Assignment II
8 pages
F Distribution Table
No ratings yet
F Distribution Table
5 pages
Table of F
No ratings yet
Table of F
2 pages
Table of Critical F Values For Alpha 0.025
No ratings yet
Table of Critical F Values For Alpha 0.025
1 page
Table F Values A 0.025 PDF
No ratings yet
Table F Values A 0.025 PDF
1 page
Tutorials2016s1 Week7 Answers-3
No ratings yet
Tutorials2016s1 Week7 Answers-3
5 pages
Slope Table From % To Degree
No ratings yet
Slope Table From % To Degree
5 pages
Grubbs' Outlier Test
No ratings yet
Grubbs' Outlier Test
2 pages
Statistical Tables
No ratings yet
Statistical Tables
10 pages
Econometric Formulae Statistical Tables
No ratings yet
Econometric Formulae Statistical Tables
4 pages
Multiple Linear Regression and Checking For Collinearity Using SAS
0% (1)
Multiple Linear Regression and Checking For Collinearity Using SAS
18 pages
Statistical Tables All in One
No ratings yet
Statistical Tables All in One
6 pages
Experiment 1
No ratings yet
Experiment 1
26 pages
Control Chart For Mean and Range: Quality Characteristic
No ratings yet
Control Chart For Mean and Range: Quality Characteristic
2 pages
Survival Models in SAS Part 7: PROC PHREG - Part 2: May 21, 2008 Charlie Hallahan
No ratings yet
Survival Models in SAS Part 7: PROC PHREG - Part 2: May 21, 2008 Charlie Hallahan
30 pages
M0393
No ratings yet
M0393
1 page
Distribusi F: WWW - Smartstat.info
No ratings yet
Distribusi F: WWW - Smartstat.info
4 pages
Statistics Reference Tables
No ratings yet
Statistics Reference Tables
4 pages
F-Table 0 of Statistical
No ratings yet
F-Table 0 of Statistical
1 page
Statistical Formulae and Tables
No ratings yet
Statistical Formulae and Tables
12 pages
Tablas Estadisticas
No ratings yet
Tablas Estadisticas
5 pages
1-Sample T-Test: The Steps For Calculating A Single-Sample T-Test "From Scratch" Are
No ratings yet
1-Sample T-Test: The Steps For Calculating A Single-Sample T-Test "From Scratch" Are
3 pages
Consolidation Data For Groups
No ratings yet
Consolidation Data For Groups
1 page
ACTIVITY No 3 - Statistics 2020
No ratings yet
ACTIVITY No 3 - Statistics 2020
8 pages
Exercise Lecture 4 - Summary Measure
No ratings yet
Exercise Lecture 4 - Summary Measure
6 pages
004 Notas - Grupo 1 Noviembre - 18 Moodle
No ratings yet
004 Notas - Grupo 1 Noviembre - 18 Moodle
6 pages
Statistial Tables
No ratings yet
Statistial Tables
2 pages
Cubesand Cube Roots: Chapter One
No ratings yet
Cubesand Cube Roots: Chapter One
3 pages
Session 2
No ratings yet
Session 2
14 pages
Name and Formula: Natl. Bur. Stand. (U.S.) Monogr. 25, 18, 59, (1981)
No ratings yet
Name and Formula: Natl. Bur. Stand. (U.S.) Monogr. 25, 18, 59, (1981)
3 pages
003 Notas - Grupo 1 Noviembre Moodle
No ratings yet
003 Notas - Grupo 1 Noviembre Moodle
6 pages
Research Paper - FINAL 1
No ratings yet
Research Paper - FINAL 1
76 pages
ABAC Policy
No ratings yet
ABAC Policy
33 pages
Non Disclosure Agreement 2
No ratings yet
Non Disclosure Agreement 2
1 page
Broken Boy
No ratings yet
Broken Boy
3 pages
IMManaging Volatility Risks in Cryptocurrency Market
No ratings yet
IMManaging Volatility Risks in Cryptocurrency Market
16 pages
Mechanical Drawing TS34-1 PDF
100% (2)
Mechanical Drawing TS34-1 PDF
88 pages
Media Queries Cheat Sheet (Hoja de Trampa) - I FuckingLoveCoding
No ratings yet
Media Queries Cheat Sheet (Hoja de Trampa) - I FuckingLoveCoding
14 pages
Bubble Test Direct Pressure Inservice
No ratings yet
Bubble Test Direct Pressure Inservice
3 pages
WFS150-5C Software Manual
No ratings yet
WFS150-5C Software Manual
10 pages
Laplace Transform for Differential Equations
No ratings yet
Laplace Transform for Differential Equations
3 pages
Tib Amx BPM Install
No ratings yet
Tib Amx BPM Install
66 pages
Solar Inverters for Global Markets
No ratings yet
Solar Inverters for Global Markets
21 pages
Capital Works Management Framework
No ratings yet
Capital Works Management Framework
27 pages
Plan 12T
No ratings yet
Plan 12T
2 pages
Medical Gas Pipeline System
No ratings yet
Medical Gas Pipeline System
16 pages
Reynald Maprangala 2019
No ratings yet
Reynald Maprangala 2019
4 pages
Industrial Networks Connecting Controllers Via OPC: Master's Thesis
No ratings yet
Industrial Networks Connecting Controllers Via OPC: Master's Thesis
93 pages
Saudi Sensing Solutions Profile
No ratings yet
Saudi Sensing Solutions Profile
25 pages
Industrial Control Solutions
No ratings yet
Industrial Control Solutions
8 pages
Đầu Nối & Ống Dẫn Fuji-V English (Đúc)
No ratings yet
Đầu Nối & Ống Dẫn Fuji-V English (Đúc)
2 pages
59318A - FASTFACTS - Sulfinert Coatings For Sampling, Transfer, and Analysis OfSulfur Compounds To Less Than 20ppb
No ratings yet
59318A - FASTFACTS - Sulfinert Coatings For Sampling, Transfer, and Analysis OfSulfur Compounds To Less Than 20ppb
2 pages
WindBack Seal
No ratings yet
WindBack Seal
2 pages
Marine Anti-Foulant Solution
No ratings yet
Marine Anti-Foulant Solution
3 pages
2014 Standard Catalog of World Coins 2001 Date Eighth Edition George S. Cuhaj Online Version
No ratings yet
2014 Standard Catalog of World Coins 2001 Date Eighth Edition George S. Cuhaj Online Version
115 pages
Sliding Wear Behaviour of HVOF and HVAF Sprayed Cr3C2-Based Coatings
No ratings yet
Sliding Wear Behaviour of HVOF and HVAF Sprayed Cr3C2-Based Coatings
24 pages
Arch Resume
No ratings yet
Arch Resume
2 pages
Heidelberg CP 2000 Computer 113204
No ratings yet
Heidelberg CP 2000 Computer 113204
1 page
Presentation For Dept. of Tourism's Wow Philippines Campaign
No ratings yet
Presentation For Dept. of Tourism's Wow Philippines Campaign
25 pages
VelPAK Tutorial
No ratings yet
VelPAK Tutorial
95 pages
QA450 QE440 Fault Code SPN FMI
No ratings yet
QA450 QE440 Fault Code SPN FMI
6 pages
Ghanaian Entrepreneur's Journey
No ratings yet
Ghanaian Entrepreneur's Journey
14 pages
MS-Access 2007 Pre-Test
No ratings yet
MS-Access 2007 Pre-Test
2 pages
Places and Directions Vocabulary - German - 4th Grade by Slidesgo
No ratings yet
Places and Directions Vocabulary - German - 4th Grade by Slidesgo
40 pages
Business Research Design
50% (2)
Business Research Design
26 pages