29
Tutorial Hierarchical Cluster - 1 TUTORIAL Hierarchical Cluster Analysis

Tutorial SPSS Hierarchical Cluster Analysis · PDF fileTutorial Hierarchical Cluster - 2 Hierarchical Cluster Analysis Proximity Matrix This table shows the matrix of proximities between

  • Upload
    dokhanh

  • View
    238

  • Download
    2

Embed Size (px)

Citation preview

Page 1: Tutorial SPSS Hierarchical Cluster Analysis · PDF fileTutorial Hierarchical Cluster - 2 Hierarchical Cluster Analysis Proximity Matrix This table shows the matrix of proximities between

Tutorial Hierarchical Cluster - 1

TUTORIAL

Hierarchical Cluster Analysis

Page 2: Tutorial SPSS Hierarchical Cluster Analysis · PDF fileTutorial Hierarchical Cluster - 2 Hierarchical Cluster Analysis Proximity Matrix This table shows the matrix of proximities between

Tutorial Hierarchical Cluster - 2

Hierarchical Cluster Analysis Proximity Matrix

This table shows the matrix of proximities between cases or variables.

These values represent the similarity or dissimilarity between each pair of items.

In this example, we use Squared Euclidean Distance, which is a measure of dissimilarity.

Page 3: Tutorial SPSS Hierarchical Cluster Analysis · PDF fileTutorial Hierarchical Cluster - 2 Hierarchical Cluster Analysis Proximity Matrix This table shows the matrix of proximities between

Tutorial Hierarchical Cluster - 3

For dissimilarities, larger values indicate items which are very different.

Smaller values indicate items which are very similar.

This relationship is reversed if a similarity measure is used.’

Page 4: Tutorial SPSS Hierarchical Cluster Analysis · PDF fileTutorial Hierarchical Cluster - 2 Hierarchical Cluster Analysis Proximity Matrix This table shows the matrix of proximities between

Tutorial Hierarchical Cluster - 4

Hierarchical Cluster Analysis Agglomeration Schedule

This table shows how the cases are clustered together at each stage of the cluster analysis.

Page 5: Tutorial SPSS Hierarchical Cluster Analysis · PDF fileTutorial Hierarchical Cluster - 2 Hierarchical Cluster Analysis Proximity Matrix This table shows the matrix of proximities between

Tutorial Hierarchical Cluster - 5

Clusters are formed by merging cases and clusters a step at a time, until all cases are joined in one big cluster.

Page 6: Tutorial SPSS Hierarchical Cluster Analysis · PDF fileTutorial Hierarchical Cluster - 2 Hierarchical Cluster Analysis Proximity Matrix This table shows the matrix of proximities between

Tutorial Hierarchical Cluster - 6

At each stage, one case or cluster is joined with another case or cluster.

Page 7: Tutorial SPSS Hierarchical Cluster Analysis · PDF fileTutorial Hierarchical Cluster - 2 Hierarchical Cluster Analysis Proximity Matrix This table shows the matrix of proximities between

Tutorial Hierarchical Cluster - 7

For instance, in this example, cases 4 and 11 are joined at stage 3. This is shown in the Clusters Combined columns.

When clusters or cases are joined, they are subsequently labeled with the smaller of the two cluster numbers.

Page 8: Tutorial SPSS Hierarchical Cluster Analysis · PDF fileTutorial Hierarchical Cluster - 2 Hierarchical Cluster Analysis Proximity Matrix This table shows the matrix of proximities between

Tutorial Hierarchical Cluster - 8

The Coefficients column indicates the distance between the two clusters (or cases) joined at each stage.

The values here depend on the proximity measure and linkage method used in the analysis.

Page 9: Tutorial SPSS Hierarchical Cluster Analysis · PDF fileTutorial Hierarchical Cluster - 2 Hierarchical Cluster Analysis Proximity Matrix This table shows the matrix of proximities between

Tutorial Hierarchical Cluster - 9

For a good cluster solution, you will see a sudden jump in the distance coefficient (or a sudden drop in the similarity coefficient) as you read down the table.

The stage before the sudden change indicates the optimal stopping point for merging clusters.

Page 10: Tutorial SPSS Hierarchical Cluster Analysis · PDF fileTutorial Hierarchical Cluster - 2 Hierarchical Cluster Analysis Proximity Matrix This table shows the matrix of proximities between

Tutorial Hierarchical Cluster - 10

For this example, we should consider using a 4-cluster solution.

The next part of the table shows the stage at which each cluster first appears.

Page 11: Tutorial SPSS Hierarchical Cluster Analysis · PDF fileTutorial Hierarchical Cluster - 2 Hierarchical Cluster Analysis Proximity Matrix This table shows the matrix of proximities between

Tutorial Hierarchical Cluster - 11

Single cases existed before we started the analysis, so they are indicated by zeroes here.

In stage 9, cluster 1 is the cluster that was formed in stage 6...

Page 12: Tutorial SPSS Hierarchical Cluster Analysis · PDF fileTutorial Hierarchical Cluster - 2 Hierarchical Cluster Analysis Proximity Matrix This table shows the matrix of proximities between

Tutorial Hierarchical Cluster - 12

...and cluster 5 is the cluster formed in stage 8.

The last column shows the subsequent stage at which the newly merged cluster is combined with yet another cluster.

Page 13: Tutorial SPSS Hierarchical Cluster Analysis · PDF fileTutorial Hierarchical Cluster - 2 Hierarchical Cluster Analysis Proximity Matrix This table shows the matrix of proximities between

Tutorial Hierarchical Cluster - 13

For example, the cluster formed in stage 2 next appears in stage 10, where it is merged with cluster 1.

Page 14: Tutorial SPSS Hierarchical Cluster Analysis · PDF fileTutorial Hierarchical Cluster - 2 Hierarchical Cluster Analysis Proximity Matrix This table shows the matrix of proximities between

Tutorial Hierarchical Cluster - 14

Hierarchical Cluster Analysis Cluster Membership

This table shows cluster membership for each case, according to the number of clusters you requested.

You can attempt to interpret the clusters by observing which cases are grouped together.

Page 15: Tutorial SPSS Hierarchical Cluster Analysis · PDF fileTutorial Hierarchical Cluster - 2 Hierarchical Cluster Analysis Proximity Matrix This table shows the matrix of proximities between

Tutorial Hierarchical Cluster - 15

If you've requested a range of solutions, you'll see a column for each solution.

Page 16: Tutorial SPSS Hierarchical Cluster Analysis · PDF fileTutorial Hierarchical Cluster - 2 Hierarchical Cluster Analysis Proximity Matrix This table shows the matrix of proximities between

Tutorial Hierarchical Cluster - 16

Hierarchical Cluster Analysis Icicle Plot

This plot gives a graphic representation of how the cases are joined at each stage of the analysis.

Page 17: Tutorial SPSS Hierarchical Cluster Analysis · PDF fileTutorial Hierarchical Cluster - 2 Hierarchical Cluster Analysis Proximity Matrix This table shows the matrix of proximities between

Tutorial Hierarchical Cluster - 17

Each white bar represents a boundary between clusters.

Page 18: Tutorial SPSS Hierarchical Cluster Analysis · PDF fileTutorial Hierarchical Cluster - 2 Hierarchical Cluster Analysis Proximity Matrix This table shows the matrix of proximities between

Tutorial Hierarchical Cluster - 18

At each stage, two clusters are joined, and so the white bar separating the joined clusters ends.

Page 19: Tutorial SPSS Hierarchical Cluster Analysis · PDF fileTutorial Hierarchical Cluster - 2 Hierarchical Cluster Analysis Proximity Matrix This table shows the matrix of proximities between

Tutorial Hierarchical Cluster - 19

Within a row, each contiguous black band indicates cases grouped as a cluster.

Formatting Icicle Plots

The default output for icicle plots displays columns of X's instead of bars.

Page 20: Tutorial SPSS Hierarchical Cluster Analysis · PDF fileTutorial Hierarchical Cluster - 2 Hierarchical Cluster Analysis Proximity Matrix This table shows the matrix of proximities between

Tutorial Hierarchical Cluster - 20

If you find it easier to see the pattern in the plot with bars, you can set your options to automatically reformat future icicle plots as follows:

Page 21: Tutorial SPSS Hierarchical Cluster Analysis · PDF fileTutorial Hierarchical Cluster - 2 Hierarchical Cluster Analysis Proximity Matrix This table shows the matrix of proximities between

Tutorial Hierarchical Cluster - 21

Choose Edit->Options, and select the Scripts tab...

Page 22: Tutorial SPSS Hierarchical Cluster Analysis · PDF fileTutorial Hierarchical Cluster - 2 Hierarchical Cluster Analysis Proximity Matrix This table shows the matrix of proximities between

Tutorial Hierarchical Cluster - 22

Now make sure the Enable Autoscripting option is activated...

Page 23: Tutorial SPSS Hierarchical Cluster Analysis · PDF fileTutorial Hierarchical Cluster - 2 Hierarchical Cluster Analysis Proximity Matrix This table shows the matrix of proximities between

Tutorial Hierarchical Cluster - 23

And make sure the Cluster_Table_Icicle_Create autoscript is checked.

Future icicle plots will be generated in the new bar format (but previously generated plots will not be altered).

Page 24: Tutorial SPSS Hierarchical Cluster Analysis · PDF fileTutorial Hierarchical Cluster - 2 Hierarchical Cluster Analysis Proximity Matrix This table shows the matrix of proximities between

Tutorial Hierarchical Cluster - 24

Hierarchical Cluster Analysis Dendrogram

The dendrogram (or "tree diagram") shows relative similarities between cases.

Notice how the "branches" merge together as you look from left to right in the dendrogram.

Page 25: Tutorial SPSS Hierarchical Cluster Analysis · PDF fileTutorial Hierarchical Cluster - 2 Hierarchical Cluster Analysis Proximity Matrix This table shows the matrix of proximities between

Tutorial Hierarchical Cluster - 25

Cases or clusters that are joined by lines "further down" the tree (near the left side of the dendrogram) are very similar.

Cases or clusters that are joined by lines "further up" the tree (near the right side) are dissimilar.

Page 26: Tutorial SPSS Hierarchical Cluster Analysis · PDF fileTutorial Hierarchical Cluster - 2 Hierarchical Cluster Analysis Proximity Matrix This table shows the matrix of proximities between

Tutorial Hierarchical Cluster - 26

Cluster distances are rescaled so that they range from 0 to 25 in this plot.

It can help to see different cluster solutions by imagining a vertical line through the dendrogram.

Page 27: Tutorial SPSS Hierarchical Cluster Analysis · PDF fileTutorial Hierarchical Cluster - 2 Hierarchical Cluster Analysis Proximity Matrix This table shows the matrix of proximities between

Tutorial Hierarchical Cluster - 27

For instance, in this example, we might draw a line at about 3 rescaled distance units.

This would identify 4 clusters, one for each point where a branch intersects our line.

By considering different cut points for our line, we can get solutions with different numbers of cluster.

Page 28: Tutorial SPSS Hierarchical Cluster Analysis · PDF fileTutorial Hierarchical Cluster - 2 Hierarchical Cluster Analysis Proximity Matrix This table shows the matrix of proximities between

Tutorial Hierarchical Cluster - 28

A good cluster solution is one with small within-cluster distances, but large between-cluster distances.

Page 29: Tutorial SPSS Hierarchical Cluster Analysis · PDF fileTutorial Hierarchical Cluster - 2 Hierarchical Cluster Analysis Proximity Matrix This table shows the matrix of proximities between

Tutorial Hierarchical Cluster - 29

[ HALAMAN INI DIKOSONGKAN ]