A machine learning method that builds a hierarchy of clusters by iteratively grouping data points or clusters based on their similarities.

Hierarchical Clustering

Hierarchical clustering is a fundamental clustering technique that organizes data into a tree-like nested structure of clusters, providing multiple levels of granularity for data analysis.

Core Principles

The method operates through two main approaches:

Agglomerative (Bottom-up)
- Starts with individual data points as singleton clusters
- Iteratively merges the closest clusters
- Forms a dendrogram representing the merging history
Divisive (Top-down)
- Begins with all points in a single cluster
- Recursively splits clusters into smaller groups
- Less common but useful for certain applications

Distance Metrics

The choice of distance metrics is crucial and includes:

Linkage Criteria

Cluster proximity is determined through linkage methods:

Single linkage: Minimum distance between clusters
Complete linkage: Maximum distance between clusters
Average linkage: Mean distance between clusters
Ward's method: Minimizes variance within clusters

Applications

Hierarchical clustering finds applications in:

Phylogenetic tree construction in biology
Document classification in text analysis
Customer segmentation in marketing
Taxonomy creation in various domains

Advantages and Limitations

Advantages

No need to specify number of clusters beforehand
Produces an interpretable hierarchy
Flexible level of granularity

Limitations

Computational complexity of O(n²) or higher
Cannot undo previous steps
Sensitive to noise and outliers

Visualization

The results are typically visualized using:

Dendrograms showing the merging/splitting history
Heat maps combined with dendrograms
Cluster visualization techniques

Implementation

Common implementations use libraries such as:

scikit-learn for Python
hierarchical clustering packages in R
Custom implementations for specific needs

The method continues to evolve with new variations and applications in machine learning and data mining, particularly in areas requiring hierarchical structure discovery or multi-level clustering analysis.