Unsupervised Learning

Clustering, dimensionality reduction, and anomaly detection.

Identifying data points that deviate significantly from the norm – isolation forests, autoencoders, and statistical approaches.

Discovering frequent itemsets and co-occurrence patterns in transactional data – the Apriori algorithm and market basket analysis.

Discovering arbitrarily-shaped clusters based on point density – no need to specify K, naturally identifies outliers.

Soft clustering via a weighted sum of Gaussians fitted with EM – probabilistic assignment captures cluster uncertainty.

Building a tree of nested clusters via agglomerative merging or divisive splitting – revealing multi-scale data structure.

Partitioning data into K groups by iteratively assigning points to nearest centroids – simple, fast, and surprisingly effective.

Projecting data onto orthogonal directions of maximum variance – the foundational dimensionality reduction technique.

Nonlinear dimensionality reduction for visualization – preserving local neighborhood structure in 2D/3D plots.