Summary Data Science- Clustering
The document is organized as a lecture-style summary or set of notes intended to explain clustering algorithms, their mechanics, use cases, strengths, and weaknesses. It is divided into clear sections, each building upon the previous, and uses a memorable party/popular kid analogy to anchor the abstract concepts of clustering in a relatable narrative. Key characteristics of the document: · Pedagogical Approach: Uses strong analogies (e.g., party groups, popular kids) to explain complex algorithms. · Visual and Textual Layering: Contains diagrams, formulas, and bullet lists to cater to different learning styles. · Comparative Analysis: Directly compares algorithms like K-Means, K-Medoids, Hierarchical Clustering, and DBSCAN. · Practical Focus: Emphasizes real-world application, parameter selection, and evaluation metrics.
Written for
- Institution
- Stockholm University
- Course
- Data Science -Clustering
Document information
- Uploaded on
- January 12, 2026
- Number of pages
- 9
- Written in
- 2025/2026
- Type
- SUMMARY
Subjects
-
clustering
-
unsupervised learning
-
k means
-
k medoids
-
dbscan
-
hierarchical clustering
-
euclidean distance
-
manhattan distance
-
similarity mesure
-
wcss
-
elbow method
-
outliers
-
silhouette score