Clustering Life Expectancy
Data Science & Analyst
About The Project
This project performs exploration and clustering of life expectancy data from various countries using the DBSCAN algorithm (Density-Based Spatial Clustering of Applications with Noise). The main objective is to identify hidden patterns and group countries with similar health characteristics based on life expectancy indicators.
Tools & Libraries : Python, Google Colab, Pandas, Numpy, Seaborn, Matplotlib, Scikit-Learn, Kneed.
Achievements
Successfully uncovered distinct global life expectancy patterns by clustering 200 countries using DBSCAN, providing insights into gender-based longevity disparities.
Achieved a Silhouette Score of 0.496, indicating a moderately well-separated clustering despite the unsupervised nature and complexity of the dataset.
Identified Nigeria as a unique outlier, demonstrating the model's ability to detect noise and potential anomalies in public health data.








