Skip to content

Wrangling and cleaning a dataset, visualizing and applying statistical measures, and developing Supervised and Unsupervised ML models

Notifications You must be signed in to change notification settings

hazem-antar/Machine-Learning-Course-Project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 

Repository files navigation

Data-Science

This project discusses the analysis, visualization, and development of different types of machine learning algorithms on a small-sized dataset of Penguins. In the first part, the used dataset will be wrangled and cleaned from null, duplicated, and outliers values. Then, every feature in the dataset will be explored thoroughly and the correlation between different features will help unfold even the tiniest details. Using the current knowledge, some features might need to be disposed of before moving on to the next phase. In the second part, two different clustering Machine Learning (ML) algorithms will be developed to discover the building structure of the dataset (different penguins' species, gender, and where they live) only by analyzing measurable features. Lastly, in the third part, three different classification machine learning algorithms will be trained to preciously predict the type of penguin species given all other features in the dataset.

About

Wrangling and cleaning a dataset, visualizing and applying statistical measures, and developing Supervised and Unsupervised ML models

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published