IMDb Data Analysis and Top 1% Movies(Team-Strangecues)

Dataset

For this project I have used public dataset from kaggle

The dataset contains IMDb's extensive database updated till 2020. The size of the dataset is around 1.44 GB.

OS Scikit-learn Matplotlib Pandas Numpy

Firstly, you will need to download/upload the dataset to the colab and extract it in a folder
copy the path of that folder and paste it into the PATH
Run "project-strangecues.ipynb" on google colab
It will take time to run
One thing that should be noted is that the starting year of the data we used is 1960s.
Now, you can see the top 1% moves in the dataframe named "Classic"

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
Sakshi Rathi		Sakshi Rathi
README.md		README.md
project_strangecues.ipynb		project_strangecues.ipynb