Skip to content

Shivanibhawsar/DL2-T8-Projects

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 

Repository files navigation

IMDb Data Analysis and Top 1% Movies(Team-Strangecues)

Table of contents

Dataset

For this project I have used public dataset from kaggle

https://www.kaggle.com/ashirwadsangwan/imdb-dataset

The dataset contains IMDb's extensive database updated till 2020. The size of the dataset is around 1.44 GB.

Prerequisites

OS Scikit-learn Matplotlib Pandas Numpy

Steps

  • Firstly, you will need to download/upload the dataset to the colab and extract it in a folder
  • copy the path of that folder and paste it into the PATH
  • Run "project-strangecues.ipynb" on google colab
  • It will take time to run
  • One thing that should be noted is that the starting year of the data we used is 1960s.
  • Now, you can see the top 1% moves in the dataframe named "Classic"

Result

  • We saw that the ratio of the top 1% movies with total movies is 0.0100026877229

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •