This work is part of the CodeFirstGirls NanoDegree Data Stream
-------------------- --------- Code First Girls NanoDegree Data Stream Group Project --------- --------------------
*Group members: Sue Scott, Flavia Fonseca, Katie Papworth, Georgia Grant, Laura Fernandes*
“INVESTIGATION INTO WHETHER BABY NAMES IN THE UK ARE INFLUENCED BY MOVIE RELEASES”
README.md 📚
📄
├── Project Report - NanoDegree Final Project Sue,Flavia,Georgia,Katie,Laura.pdf
└── 0. Appendix
📊
├── Jupyter Notebooks
└── 1. Data_Cleaning_and_Exploration_ONS_data.ipynb
└── 2. Movie_API_Genre_Family.ipynb
└── 3. analysis_Movies&BabyNames_Subplots.ipynb
└── 4. Graphs_for_Baby_names_and_our_names.ipynb
└── 5. Key_statistics_and_SQLconnection.ipynb
└── 6. Exploratory_Analysis.ipynb
└── 7. Hero_Villains_Hypothesis_Testing.ipynb
📔
├── CSV and excel files
│ └── Movie API
│ ├── family_movies_2009_top3.csv
│ ├── family_movies_2011_top3.csv
│ ├── family_movies_2013_top3.csv
│ ├── family_movies_2015_top3.csv
│ │── family_movies_2017_top3.csv
│ └── family_movies_data2000to2020.csv
│
└── ONS data
│ └───├── Clean data files
│ │ ├──boys_count_clean.csv
│ │ ├──boys_rank_clean.csv
│ │ ├──girls_count_clean.csv
│ │ └──girls_rank_clean.csv
│ │
│ │
│ ├── Raw data
│ └──babyanmes1996to2020.xls
│
└── statistics
│
└── Key Statistics Heroes Villains.xlsx
📁
├── SQL database code
└── SQL databse for characters names.sql
🖼️
├── Images
├── godmothered.png
├── onward.jpeg
├── thewilloughbys.png
├── 15_subplots_movie_names.png
├── 2021 Baby Name Predications.png
├── CodeFirstGirlsLogo.png
├── Hero Vs Villains.png
├── Influential Movies.png
├── Libraries_tools_tasks.png
├── Movie Data Diagram.png
├── movie_ranks.png
└── SWOT Analysis.png
To get started, clone the git repository and pip install the following packages:
- Pandas
- NumPy
- SciPy
- SkiKit-Learn
- Matplotlib
- Seaborn
- Openpyxl
Run through the ipynb notebooks 1-7 following the report and using the relevant files
Enjoy!:nerd_face: