Skip to content

This work is part of the CodeFirstGirls NanoDegree Data Stream and is the final group project

Notifications You must be signed in to change notification settings

Fernandes2692/CFG_DataStream_GroupProject

Repository files navigation

CFG_DataStream_GroupProject

                          This work is part of the CodeFirstGirls NanoDegree Data Stream 

-------------------- --------- Code First Girls NanoDegree Data Stream Group Project --------- --------------------

                *Group members: Sue Scott, Flavia Fonseca, Katie Papworth, Georgia Grant, Laura Fernandes* 


This repository contains the script needed to perform the data analysis:

                  “INVESTIGATION INTO WHETHER BABY NAMES IN THE UK ARE INFLUENCED BY MOVIE RELEASES” 

README.md 📚

📄

├── Project Report - NanoDegree Final Project Sue,Flavia,Georgia,Katie,Laura.pdf
   └── 0. Appendix

📊

├── Jupyter Notebooks 
  └── 1. Data_Cleaning_and_Exploration_ONS_data.ipynb 
  └── 2. Movie_API_Genre_Family.ipynb 
  └── 3. analysis_Movies&BabyNames_Subplots.ipynb
  └── 4. Graphs_for_Baby_names_and_our_names.ipynb
  └── 5. Key_statistics_and_SQLconnection.ipynb
  └── 6. Exploratory_Analysis.ipynb 
  └── 7. Hero_Villains_Hypothesis_Testing.ipynb 

📔

├── CSV and excel files
    │ └── Movie API
    │  ├── family_movies_2009_top3.csv 
    │  ├── family_movies_2011_top3.csv 
    │  ├── family_movies_2013_top3.csv 
    │  ├── family_movies_2015_top3.csv 
    │  │── family_movies_2017_top3.csv
    │  └── family_movies_data2000to2020.csv
    │  
    └── ONS data 
         │   └───├── Clean data files 
         │       │   ├──boys_count_clean.csv
         │       │   ├──boys_rank_clean.csv
         │       │   ├──girls_count_clean.csv
         │       │   └──girls_rank_clean.csv
         │       │
         │       │        
         │       ├── Raw data
         │           └──babyanmes1996to2020.xls
         │
         └── statistics 
                │
                └── Key Statistics Heroes Villains.xlsx 

📁

├── SQL database code
    └── SQL databse for characters names.sql

🖼️

├── Images
    ├── godmothered.png
    ├── onward.jpeg
    ├── thewilloughbys.png
    ├── 15_subplots_movie_names.png
    ├── 2021 Baby Name Predications.png
    ├── CodeFirstGirlsLogo.png
    ├── Hero Vs Villains.png
    ├── Influential Movies.png
    ├── Libraries_tools_tasks.png
    ├── Movie Data Diagram.png
    ├── movie_ranks.png
    └── SWOT Analysis.png

To get started, clone the git repository and pip install the following packages:

  • Pandas
  • NumPy
  • SciPy
  • SkiKit-Learn
  • Matplotlib
  • Seaborn
  • Openpyxl

Run through the ipynb notebooks 1-7 following the report and using the relevant files

Enjoy!:nerd_face:

About

This work is part of the CodeFirstGirls NanoDegree Data Stream and is the final group project

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published