Wrangling and Analyzing WeRateDogs Twitter Data

Ahmed Unshur

Data collected from the real-world is mostly dirty and messy, which is why it’s important to acquire a number of skills of handling and cleaning such data.

This project was conducted to wrangle and analyze a dataset from the Twitter account @dog_rates, also known as WeRateDogs. The project was completed as part of Udacity's Data Analyst Nanodegree program.

We have followed the wrangling process of gathering, assessing, and cleaning data. We have gathered three datasets using three different methods. Then we assessed the data and identified 9 quality and 2 tidiness issues. Finally, we have cleaned the issues using the define, code, and test framework. After cleaning the issues, a master dataset was created. An analysis was conducted to uncover some insights from the data.

To complete this project, we have used Anaconda, Python and some of its packages and libraries (NumPy, Pandas, Matplotlib, Seaborn, Requests, Tweepy, and JSON), Jupyter Notebook, Sublime Text, and Microsoft Word.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
act_report.pdf		act_report.pdf
image-predictions.tsv		image-predictions.tsv
libraries.py		libraries.py
tweet-json.txt		tweet-json.txt
tweets_data.csv		tweets_data.csv
twitter-archive-enhanced.csv		twitter-archive-enhanced.csv
twitter_archive_master.csv		twitter_archive_master.csv
wrangle_act.ipynb		wrangle_act.ipynb
wrangle_report.pdf		wrangle_report.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Wrangling and Analyzing WeRateDogs Twitter Data

Ahmed Unshur

About

Releases

Packages

Languages

ahmedunshur/wrangling_and_analyzing_weratedogs_twitter_data

Folders and files

Latest commit

History

Repository files navigation

Wrangling and Analyzing WeRateDogs Twitter Data

Ahmed Unshur

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages