Twitter Sentiment Analysis

Overview

This project focuses on the analysis of sentiments expressed in tweets. Utilizing advanced machine learning techniques, it categorizes tweets as either positive or negative. This analysis provides valuable insights into public opinion and sentiment on various topics.

Objective

The primary goal of this project is to accurately determine the sentiment of a tweet. This can be applied in numerous domains such as marketing, political campaigns, public relations, and customer service to understand and respond to public sentiment effectively.

Dataset

The analysis is based on the Sentiment140 dataset, which contains 1,600,000 tweets extracted using the Twitter API. Each tweet in the dataset is labeled as either positive or negative.

Methodology

Data Preprocessing: The dataset underwent a series of preprocessing steps including text normalization, removing URLs, emojis, and non-alphabetic characters, and tokenization.
Feature Extraction: We used TF-IDF Vectorization to convert the preprocessed tweets into a format suitable for model training.
Model Training and Evaluation: Several machine learning models were trained and compared, including Bernoulli Naive Bayes, Linear Support Vector Classification, and Logistic Regression. The models were evaluated based on their accuracy, with a particular focus on understanding the trade-offs between model complexity and performance.

Key Findings

Logistic Regression provided the best balance between accuracy and computational efficiency.
The project highlights the importance of comprehensive data preprocessing in NLP tasks.
The analysis revealed common patterns and themes in sentiment expression on Twitter.

Tools & Technologies

Python
Pandas, NumPy
Scikit-learn
NLTK
Matplotlib, Seaborn

How to Run the Project

Clone the repository to your local machine.
Install the required dependencies: pip install -r requirements.txt.
Run the "twitter-sentiment-analysis.ipynb " Jupyter notebook.

Contributing

Feel free to fork this project, submit pull requests, or send us your feedback and suggestions!

License

MIT License

Contact Information

For any additional questions or comments, please contact me at [[email protected]].

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
src		src
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Twitter Sentiment Analysis

Overview

Objective

Dataset

Methodology

Key Findings

Tools & Technologies

How to Run the Project

Contributing

License

Contact Information

About

Releases

Packages

Languages

CobosDS/twitter-sentiment-analysis

Folders and files

Latest commit

History

Repository files navigation

Twitter Sentiment Analysis

Overview

Objective

Dataset

Methodology

Key Findings

Tools & Technologies

How to Run the Project

Contributing

License

Contact Information

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages