- setting twitter developer account
- decide the twitter topic to analyze - Starbucks
- setting docker - spark streaming might need it soon
- kafka - try producer and consumer
- kafka - twitter - data ingestion
# create a virtual environment
python -m <virtual environment name>
# activate this virtual environment
source venv/bin/activate
# Installing list of packages in requirements.txt
pip install -r requirements.txt
Docker Setup for Kafka is explained in HERE with details.
3-1. Check the link for TwitterAPI for Developer
Check the file src/credential.json
Check the files src/producer.py
and src/consumer.py
.
Make sure to modify the kafka topic_name
and what to track
on twitter.
Prepare two separate terminals, and run python conumser.py
and python producer.py
python src/consumer.py
python src/producer.py