A corpus of 500,000 (full dataset) Bangla sentences. Currently, only 300,000 sentences are available in this repository. If you need the full version, please don't hesitate to drop us an email. The sentences were collected from social media sites, blogs and news portals. It can be used to train Sentiment Analysis systems. This dataset can be used to train unsupervised learning algorithms.
The corpus is released in excel and csv format.
If you need the full version, we can arrange a way to send the dataset to you. Please email at [email protected]
The corpus is licensed under GNU GPLv3, making it very easy to anyone to use the data for any purpose.