Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature/hot topic of the day #64

Open
wants to merge 9 commits into
base: master
Choose a base branch
from

Conversation

fccoelho
Copy link
Member

This feature is a script to calculate the hottest tokens of the day. I works by adding up the freqdists of every article published on a given day. It's different than whats is shown in the Media cloud dashboard (number of articles mentioning the word), here we calculate number of times the word has been printed on a given day.

It's not done yet, but I'd appreciate some input. The Idea is that we create a collection with these global freqdists in Mongo so that we can easily extract time series for analyses.

==== THIS REQUIRES THAT THE ARTICLE COLLECTION HAS BEEN SENT TO PYPLN ====

Added script to extract freqdist from index.
Added script to extract freqdist from index.
Added script to extract freqdist from index.
modified generate_freqdist to optionally return the freqdist as a dataframe
…loud_backend into feature/hot-topic-of-the-day
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant