Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Diachronic features #99

Open
nicolasdugue opened this issue Jul 19, 2024 · 2 comments · Fixed by #100
Open

Diachronic features #99

nicolasdugue opened this issue Jul 19, 2024 · 2 comments · Fixed by #100
Assignees
Labels
enhancement New feature or request

Comments

@nicolasdugue
Copy link
Member

Feature
Let suppose one has at hand a textual corpus with a split in distinct time periods. One may want to analyze how word embeddings change across time.

Describe the solution you'd like
We suggest an approach as follows :

  • train a SINr model on the whole corpus
  • train several SINr models, one for each of the corpus slices, using the communities detected on the whole corpus
@nicolasdugue
Copy link
Member Author

Adding one to play with the model :

  • most stereotypic words variation

Based on the nearest neighbor variation, but adapted to evaluate how the stereotypes of dimension have changed between two models.

@nicolasdugue nicolasdugue linked a pull request Jul 22, 2024 that will close this issue
8 tasks
@SimonGuillot
Copy link
Collaborator

Adding a diachronic tool :

  • the difference vector between the same words in two sub corpora in the same reference model

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants