Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

need to be able to retrieve short defs after disambiguation #617

Open
balmas opened this issue Jan 18, 2021 · 0 comments
Open

need to be able to retrieve short defs after disambiguation #617

balmas opened this issue Jan 18, 2021 · 0 comments
Labels
bug Something isn't working treebank

Comments

@balmas
Copy link
Member

balmas commented Jan 18, 2021

With Greek, because we get the short definitions separately from the morphological parse, we are able to retrieve definitions for the lemmas identified by the treebank because we retrieve short definitions separately from the morphological parse. With Latin, we get our short definitions from Whitaker, in the same step as the morphological parse, and so if the treebank identifies a lemma that was missing from the Whitaker parser result, we don't get any definition, because we don't have a separate short definitions index for Latin. This is not good and we need to fix it.

Examples of words which cause a problem

nos, and mihi because Whitaker doesn't report the lemma ego as a possible parse
quaeque which when lemmatized as quisque

@balmas balmas added treebank bug Something isn't working labels Jan 18, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working treebank
Projects
None yet
Development

No branches or pull requests

1 participant