Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support CoNLL-U Plus as input and output format #16

Open
chiarcos opened this issue Jul 29, 2019 · 1 comment
Open

Support CoNLL-U Plus as input and output format #16

chiarcos opened this issue Jul 29, 2019 · 1 comment
Assignees

Comments

@chiarcos
Copy link
Contributor

Support for a recently suggested extension of the CoNLL-U format: https://universaldependencies.org/ext-format.html

Note that this format allows to read TSV data without providing explicit column names (as these are stored in the data). Thus requires a specialized reader module.

@cfaeth
Copy link
Collaborator

cfaeth commented Oct 29, 2019

Has been handled in #17
All comments are now stored on sentence-level. global.columns is used as default for naming of conll:COL properties. (Can still be overwritten by command-line parameters cf. README.md)

Still open: handling of other relevant fields:
sent_id
text
text_{lang}

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants