-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
names of the profiles in the last ERG treebanks #5
Comments
I ended up with the following list
|
In https://arxiv.org/pdf/1904.11564.pdf, you wrote
once the script executed, I counted the graphs with:
So I am missing The profiles sum up ..
|
These were taken from the Regarding the new distribution of the Redwoods 2020 data, I don't really know what changed or why, so I cannot comment on your proposed list. Regarding the counts, a few things:
These may account for the discrepancies you saw. |
Sorry, I was reading the profile inputs but I should read the results:
|
The cases of possible invalid MRS I already count, this is my 2297 above. |
Ah, yes, the result file is better because of course some items won't get a parse. Good catch. |
This is related to delph-in/docs#40, and maybe @olzama and @danflick can add something.
The names of the ERG gold profiles in the
tsdb/gold
changed. The http://svn.delph-in.net/erg/tags/2020/etc/redwoods.xls didn't preserve the old names, which is pretty confusing. So, for example,wsj06c
now is onlywsj06
, right?How were the dev, test, and train sets defined for https://github.com/goodmami/mrs-to-penman/blob/master/convert-redwoods.sh#L8-L187? The new names can impact the dev/test/train sets?
The text was updated successfully, but these errors were encountered: