Normalize entities and relations from annotated data with Wikidata #3

apiad · 2020-06-14T09:31:28Z

In the data/output folder, two files have been created:

entities.tsv contains all keyphrases annotated with their corresponding label.
relations.tsv contains all relation triplets.

Both files are TSV (tab-separated values), so there should no problems with ,, ", etc. Simply opening each file and splitting by \t should do.

The idea would be trying to normalize these mentions with their appearances in Wikidata. For that I would propose creating another two files (data/output/(entities|relations)-normalization.tsv for which all the matches found are logged together with their corresponding Wikidata metadata (i.e., IDs, etc.)

The text was updated successfully, but these errors were encountered:

csisc · 2020-07-31T16:41:07Z

I developed Python codes to solve this issue:

apiad changed the title ~~Extract entities and relations from annotated data~~ Normalize entities and relations from annotated data with Wikidata Jun 14, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Normalize entities and relations from annotated data with Wikidata #3

Normalize entities and relations from annotated data with Wikidata #3

apiad commented Jun 14, 2020 •

edited

Loading

csisc commented Jul 31, 2020

Normalize entities and relations from annotated data with Wikidata #3

Normalize entities and relations from annotated data with Wikidata #3

Comments

apiad commented Jun 14, 2020 • edited Loading

csisc commented Jul 31, 2020

apiad commented Jun 14, 2020 •

edited

Loading