Skip to content

Commit

Permalink
Add files via upload
Browse files Browse the repository at this point in the history
  • Loading branch information
jakoble authored Oct 28, 2024
1 parent 3595f90 commit 3185bff
Show file tree
Hide file tree
Showing 4 changed files with 60 additions and 0 deletions.
15 changes: 15 additions & 0 deletions corpora/newspaper-corpora/corpus-vu-dnc.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,15 @@
{
"Name": "Corpus VU-DNC",
"URL": "http://hdl.handle.net/10032/tm-a2-g4",
"Family": "Newspaper corpora",
"Description": "This corpus consists of data from five newspapers, covering 3 separate years (1950, 1951, and 2002).\nThe corpus is available from the Dutch Language Institute.",
"Language": ["nld"],
"Licence": "",
"Size": [""],
"Annotation": [""],
"Infrastructure": "CLARIN",
"Access": {
"Concordancer": "https://ivdnt.org/wp-content/apps/vu-dnc/index.html"
},
"Publication":""
}
15 changes: 15 additions & 0 deletions corpora/newspaper-corpora/couranten-corpus.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,15 @@
{
"Name": "Couranten Corpus",
"URL": "http://hdl.handle.net/10032/tm-a2-u9 ",
"Family": "Newspaper corpora",
"Description": "This corpus contains thirteen seventeenth-century Dutch newspapers (altogether 109,532 articles) published between 1619 and 1700.\nThe corpus is available from the Dutch Language Institute.",
"Language": ["nld"],
"Licence": "",
"Size": ["18.9 million words"],
"Annotation": ["", ""],
"Infrastructure": "CLARIN",
"Access": {
"Concordancer": "https://couranten.ivdnt.org/corpus-frontend/couranten/search/"
},
"Publication":""
}
15 changes: 15 additions & 0 deletions corpora/newspaper-corpora/wablieft-corpus.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,15 @@
{
"Name": "Wablieft Corpus",
"URL": "https://hdl.handle.net/10032/tm-a2-q6",
"Family": "Newspaper corpora",
"Description": "This corpus contains the digital archive of the <a href=\"https://www.wablieft.be/nl\">Wablieft newspaper</a> from 2011 to 2017.\nThe corpus is available from the Dutch Language Institute.",
"Language": ["nld"],
"Licence": "CC BY",
"Size": ["2 million words"],
"Annotation": ["PoS-tagged", "lemmatised", "named entities", "syntactic dependencies"],
"Infrastructure": "CLARIN",
"Access": {
"Download": "https://hdl.handle.net/10032/tm-a2-q6"
},
"Publication":""
}
15 changes: 15 additions & 0 deletions corpora/newspaper-corpora/wai-not-corpus.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,15 @@
{
"Name": "WAI-NOT Corpus",
"URL": "http://hdl.handle.net/10032/tm-a2-t9",
"Family": "Newspaper corpora",
"Description": "This corpus contains the digital archive of the <a href=\"https://www.wai-not.be/\">WAI-NOT newspaper</a> between 2009 and 2021.\nThe corpus is available from the Dutch Language Institute.",
"Language": ["nld"],
"Licence": "",
"Size": [""],
"Annotation": ["", ""],
"Infrastructure": "CLARIN",
"Access": {
"Download": "http://hdl.handle.net/10032/tm-a2-t9 "
},
"Publication":""
}

0 comments on commit 3185bff

Please sign in to comment.