Skip to content

Commit

Permalink
Add files via upload
Browse files Browse the repository at this point in the history
  • Loading branch information
jakoble authored Oct 28, 2024
1 parent 7c66dc5 commit 8c6ac8e
Show file tree
Hide file tree
Showing 3 changed files with 45 additions and 0 deletions.
15 changes: 15 additions & 0 deletions corpora/spoken-corpora/c-oral.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,15 @@
{
"Name": "C-ORAL-ROM_EXM",
"URL": "https://hdl.handle.net/21.11129/0000-000B-D4FF-7",
"Family": "Spoken corpora",
"Description": "This is a corpus of formal and informal speech.\nThe corpus is available from PORTULAN.",
"Language": ["por"],
"Licence": "The MIT licence",
"Size": ["300,000 words"],
"Annotation": ["Orthographically aligned", "Phonemically alligned", "PoS tagged"],
"Infrastructure": "CLARIN",
"Access": {
"Download": "https://hdl.handle.net/21.11129/0000-000B-D4FF-7"
},
"Publication":""
}
15 changes: 15 additions & 0 deletions corpora/spoken-corpora/perfil.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,15 @@
{
"Name": "Perfil Sociolinguístico da Fala Bracarense",
"URL": "https://hdl.handle.net/21.11129/0000-000D-F928-E",
"Family": "90 hours",
"Description": "The corpus is composed by 1 hour interviews with speakers of the same area (around Braga, Portugal).\nThe interviews are stratified according to gender, age and level of education; the transcriptions are aligned with <a href=\"https://exmaralda.org/en/\"EXMARaLDA</a>.\nThe corpus is available from PORTULAN.",
"Language": ["por"],
"Licence": "CC BY-NC-ND",
"Size": ["90 hours"],
"Annotation": ["transcriptions aligned"],
"Infrastructure": "CLARIN",
"Access": {
"Download": "https://hdl.handle.net/21.11129/0000-000D-F928-E"
},
"Publication":""
}
15 changes: 15 additions & 0 deletions corpora/spoken-corpora/spoken-dutch-corpus.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,15 @@
{
"Name": "Spoken Dutch Corpus",
"URL": "https://hdl.handle.net/10032/tm-a2-k6",
"Family": "Spoken corpora",
"Description": "This is a corpus of standard Dutch spoken in Flanders and the Netherlands.",
"Language": ["nld"],
"Licence": "",
"Size": ["900 hours"],
"Annotation": ["PoS-tagged", "syntactically parsed", "phonetically transcribed", "phonemically transcribed"],
"Infrastructure": "CLARIN",
"Access": {
"Download": "https://hdl.handle.net/10032/tm-a2-k6"
},
"Publication":""
}

0 comments on commit 8c6ac8e

Please sign in to comment.