Skip to content

Commit

Permalink
update
Browse files Browse the repository at this point in the history
  • Loading branch information
agricolamz committed Dec 3, 2024
1 parent abc007b commit 19f91a9
Show file tree
Hide file tree
Showing 2 changed files with 44 additions and 0 deletions.
22 changes: 22 additions & 0 deletions result_en.html
Original file line number Diff line number Diff line change
Expand Up @@ -53,6 +53,28 @@ <h2 class="anchored" data-anchor-id="seminars-of-the-linguistic-convergence-labo
<h3 class="anchored" data-anchor-id="seminar-schedule-2024">Seminar schedule 2024</h3>
<div class="columns">
<div class="column" style="width:15%;">
<p><strong>10 December</strong></p>
</div><div class="column" style="width:85%;">
<p><em>Martin Haspelmath (Max Planck Institute for Evolutionary Anthropology)</em></p>
<p><strong>Language parameters and construction parameters in the CrossGram database collection</strong></p>
<details>
<summary>
Abstract
</summary>
Replicability of research results minimally relies on data accessibility, but the data should ideally be FAIR: Findable, Accessible, Interoperable, and Reusable. For technical interoperability, the CLDF standard (Forkel et al.&nbsp;2018) could be used by typologists, though uptake seems to have been slow so far. In this presentation, I discuss the design of the CrossGram database collection, which is specifically designed for typological datasets (it has been public since the summer of 2024: https://crossgram.clld.org/), Here I describe how CrossGram enhances findability and reusability, and I highlight the two different data types that it supports (language parameters and construction parameters). CrossGram makes typological data more findable in that it “brings to light” what is often “hidden away” in supplementary spreadsheet files (or even tables in PDF files, though this is becoming rare). Research papers typically limit themselves to summary tables or graphs and a few small maps, but ideally we want to access all typological datasets with the amenities known from CLLD applications such as WALS Online (Dryer &amp; Haspelmath 2013, wals.info) or Grambank (Skirgård et al.&nbsp;2023, grambank.clld.org). These provide not only easy exporting in CLDF format, but also easy searching and sorting in data tables, as well as map visualization, and links to references and Glottolog language information. In addition, CrossGram provides glossed example sentences in tabular form, similar to the thousands of examples in the APiCS database (Michaelis et al.&nbsp;2013). These are a particularly striking case of increased transparency, because it is not uncommon for example sentences to be hidden in PDF supplements (for example, Bugaeva 2022 has a supplement of 80 pages of annotated examples). Interlinear glossed text has a range of applications even independently of the typological claims that the examples illustrate, so this is another obvious improvement in reusability. CrossGram supports two types of typological data: Language parameters that classify entire languages (i.e.&nbsp;parameters of the type known from the maps of WALS and Grambank), and construction parameters that classify constructions. There are many grammatical meanings or functions that can be rendered by multiple constructions in a given language, and if we only consider language parameters, the language must be classified as “mixed” (or a minor construction must be ignored). For example, Kashmiri has both correlative relative clauses and postnominal relative clauses, so both of these strategies could be included in the database and their properties recorded. Stereotypically, typology consists in classifying languages into types, but in reality, languages often have multiple types coexisting with each other, so the addition of constructions and construction types as a data type makes typological databases more fine-grained. Finally, CrossGram parameters (both language parameters and construction parameters) are not only explained succinctly and clearly, but there is also a sophisticated keyword annotation that allows users to easily find grammatical information on a wide range of topics, and for the future, integration with the envisaged “Grammaticon” reference catalogue is planned (see Haspelmath 2022). This will enhance findability and accessibility even further, and it will facilitate replication and (more generally) cumulative science.
</details>
</div><div class="column" style="width:15%;">
<p><strong>3 December</strong></p>
</div><div class="column" style="width:85%;">
<p><em>Daria Ryzhova (HSE University), Polina Padalka (HSE University)</em></p>
<p><strong>YES and NO answers and their synonyms in Shughni</strong></p>
<details>
<summary>
Abstract
</summary>
Shughni response particles ůn ‘yes’ and nāy ‘no’ may be used in various contexts, including, besides answers to polarity questions, reactions to requests, suggestions, opinions and other types of speech acts. In addition to their usage in a dialogue setting, these particles can function as discourse markers in narratives. In this talk, we will outline the range of their functions and present their synonyms: other response particles (e.g.&nbsp;en ‘yes’, an ‘yes’, nāyo ‘no’, na-a ‘no’) and multiword expressions (discourse formulae). We will show that synonymous items tend to distribute across different discourse functions. For some items, we will trace their presumable pragmaticalization paths.
</details>
</div><div class="column" style="width:15%;">
<p><strong>26 November</strong></p>
</div><div class="column" style="width:85%;">
<p><em>Leah Finkelberg (HSE University)</em></p>
Expand Down
22 changes: 22 additions & 0 deletions result_ru.html
Original file line number Diff line number Diff line change
Expand Up @@ -53,6 +53,28 @@ <h2 class="anchored" data-anchor-id="семинары-международной
<h3 class="anchored" data-anchor-id="расписание-семинаров-в-2024-году">Расписание семинаров в 2024 году</h3>
<div class="columns">
<div class="column" style="width:15%;">
<p><strong>10 декабря</strong></p>
</div><div class="column" style="width:85%;">
<p><em>Martin Haspelmath (Max Planck Institute for Evolutionary Anthropology)</em></p>
<p><strong>Language parameters and construction parameters in the CrossGram database collection</strong></p>
<details>
<summary>
Аннотация
</summary>
Replicability of research results minimally relies on data accessibility, but the data should ideally be FAIR: Findable, Accessible, Interoperable, and Reusable. For technical interoperability, the CLDF standard (Forkel et al.&nbsp;2018) could be used by typologists, though uptake seems to have been slow so far. In this presentation, I discuss the design of the CrossGram database collection, which is specifically designed for typological datasets (it has been public since the summer of 2024: https://crossgram.clld.org/), Here I describe how CrossGram enhances findability and reusability, and I highlight the two different data types that it supports (language parameters and construction parameters). CrossGram makes typological data more findable in that it “brings to light” what is often “hidden away” in supplementary spreadsheet files (or even tables in PDF files, though this is becoming rare). Research papers typically limit themselves to summary tables or graphs and a few small maps, but ideally we want to access all typological datasets with the amenities known from CLLD applications such as WALS Online (Dryer &amp; Haspelmath 2013, wals.info) or Grambank (Skirgård et al.&nbsp;2023, grambank.clld.org). These provide not only easy exporting in CLDF format, but also easy searching and sorting in data tables, as well as map visualization, and links to references and Glottolog language information. In addition, CrossGram provides glossed example sentences in tabular form, similar to the thousands of examples in the APiCS database (Michaelis et al.&nbsp;2013). These are a particularly striking case of increased transparency, because it is not uncommon for example sentences to be hidden in PDF supplements (for example, Bugaeva 2022 has a supplement of 80 pages of annotated examples). Interlinear glossed text has a range of applications even independently of the typological claims that the examples illustrate, so this is another obvious improvement in reusability. CrossGram supports two types of typological data: Language parameters that classify entire languages (i.e.&nbsp;parameters of the type known from the maps of WALS and Grambank), and construction parameters that classify constructions. There are many grammatical meanings or functions that can be rendered by multiple constructions in a given language, and if we only consider language parameters, the language must be classified as “mixed” (or a minor construction must be ignored). For example, Kashmiri has both correlative relative clauses and postnominal relative clauses, so both of these strategies could be included in the database and their properties recorded. Stereotypically, typology consists in classifying languages into types, but in reality, languages often have multiple types coexisting with each other, so the addition of constructions and construction types as a data type makes typological databases more fine-grained. Finally, CrossGram parameters (both language parameters and construction parameters) are not only explained succinctly and clearly, but there is also a sophisticated keyword annotation that allows users to easily find grammatical information on a wide range of topics, and for the future, integration with the envisaged “Grammaticon” reference catalogue is planned (see Haspelmath 2022). This will enhance findability and accessibility even further, and it will facilitate replication and (more generally) cumulative science.
</details>
</div><div class="column" style="width:15%;">
<p><strong>3 декабря</strong></p>
</div><div class="column" style="width:85%;">
<p><em>Daria Ryzhova (HSE University), Polina Padalka (HSE University)</em></p>
<p><strong>YES and NO answers and their synonyms in Shughni</strong></p>
<details>
<summary>
Аннотация
</summary>
Shughni response particles ůn ‘yes’ and nāy ‘no’ may be used in various contexts, including, besides answers to polarity questions, reactions to requests, suggestions, opinions and other types of speech acts. In addition to their usage in a dialogue setting, these particles can function as discourse markers in narratives. In this talk, we will outline the range of their functions and present their synonyms: other response particles (e.g.&nbsp;en ‘yes’, an ‘yes’, nāyo ‘no’, na-a ‘no’) and multiword expressions (discourse formulae). We will show that synonymous items tend to distribute across different discourse functions. For some items, we will trace their presumable pragmaticalization paths.
</details>
</div><div class="column" style="width:15%;">
<p><strong>26 ноября</strong></p>
</div><div class="column" style="width:85%;">
<p><em>Leah Finkelberg (HSE University)</em></p>
Expand Down

0 comments on commit 19f91a9

Please sign in to comment.