This project is devoted to the Siberian Ingrian Finnish language. Siberian Ingrian Finnish – is a language (dialect) used by the descendants of the settlers who spoke Lower Luga Ingrian Finnish varieties and Lower Luga Ingrian (Izhorian) who have been living in Omsk oblast (previously they lived also in other regions of the Siberia) for more than 200 years. The ancestors of the speakers of Siberian Ingrian Finnish came from the Lower Luga area in the early 19th century. They came from the Rosona river area, to be exact. This region is also called Estonian Ingria. Siberian Ingrian Finnish (Russian: Сибирский ингерманландский идиом) is the term introduced by D. V. Sidorkevich.
References:
- Сидоркевич, Д. В. (Sidorkevich, Daria) (2014). Язык ингерманландских переселенцев в Сибири: структура, диалектные особенности, контактные явления (Doctoral dissertation, Ин-т лингвист. исслед. РАН (СПб)). https://iling.spb.ru/theses/1999 (In Russian)
- Sidorkevich, Daria (2011). On domains of adessive-allative in Siberian Ingrian Finnish. Acta Linguistica Petropolitana, 7(3). https://cyberleninka.ru/article/n/on-domains-of-adessive-allative-in-siberian-ingrian-finnish/viewer
- Kuznetsova, Natalia (2016). Evolution of the non-initial vocalic length contrast across the Finnic varieties of Ingria and adjacent areas. Linguistica Uralica, 52(1), 1-25. https://publicatt.unicatt.it/retrieve/handle/10807/143760/240945/ling-2016-1-1-25%28uus%29.pdf The part of this paper about Siberian Ingrian Finnish (mixed Siberian Ingrian/Finnish dialect).
- Ubaleht, Ivan (2020). The Creation of Siberian Ingrian Finnish and Siberian Tatar Speech Corpora. Workshop on RESOURCEs and representations For Under-resourced Languages and domains (RESOURCEFUL-2020) at SLTC, Gothenburg, Sweden, 25th November 2020. https://gu-clasp.github.io/resourceful-2020/papers/RESOURCEFUL-2020_paper_5.pdf
- Ubaleht, I. (2021, March). Lexeme: the Concept of System and the Creation of Speech Corpora for Two Endangered Languages. In Proceedings of the Workshop on Computational Methods for Endangered Languages (Vol. 2, pp. 20-23). https://journals.colorado.edu/index.php/computel/article/view/981 https://computel-workshop.org/wp-content/uploads/2021/03/2021.computel-2.5.pdf
- Ivan Ubaleht and Taisto-Kalevi Raudalainen. 2022. Development of the Siberian Ingrian Finnish Speech Corpus. In Proceedings of the Fifth Workshop on the Use of Computational Methods in the Study of Endangered Languages, pages 1–4, Dublin, Ireland. Association for Computational Linguistics. DOI: 10.18653/v1/2022.computel-1.1 https://aclanthology.org/2022.computel-1.1.pdf
- Злобина, Виено (Zlobina, Vieno) (1971). "Кто такие корлаки?" [Who are Korlaks?]. Советское финно-угроведение, 2, pp. 87–91. (In Russian)
- Zlobina, Vieno (1972). "Mitä alkujuurta Siperian suomalaiset ja korlakat ovat". Kotiseutu, 2 (3), pp. 86–92. (In Finnish)
- Nirvi, Ruben (1972). "Siperian inkeriläisten murteesta ja alkuperästä". Kotiseutu, 2 (3), pp. 92–95. (In Finnish)
You can download the primary speech data for the Siberian Ingrian Finnish corpus here: https://drive.google.com/drive/folders/1csw-_n2TzQa_AQObGBJP8x-S8ZH_h9E9
Stay tuned for more updates...
Speaker JuMS-28: https://www.youtube.com/watch?v=YqwrK6sItHI
All data of Siberian Ingrian Finnish in this repository are licensed under the CC BY 4.0: https://creativecommons.org/licenses/by/4.0/
If you use these materials, please publish reference to paper: Ivan Ubaleht and Taisto-Kalevi Raudalainen. 2022. Development of the Siberian Ingrian Finnish Speech Corpus. In Proceedings of the Fifth Workshop on the Use of Computational Methods in the Study of Endangered Languages, pages 1–4, Dublin, Ireland. Association for Computational Linguistics. DOI: 10.18653/v1/2022.computel-1.1
or reference to the corpus' repository: https://github.com/ubaleht/SiberianIngrianFinnish
- The source code of the parser for ELAN (*.eaf) files: https://github.com/ubaleht/LexemeELAN
- The source code of the web application of the corpus : https://github.com/ubaleht/Lexeme
Code of the Speaker and Gender | Year of Birth | Current Place of Residence | Place of the Birth | Birthplace of parents | Speech Data (Duration) |
---|---|---|---|---|---|
AAK-47 (M) | 1947 | Ryzhkovo | Syade | mother: Ryzhkovo, father: no data | 40 min 57 s |
IAI-33 (F) | 1933 | Oglukhino | Ryzhkovo | both parents: Ryzhkovo | 33 min 14 s |
JuMS-28 (M) | 1928 | Ryzhkovo | Ryzhkovo | both parents: Ryzhkovo | 77 min 53 s |
KKM-34 (M) | 1934 | Ryzhkovo | Ryzhkovo | both parents: Ryzhkovo | 31 min 29 s |
MAP-49 (F) | 1949 | Ryzhkovo | Ryzhkovo | both parents: Ryzhkovo | 30 min 36 s |
MMM-39 (M) | 1939 | Ryzhkovo | Ryzhkovo | both parents: Ryzhkovo | 62 min 20 s |
PGM-56 (F) | 1956 | Omsk | Finy | both parents: Finy | 8 min 20 s |
SVM-29 (M) | 1929 | Mikhailovka | Larionovka | both parents: Yamburgsky Uyezd, Saint Petersburg Governorate | 10 min 36 s |
KZM-51 (F) | 1951 | Ryzhkovo | Ryzhkovo | mother: Ryzhkovo, father: no data | 3 min 35 s |