

Olli Kuparinen
About me
Academy Research Fellow at the Languages Unit from 2024 to 2028. Description of the project: https://tiedejatutkimus.fi/en/results/funding/81468
From October 2023 to August 2024, I worked in the LANGAWARE project, funded by the Kone Foundation.
From 2021 to 2023, I was a postdoctoral researcher in the language technology group at the University of Helsinki.
From 2017 to 2021, I worked as a PhD researcher in a Kone Foundation funded project "Kielelliset populaatiot Helsingissä". I defended my PhD in 2021.
Fields of expertise
sociolinguistics, dialectology, language technology, computational methods
Top achievements
An up-to-date CV can be found on my personal website: https://okuparinen.github.io/cv.html
Awards:
- Thesis award of August Ahlqvist, Yrjö Wichmann, Kai Donner and Artturi Kannisto's fund for "Muutoksen mekanismit. Kolmen aikapisteen reaaliaikatutkimus Helsingin puhekielestä". Granted by Kotikielen Seura 14.3.2022.
- Thesis award of The University of Tampere Foundation for "Muutoksen mekanismit. Kolmen aikapisteen reaaliaikatutkimus Helsingin puhekielestä". Granted 15.9.2022.
- Article award of E.A. Saarimaa's fund for ”Infinitiivien variaatio ja muutos Helsingissä” (Virittäjä 122). Granted by Kotikielen Seura 14.3.2019.
Research topics
language variation and change, language technology
Research unit
Funding
Selected publications
An up-to-date list of publications can be found on my website: https://okuparinen.github.io/publications.html
Latest publications
Automatic Dialectal Transcription: An Evaluation on Finnish and Norwegian
Kuparinen, O., 2025, Interspeech 2025: Proceedings of the international conference on spoken language processing. International Speech Communication Association (ISCA), p. 2390-2394 5 p. (Interspeech).Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › Scientific › peer-review
Interactive maps for corpus-based dialectology
Scherrer, Y. & Kuparinen, O., 3 Mar 2025, Proceedings of the Joint 25th Nordic Conference on Computational Linguistics and 11th Baltic Conference on Human Language Technologies (NoDaLiDa/Baltic-HLT 2025). Johansson, R. & Stymne, S. (eds.). Tartu, Estonia: University of Tartu, p. 634-638 5 p. (NEALT Proceedings Series; vol. 57).Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › Scientific › peer-review
Murteesta ja identiteetistä 2020-luvulla: Piirrelähtöiseen itseraportointimenetelmään perustuva kyselytutkimuksen analyysi
Kuparinen, O. & Vaattovaara, J., 14 Mar 2025, In: Virittäjä. 129, 1, p. 63–90Research output: Contribution to journal › Article › Scientific › peer-review
Corpus-based dialectometry with topic models
Kuparinen, O. & Scherrer, Y., 2024, In: Journal of Linguistic Geography. 12, 1, p. 1-12 12 p.Research output: Contribution to journal › Article › Scientific › peer-review
Murre24: Dialect Identification of Finnish Internet Forum Messages
Kuparinen, O., May 2024, Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024). Calzolari, N., Kan, M.-Y., Hoste, V., Lenci, A., Sakti, S. & Xue, N. (eds.). Torino, Italy: European Language Resources Association (ELRA), p. 12003-12015 13 p. (LREC proceedings)(International Conference on Computational Linguistics).Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › Scientific › peer-review
Dialect-to-Standard Normalization: A Large-Scale Multilingual Evaluation
Kuparinen, O., Miletić, A. & Scherrer, Y., 1 Dec 2023, Findings of the Association for Computational Linguistics: EMNLP 2023. Bouamor, H., Pino, J. & Bali, K. (eds.). Singapore: ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, p. 13814-13828 15 p.Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › Scientific › peer-review
The Helsinki-NLP Submissions at NADI 2023 Shared Task: Walking the Baseline
Scherrer, Y., Miletić, A. & Kuparinen, O., 1 Dec 2023, Proceedings of ArabicNLP 2023. Sawaf, H., El-Beltagy, S., Zaghouani, W., Magdy, W., Abdelali, A., Tomeh, N., Abu Farha, I., Habash, N., Khalifa, S., Keleg, A., Haddad, H., Zitouni, I., Mrini, K. & Almatham, R. (eds.). Singapore (Hybrid): ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, p. 670-677 8 p.Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › Scientific › peer-review
Katomuotojen eteneminen hd-yhtymässä Helsingin puhekielessä
Kuparinen, O., Santaharju, J., Leino, U., Mustanoja, L. & Peltonen, J., 2022, In: Virittäjä. 126, 3Research output: Contribution to journal › Article › Scientific › peer-review
Miksi kato leviää? hd-yhtymän katovariantin diffuusion syyt Helsingin puhekielessä
Knuutila, S., Kuparinen, O., Santaharju, J., Mustanoja, L., Leino, U. & Peltonen, J., 2022, In: SANANJALKA. 64, p. 67-85Research output: Contribution to journal › Article › Scientific › peer-review
Helsingin puhekielen muutos ja muutosten teoretisointi
Kuparinen, O., 2021, In: Virittäjä. 125, 4, 6 p.Research output: Contribution to journal › Article › Scientific