Projekter pr. år
Abstract
In order to draw generalizable conclusions about the performance of multilingual models across languages, it is important to evaluate on a set of languages that captures linguistic diversity.Linguistic typology is increasingly used to justify language selection, inspired by language sampling in linguistics. However, justifications for ‘typological diversity’ exhibit great variation, as there seems to be no set definition, methodology or consistent link to linguistic typology.In this work, we provide a systematic insight into how previous work in the ACL Anthology uses the term ‘typological diversity’.Our two main findings are: 1) what is meant by typologically diverse language selection is not consistent and 2) the actual typological diversity of the language sets in these papers varies greatly.We argue that, when making claims about ‘typological diversity’, an operationalization of this should be included.A systematic approach that quantifies this claim, also with respect to the number of languages used, would be even better.
Originalsprog | Engelsk |
---|---|
Titel | Proceedings of the 6th Workshop on Research in Computational Linguistic Typology and Multilingual NLP |
Forlag | Association for Computational Linguistics |
Publikationsdato | 17 mar. 2024 |
ISBN (Trykt) | 979-8-89176-071-4 |
Status | Udgivet - 17 mar. 2024 |
Begivenhed | The 18th Conference of the European Chapter of the Association for Computational Linguistics - Radisson Blu, St. Julian's, Malta Varighed: 17 mar. 2024 → 22 mar. 2024 https://2024.eacl.org/ |
Konference
Konference | The 18th Conference of the European Chapter of the Association for Computational Linguistics |
---|---|
Lokation | Radisson Blu |
Land/Område | Malta |
By | St. Julian's |
Periode | 17/03/2024 → 22/03/2024 |
Internetadresse |
Fingeraftryk
Dyk ned i forskningsemnerne om 'A Call for Consistency in Reporting Typological Diversity'. Sammen danner de et unikt fingeraftryk.Projekter
- 1 Igangværende
-
Multilingual Modelling for Resource-Poor Languages
Bjerva, J., Lent, H. C., Chen, Y., Ploeger, E., Fekete, M. R. & Lavrinovics, E.
01/09/2022 → 31/08/2025
Projekter: Projekt › Forskning
Aktiviteter
- 1 Konferenceoplæg
-
A Call for Consistency in Reporting Typological Diversity
Esther Ploeger (Foredragsholder)
22 mar. 2024Aktivitet: Foredrag og mundtlige bidrag › Konferenceoplæg
Publikation
- 1 Preprint
-
What is 'Typological Diversity' in NLP?
Ploeger, E., Poelman, W., de Lhoneux, M. & Bjerva, J., 6 feb. 2024, (Afsendt).Publikation: Working paper/Preprint › Preprint