[en] Towards standardized inflected lexicons for the Finnic languages

Type de document

Auteur(s)

Titre de l'ouvrage

Proceedings of the 9th International Workshop on Computational Linguistics for Uralic Languages

Instance

UNIV-PARIS

Est une partie de

Meeting

9th International Workshop on Computational Linguistics for Uralic Languages - 2024-11-28 / 2024-11-29 - Helsinki - Finland

Mots clés en

Humanities and Social Sciences/Linguistics
Morphology
Paradigms
Lexicons
Finno-Ugric
Finnic

Date de publication

Langue du document

Anglais

Résumé

[en] <div><p>We introduce three richly annotated lexicons of nouns for Livonian, standard Finnish and Livvi Karelian. Our datasets are distributed in the machine-readable Paralex standard, which consists of linked CSV tables described in a JSON metadata file. We built on the morphological dictionary of Livonian, the VepKar database and the Omorfi software to provide inflected forms. All noun forms were transcribed with grapheme-to-phoneme conversion rules and the paradigms annotated for both overabundance and defectivity. The resulting datasets are usable for quantitative studies of morphological systems and for qualitative investigations. They are linked to the original resources and can be easily updated.</p></div>

Provenance

Proceedings of the 9th International Workshop on Computational Linguistics for Uralic Languages

Collection

Source

HAL

Type de ressource

Texte intégral

Licence

Attribution

Citation bibliographique

Jules Bouton. Towards standardized inflected lexicons for the Finnic languages. 9th International Workshop on Computational Linguistics for Uralic Languages, Association for Computational Linguistics, Nov 2024, Helsinki, Finland. pp.59-66. [hal-04822038]

Citer cette ressource

[en] Towards standardized inflected lexicons for the Finnic languages, dans Études nordiques, consulté le 5 Juillet 2025, https://etudes-nordiques.cnrs.fr/s/numenord/item/17428