Language resources
Datasets
During my thesis, I created or contributed to various morphological datasets, mostly about Finnic languages. These datasets follow the paralex standard or the frictionless DataPackage specification.
FinCog
A concordance between 4 paralex lexicons for Finnic languages, based on cognacy information.
DOI: 10.5281/zenodo.15133173
ParaFin: Finnish
An inflected lexicon for Finnish, in phonemic transcription. It follows the paralex standard.
URL: parafin.finug.eu
ParaKar: Olonets Karelian
An inflected lexicon for Olonets Karelian, in phonemic transcription. It follows the paralex standard.
URL: parafin.finug.eu
ParaLiv: Livonian
An inflected lexicon for Livonian, in phonemic transcription. It follows the paralex standard. Livonian is a minority language in Latvia. With V. Ernštreits and T. Tuisk.
URL: paraliv.finug.eu
Eesthetic
[S. Beniamine & al.] An inflected lexicon for Estonian, in phonemic transcription. It follows the paralex standard.
DOI: 10.5281/zenodo.15133173
Code & Software
Contributions to projects are either required by my current research, or driven by my personal interest in language.
Qumín
A python package developped by Sacha Beniamine and myself for the quantitative study of inflection systems.
DOI: 10.5281/zenodo.15008373
Paralex
Paralex is a collective projet led by Sacha Beniamine to standardize morphological datasets. I contributed the python package and partly to the standard.
URL: paralex-standard.org
Korpef
The Korpef is a new interface for the French-Estonian Parallel Corpus, using cwb as a backend. It is still WIP.
URL: https://codeberg.org/copef
Khotanese.finug.eu
This is a small website that I developped to help a friend to publish a database from her MA thesis.
URL: khotanese.finug.eu
Adefo.org
The website of the Association pour le développement des études finno-ougriennes. I often create lightweight static websites for small projects.
URL: adefo.org