Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

added 4 new languages and improved Hungarian #104

Merged
merged 5 commits into from
Dec 28, 2021
Merged

added 4 new languages and improved Hungarian #104

merged 5 commits into from
Dec 28, 2021

Conversation

martino-vic
Copy link
Contributor

I have created and used these files for the data I am working with in my doctoral thesis and thought it might be worth sharing them publicly as well. See detailed description including sources in the commit-message.

- hun-Latn.csv: adding geminates to Hungarian (based on own knowledge of Hungarian orthography)
- mri-Latn.csv: Maori (src: https://en.wikipedia.org/wiki/M%C4%81ori_phonology)
- got-Latn.csv: Transcribing Gothic (Latin script) to IPA, based on Gotische Grammatik by Wilhelm Braune (2004 edition)
- Goth2Latn.csv and Latn2Goth.csv: - Transliterating between the Gothic and Latin script, based on Wikipedia: https://en.wikipedia.org/wiki/Gothic_alphabet
- pii-latn_Holopainen2019.csv,  pii-latn_Wiktionary.csv: Transcribing Proto-Indo-Iranian to IPA, for characters from Sampsa Holopainen’s dissertation (https://helda.helsinki.fi/handle/10138/307582) and for Wiktionary, respecitvely. Transcriptions based on: Kümmel, Martin Joachim. "111. The morphology of Indo-Iranian". Volume 3 Handbook of Comparative and Historical Indo-European Linguistics, edited by Jared Klein, Brian Joseph and Matthias Fritz, Berlin, Boston: De Gruyter Mouton, 2018, pp. 1888-1924. https://doi.org/10.1515/9783110542431-032
- uew.csv: Proto-Uralic and descendant proto-languages to IPA, for characters from http://uralonet.nytud.hu/, which in turn is based on the UEW (Uralic Etymological Dictionary by Károly Rédei). Transcriptions are based on http://uralonet.nytud.hu/help.cgi
Hungarian, Maori, Gothic, PII, UEW
restoring certain voiced plosives and marking syllabic consonants for Gothic, based on [Gotische Grammatik by Wilhelm Braune (2004 edition)](https://www.degruyter.com/document/doi/10.1515/9783110945089/html)
@dmort27 dmort27 merged commit 113c07e into dmort27:master Dec 28, 2021
martino-vic added a commit to martino-vic/epitran that referenced this pull request Feb 21, 2022
Somehow I forgot to include the corrected orthography profile for Hungarian in pull request dmort27#104, so here it goes. This one transcribes the geminates properly.
dmort27 added a commit that referenced this pull request Feb 21, 2022
in #104 I forgot to include the file for Hungarian
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants