Amharic English Machine Translation Corpus prepared through website crawelling and custom preprocessing.
-
Updated
Aug 2, 2018 - Python
Amharic English Machine Translation Corpus prepared through website crawelling and custom preprocessing.
An Amharic News Text classification Dataset
A toolset for Amharic Language pre-processing. Includes an Amharic Stemmer, Transliterator, Stopword remover , Lexical analyzer, Corpus indexer and Term weighter.
Amharic Spelling Corrector based on SymSpell - Spelling corrector which is 1 million times faster through Symmetric Delete spelling correction algorithm
simple bs4 based web crawl for a corpus in need of statistical machine translation
The set of files used for the development of the Amharic Corpus.
k`wat is collection of Amharic datasets includes peoples names, postcodes, tweets
Add a description, image, and links to the amharic-corpus topic page so that developers can more easily learn about it.
To associate your repository with the amharic-corpus topic, visit your repo's landing page and select "manage topics."