Skip to content
View raphaelmerx's full-sized avatar

Organizations

@catalpainternational @wevote

Block or report raphaelmerx

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Inference slice of marian for bergamot's tiny11 models. Faster to compile, and wield. Fewer model-archs than bergamot-translator.

C++ 8 2 Updated Oct 24, 2024

Integrate cutting-edge LLM technology quickly and easily into your apps

C# 21,887 3,260 Updated Nov 8, 2024
Python 3 Updated Jun 1, 2024

CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.

Python 753 128 Updated Nov 8, 2024

NusaWrites is an in-depth analysis of corpora collection strategy and a comprehensive language modeling benchmark for underrepresented and extremely low-resource Indonesian local languages.

Jupyter Notebook 24 2 Updated Sep 27, 2024

The central repo for Creole based NLU and NLG work

HTML 14 3 Updated May 28, 2024

This add-on implements a speech synthesizer driver for NVDA using neural TTS models. It supports Piper

Python 50 11 Updated Jul 22, 2024
Java 1 Updated Dec 18, 2019

Code to create a database with cleaned up Wiktionary data and then to create ebook dictionaries based on this data.

Python 17 2 Updated Aug 16, 2023

Java JNI wrapper for SentencePiece: unsupervised text tokenizer for Neural Network-based text generation.

C++ 35 14 Updated Jan 16, 2023

Build a chatbot or Q&A bot of your website's content

Python 522 55 Updated Jan 28, 2024

QGIS on Apple Silicon

Shell 19 Updated Dec 9, 2022

Spam Numbero and make your city the most dangerous place on Earth

Python 84 12 Updated Sep 27, 2022

🙊 software for creating speech recognition models.

Python 152 33 Updated Jun 2, 2024

A collaborative project to collect datasets in Indonesian languages.

Jupyter Notebook 262 62 Updated Jun 2, 2024

High-quality parallel resource on sentiment analysis for 10 low-resource Indonesian languages, English, and Indonesian (Outstanding Paper at EACL 2023)

Jupyter Notebook 89 10 Updated May 8, 2023

Machine Translation for Africa

Lua 277 206 Updated Jun 14, 2022

dQuery documentation and recipes

4 Updated Mar 7, 2024
Python 2 1 Updated Dec 16, 2021

PanLex Vocabulary interface

Python 1 1 Updated Dec 8, 2022
TypeScript 1 Updated May 28, 2023

🕷️ The pipeline for the OSCAR corpus

Rust 162 14 Updated Dec 18, 2023

Python source code for EMNLP 2020 paper "Reusing a Pretrained Language Model on Languages with Limited Corpora for Unsupervised NMT".

Python 35 3 Updated Mar 16, 2022

This is the repository of the EMNLP 2021 paper "BERT, mBERT, or BiBERT? A Study on Contextualized Embeddings for Neural Machine Translation".

Python 31 6 Updated Nov 28, 2022

Multilingual Speech Recognition for Indonesian Languages

Python 54 3 Updated Oct 5, 2022

AUSTENDER OCDS Search API. This portal will provide users of AusTender data with documentation, code examples, bug notifications and feature requests.

9 4 Updated Feb 12, 2024

A platform for creating interactive data visualizations

TypeScript 1,389 229 Updated Nov 8, 2024

Sentence aligner

C++ 108 38 Updated May 21, 2021

The SQL to IATI Database repository contains all of the SQL scripts that are required to build DFID’s IATIv203 database, which is used by DFID to transform their internal data into IATI v2.03 stand…

TSQL 10 2 Updated Aug 5, 2020

Application Web de mise en valeur des données du graphe sireneLD

CSS 4 Updated Dec 9, 2022
Next