BramVanroy

Bram Vanroy BramVanroy

👋 My name is Bram and I work on natural language processing and machine translation (evaluation) but I also spend a lot of time in this open-source world 🌍

128 followers · 25 following

Achievements

x3 x3

Achievements

x3 x3

Highlights

Organizations

Stars

INL / galahad

"Galahad". Goal: enable linguists to experiment with different taggers and use the result in other INT products

Kotlin 1 Updated Sep 24, 2024

Lightning-AI / LitServe

Lightning-fast serving engine for AI models. Flexible. Easy. Enterprise-scale.

Python 2,204 134 Updated Oct 4, 2024

wjbmattingly / hobbit-spacy

Jupyter Notebook 23 Updated Aug 13, 2023

mlabonne / llm-datasets

High-quality datasets, tools, and concepts for LLM fine-tuning.

1,752 166 Updated Aug 18, 2024

NVIDIA / NeMo-Curator

Scalable data pre processing and curation toolkit for LLMs

Jupyter Notebook 487 60 Updated Oct 3, 2024

weaviate / Verba

Retrieval Augmented Generation (RAG) chatbot powered by Weaviate

TypeScript 6,090 651 Updated Sep 23, 2024

jondurbin / bagel

A bagel, with everything.

Python 307 31 Updated Apr 11, 2024

ScandEval / ScandEval

Evaluation of language models on mono- or multilingual tasks.

Python 71 13 Updated Aug 9, 2024

huggingface / cosmopedia

Python 432 43 Updated Oct 1, 2024

AnswerDotAI / fsdp_qlora

Training LLMs with QLoRA + FSDP

Jupyter Notebook 1,394 187 Updated Sep 23, 2024

davanstrien / haiku-dpo

Using open source LLMs to build synthetic datasets for direct preference optimization

Jupyter Notebook 35 5 Updated Feb 29, 2024

DaneelTrevize / TABSAT

They Are Billions Save Automation Tool

C# 19 4 Updated Jun 28, 2023

facebookresearch / nougat

Implementation of Nougat Neural Optical Understanding for Academic Documents

Python 8,831 561 Updated Apr 16, 2024

andreasvc / udstyle

Compute complexity metrics from Universal Dependencies

Python 2 Updated Mar 7, 2022

andreasvc / readability

Forked from mmautner/readability

Measure the readability of a given text using surface characteristics

Python 71 17 Updated Dec 15, 2022

argilla-io / distilabel

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Python 1,465 113 Updated Oct 4, 2024