curated collection of papers for the nlp practitioner 📖👩🔬
-
Updated
Aug 5, 2020
curated collection of papers for the nlp practitioner 📖👩🔬
multi_task_NLP is a utility toolkit enabling NLP developers to easily train and infer a single model for multiple tasks.
chinese NLP corpus of chinese science fiction,chinese science fiction corpus : About 4675 Chinese science fiction novels 大约有4675本科幻小说,中文科幻小说自然语言处理语料库,中文科幻小说文本语料库,中文科幻小说文本数据库,科幻小说语料
Открытые лингвистические датасеты: тональный словарь русского языка КартаСловСент, датасет по семантике, ассоциативный граф и датасет по орфографическим ошибкам и опечаткам.
Chinese, English NER, English-Chinese machine translation dataset. 中英文实体识别数据集,中英文机器翻译数据集, 中文分词数据集
UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language
a Fine-tuned LLaMA that is Good at Arithmetic Tasks
TriggerNER: Learning with Entity Triggers as Explanations for Named Entity Recognition (ACL 2020)
Implementation of Very Deep Convolutional Neural Network for Text Classification
A Constrained Text Generation Challenge Towards Generative Commonsense Reasoning
手工整理医疗行业词汇、术语等语料。可用于语音识别、对话系统等各类nlp模型训练。
🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Paper.
chinese NLP corpus of chinese science fiction, chinese science fiction corpus: Archive of the Ark Plan of Ula Science Fiction Website 乌拉科幻小说网方舟计划存档,中文科幻小说自然语言处理语料库,中文科幻小说文本语料库,中文科幻小说文本数据库,科幻小说语料
汉字数据集,包括汉字的相关信息,例如笔画数、部首、拼音、英文释义/同义词等。
Yorùbá language training text for NLP, ASR and TTS tasks
The release of the FreebaseQA data set (NAACL 2019).
Code and data for "Summarising Historical Text in Modern Languages" (EACL 2021)
A collection of datasets for Ukrainian language
Extracts Transcript and Summary (Abstractive and Extractive) from the AMI Meeting Corpus
Add a description, image, and links to the nlp-datasets topic page so that developers can more easily learn about it.
To associate your repository with the nlp-datasets topic, visit your repo's landing page and select "manage topics."