A Curated List of Dataset and Usable Library Resources for NLP in Bahasa Indonesia
-
Updated
Feb 17, 2023
A Curated List of Dataset and Usable Library Resources for NLP in Bahasa Indonesia
A list of Indonesian NLP resources.
Data Science Learning Path - A complete guide to learn data science for beginners
data resource untuk NLP bahasa indonesia
Repositori ini merupakan kumpulan dataset terkait analisis sentimen Berbahasa Indonesia. Apabila Anda menggunakan dataset-dataset yang ada pada repositori ini untuk penelitian, maka cantumkanlah/kutiplah jurnal artikel terkait dataset tersebut. Dataset yang tersedia telah diimplementasikan dalam beberapa penelitian dan hasilnya telah dipublikasi…
IndoLEM is a comprehensive Indonesian NLU benchmark, comprising three pillars NLP task: morpho-syntax, semantic, and discourse. Presented in COLING 2020.
Pujangga - Indonesian Natural Language Processing Tool with REST API, an Interface for InaNLP and Deeplearning4j's Word2Vec
Database kamus kumpulan kata dalam bahasa Indonesia sesuai KBBI (Indonesian word list database based on KBBI)
A Python module that fetches a page of a word/phrase from the Online Indonesian Dictionary (https://kbbi.kemdikbud.go.id).
A benchmark dataset for Indonesian text summarization.
Convert numbers into words in Indonesian language
A curated list of natural language processing courses, video lectures, books, library and many more.
Json hari libur indonesia yang slalu update.
IndoBERTweet is the first large-scale pretrained model for Indonesian Twitter. Published at EMNLP 2021 (main conference)
Introduction to Node.js and Example Applications (Bahasa Indonesia)
Baik Language Next Release
The first large-scale summarization corpus for the Indonesian language. AACL 2020.
Indonesian Grapheme-to-Phoneme (IPA notation)
Add a description, image, and links to the indonesian-language topic page so that developers can more easily learn about it.
To associate your repository with the indonesian-language topic, visit your repo's landing page and select "manage topics."