-
python-ftfy Public
Fixes mojibake and other glitches in Unicode text, after the fact.
-
countmerge Public
A command-line tool that adds counts for sorted keys.
-
wordfreq Public
Access a database of word frequencies, in various natural languages.
-
ordered-set Public
A mutable set that remembers the order of its entries. One of Python's missing data types.
-
solvertools Public
Mystery Hunt solving tools for Metropolitan Rage Warehouse. Or anyone really.
-
scholar.hasfailed.us Public
Google Scholar is a trans-exclusionary site. Don't use it. Help us demand change.
-
langcodes Public
A Python library for working with and comparing language codes.
-
language_data Public
An optional supplement to `langcodes` that stores names and statistics of languages.
-
wikiparsec Public
An LL parser for extracting information from Wiki text, particularly Wiktionary.
-
spaCy Public
Forked from explosion/spaCy💫 Industrial-strength Natural Language Processing (NLP) in Python
-
staged-recipes Public
Forked from conda-forge/staged-recipesA place to submit conda recipes before they become fully fledged conda-forge feedstocks
-
projects Public
Forked from explosion/projects🪐 End-to-end NLP workflows from prototype to production
-
spacious_corpus Public
A corpus build process for use with SpaCy projects
-
spacy-legacy Public
Forked from explosion/spacy-legacy🕸️ Legacy architectures and other registered spaCy v3.x functions for backwards-compatibility
-
dominiate Public
A simulator for Dominion card game strategies
-
rms-open-letter.github.io Public
Forked from rms-open-letter/rms-open-letter.github.io1 UpdatedMar 24, 2021 -
jetsoftime Public
Forked from Anskiy/jetsoftime -
-
dear-github-2.0 Public
Forked from drop-ice/dear-github-2.0📨 An open letter to GitHub from the maintainers of open source projects
1 UpdatedDec 5, 2019 -
IoGR Public
Forked from DontBaguMe/IoGRIllusion of Gaia Randomizer
-
ftfy-web Public
Forked from simonw/ftfy-webPaste in some broken unicode text and FTFY will tell you how to fix it!
-
-
acl-anthology Public
Forked from acl-org/acl-anthologyThe website of the ACL Anthology. Originally forked from zamakkat/acl, but this is now the main repository. For software for the legacy anthology, see WING-NUS/ACL-Anthology-Codebase .
-
wiki2text Public
Extract a plain text corpus from MediaWiki XML dumps, such as Wikipedia.
-
kytea Public
Forked from neubig/kyteaThe Kyoto Text Analysis Toolkit for word segmentation and pronunciation estimation, etc.
-
barnes-hut-tsne Public
Forked from alexisbcook/tsneA python wrapper for Barnes-Hut T-SNE on Python >= 3.5
-
marisa-trie Public
Forked from pytries/marisa-trieStatic memory-efficient Trie-like structures for Python (2.x and 3.x) based on marisa-trie C++ library.
-
-
rust Public
Forked from rust-lang/rustA safe, concurrent, practical language.
-
rust-caseless Public
Forked from unicode-rs/rust-caselessUnicode caseless matching