Note
Dear user, to provide an one stop solution with robust functionalities, the content of
the repository is migrated under PyPI.
Please find the End of Life (EoL) details at #1
.
Migrations guidelines: #5
is available for your reference.
NLPurify is a text cleaning and extraction engine was developed using a combination of traditional techniques like Unicode translations, cleaning using regular expressions, and modern tools like "natural language processing" and "large language models" to detect and clean long texts and create word vectors.
List of active and deprecated projects that I'm currently working on is available here.