A string type for Rust that is not required to be valid UTF-8.
-
Updated
Dec 11, 2024 - Rust
A string type for Rust that is not required to be valid UTF-8.
An implementation of the Unicode algorithm for breaking code point sequences into extended grapheme clusters as specified in UAX #29. This library supports version 16.0 of the Unicode Standard.
A lightweight, intuitive wrapper around Intl.Segmenter for seamless segment-aware string operations in TypeScript and JavaScript
Check Unicode strings for possible confusing graphemes
terminal width of Unicode 16.0+Emoji strings in nanoseconds
A tool for generating sub-word (phone or grapheme) level embeddings from an HTK-style MLF ASR corpus
Working backwards through a deep convolutional network, to recreate the input image - and identify areas for improvement.
Add a description, image, and links to the graphemes topic page so that developers can more easily learn about it.
To associate your repository with the graphemes topic, visit your repo's landing page and select "manage topics."