Skip to content

Switch to fewer allocations in UnicodeSet parsing #3684

Open
@skius

Description

icu_unicodeset_parser uses a bunch of allocating types internally to allow for

  • arbitrary-length escapes (\x{61 62 63 64...}) use Vec<char>
  • arbitrary-length strings ({abcd...}) use String

These can/should probably be swapped out for types with "small lives on stack, big lives on heap" semantics.

Linking PR: #3670

Discuss/decide: Priority of UnicodeSet parsing efficiency

Metadata

Assignees

No one assigned

    Labels

    C-unicodeComponent: Props, sets, triesS-smallSize: One afternoon (small bug fix or enhancement)T-techdebtType: ICU4X code health and tech debtgood first issueGood for newcomers

    Type

    No type

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions