Add Indic numerals and missing punctuation to Arabic #131
Open
Description
Previously: #71 and tesseract-ocr/tessdata_best#11 (also contains a pertinent discussion on how well the different traineddata deal with these characters).
• Indic numerals: (٠ ١ ٢ ٣ ٤ ٥ ٦ ٧ ٨ ٩)
• Punctuation: (؛
, ،
, ﴿﴾
)
• Also, a ligature very commonly found in Arabic texts: ﷺ
If I can do this myself please simply point me the way.
CC @Shreeshrii
Metadata
Assignees
Labels
No labels