Starred repositories
A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling
支持中文和拼音的 SQLite fts5 全文搜索扩展 | A SQLite3 fts5 tokenizer which supports Chinese and PinYin
A maroto way to create PDFs. Maroto is inspired in Bootstrap and uses gofpdf. Fast and simple.
Noto fonts go universal! Download pan-Unicode, merged Noto fonts according to time of usage (current, ancient) or geographical region (South Asia, SE Asia, Africa-MiddleEast, Europe-Americas).
A basic boilerplate template for starting a Flutter GetX project. GetX, Dio, MVVM, get CLI, Localization, Pagination etc are implemented.
DocReal: Robust Document Dewarping of Real-Life Images via Attention-Enhanced Control Point Prediction
Simple package to extract text with coordinates from programmatic PDFs
A Comprehensive Toolkit for High-Quality PDF Content Extraction
Kubernetes based Cloud Development Environments for Enterprise Teams
Dataset and Code for our ACL 2024 paper: "Multimodal Table Understanding". We propose the first large-scale Multimodal IFT and Pre-Train Dataset for table understanding and develop a generalist tab…
TabularOCR is a Python library that provides an easy-to-use Optical Character Recognition (OCR) solution for extracting tables from images and PDFs. It offers flexible output options, allowing you…
本项目旨在收集开源的表格智能任务数据集(比如表格问答、表格-文本生成等),将原始数据整理为指令微调格式的数据并微调LLM,进而增强LLM对于表格数据的理解,最终构建出专门面向表格智能任务的大型语言模型。
Evaluate your speech-to-text system with similarity measures such as word error rate (WER)
Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks
使用FastAPI构建发票识别系统后端服务,支持并发。使用ERFNet模型训练发票轮廓检测,进行畸变矫正,OCR识别,模板匹配,支持倾斜发票识别。准确率99.9%。
Code for CVPR23 Highlight "I2MVFormer: Large Language Model Generated Multi-View Document Supervision for Zero-Shot Image Classification" and NeurIPS2022 "I2DFormer: Learning Image to Document Atte…
🐦 Flutter 3 心情记录 样例工程 - 国际化 i18n、uni 小程序、深色模式、多主题、本地数据管理、路由管理、状态管理、无障碍(Semantics)、异步 FFI、集成测试、图表统计、Excel 导入导出、游戏…
Cinder is Meta's internal performance-oriented production version of CPython.
Fast, reliable, and free document scanner app for iPhone
Official PyTorch implementation of SegFormer
React (version 17) project with MathJax (configured with mchem, the package for chemistry)