Skip to content
View fireae's full-sized avatar

Block or report fireae

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results
Python 483 39 Updated Jun 7, 2024

A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling

40 1 Updated Oct 12, 2024

支持中文和拼音的 SQLite fts5 全文搜索扩展 | A SQLite3 fts5 tokenizer which supports Chinese and PinYin

C++ 579 81 Updated Oct 6, 2024

NDL-DocLデータセット(資料画像レイアウトデータセット)

Java 25 3 Updated Mar 2, 2023

A maroto way to create PDFs. Maroto is inspired in Bootstrap and uses gofpdf. Fast and simple.

Go 1,938 196 Updated Oct 11, 2024

Vision model based PDF chunking.

Python 938 36 Updated Oct 12, 2024

印章检测和印章文字识别

Python 7 1 Updated Mar 29, 2024

Noto fonts go universal! Download pan-Unicode, merged Noto fonts according to time of usage (current, ancient) or geographical region (South Asia, SE Asia, Africa-MiddleEast, Europe-Americas).

Shell 163 21 Updated Aug 1, 2023

A basic boilerplate template for starting a Flutter GetX project. GetX, Dio, MVVM, get CLI, Localization, Pagination etc are implemented.

Dart 402 146 Updated Jan 11, 2024

DocReal: Robust Document Dewarping of Real-Life Images via Attention-Enhanced Control Point Prediction

13 1 Updated Jun 28, 2023

Simple package to extract text with coordinates from programmatic PDFs

C++ 11 3 Updated Oct 11, 2024

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Python 5 Updated Oct 4, 2024

Kubernetes based Cloud Development Environments for Enterprise Teams

TypeScript 6,986 1,186 Updated Oct 10, 2024

table structure recognition

Python 270 94 Updated Nov 22, 2022

Dataset and Code for our ACL 2024 paper: "Multimodal Table Understanding". We propose the first large-scale Multimodal IFT and Pre-Train Dataset for table understanding and develop a generalist tab…

Python 147 7 Updated Sep 27, 2024

Detect file content types with deep learning

Rust 7,774 412 Updated Oct 11, 2024

Render LaTeX in React apps

TypeScript 88 17 Updated Apr 24, 2024

TabularOCR is a Python library that provides an easy-to-use Optical Character Recognition (OCR) solution for extracting tables from images and PDFs. It offers flexible output options, allowing you…

Python 4 Updated Mar 26, 2024

本项目旨在收集开源的表格智能任务数据集(比如表格问答、表格-文本生成等),将原始数据整理为指令微调格式的数据并微调LLM,进而增强LLM对于表格数据的理解,最终构建出专门面向表格智能任务的大型语言模型。

435 34 Updated Apr 22, 2024

Evaluate your speech-to-text system with similarity measures such as word error rate (WER)

Python 621 95 Updated May 6, 2024

Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks

Python 5,143 435 Updated Sep 23, 2024

Zero shot pdf OCR with gpt-4o-mini

Python 1,659 58 Updated Oct 10, 2024

使用FastAPI构建发票识别系统后端服务,支持并发。使用ERFNet模型训练发票轮廓检测,进行畸变矫正,OCR识别,模板匹配,支持倾斜发票识别。准确率99.9%。

Python 4 1 Updated May 23, 2024

Code for CVPR23 Highlight "I2MVFormer: Large Language Model Generated Multi-View Document Supervision for Zero-Shot Image Classification" and NeurIPS2022 "I2DFormer: Learning Image to Document Atte…

Python 18 2 Updated Aug 1, 2023

Flutter_learn_demo Flutter学习历程

Dart 229 40 Updated Sep 30, 2024

🐦 Flutter 3 心情记录 样例工程 - 国际化 i18n、uni 小程序、深色模式、多主题、本地数据管理、路由管理、状态管理、无障碍(Semantics)、异步 FFI、集成测试、图表统计、Excel 导入导出、游戏…

Dart 474 72 Updated Sep 24, 2024

Cinder is Meta's internal performance-oriented production version of CPython.

Python 3,492 121 Updated Oct 8, 2024

Fast, reliable, and free document scanner app for iPhone

Swift 790 37 Updated Sep 17, 2024

Official PyTorch implementation of SegFormer

Python 2,516 350 Updated Aug 2, 2024

React (version 17) project with MathJax (configured with mchem, the package for chemistry)

HTML 1 Updated Jun 15, 2023
Next