Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Issue with LlamaParse ... Image/PDF parsing was not right - OCR issues #527

Open
rkrishnan-mn opened this issue Dec 5, 2024 · 1 comment
Assignees
Labels
enhancement New feature or request ocr

Comments

@rkrishnan-mn
Copy link

Describe the bug
a) Look at the Time Stamp on the parsed output and the actual pdf image . It is not 18PM it is 12:18 PM
b) It has two line reciept entry per lint item , the output is all messed up with wrong price info

Files
refer job id 4ab0d236-9f3d-450a-a995-b852aba0a468 has the image/pdf i used to parse

Job ID
4ab0d236-9f3d-450a-a995-b852aba0a468

Client:
API, was calling from my local Python API call

Additional context
Looks like OCR is not right

@rkrishnan-mn rkrishnan-mn added the bug Something isn't working label Dec 5, 2024
@rkrishnan-mn rkrishnan-mn changed the title Issue with LlamaParse ... Image in PDF parsing Issue with LlamaParse ... Image/PDF parsing was not right - OCR issues Dec 5, 2024
@BinaryBrain BinaryBrain self-assigned this Dec 5, 2024
@BinaryBrain BinaryBrain added ocr enhancement New feature or request and removed bug Something isn't working labels Dec 5, 2024
@BinaryBrain
Copy link
Member

Hi, our OCR is working best on images that are not photos.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request ocr
Projects
None yet
Development

No branches or pull requests

2 participants