Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Page content selected improperly when running deskew or select content #203

Open
rnmerchant opened this issue Dec 15, 2023 · 1 comment
Open

Comments

@rnmerchant
Copy link

Apologies if this has been covered or addressed previously. Newbee user.
I've a simple PDF scan of a journal article; exported with acrobat to jpegs. They are a standard full page (~8x10") and very constant content with 3 columns of text and some images.
When I run some of the functions - say, deskew or select content - the content is selected properly on the first three of 8 pages then improperly on the remainder: sides cut off (either side) or on one page and odd selection of part of one column. I tried setting 'select content' to manual but this doesn't change the issue.

Screenshot is attached.

Richard
Screen Shot 12-15-23 at 10 23 AM

@majkaz
Copy link

majkaz commented Dec 19, 2023

What I am seeing is the result of incorrect "Split pages". Best way to avoid it is to set it manually for all pages (Manual + Apply cut).

You cannot simply pass over the first stages. Scantailor won't complain but will work differently than you expect. It will run these steps automatically and often quite wrong - especially if the text is in columns or if there are "column-like" pictures or tables.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants