Issue #1271 added LZWDecode compression #1286

opposss · 2024-10-16T00:13:55Z

Added support for image compression using LZWDecode.

Checklist:

The GitHub pipeline is OK (green),
meaning that both pylint (static code analyzer) and black (code formatter) are happy with the changes of this PR.
A unit test is covering the code added / modified by this PR
This PR is ready to be merged
In case of a new feature, docstrings have been added, with also some documentation in the docs/ folder
A mention of the change is present in CHANGELOG.md

By submitting this pull request, I confirm that my contribution is made under the terms of the GNU LGPL 3.0 license.

opposss · 2024-10-16T00:22:19Z

Hi @Lucas-C I added support for compression using LZWDecode, and I would like you to look at my code and give your opinion. I tested this filter manually by adding images compressed with LZWDecode to pdf files, and it seems to work correctly. Local check of pylint and black showed no errors. After the review, if the code is ok, I could write unit tests.

Lucas-C · 2024-10-16T07:08:56Z

Hi @Lucas-C I added support for compression using LZWDecode, and I would like you to look at my code and give your opinion. I tested this filter manually by adding images compressed with LZWDecode to pdf files, and it seems to work correctly. Local check of pylint and black showed no errors. After the review, if the code is ok, I could write unit tests.

Good job 👍

I'll try to review this PR very soon, today or tomorrow.

fpdf/image_parsing.py

Lucas-C

Hi @opposss

God job overall 👍

You placed the code at the right place, and it's clear.

I'll finish the code review once unit tests have been added, but it's a promising start!

fpdf/image_parsing.py

Lucas-C · 2024-10-22T06:28:18Z

OK so there is one thing currently blocking in the GItHub Actions pipeline:

6.1.10-1 (1): test/image/image_types/image_types_insert_jpg_lzwdecode.pdf

This is the relevant VeraPDF rule: https://github.com/veraPDF/veraPDF-validation-profiles/wiki/PDFA-Part-1-rules/#rule-6110-1

I think the best fix is to add this rule (6.1.10-1) into verapdf-ignore.json, with reason: fpdf2 wants to support LZWDecode filter`

Lucas-C · 2024-10-23T08:01:50Z

Thank you for your contribution @opposss 👍

@allcontributors please add @opposss for code

allcontributors · 2024-10-23T08:02:00Z

@Lucas-C

I've put up a pull request to add @opposss! 🎉

Lucas-C · 2024-11-21T12:42:44Z

I noticed today that the unit test test_insert_jpg_lzwdecode is quite slow to execute: ~78s on my computer.

And 90% of this execution is spent in pack_codes_into_bytes() based on this quick test:

pip install pytest-profiling
pytest test/image/image_types/test_insert_images.py -k lzwdecode --profile

I wonder if this could be improved...

opposss · 2024-11-21T18:44:00Z

@Lucas-C
Apparently there is a problem with encoding large amounts of data in the case of JPEG, in case of PNG and other formats everything seems to work fine.

I changed the implementation pack_codes_into_bytes() a bit, and so far I've only managed to reduce the time from 80s to ~40s.
I will try to improve the performance even more in the coming days.

Issue py-pdf#1271 added LZWDecode compression

22709f8

opposss requested a review from gmischler as a code owner October 16, 2024 00:13

Lucas-C reviewed Oct 16, 2024

View reviewed changes

fpdf/image_parsing.py Outdated Show resolved Hide resolved

Lucas-C reviewed Oct 16, 2024

View reviewed changes

fpdf/image_parsing.py Outdated Show resolved Hide resolved

Lucas-C reviewed Oct 16, 2024

View reviewed changes

fpdf/image_parsing.py Outdated Show resolved Hide resolved

Lucas-C requested changes Oct 16, 2024

View reviewed changes

Issue py-pdf#1271 added Unit tests

02fb6a9

Lucas-C reviewed Oct 17, 2024

View reviewed changes

fpdf/image_parsing.py Outdated Show resolved Hide resolved

Issue py-pdf#1271 fixies

9a1551e

opposss requested a review from Lucas-C October 19, 2024 15:23

opposss added 3 commits October 21, 2024 22:45

Issue py-pdf#1271 build fixing

38974ab

Issue py-pdf#1271 build fixing

c078b3c

Issue py-pdf#1271 build fixing

54c7458

opposss and others added 3 commits October 22, 2024 18:35

Issue py-pdf#1271 build fixing

3ec9294

Issue py-pdf#1271 padding rework and CHANGELOG

88e75c5

Merge branch 'master' into Issue-1271-LZWDecode-compression-support

dd9c364

Lucas-C approved these changes Oct 23, 2024

View reviewed changes

Lucas-C merged commit 8d7cbf1 into py-pdf:master Oct 23, 2024
11 checks passed

Lucas-C added the hacktoberfest-accepted label Oct 23, 2024

allcontributors bot mentioned this pull request Oct 23, 2024

add opposss as a contributor for code #1292

Merged

Lucas-C mentioned this pull request Oct 23, 2024

Add support for LZWDecode compression #1271

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issue #1271 added LZWDecode compression #1286

Issue #1271 added LZWDecode compression #1286

opposss commented Oct 16, 2024 •

edited

Loading

opposss commented Oct 16, 2024 •

edited

Loading

Lucas-C commented Oct 16, 2024

Lucas-C left a comment

Lucas-C commented Oct 22, 2024

Lucas-C commented Oct 23, 2024

allcontributors bot commented Oct 23, 2024

Lucas-C commented Nov 21, 2024

opposss commented Nov 21, 2024 •

edited

Loading

Issue #1271 added LZWDecode compression #1286

Issue #1271 added LZWDecode compression #1286

Conversation

opposss commented Oct 16, 2024 • edited Loading

opposss commented Oct 16, 2024 • edited Loading

Lucas-C commented Oct 16, 2024

Lucas-C left a comment

Choose a reason for hiding this comment

Lucas-C commented Oct 22, 2024

Lucas-C commented Oct 23, 2024

allcontributors bot commented Oct 23, 2024

Lucas-C commented Nov 21, 2024

opposss commented Nov 21, 2024 • edited Loading

opposss commented Oct 16, 2024 •

edited

Loading

opposss commented Oct 16, 2024 •

edited

Loading

opposss commented Nov 21, 2024 •

edited

Loading