Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Rotate page #488

Merged
merged 66 commits into from
Dec 6, 2021
Merged
Show file tree
Hide file tree
Changes from 1 commit
Commits
Show all changes
66 commits
Select commit Hold shift + click to select a range
1af0fc1
feat: integrate image rotation before using predictor
Rob192 Jul 19, 2021
811fd6a
Merge branch 'main' into rotate_page
Rob192 Jul 19, 2021
6f5c183
Merge branch 'main' into rotate_page
Rob192 Sep 21, 2021
c2b18d7
feat: add rotate_document functionality
Rob192 Sep 22, 2021
312179f
fix: remove min_angle from rotate_page
Rob192 Sep 22, 2021
b6bc74f
merge
Rob192 Sep 30, 2021
6fe1bc6
fix: correct models.predictor.tensorflow
Rob192 Sep 30, 2021
a6f2ff1
fix: minor corrections
Rob192 Oct 1, 2021
ccda4d0
feat: Rotate back images and boxes after straightening
Rob192 Oct 4, 2021
7d4ed75
fix: correct typo
Rob192 Oct 4, 2021
a303b23
fix: merge two functions rotate_image
Rob192 Oct 5, 2021
7a78263
fix: do not rotate back pages but only boxes
Rob192 Oct 5, 2021
eb341ac
fix: typos
Rob192 Oct 6, 2021
eeff2d6
fix: add more testing for remap_boxes in cases of boxes with an angle…
Rob192 Oct 6, 2021
16f3489
fix: remove the cropping after rotation of the image
Rob192 Oct 14, 2021
eb063f1
Merge branch 'main' of https://github.com/mindee/doctr
Rob192 Oct 25, 2021
b9ec27e
Merge branch 'main' into rotate_page
Rob192 Oct 25, 2021
f7fcf90
fix: correct model/_utils.py
Rob192 Oct 25, 2021
a04ab4f
Merge branch 'main' of https://github.com/mindee/doctr
Rob192 Oct 28, 2021
8ec9eab
Merge branch 'main' into rotate_page
Rob192 Oct 28, 2021
cf9ab0d
fix: do not use resolve_lines and resolve_boxes as it does not work w…
Rob192 Oct 28, 2021
8a31014
fix: remove expand in geometry.rotate_boxes
Rob192 Oct 29, 2021
e457cf0
fix: reformat code
Rob192 Oct 29, 2021
9975c82
fix: reformat expand from function signature
Rob192 Oct 29, 2021
32d53e4
fix: rename keep_original_size to preserve_aspect_ratio
Rob192 Oct 29, 2021
6821442
fix: vectorize box transformation
Rob192 Oct 29, 2021
988e2d0
Merge branch 'main' into rotate_page
Rob192 Nov 16, 2021
573f13f
fix: minor modifications + remove test_bbox_to_rbbox
Rob192 Nov 16, 2021
290e8ed
fix: add the straighten_pages to the latest codebase
Rob192 Nov 19, 2021
1775ebf
feat: add the straighten_pages to the pytorch predictor
Rob192 Nov 19, 2021
816168c
Merge branch 'main' into rotate_page
Rob192 Nov 19, 2021
f199044
feat: add testing for the straighten_pages parameter
Rob192 Nov 19, 2021
887ed25
fix: in case no angle is found in estimate_orientation return 0
Rob192 Nov 19, 2021
789a9c2
Merge branch 'main' into rotate_page
Rob192 Nov 24, 2021
239c508
fix: make sure boxes are outputted from _process_predictions
Rob192 Nov 24, 2021
52461dc
fix: update docstrings in OCRPredictor
Rob192 Nov 24, 2021
b6f8cca
fix: create a copy of boxes inside rotate_boxes
Rob192 Nov 24, 2021
a9f3d6e
fix: update docstring for rotate_image
Rob192 Nov 24, 2021
ea69de6
fix: add comments inside remap_boxes
Rob192 Nov 24, 2021
1a72a8c
fix: change testing in test_estimate_orientation
Rob192 Nov 24, 2021
d658be4
fix: change testing in test_estimate_orientation
Rob192 Nov 24, 2021
4711797
Merge branch 'main' into rotate_page
Rob192 Nov 25, 2021
e5ed562
Merge branch 'main' into rotate_page
Rob192 Nov 26, 2021
9a6c658
fix: delete imports not used
Rob192 Nov 29, 2021
a9cbe04
fix: styling
Rob192 Nov 29, 2021
ebdc320
fix: change assertion in test_utils_geometry.py
Rob192 Nov 30, 2021
d16fba3
fix: keep check with if expand in rotate_image
Rob192 Nov 30, 2021
e344e5c
fix: change rotate_boxes signature
Rob192 Nov 30, 2021
2e11dc8
fix: use loc_preds instead of boxes
Rob192 Nov 30, 2021
bb6bc79
fix: wrong test in remap boxes
Rob192 Nov 30, 2021
258b18b
Merge branch 'main' into rotate_page
Rob192 Nov 30, 2021
b22309d
add unit tests for pytorch
Rob192 Dec 1, 2021
495ad8c
add unit tests for remap_boxes and estimate_orientation
Rob192 Dec 1, 2021
98b44f6
fix: styling
Rob192 Dec 1, 2021
28f1fd8
fix: isort
Rob192 Dec 1, 2021
1d61de7
fix: remove unnecessary fixture
Rob192 Dec 1, 2021
faac0bd
fix: add testing for pytorch predictor
Rob192 Dec 2, 2021
938c9f2
fix: styling
Rob192 Dec 2, 2021
30a70f2
fix: correct testing for ocrpredictor with pytorch
Rob192 Dec 2, 2021
a7c0d55
fix: correct imports for testing
Rob192 Dec 2, 2021
ce23100
fix: isort
Rob192 Dec 2, 2021
8525b14
fix: make sure that expand in rotate_image is keeping the same image …
Rob192 Dec 3, 2021
7bcf639
fix: styling
Rob192 Dec 3, 2021
2205737
fix: use absolute centers for rotate_boxes
Rob192 Dec 5, 2021
484451b
fix: calculation of image_center and documentation
Rob192 Dec 5, 2021
66fad67
fix: remove default value for orig_shape in rotate_boxes
Rob192 Dec 5, 2021
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
fix: add testing for pytorch predictor
  • Loading branch information
Rob192 committed Dec 2, 2021
commit faac0bd872179aa17b4ff86ef32305c22b39d76d
4 changes: 2 additions & 2 deletions doctr/models/predictor/pytorch.py
Original file line number Diff line number Diff line change
Expand Up @@ -47,8 +47,8 @@ def __init__(
) -> None:

super().__init__()
self.det_predictor = det_predictor # type: ignore[attr-defined]
self.reco_predictor = reco_predictor # type: ignore[attr-defined]
self.det_predictor = det_predictor.eval() # type: ignore[attr-defined]
self.reco_predictor = reco_predictor.eval() # type: ignore[attr-defined]
self.doc_builder = DocumentBuilder(export_as_straight_boxes=export_as_straight_boxes)
self.assume_straight_pages = assume_straight_pages
self.straighten_pages = straighten_pages
Expand Down
39 changes: 8 additions & 31 deletions tests/pytorch/test_models_detection_pt.py
Original file line number Diff line number Diff line change
Expand Up @@ -4,8 +4,8 @@

from doctr.io import DocumentFile
from doctr.models import detection
from doctr.models.detection.predictor import DetectionPredictor
from doctr.models.preprocessor import PreProcessor
from doctr.models.detection.predictor.pytorch import DetectionPredictor
from doctr.models.preprocessor.pytorch import PreProcessor


@pytest.mark.parametrize(
Expand Down Expand Up @@ -47,46 +47,23 @@ def test_detection_models(arch_name, input_shape, output_size, out_prob):


@pytest.fixture(scope="session")
def test_detectionpredictor(mock_pdf): # noqa: F811
def test_detectionpredictor_pt(mock_pdf): # noqa: F811

batch_size = 4
predictor = DetectionPredictor(
PreProcessor(output_size=(512, 512), batch_size=batch_size),
detection.db_resnet50(input_shape=(512, 512, 3))
detection.db_resnet50(pretrained=False).eval()
)

pages = DocumentFile.from_pdf(mock_pdf).as_images()
out = predictor(pages)
# The input PDF has 8 pages
assert len(out) == 8

# Dimension check
with pytest.raises(ValueError):
input_page = (255 * np.random.rand(1, 256, 512, 3)).astype(np.uint8)
_ = predictor([input_page])

return predictor


@pytest.fixture(scope="session")
def test_rotated_detectionpredictor(mock_pdf): # noqa: F811

batch_size = 4
predictor = DetectionPredictor(
PreProcessor(output_size=(512, 512), batch_size=batch_size),
detection.db_resnet50(assume_straight_pages=False, input_shape=(512, 512, 3))
)

pages = DocumentFile.from_pdf(mock_pdf).as_images()
out = predictor(pages)

# The input PDF has 8 pages
assert len(out) == 8

# Dimension check
with pytest.raises(ValueError):
input_page = (255 * np.random.rand(1, 256, 512, 3)).astype(np.uint8)
_ = predictor([input_page])
# # Dimension check
# with pytest.raises(ValueError):
# input_page = (255 * np.random.rand(1, 256, 512, 3)).astype(np.uint8)
# _ = predictor([input_page])

return predictor

Expand Down
6 changes: 3 additions & 3 deletions tests/pytorch/test_models_recognition_pt.py
Original file line number Diff line number Diff line change
Expand Up @@ -5,8 +5,8 @@
from doctr.io import DocumentFile
from doctr.models import recognition
from doctr.models._utils import extract_crops
from doctr.models.preprocessor import PreProcessor
from doctr.models.recognition.predictor import RecognitionPredictor
from doctr.models.preprocessor.pytorch import PreProcessor
from doctr.models.recognition.predictor.pytorch import RecognitionPredictor


@pytest.mark.parametrize(
Expand Down Expand Up @@ -59,7 +59,7 @@ def test_reco_postprocessors(post_processor, input_shape, mock_vocab):


@pytest.fixture(scope="session")
def test_recognitionpredictor(mock_pdf, mock_vocab): # noqa: F811
def test_recognitionpredictor_pt(mock_pdf, mock_vocab): # noqa: F811

batch_size = 4
predictor = RecognitionPredictor(
Expand Down
24 changes: 7 additions & 17 deletions tests/pytorch/test_models_zoo_pt.py
Original file line number Diff line number Diff line change
@@ -1,51 +1,41 @@
import numpy as np
import pytest
from test_models_detection_pt import test_detectionpredictor, test_rotated_detectionpredictor # noqa: F401
from test_models_recognition_pt import test_recognitionpredictor
from test_models_detection_pt import test_detectionpredictor_pt # noqa: F401
from test_models_recognition_pt import test_recognitionpredictor_pt # noqa: F401

from doctr import models
from doctr.io import Document, DocumentFile
from doctr.models.predictor.pytorch import OCRPredictor


def test_ocrpredictor(
mock_pdf, test_detectionpredictor, test_recognitionpredictor, test_rotated_detectionpredictor # noqa: F811
mock_pdf, test_detectionpredictor_pt, test_recognitionpredictor_pt # noqa: F811
):

predictor = OCRPredictor(
test_detectionpredictor,
test_recognitionpredictor,
test_detectionpredictor_pt,
test_recognitionpredictor_pt,
assume_straight_pages=True,
straighten_pages=False,
)

r_predictor = OCRPredictor(
test_rotated_detectionpredictor,
test_recognitionpredictor,
assume_straight_pages=False,
straighten_pages=False,
)

s_predictor = OCRPredictor(
test_detectionpredictor,
test_recognitionpredictor,
test_detectionpredictor_pt,
test_recognitionpredictor_pt,
assume_straight_pages=True,
straighten_pages=True,
)

doc = DocumentFile.from_pdf(mock_pdf).as_images()
out = predictor(doc)
r_out = r_predictor(doc)
s_out = s_predictor(doc)

# Document
assert isinstance(out, Document)
assert isinstance(r_out, Document)
assert isinstance(s_out, Document)

# The input PDF has 8 pages
assert len(out.pages) == 8
assert len(r_out.pages) == 8
assert len(s_out.pages) == 8
# Dimension check
with pytest.raises(ValueError):
Expand Down