New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Add Image Processor Fast RT-DETR #34354

Merged

yonigozlan merged 9 commits into huggingface:main from yonigozlan:add-image-processing-fast-rtdetr

Oct 30, 2024

Member

yonigozlan commented Oct 23, 2024 •

edited

Loading

What does this PR do?

Adds a fast image processor for RT-DETR. Follows issue #33810.
This image processor is a result of this work on comparing different image processing method.

The diffs look bad but this PR is almost exclusively made up of # Copied from based on the fast image processor for DETR!

Implementation

Usage

Except for the fact that it only returns torch tensors, this fast processor is fully compatible with the current one.
It can be instantiated through AutoImageProcessor with use_fast=True, or through the Class directly:

from transformers import AutoImageProcessor

processor = AutoImageProcessor.from_pretrained("PekingU/rtdetr_r50vd", use_fast=True)

from transformers import RTDetrImageProcessorFast

processor = RTDetrImageProcessorFast.from_pretrained("PekingU/rtdetr_r50vd")

Usage is the same as the current processor, except for the device kwarg:

from torchvision.io import read_image
images = torchvision.io.read_image(image_path)
processor = RTDetrImageProcessorFast.from_pretrained("PekingU/rtdetr_r50vd")
images_processed = processor(images , return_tensors="pt", device="cuda")

If device is not specified:

If the input images are tensors, the processing will be done on the device of the images.
If the inputs are PIL or Numpy images, the processing is done on CPU.

Performance gains

Average over 10% of the COCO 2017 validation dataset, with batch_size=1.

Average over 10% of the COCO 2017 validation dataset, with batch_size=8.

Tests

The new image processor is tested on all the tests of the current processor.
I have also added a consistency test for processing on GPU vs CPU.

Who can review?

@ArthurZucker Pinging you directly as there is almost no "new" code here.

HuggingFaceDocBuilderDev commented Oct 23, 2024

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

yonigozlan marked this pull request as ready for review

October 23, 2024 17:41

yonigozlan requested a review from ArthurZucker

October 23, 2024 17:45

ArthurZucker requested a review from molbap

October 24, 2024 13:36

molbap approved these changes

View reviewed changes

Contributor

molbap left a comment

Thanks! Just a few nits - saw you were removing the kwargs validation as well, can we also do that for detr?

src/transformers/models/rt_detr/image_processing_rt_detr_fast.py

Comment on lines +137 to +84

+              def prepare_coco_detection_annotation(
+                  image,
+                  target,
+                  return_segmentation_masks: bool = False,

Contributor

molbap Oct 24, 2024

return_segmentation_masks is unused here, could either reuse the one from detr_fast so we have a # Copied from statement here?

Member Author

yonigozlan Oct 24, 2024 •

edited

Loading

Yes it's a bit weird, return_segmentation_masks is present in several places, but rt-detr does not support segmentation. I added a copied from here, with an Ignore copy for the segmentation part (as otherwise we would need to import/copy a function that would never be used.

EDIT: actually the Ignore copy makes the CI crash, not sure why, so I left it as is for now...

Collaborator

ArthurZucker Oct 28, 2024

# Ignore copy does not work like this 😉 forget about it in this case!

src/transformers/models/rt_detr/image_processing_rt_detr_fast.py Outdated Show resolved Hide resolved

src/transformers/models/rt_detr/image_processing_rt_detr_fast.py Show resolved Hide resolved

molbap mentioned this pull request

Add Image Processor Fast Deformable DETR #34353

Open

yonigozlan force-pushed the add-image-processing-fast-rtdetr branch from cb68014 to 2ccaaa0 Compare

October 24, 2024 22:24

ArthurZucker reviewed

View reviewed changes

Collaborator

ArthurZucker left a comment

Thanks for working on this!
A few more nits, overall good, IMO all your graph should be placed in the documentation as well and not just on the PR description!

src/transformers/models/rt_detr/image_processing_rt_detr_fast.py Outdated Show resolved Hide resolved

src/transformers/utils/dummy_vision_objects.py Show resolved Hide resolved

src/transformers/models/rt_detr/image_processing_rt_detr_fast.py

Comment on lines +137 to +84

+              def prepare_coco_detection_annotation(
+                  image,
+                  target,
+                  return_segmentation_masks: bool = False,

Collaborator

ArthurZucker Oct 28, 2024

# Ignore copy does not work like this 😉 forget about it in this case!

src/transformers/models/rt_detr/image_processing_rt_detr_fast.py Outdated Show resolved Hide resolved

src/transformers/models/rt_detr/image_processing_rt_detr_fast.py Outdated Show resolved Hide resolved

src/transformers/models/rt_detr/image_processing_rt_detr_fast.py Outdated Show resolved Hide resolved

src/transformers/models/rt_detr/image_processing_rt_detr_fast.py Outdated Show resolved Hide resolved

src/transformers/models/rt_detr/image_processing_rt_detr_fast.py Outdated Show resolved Hide resolved

src/transformers/models/rt_detr/image_processing_rt_detr_fast.py Outdated Show resolved Hide resolved

yonigozlan added 7 commits

October 29, 2024 18:07


          add fast image processor rtdetr

4b60e69


          add gpu/cpu test and fix docstring

e4c57c3


          remove prints

bf21b51


          add to doc

a2c9577


          nit docstring

827d1c2


          avoid iterating over images/annotations several times

64b2449


          change torch typing

yonigozlan force-pushed the add-image-processing-fast-rtdetr branch from b22bf32 to 2960859 Compare

October 29, 2024 18:08

yonigozlan and others added 2 commits

October 29, 2024 19:26


          Add image processor fast documentation

0b18cf3


          Merge branch 'main' into add-image-processing-fast-rtdetr

2a91b73

Member Author

yonigozlan commented Oct 29, 2024

@ArthurZucker Refactored DETR and RT-DETR image processor fast to loop as few times as possible over annotations and images, and added some docs!

yonigozlan requested a review from ArthurZucker

October 29, 2024 19:31

Collaborator

ArthurZucker commented Oct 30, 2024

Does this improve the perf you saw? 😉

ArthurZucker approved these changes

View reviewed changes

Collaborator

ArthurZucker left a comment

Thanks for iterating with me! 🤗

Member Author

yonigozlan commented Oct 30, 2024 •

edited

Loading

Thanks for iterating with me! 🤗

Thank you!

Does this improve the perf you saw? 😉

Hmm hard to tell, maybe very slightly when on GPU, as on CPU the potential gains are overshadowed by the processing time. But at least it's cleaner that way! :)

yonigozlan merged commit 48872fd into huggingface:main

26 checks passed

qubvel added Vision Processing optimization labels

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

optimization Processing Vision