[RetinaNet] Image Converter and ObjectDetector #1906

sineeli · 2024-10-03T18:44:36Z

This PR covers preprocessor for RetinaNet object detector and RetinaNet model itself. #1756

ImageObjectDetector
ImageObjectDetectorPreprocessor
RetinaNetObjectDetector
RetinaNetObjectDetectorPreprocessor
RetinaNetImageConverter

…neeli/keras-hub into sineeli/add-retinanet-phase-2

…anet-phase-2

divyashreepathihalli

Thanks for the PR @sineeli!! Looks generally good!! I have left a few comments.
Also, I want to make sure the -The code usage is updated to reflect the correct implementation - https://docs.google.com/document/d/15FUEP_vNehwLWJLragXhPFkYcbmmpbo0NYh5vL6q1xA/edit?tab=t.0

keras_hub/src/models/image_object_detector.py

keras_hub/src/models/image_object_detector_preprocessor.py

keras_hub/src/models/retinanet/retinanet_backbone.py

keras_hub/src/models/retinanet/retinanet_label_encoder.py

keras_hub/src/models/retinanet/retinanet_object_detector.py

keras_hub/src/tests/test_case.py

keras_hub/src/models/image_object_detector.py

keras_hub/src/models/retinanet/prediction_head.py

…ction

sineeli · 2024-10-04T17:09:07Z

@divyashreepathihalli

Kept some layers FeaturePyramid, RetinaNetLabelEncoder , BoxMatcher and NonMaxSupressionnot exposed as layer API's we can expose once all the models are ported in and move a centralized layers to layers/modeling/ folder.

keras_hub/src/models/image_object_detector.py

keras_hub/src/models/retinanet/retinanet_label_encoder.py

keras_hub/src/models/retinanet/retinanet_object_detector.py

divyashreepathihalli

Thanks @sineeli. I left a few comments.

keras_hub/src/models/retinanet/prediction_head.py

divyashreepathihalli · 2024-10-07T18:48:42Z

keras_hub/src/models/retinanet/retinanet_label_encoder_test.py

-                "max_level": 7,
-                "num_scales": 3,
-                "aspect_ratios": [0.5, 1.0, 2.0],
-                "anchor_size": 8,
            },
            input_data={


define input_data also in setup()

input_data varies in each test_case so kept it separately also how we assert changes

keras_hub/src/models/retinanet/retinanet_label_encoder_test.py

keras_hub/src/models/retinanet/retinanet_object_detector.py

- Correct test cases.

…et format from preprocessor

… method

…etection method" This reverts commit 3b26d3a.

…cation head and user friendly

… can effect the bounding boxes and the ops i backend framework dependent

- Add required docstrings - Use `center_xywh` encoding for retinanet as per torch weights

… arg for prediction head configuration

…e extraction from image encoder

divyashreepathihalli

Thanks for the updates @sineeli. Left a few comments! This is looking good!

keras_hub/src/bounding_box/converters.py

divyashreepathihalli · 2024-10-14T17:34:52Z

keras_hub/src/bounding_box/converters.py

+    Args:
+        anchors: `Tensors`. Anchor boxes with shape of `(N, 4)` where N is the
+            number of anchors.
+        boxes:  `Tensors` Bounding boxes to encode. Boxes can be of be shape


with shape (B, N, 4) for batched boxes or (N, 4) for a single set of boxes. N should match the number of anchors.

keras_hub/src/bounding_box/converters.py

divyashreepathihalli · 2024-10-14T18:07:59Z

keras_hub/src/models/retinanet/retinanet_image_converter.py

+
+@keras_hub_export("keras_hub.layers.RetinaNetImageConverter")
+class RetinaNetImageConverter(ImageConverter):
+    backbone_cls = RetinaNetBackbone


The converter has no resizing option, this is something we need to support for vision models. The backbone is taking image_shape as an input. This means we have to support image resizing in preprocessor.

divyashreepathihalli · 2024-10-14T18:36:13Z

keras_hub/src/models/retinanet/retinanet_label_encoder.py

        positive_threshold=0.5,
        negative_threshold=0.4,
-        box_variance=[0.1, 0.1, 0.2, 0.2],
+        box_variance=[1.0, 1.0, 1.0, 1.0],


can you please explain why this update?

divyashreepathihalli · 2024-10-14T18:41:34Z

keras_hub/src/models/retinanet/retinanet_label_encoder.py

            anchors=anchor_boxes,
            boxes=matched_gt_boxes,
            anchor_format=self.bounding_box_format,
            box_format=self.bounding_box_format,
+            encoding_format=self.encoding_format,
            variance=self.box_variance,
            image_shape=image_shape,
        )


for the return statement of this method. assign the value to box_targets and class_targets and then return them for clarity

keras_hub/src/models/retinanet/retinanet_object_detector.py

keras_hub/src/models/retinanet/retinanet_object_detector_test.py

sineeli · 2024-10-14T19:23:38Z

Weights Transfer Check:

retinanet_resnet50_fpn_coco: https://colab.research.google.com/gist/sineeli/c9689bb91d5ae9482e58da097f0b68d8/-keras-hub-retinanet-resnet50-fpn.ipynb

retinanet_resnet50_fpn_v2_coco: https://colab.research.google.com/gist/sineeli/1630a6d2ea48c8b13f9ef206f6f28d81/-keras-hub-retinanet-resnet50-fpn-v2.ipynb

sineeli · 2024-10-15T18:55:00Z

Trained on pascal_voc just to check transfer learning: https://colab.research.google.com/gist/sineeli/39ed424efd79cfdd08ada2fac78b5cbf/-keras-hub-retinanet-training.ipynb

sineeli added 6 commits September 26, 2024 14:53

Rebased phase 1 changes

c1d7955

Rebased phase 1 changes

deaeac4

Merge branch 'sineeli/add-retinanet-phase-2' of https://github.com/si…

1cdd164

…neeli/keras-hub into sineeli/add-retinanet-phase-2

nit

f90add8

Merge remote-tracking branch 'upstream/master' into sineeli/add-retin…

fb0c733

…anet-phase-2

Retina Phase 2

6c26534

sineeli requested review from divyashreepathihalli and fchollet October 3, 2024 18:53

nit

baee6e2

divyashreepathihalli reviewed Oct 3, 2024

View reviewed changes

fchollet reviewed Oct 3, 2024

View reviewed changes

keras_hub/src/models/image_object_detector.py Outdated Show resolved Hide resolved

keras_hub/src/models/retinanet/prediction_head.py Show resolved Hide resolved

Expose Anchor Generator as layer, docstring correction and test corre…

5ee905e

…ction

sineeli requested a review from divyashreepathihalli October 4, 2024 17:07

sineeli added 2 commits October 4, 2024 11:25

nit

84533d4

Add missing args for prediction heads

b6ceb8f

sineeli requested a review from fchollet October 4, 2024 18:54

divyashreepathihalli reviewed Oct 5, 2024

View reviewed changes

divyashreepathihalli added the kokoro:force-run Runs Tests on GPU label Oct 5, 2024

kokoro-team removed the kokoro:force-run Runs Tests on GPU label Oct 5, 2024

divyashreepathihalli reviewed Oct 7, 2024

View reviewed changes

- Use FeaturePyramidBackbone cls for RetinaNet backbone.

4c7a28b

- Correct test cases.

sineeli requested a review from divyashreepathihalli October 8, 2024 17:48

divyashreepathihalli added the kokoro:force-run Runs Tests on GPU label Oct 8, 2024

kokoro-team removed the kokoro:force-run Runs Tests on GPU label Oct 8, 2024

fix decoding error

3f915dc

sineeli added the kokoro:force-run Runs Tests on GPU label Oct 8, 2024

kokoro-team removed the kokoro:force-run Runs Tests on GPU label Oct 8, 2024

- Add ground truth arg for RetinaNet model and remove source and targ…

f0da549

…et format from preprocessor

sineeli added the kokoro:force-run Runs Tests on GPU label Oct 8, 2024

kokoro-team removed the kokoro:force-run Runs Tests on GPU label Oct 8, 2024

sineeli added 23 commits October 8, 2024 20:29

nit

05fdefe

Subclass Imageconverter and overload call method for object detection…

3b26d3a

… method

Revert "Subclass Imageconverter and overload call method for object d…

0df121a

…etection method" This reverts commit 3b26d3a.

add names to layers

8697240

correct fpn coarser level as per torch retinanet model

394faf0

nit

33d81e9

Polish Prediction head and fpn layers to include flags and norm layers

79502d9

nit

72a02c4

nit

a28a033

add prior probability flag for prediction head to use it for classifi…

50686e0

…cation head and user friendly

compute_shape seems redudant here and correct layers for channels_first

8dc5483

keep compute_output_shape for fpn

9f7d8ef

nit

6801789

Change AnchorGen Implementation as per torch

7e57cf1

correct the source format of anchors format

8ac617c

use plain rescaling and normalization no resizing for od models as it…

03efed5

… can effect the bounding boxes and the ops i backend framework dependent

use single bbox format for model

5704950

- Add arg for encoding format

7c1d1de

- Add required docstrings - Use `center_xywh` encoding for retinanet as per torch weights

make anchor generator optional

2414f00

init as layers for anchor generator and label encoder and as one more…

064c971

… arg for prediction head configuration

nit

4ff8f13

- only consider levels from min level to backbone maxlevel fro featur…

c4f752d

…e extraction from image encoder

nit

bde84b9

divyashreepathihalli added the kokoro:force-run Runs Tests on GPU label Oct 14, 2024

kokoro-team removed the kokoro:force-run Runs Tests on GPU label Oct 14, 2024

divyashreepathihalli reviewed Oct 14, 2024

View reviewed changes

nit

caacc99

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RetinaNet] Image Converter and ObjectDetector #1906

[RetinaNet] Image Converter and ObjectDetector #1906

sineeli commented Oct 3, 2024 •

edited

Loading

divyashreepathihalli left a comment

sineeli commented Oct 4, 2024 •

edited

Loading

divyashreepathihalli left a comment

divyashreepathihalli Oct 7, 2024

sineeli Oct 8, 2024

divyashreepathihalli left a comment

divyashreepathihalli Oct 14, 2024

divyashreepathihalli Oct 14, 2024

divyashreepathihalli Oct 14, 2024

divyashreepathihalli Oct 14, 2024

sineeli commented Oct 14, 2024

sineeli commented Oct 15, 2024

[RetinaNet] Image Converter and ObjectDetector #1906

Are you sure you want to change the base?

[RetinaNet] Image Converter and ObjectDetector #1906

Conversation

sineeli commented Oct 3, 2024 • edited Loading

divyashreepathihalli left a comment

Choose a reason for hiding this comment

sineeli commented Oct 4, 2024 • edited Loading

divyashreepathihalli left a comment

Choose a reason for hiding this comment

divyashreepathihalli Oct 7, 2024

Choose a reason for hiding this comment

sineeli Oct 8, 2024

Choose a reason for hiding this comment

divyashreepathihalli left a comment

Choose a reason for hiding this comment

divyashreepathihalli Oct 14, 2024

Choose a reason for hiding this comment

divyashreepathihalli Oct 14, 2024

Choose a reason for hiding this comment

divyashreepathihalli Oct 14, 2024

Choose a reason for hiding this comment

divyashreepathihalli Oct 14, 2024

Choose a reason for hiding this comment

sineeli commented Oct 14, 2024

sineeli commented Oct 15, 2024

sineeli commented Oct 3, 2024 •

edited

Loading

sineeli commented Oct 4, 2024 •

edited

Loading