Add mujoco rgbd rendering #1229

DavidPL1 · 2024-10-23T15:29:21Z

Description

Adds a new render_mode to MuJoCo environments for when rgb and depth images are required as observations, e.g. to create point clouds.

Fixes #1226

Type of change

Please delete options that are not relevant.

New feature (non-breaking change which adds functionality)
This change requires a documentation update

Checklist:

I have run the pre-commit checks with pre-commit run --all-files (see CONTRIBUTING.md instructions to set it up)
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes

Open Questions

Currently this change was applied to all v5 and v4 environments. Is that correct or should I create v6 envs?
In case just changing v5 and v4 is fine, should this feature be added to the version history docstrings of the respective envs?

pseudo-rnd-thoughts

Looks good to me other than the comment.
@Kallinteris-Andreas could you review as well

gymnasium/utils/passive_env_checker.py

DavidPL1 · 2024-10-24T12:55:26Z

@pseudo-rnd-thoughts
Looks like Run PyTest / build-all (3.8, >=1.21,<2.0) for some reason got stuck at apt-get -y update can you trigger it again?

pseudo-rnd-thoughts · 2024-10-24T13:00:18Z

I can't from my phone but will be home in an hour or so and can do then

Kallinteris-Andreas

need to update the index page (https://gymnasium.farama.org/environments/mujoco/ ) to document the available render_modes
add a test in test_mujoco_rendering.py named test_rgbd_array, where we assert the output of test_rgbd_array is a proper combination of rgb_array and depth_array
I am not convinced that the correct output format is a tuple of (rgb_image, depth_image) while it is the most human-readable format, it is not ideal for CNNs and would require an additional renderWrapper (for the cases where it used as addRenderObservation wrapper)
edit: RGB-D CNNs do not appear to be "mature", see "RGB-D Object Recognition Using Deep Convolutional Neural Networks" as an example the data is preprocessed anyway, so long as the decision is done intentionally, I am fine with the current choice. @DavidPL1 you may be more familiar with depth CNNs what is your oppinon

Thanks

Kallinteris-Andreas · 2024-10-24T16:32:40Z

gymnasium/envs/mujoco/mujoco_rendering.py

@@ -281,6 +281,10 @@ def render(
                        seg_ids[geom.segid + 1, 1] = geom.objid
                rgb_img = seg_ids[seg_img]

+            if render_mode == "rgbd_array":


This section (L259-L289) was not written to support 3 render types, it should be re-written to be clearer, into 2 stages, first stage collects the images, and the second stage returns the correct images.

if render_mode in ["depth_array", "rgbd_array"]: depth_img = depth_arr.reshape(self.viewport.height, self.viewport.width) if render_mode in ["rgb_array", "rgbd_array"]: rgb_img = ...

if render_mode == "rgb_array": return ... elif render_mode == "depth_array": return ... elif render_mode == "rgbd_array": return ...

DavidPL1 · 2024-10-25T11:32:48Z

need to update the index page (https://gymnasium.farama.org/environments/mujoco/ ) to document the available render_modes

add a test in test_mujoco_rendering.py named test_rgbd_array, where we assert the output of test_rgbd_array is a proper combination of rgb_array and depth_array

Will do after we settle on the type

I am not convinced that the correct output format is a tuple of (rgb_image, depth_image) while it is the most human-readable format, it is not ideal for CNNs and would require an additional renderWrapper (for the cases where it used as addRenderObservation wrapper)
edit: RGB-D CNNs do not appear to be "mature", see "RGB-D Object Recognition Using Deep Convolutional Neural Networks" as an example the data is preprocessed anyway, so long as the decision is done intentionally, I am fine with the current choice. @DavidPL1 you may be more familiar with depth CNNs what is your oppinon

I'm am also not too familiar with RGB-D CNNs myself. After a quick read on the topic, researchers seem to perform all kinds of processing steps for alternative representations (voxels, pointclouds, ...). Usually RGB-D cameras provide access to two separate streams and datasets also provide the modalities separated.
Still, from my point of view both ways, tuple or 4D-image, would be fine. And custom wrappers should be used to get a favored representation.

I opted for a tuple because I thought it achieves the same thing as the rendering twice alternative (which I have already seen), returning the same types as the individual images. Also I'm using an existing Wrapper combining the two images into a point cloud.
Though I must admit that the naming rgbd_array might mislead into thinking a single array is returned.

Kallinteris-Andreas · 2024-10-26T10:48:13Z

The Open3D python library seems to store the rgbd data on a tuple https://www.open3d.org/docs/latest/tutorial/Basic/rgbd_image.html#RGBD-images
I can not figure out how OpenCV structures the storage of rgbd images

A tuple of rgb and depth should be fine, as it is easily convertible to all other formats

Naming should perhaps be rgbd_tuple?

DavidPL1 · 2024-10-28T10:25:14Z

judging by the function calls of e.g. rgbd::warpFrame taking both as separate inputs, OpenCV seems to also store image and depth separately.

Then I'll stick with the tuple implementation. rgbd_tuple also sounds good to me.

Kallinteris-Andreas · 2024-10-28T11:26:15Z

add a test in test_mujoco_rendering.py named test_rgbd_tuple, where we assert the output of test_rgbd_tuple is a proper combination of rgb_array and depth_array

Kallinteris-Andreas · 2024-10-28T11:27:36Z

@pseudo-rnd-thoughts All I want from you is to tell us if you are okay with the name of the new render_mode being "rgbd_tuple"

DavidPL1 · 2024-10-28T11:29:18Z

add a test in test_mujoco_rendering.py named test_rgbd_tuple, where we assert the output of test_rgbd_tuple is a proper combination of rgb_array and depth_array

See a72f44d or do you want to add something more to the test?

Kallinteris-Andreas · 2024-10-28T11:43:32Z

tests/envs/mujoco/test_mujoco_rendering.py

+
+def test_rgbd_tuple():
+    """Assert that rgbd_tuple is the proper combination of rgb and depth images as tuple"""
+    env = gymnasium.make("Ant-v5", camera_id=0, render_mode="rgbd_tuple").unwrapped


render once with rgb_array and depth_array and assert that the resulting images are the same

pseudo-rnd-thoughts

rgbd_tuple sounds good to me.

DavidPL1 added 4 commits October 22, 2024 11:28

Adds RGB-D rendering option to mujoco envs

8fbf5d6

adds more rgbd mode docs and applies black

e68c2bd

adds rgbd mode to respective render tests

abb40c3

adds rgbd_array rendering to mujoco v4 envs meta

e34d684

pseudo-rnd-thoughts requested changes Oct 23, 2024

View reviewed changes

gymnasium/utils/passive_env_checker.py Outdated Show resolved Hide resolved

removes rgbd_array asserts in passive_env_checker

3e51379

Kallinteris-Andreas mentioned this pull request Oct 24, 2024

Add width and height check for MujocoRenderer #1230

Merged

11 tasks

DavidPL1 requested a review from pseudo-rnd-thoughts October 24, 2024 12:55

Kallinteris-Andreas requested changes Oct 24, 2024

View reviewed changes

DavidPL1 added 4 commits October 28, 2024 11:35

renames rgbd_array -> rgbd_tuple

8c45482

adds render_mode documentation in mujoco index doc

998844d

rewrites offscreen rendering for better readability

3c16061

adds test for rgbd_tuple rendered content

a72f44d

DavidPL1 requested a review from Kallinteris-Andreas October 28, 2024 10:46

DavidPL1 added 2 commits October 28, 2024 11:51

applies formatting

a139fab

fixes wrong depth img return

56ed742

Kallinteris-Andreas reviewed Oct 28, 2024

View reviewed changes

pseudo-rnd-thoughts approved these changes Oct 28, 2024

View reviewed changes

extends rgbd_tuple test to compare image contents

8632d15

Kallinteris-Andreas approved these changes Oct 28, 2024

View reviewed changes

Kallinteris-Andreas merged commit 988999c into Farama-Foundation:main Oct 28, 2024
13 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add mujoco rgbd rendering #1229

Add mujoco rgbd rendering #1229

DavidPL1 commented Oct 23, 2024 •

edited

Loading

pseudo-rnd-thoughts left a comment

DavidPL1 commented Oct 24, 2024

pseudo-rnd-thoughts commented Oct 24, 2024

Kallinteris-Andreas left a comment

Kallinteris-Andreas Oct 24, 2024 •

edited

Loading

DavidPL1 commented Oct 25, 2024

Kallinteris-Andreas commented Oct 26, 2024 •

edited

Loading

DavidPL1 commented Oct 28, 2024 •

edited

Loading

Kallinteris-Andreas commented Oct 28, 2024

Kallinteris-Andreas commented Oct 28, 2024

DavidPL1 commented Oct 28, 2024

Kallinteris-Andreas Oct 28, 2024

pseudo-rnd-thoughts left a comment

Add mujoco rgbd rendering #1229

Add mujoco rgbd rendering #1229

Conversation

DavidPL1 commented Oct 23, 2024 • edited Loading

Description

Type of change

Checklist:

Open Questions

pseudo-rnd-thoughts left a comment

Choose a reason for hiding this comment

DavidPL1 commented Oct 24, 2024

pseudo-rnd-thoughts commented Oct 24, 2024

Kallinteris-Andreas left a comment

Choose a reason for hiding this comment

Kallinteris-Andreas Oct 24, 2024 • edited Loading

Choose a reason for hiding this comment

DavidPL1 commented Oct 25, 2024

Kallinteris-Andreas commented Oct 26, 2024 • edited Loading

DavidPL1 commented Oct 28, 2024 • edited Loading

Kallinteris-Andreas commented Oct 28, 2024

Kallinteris-Andreas commented Oct 28, 2024

DavidPL1 commented Oct 28, 2024

Kallinteris-Andreas Oct 28, 2024

Choose a reason for hiding this comment

pseudo-rnd-thoughts left a comment

Choose a reason for hiding this comment

DavidPL1 commented Oct 23, 2024 •

edited

Loading

Kallinteris-Andreas Oct 24, 2024 •

edited

Loading

Kallinteris-Andreas commented Oct 26, 2024 •

edited

Loading

DavidPL1 commented Oct 28, 2024 •

edited

Loading