Add testing.check_figures_equal to avoid storing baseline images #555

seisman · 2020-08-06T01:58:13Z

Description of proposed changes

This PR adds a new function check_figures_equal to check if two pygmt.Figure() objects are the same, mostly inspired by comment #522 (review) and the matplotlib decorator check_figures_equal.

What the function check_figures_equal does is very simple:

It accepts two pygmt.Figure() objects (fig_ref and fig_test) as arguments
save them into two PNG images
compare these two images using the compare_images function from matplotlib and calculate the RMS value
if the two images are the same, then delete them; if the rms is greater than tol, generate the diff image and raise an exception.

I add three new tests:

test_check_figures_equal checks the case when two images are equal
test_check_figures_unequal checks the case when two images are unequal. The exception is correctly caught by pytest.raises.
test_grdimage_central_longitude is a real test to check the images generated by passing a matrix or a grid to grdimage (we always assume that passing a netCDF grid to GMT gives the correct/reference image). The test helps me find a GMT bug, which is usually very difficult to detect (see Matrix as grid with changing central meridian doesn't work well for gridline grids gmt#3844 and Add special check for non-rotated global grids gmt#3849).

Some known issues/limitations:

some paths are hard-coded, but they may be improved to be more flexible
The test test_check_figures_unequal() checks if the function check_figures_equal correctly raises the GMTImageComparisonFailure exception when two images are unequal, and I use pytest.raises to catch the exception so that the test passes. ~~However, the two images are not deleted after the test.~~ They are now.

Fixes #

Reminders

Run make format and make check to make sure the code follows the style guide.
Add tests for new features or tests that would have caught the bug that you're fixing.
Add new public functions/methods/classes to doc/api/index.rst.
Write detailed docstrings for all functions/methods.
If adding new functionality, add an example to docstrings or tutorials.

weiji14 · 2020-09-03T02:07:32Z

@seisman, hope you don't mind if I continue some work on this.

some paths are hard-coded, but they may be improved to be more flexible

Will try to see if it's possible to mimic the pytest-mpl paths, or look to matplotlib to see how they do it.

The test test_check_figures_unequal() checks if the function check_figures_equal correctly raises the GMTImageComparisonFailure exception when two images are unequal, and I use pytest.raises to catch the exception so that the test passes. However, the two images are not deleted after the test.

It should be easy-enough to use GMTTempFile so that the images are cleaned up after the test, but we'll need to find a way to keep the images if the test fails so that they can be examined.

pygmt/helpers/testing.py

Also moved test_check_figures_* to a doctest under check_figures_equal.

Same logic that was implemented in matplotlib/matplotlib#16800

weiji14

Ok, this is pretty much ready for review. After stress testing this with pytest, I think the check_figures_equal function will have to end up pretty much exactly like matplotlib's https://matplotlib.org/3.3.1/_modules/matplotlib/testing/decorators.html#check_figures_equal, except that we're using fig=pygmt.Figure() instead of matplotlib's plt.subplot. If only we can monkeypatch it!!

It might be worth raising an issue/PR on matplotlib to see if there's a way to reduce code duplication (for the sake of long term maintainability), as they've clearly spent months/years of thought and effort into this one check_figures_equal function. This would involve some sort of subclassing (or I don't know, pluggability?), so that we can swap in pygmt.Figure into fig_ref and fig_test, while getting all of the image_compare goodness.

weiji14 · 2020-09-03T09:22:51Z

pygmt/helpers/decorators.py

+        parameters = [
+            param
+            for param in old_sig.parameters.values()
+            if param.name not in {"fig_test", "fig_ref"}
+        ]
+        new_sig = old_sig.replace(parameters=parameters)
+        wrapper.__signature__ = new_sig


Figured out how to make our PyGMT check_figures_equal decorator work with pytest fixtures (e.g. grid=xr.DataArray(...)) in 3e0d3fb. This is basically just copying what was done in matplotlib at matplotlib/matplotlib#16800.

weiji14 · 2020-09-03T10:14:39Z

pygmt/tests/test_grdimage.py

+@check_figures_equal()
+def test_grdimage_central_longitude(grid, fig_ref, fig_test):
+    """
+    Test that plotting a grid centred at different longitudes/meridians work.
+    """
+    fig_ref.grdimage("@earth_relief_01d_g", projection="W120/15c", cmap="geo")
+    fig_test.grdimage(grid, projection="W120/15c", cmap="geo")


Suggested change

@check_figures_equal()

def test_grdimage_central_longitude(grid, fig_ref, fig_test):

"""

Test that plotting a grid centred at different longitudes/meridians work.

"""

fig_ref.grdimage("@earth_relief_01d_g", projection="W120/15c", cmap="geo")

fig_test.grdimage(grid, projection="W120/15c", cmap="geo")

@pytest.mark.parametrize("meridian", [0, 33, 120, 180])

@check_figures_equal()

@pytest.mark.parametrize("proj_type", ["H", "Q", "W"])

def test_grdimage_different_central_meridians_and_projections(

grid, proj_type, meridian, fig_ref, fig_test

):

"""

Test that plotting a grid centred on different meridians using different

projection systems work.

"""

fig_ref.grdimage(

"@earth_relief_01d_g", projection=f"{proj_type}{meridian}/15c", cmap="geo"

)

fig_test.grdimage(grid, projection=f"{proj_type}{meridian}/15c", cmap="geo")

I'll update this test in #560 later 😄. Problem with using this fancy pytest.mark.parametrize is that it would complicate the check_figures_equal code (see matplotlib/matplotlib#15199 and matplotlib/matplotlib#16693), and make this PR even harder to review.

seisman · 2020-09-03T19:55:02Z

pygmt/helpers/decorators.py


 import numpy as np
+from matplotlib.testing.compare import compare_images


from matplotlib.testing.compare import compare_images

As I understand it, the code means that now matplotlib becomes a required dependency, even for users who never run the tests, right?

Although PyGMT already requires matplotlib for testings and most users usually have matplotlib installed. I still don't want to add one dependency to PyGMT.

When I wrote the first commit (8b78614), I put the codes in pygmt/helpers/testing.py. By doing that way, I think matplotlib is still optional, although I haven't tested it.

Good point, I think you're right here. I also encountered issues with circular imports when moving the code to decorators.py, hence this line:

pygmt/pygmt/helpers/decorators.py

Line 459 in 04b3f41

from ..figure import Figure # pylint: disable=import-outside-toplevel

Probably should move it back under pygmt/helpers/testing.py then. As an aside, I've opened up a feature request at matplotlib/pytest-mpl#94, and we might be able to do all this from pytest-mpl in the future.

pygmt/helpers/testing.py

seisman · 2020-09-03T22:41:21Z

pygmt/helpers/testing.py

+from ..figure import Figure
+
+
+def check_figures_equal(*, tol=0.0, result_dir="result_images"):


One more thing about the result_dir is that, the check_figures_equal decorator generates images in result_images directory, while pytest.mark.mpl_image_compare generates images in directories like results/tmpjtnnwqt4.

Yes, which was partly why I opened up the issue at matplotlib/pytest-mpl#94, to get all of that pytest-mpl goodness (e.g. not having a hardcoded result_dir). I'll try to make a Pull Request to pytest-mpl for that, so we can just use a proper @pytest.mark.mpl_check_equal decorator in the future (will open a new issue after this one is merged). For now though, since we don't have many tests using check_figures_equal yet, we can probably just leave it like so.

Yes, looks good to me.

weiji14 · 2020-09-03T23:19:13Z

CONTRIBUTING.md

+Writing an image-based test is only slightly more difficult than a simple test.
+The main consideration is that you must specify the "baseline" or reference
+image, and compare it with a "generated" or test image. This is handled using


Added some notes here to CONTRIBUTING.md, adapted from https://matplotlib.org/3.3.1/devel/testing.html#writing-an-image-comparison-test.

seisman

Looks good to me, but I can't approve it because I opened this PR.

weiji14

Ah yes, I'll approve it then 😆

Patches #555.

Tell git to ignore PNG files in 'result_images' folder, and add it to list of folders to clean for make clean. Patches #555.

Tests with @pytest-mpl.mpl_image_compare and @check_figures_equal (#555) will return an image diff on test failures in the 'tmp-test-dir-with-unique-name' directory, and we can upload those files as a Github artifact for inspection purposes. * Use GitHub Action to upload diff images on test failure * Set a unique artifact name for each OS/Python Version * Add upload-artifact to GMT Latest tests Co-Authored-By: Dongdong Tian <[email protected]>

Add testing.check_figures_equal to avoid storing baseline images

8b78614

weiji14 mentioned this pull request Aug 10, 2020

Ensure plotting xarray grids with different central meridians work on some projections #560

Merged

5 tasks

Merge branch 'master' into testing

2bb82f4

vercel bot temporarily deployed to Preview August 18, 2020 21:45 Inactive

Merge branch 'master' into testing

847814b

vercel bot temporarily deployed to Preview September 3, 2020 01:55 Inactive

Black lint, fix typos and run isort

27e03ed

weiji14 force-pushed the testing branch from 8286d7e to 27e03ed Compare September 3, 2020 02:02

vercel bot temporarily deployed to Preview September 3, 2020 02:02 Inactive

weiji14 reviewed Sep 3, 2020

View reviewed changes

pygmt/helpers/testing.py Outdated Show resolved Hide resolved

Turn check_figures_equal into a decorator function

d2ad3f5

Also moved test_check_figures_* to a doctest under check_figures_equal.

vercel bot temporarily deployed to Preview September 3, 2020 05:40 Inactive

vercel bot temporarily deployed to Preview September 3, 2020 09:12 Inactive

Ensure pytest fixtures can be used with check_figures_equal decorator

3e0d3fb

Same logic that was implemented in matplotlib/matplotlib#16800

weiji14 force-pushed the testing branch from d3cab24 to 3e0d3fb Compare September 3, 2020 09:13

vercel bot temporarily deployed to Preview September 3, 2020 09:13 Inactive

weiji14 reviewed Sep 3, 2020

View reviewed changes

Reorder parameters and add docstring note on code origin from matplotlib

04b3f41

vercel bot temporarily deployed to Preview September 3, 2020 10:47 Inactive

weiji14 mentioned this pull request Sep 3, 2020

Support check_figures_equal style comparison without any baseline_image matplotlib/pytest-mpl#94

Open

weiji14 marked this pull request as ready for review September 3, 2020 11:18

seisman commented Sep 3, 2020

View reviewed changes

vercel bot temporarily deployed to Preview September 3, 2020 21:34 Inactive

Move check_figures_equal out of decorators.py and back into testing.py

cfe3a24

weiji14 force-pushed the testing branch from fb19edf to cfe3a24 Compare September 3, 2020 21:36

vercel bot temporarily deployed to Preview September 3, 2020 21:37 Inactive

seisman commented Sep 3, 2020

View reviewed changes

pygmt/helpers/testing.py Show resolved Hide resolved

pygmt/helpers/testing.py Show resolved Hide resolved

Add notes on using check_figures_equal to MAINTENANCE.md

1155df0

seisman commented Sep 3, 2020

View reviewed changes

Extra checks to ensure image files exist or not

e6ad74d

vercel bot temporarily deployed to Preview September 3, 2020 23:16 Inactive

weiji14 reviewed Sep 3, 2020

View reviewed changes

seisman commented Sep 3, 2020

View reviewed changes

weiji14 approved these changes Sep 3, 2020

View reviewed changes

weiji14 merged commit 97a585b into master Sep 4, 2020

weiji14 deleted the testing branch September 4, 2020 02:15

weiji14 mentioned this pull request Sep 4, 2020

Use vendored check_figures_equal decorator function #579

Open

weiji14 added the maintenance Boring but important stuff for the core devs label Sep 5, 2020

This was referenced Sep 6, 2020

Baseline images need updates due to the recent updates of earth relief data #451

Closed

Redesign check_figures_equal testing function to be more explicit #590

Merged

weiji14 added a commit that referenced this pull request Sep 8, 2020

Add result_images folder to gitignore and make clean list

6146606

Patches #555.

weiji14 mentioned this pull request Sep 8, 2020

Add result_images folder to gitignore and make clean list #592

Merged

5 tasks

weiji14 added a commit that referenced this pull request Sep 8, 2020

Add result_images folder to gitignore and make clean list (#592)

f59fd80

Tell git to ignore PNG files in 'result_images' folder, and add it to list of folders to clean for make clean. Patches #555.

weiji14 mentioned this pull request Sep 12, 2020

Refactor tests to use a more stable grid instead of earth_relief #505

Closed

weiji14 mentioned this pull request Oct 27, 2020

Upload artifacts showing diff images on test failure #675

Merged

5 tasks

lhoupert mentioned this pull request Nov 6, 2020

Wrap velo #525

Merged

seisman mentioned this pull request Feb 24, 2021

Rethink the testing mechanism for images #963

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add testing.check_figures_equal to avoid storing baseline images #555

Add testing.check_figures_equal to avoid storing baseline images #555

seisman commented Aug 6, 2020 •

edited by weiji14

Loading

weiji14 commented Sep 3, 2020

weiji14 left a comment

weiji14 Sep 3, 2020

weiji14 Sep 3, 2020 •

edited

Loading

seisman Sep 3, 2020

weiji14 Sep 3, 2020

seisman Sep 3, 2020

weiji14 Sep 3, 2020

seisman Sep 3, 2020

weiji14 Sep 3, 2020

seisman left a comment

weiji14 left a comment

-@check_figures_equal()
-def test_grdimage_central_longitude(grid, fig_ref, fig_test):
-    """
-    Test that plotting a grid centred at different longitudes/meridians work.
-    """
-    fig_ref.grdimage("@earth_relief_01d_g", projection="W120/15c", cmap="geo")
-    fig_test.grdimage(grid, projection="W120/15c", cmap="geo")
+@pytest.mark.parametrize("meridian", [0, 33, 120, 180])
+@check_figures_equal()
+@pytest.mark.parametrize("proj_type", ["H", "Q", "W"])
+def test_grdimage_different_central_meridians_and_projections(
+    grid, proj_type, meridian, fig_ref, fig_test
+):
+    """
+    Test that plotting a grid centred on different meridians using different
+    projection systems work.
+    """
+    fig_ref.grdimage(
+        "@earth_relief_01d_g", projection=f"{proj_type}{meridian}/15c", cmap="geo"
+    )
+    fig_test.grdimage(grid, projection=f"{proj_type}{meridian}/15c", cmap="geo")


		import numpy as np
		from matplotlib.testing.compare import compare_images

		from ..figure import Figure


		def check_figures_equal(*, tol=0.0, result_dir="result_images"):

Add testing.check_figures_equal to avoid storing baseline images #555

Add testing.check_figures_equal to avoid storing baseline images #555

Conversation

seisman commented Aug 6, 2020 • edited by weiji14 Loading

weiji14 commented Sep 3, 2020

weiji14 left a comment

Choose a reason for hiding this comment

weiji14 Sep 3, 2020

Choose a reason for hiding this comment

weiji14 Sep 3, 2020 • edited Loading

Choose a reason for hiding this comment

seisman Sep 3, 2020

Choose a reason for hiding this comment

weiji14 Sep 3, 2020

Choose a reason for hiding this comment

seisman Sep 3, 2020

Choose a reason for hiding this comment

weiji14 Sep 3, 2020

Choose a reason for hiding this comment

seisman Sep 3, 2020

Choose a reason for hiding this comment

weiji14 Sep 3, 2020

Choose a reason for hiding this comment

seisman left a comment

Choose a reason for hiding this comment

weiji14 left a comment

Choose a reason for hiding this comment

seisman commented Aug 6, 2020 •

edited by weiji14

Loading

weiji14 Sep 3, 2020 •

edited

Loading