Introduce `backend.result_type` #18482

james77777778 · 2023-09-23T16:15:14Z

[MOVED FROM KERAS CORE PR]
keras-team/keras-core#938

Related to #18400

This PR adds backend.result_type which is inspired by and modified from JAX:
https://github.com/google/jax/blob/main/jax/_src/dtypes.py

The major difference is as follows:

Keras' backend.result_type does not canonicalize the resulting dtype. Consequently, Keras allows the computation of high-precision types such as float64, int64 and uint64.
Keras' backend.result_type utilizes the precision of backend.floatx() for weak type. It defaults to "32", which means that float will be float32 and so on.

In this PR, result_type has been applied to:

ops.add
ops.arange
ops.sqrt

codecov-commenter · 2023-09-23T16:23:46Z

Codecov Report

Attention: 25 lines in your changes are missing coverage. Please review.

Comparison is base (299419a) 77.34% compared to head (6bb815f) 77.54%.
Report is 2 commits behind head on master.

Additional details and impacted files

@@            Coverage Diff             @@
##           master   #18482      +/-   ##
==========================================
+ Coverage   77.34%   77.54%   +0.19%     
==========================================
  Files         332      333       +1     
  Lines       32000    32163     +163     
  Branches     6248     6277      +29     
==========================================
+ Hits        24751    24940     +189     
+ Misses       5664     5637      -27     
- Partials     1585     1586       +1

Flag	Coverage Δ
keras	`77.44% <89.13%> (+0.18%)`	⬆️
keras-jax	`63.32% <64.78%> (+1.13%)`	⬆️
keras-numpy	`57.27% <71.30%> (+1.14%)`	⬆️
keras-tensorflow	`63.15% <64.34%> (+1.14%)`	⬆️
keras-torch	`64.06% <63.91%> (+0.12%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Files	Coverage Δ
keras/backend/__init__.py	`95.00% <100.00%> (+0.12%)`	⬆️
keras/backend/common/__init__.py	`100.00% <100.00%> (ø)`
keras/backend/jax/numpy.py	`98.87% <100.00%> (+1.16%)`	⬆️
keras/backend/numpy/numpy.py	`98.52% <100.00%> (+1.55%)`	⬆️
keras/backend/tensorflow/numpy.py	`95.01% <100.00%> (+1.07%)`	⬆️
keras/backend/torch/numpy.py	`94.87% <100.00%> (+0.78%)`	⬆️
keras/ops/numpy.py	`94.86% <97.14%> (+0.63%)`	⬆️
keras/backend/common/dtypes.py	`76.69% <76.69%> (ø)`

... and 3 files with indirect coverage changes

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

fchollet

Thanks for the PR!

The logic here is quite complex overall, and it may be difficult to extend and maintain in the future. I assume it is adapted from JAX? Could it be significantly simplified?

I think we can just need a function that can resolve result_dtype(a_dtype, b_dtype). Once we have it we can do pairwise reductions to get the result dtype for any list of tensors / dtypes. The function itself can just be half of a symmetric matrix (or the entire matrix, if you don't want to order the inputs). That should be readable and would not take much code.

keras/backend/common/dtypes.py

james77777778 · 2023-09-25T10:01:36Z

The logic here is quite complex overall, and it may be difficult to extend and maintain in the future. I assume it is adapted from JAX? Could it be significantly simplified?

Yes, it is adapted from JAX. In the latest commit, I believe I have simplified it significantly.

I think we can just need a function that can resolve result_dtype(a_dtype, b_dtype).

I think most of the complexity arises from dealing the weak type (python's int and float). We can simplify the logic by removing them, but we might lose some value in result_dtype.
For example, there are some ops allow python scalar type for their arguments. It would be inconvenient if we couldn't handle weak type in result_dtype.

fchollet

Thanks for the update -- this is a nice simplification!

keras/backend/common/dtypes.py

james77777778 · 2023-09-26T09:45:44Z

Hi @fchollet

I think backend.result_type should now be ready. I have applied it to the following ops:

ops.ones
ops.zeros
ops.empty
ops.identity
ops.tri
ops.eye

and also added the corresponding tests.

It is worth noting:
There is an inevitable gap between tensorflow and other backends, where tf.Variable with int32 cannot be placed on the GPU, and as a workaround we need to use int64.
https://www.tensorflow.org/xla/known_issues#tfvariable_on_a_different_device
This issue will break self.add_variable because constant initializer extensively use ops.ones, ops.zeros and ops.eye in the __call__.

~~So I have skipped the canonicalization when the resulting dtype is int64 with tensorflow in backend.result_type.~~

EDITED:
On second thought:
It should be safe to cast the value for the initialization of tf.Variable.
This resolves the int32 and int64 issue in tensorflow without making the exception of the type inference rule.

james77777778 · 2023-09-26T09:58:57Z

While there are still some dtype inconsistencies in other ops, we can address them in a separate PR.

fchollet

Thanks for the great contribution -- it's looking very good.

keras/backend/jax/numpy.py

keras/backend/numpy/numpy.py

fchollet

Thank you for the update!

keras/backend/jax/numpy.py

james77777778 · 2023-09-28T01:09:19Z

There are valid use cases for float64. Doing print(ops.ones((1,), dtype="float64").dtype) should definitely return float64 with all backends other than JAX. In JAX it is an unfortunate limitation inherited from historical reasons.

Requesting float64 should return a float64 output if the backend supports it (as all do, except JAX).

Got it.
Generally, we should adhere to JAX's type inference rules with enable_x64=True. Is that correct?

fchollet · 2023-09-28T01:36:17Z

Generally, we should adhere to JAX's type inference rules with enable_x64=True. Is that correct?

Yes, that is a good plan. My experience with JAX dtype promotion policy is that it is more user friendly than TF's (with the exception of 64 bit dtypes being disabled by default).

…X_DEFAULT_DTYPE_BITS=32`

james77777778 · 2023-09-28T03:22:30Z

Yes, that is a good plan. My experience with JAX dtype promotion policy is that it is more user friendly than TF's (with the exception of 64 bit dtypes being disabled by default).

Currently, backend.result_type and some of ops.numpy.* behavior matches JAX when using JAX_DEFAULT_DTYPE_BITS=32 and JAX_ENABLE_X64=true

EDITED:
This PR should be ready.
In ops.add, the type inference of jax and torch is the same, so we can omit result_type and casting.

fchollet

Thanks for the updates!

keras/backend/jax/numpy.py

keras/backend/tensorflow/core.py

fchollet · 2023-09-28T04:11:56Z

keras/ops/numpy_test.py

@@ -3872,3 +3874,165 @@ def test_tri(self):
        self.assertAllClose(knp.Tri()(3), np.tri(3))
        self.assertAllClose(knp.Tri()(3, 4), np.tri(3, 4))
        self.assertAllClose(knp.Tri()(3, 4, 1), np.tri(3, 4, 1))
+
+
+class NumpyDtypeTest(testing.TestCase, parameterized.TestCase):


This is the most useful class -- only through rigorous testing can we achieve consistency 👍

james77777778 · 2023-09-28T04:35:11Z

Thank you for the thorough review.
Please let me know if any changes are necessary.

EDITED:
result_type has been applied to:

ops.add
ops.arange
ops.sqrt

fchollet

Wonderful. Thank you for the awesome contribution! 🚀

Add result_dtype and some refactor of ops.numpy

26ad3bc

google-ml-butler bot added the size:L label Sep 23, 2023

google-ml-butler bot assigned gbaned Sep 23, 2023

Fix keras_export

c206043

fchollet reviewed Sep 23, 2023

View reviewed changes

james77777778 added 2 commits September 25, 2023 09:54

Merge branch 'keras-team:master' into introduce-dtype-inference

b5a7944

Refactor result_dtype

fc34bb7

fchollet reviewed Sep 25, 2023

View reviewed changes

keras/backend/common/dtypes.py Show resolved Hide resolved

keras/backend/common/dtypes.py Outdated Show resolved Hide resolved

keras/backend/common/dtypes.py Outdated Show resolved Hide resolved

james77777778 added 3 commits September 26, 2023 10:31

Merge branch 'keras-team:master' into introduce-dtype-inference

1df20fc

Update result_type

1f8e623

Revert ops.numpy.* changes

3890ab0

james77777778 changed the title ~~Introduce backend.result_dtype and improve dtypes in ops.numpy.*~~ Introduce backend.result_dtype Sep 26, 2023

james77777778 changed the title ~~Introduce backend.result_dtype~~ Introduce backend.result_type Sep 26, 2023

james77777778 added 8 commits September 26, 2023 13:49

Merge branch 'keras-team:master' into introduce-dtype-inference

19c88a9

ensure consistent dtype inference

eb02511

add dtype test

2e86e59

fix dropout rnn test

d9a049f

Fix symbolic test

e2df127

Fix torch test

dde0cc0

keep "int64" when using tensorflow

25b94a2

Fix test

4b0eaba

james77777778 requested a review from fchollet September 26, 2023 09:51

google-ml-butler bot added the awaiting review label Sep 26, 2023

Simplify result_type for tensorflow

61c958c

fchollet reviewed Sep 27, 2023

View reviewed changes

keras/backend/jax/numpy.py Outdated Show resolved Hide resolved

keras/backend/jax/numpy.py Outdated Show resolved Hide resolved

keras/backend/numpy/numpy.py Show resolved Hide resolved

Merge branch 'keras-team:master' into introduce-dtype-inference

14d8d5e

james77777778 requested a review from fchollet September 27, 2023 08:02

fchollet reviewed Sep 27, 2023

View reviewed changes

keras/backend/jax/numpy.py Outdated Show resolved Hide resolved

keras/backend/jax/numpy.py Outdated Show resolved Hide resolved

Merge branch 'keras-team:master' into introduce-dtype-inference

072cc55

james77777778 added 2 commits September 28, 2023 10:49

Match backend.result_type to JAX with JAX_ENABLE_X64=true and `JA…

8ebff53

…X_DEFAULT_DTYPE_BITS=32`

Use dtype or config.floatx()

2d014f0

james77777778 requested a review from fchollet September 28, 2023 03:22

james77777778 added 2 commits September 28, 2023 11:25

Fix symbolic ops

4e5f67a

Remove result_type in jax and torch

903dbec

fchollet reviewed Sep 28, 2023

View reviewed changes

Address comments

e2883c8

james77777778 requested a review from fchollet September 28, 2023 05:13

james77777778 added 3 commits September 28, 2023 15:17

Apply result_type to ops.numpy.arange

3b20fec

Apply backend.result_type to ops.numpy.sqrt

42be5d2

Skip float16 test for torch's sqrt

6bb815f

fchollet approved these changes Sep 28, 2023

View reviewed changes

google-ml-butler bot added kokoro:force-run ready to pull Ready to be merged into the codebase labels Sep 28, 2023

fchollet merged commit f73df5a into keras-team:master Sep 28, 2023
6 checks passed

google-ml-butler bot removed awaiting review ready to pull Ready to be merged into the codebase kokoro:force-run labels Sep 28, 2023

james77777778 deleted the introduce-dtype-inference branch September 29, 2023 00:14

james77777778 mentioned this pull request Oct 2, 2023

Apply backend.result_type to bincount, substract, matmul, multiply, mean and max #18534

Merged

jackd mentioned this pull request Oct 2, 2023

Symbolic tensor addition is not commutative w.r.t. dtypes #18415

Closed

james77777778 mentioned this pull request Oct 5, 2023

Apply backend.result_type to absolute, argmax, argmin, argsort, ceil, clip and dot #18548

Merged

james77777778 mentioned this pull request Oct 17, 2023

Apply backend.result_type to equal, exp, expm1, full, greater* and less* #18636

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce `backend.result_type` #18482

Introduce `backend.result_type` #18482

james77777778 commented Sep 23, 2023 •

edited

Loading

codecov-commenter commented Sep 23, 2023 •

edited

Loading

fchollet left a comment

james77777778 commented Sep 25, 2023

fchollet left a comment

james77777778 commented Sep 26, 2023 •

edited

Loading

james77777778 commented Sep 26, 2023

fchollet left a comment

fchollet left a comment

james77777778 commented Sep 28, 2023

fchollet commented Sep 28, 2023

james77777778 commented Sep 28, 2023 •

edited

Loading

fchollet left a comment

fchollet Sep 28, 2023

james77777778 commented Sep 28, 2023 •

edited

Loading

fchollet left a comment

Introduce backend.result_type #18482

Introduce backend.result_type #18482

Conversation

james77777778 commented Sep 23, 2023 • edited Loading

codecov-commenter commented Sep 23, 2023 • edited Loading

Codecov Report

fchollet left a comment

Choose a reason for hiding this comment

james77777778 commented Sep 25, 2023

fchollet left a comment

Choose a reason for hiding this comment

james77777778 commented Sep 26, 2023 • edited Loading

james77777778 commented Sep 26, 2023

fchollet left a comment

Choose a reason for hiding this comment

fchollet left a comment

Choose a reason for hiding this comment

james77777778 commented Sep 28, 2023

fchollet commented Sep 28, 2023

james77777778 commented Sep 28, 2023 • edited Loading

fchollet left a comment

Choose a reason for hiding this comment

fchollet Sep 28, 2023

Choose a reason for hiding this comment

james77777778 commented Sep 28, 2023 • edited Loading

fchollet left a comment

Choose a reason for hiding this comment

Introduce `backend.result_type` #18482

Introduce `backend.result_type` #18482

james77777778 commented Sep 23, 2023 •

edited

Loading

codecov-commenter commented Sep 23, 2023 •

edited

Loading

james77777778 commented Sep 26, 2023 •

edited

Loading

james77777778 commented Sep 28, 2023 •

edited

Loading

james77777778 commented Sep 28, 2023 •

edited

Loading