Enable batch support for `windowed_mean|variance`#1600

Open

nicolaspi wants to merge 14 commits intotensorflow:mainfrom

nicolaspi:windowed_stats

nicolaspi commented Aug 8, 2022

This PR makes functions windowed_mean and windowed_variance to accept indices with batch dimensions.

Example:

x = np.array([[1, 2, 3], [1, 2, 3], [1, 2, 3]], dtype=np.float32)
low_indices = [[0, 0, 0], [1, 0, 0], [2, 2, 0]]
high_indices = [[3, 3, 3], [1, 2, 3], [3, 2, 1]]
tfp.stats.windowed_mean(x, low_indices=low_indices, high_indices=high_indices, axis=1)

Now gives:

<tf.Tensor: shape=(3, 3), dtype=float32, numpy=
array([[2. , 2. , 2. ],
       [0. , 1.5, 2. ],
       [3. , 0. , 1. ]], dtype=float32)>

Was previously failing with:

tensorflow.python.framework.errors_impl.InvalidArgumentError: required broadcastable shapes [Op:SelectV2]

nicolaspi mentioned this pull request

Fix doc of windowed_mean|variance to match implementation #1599

Closed

Author

nicolaspi commented Aug 9, 2022

@axch I made changes in the code you authored, could you kindly have a look at this PR?
Thanks

Contributor

axch commented Aug 9, 2022

@nicolaspi thanks for the contribution! I am no longer an active maintainer of TFP, so I'm not really in a position to review your PR in detail (@jburnim please suggest someone?). On a quick look, though, I see a couple potential code style issues:

Do we need the dependency on tf.experimental.numpy?
Do we need the special case for rank-1 indices? Could we define a more uniform behavior instead?
I'm guessing some of the shape munging already has relevant helpers defined elsewhere in TFP, but I don't remember off-hand
TFP generally tries to handle static and dynamic TF shapes uniformly using prefer_static and tensorshape_util.

Author

nicolaspi commented Aug 10, 2022

Thanks for your feedback!

Do we need the dependency on tf.experimental.numpy?

We need specifically the take_along_axis function that allow to gather the slices along each batch dimensions. I replaced the 'experimental' import path with from tensorflow.python.ops import numpy_ops.

Do we need the special case for rank-1 indices? Could we define a more uniform behavior instead?

There is two motivations for this case. First, for backward compatibility, it is equivalent to the legacy non batched usage. Second, it is the only case I can think of where the broadcast is unambiguous when rank(indices) < rank(x).

I'm guessing some of the shape munging already has relevant helpers defined elsewhere in TFP, but I don't remember off-hand

In any case, I modified the unit tests to test against non static shapes.

TFP generally tries to handle static and dynamic TF shapes uniformly using prefer_static and tensorshape_util.

I made usage of prefer_static whenever possible.

Author

nicolaspi commented Sep 20, 2022

@jburnim can you please suggest a reviewer?
CC @axch

Member

SiegeLordEx commented Sep 20, 2022

I'll take a look at this.

SiegeLordEx self-assigned this

SiegeLordEx self-requested a review

September 20, 2022 07:41

SiegeLordEx removed their assignment

SiegeLordEx requested changes

View reviewed changes

tensorflow_probability/python/stats/sample_stats.py Outdated

               import numpy as np
               import tensorflow.compat.v2 as tf
+              if NUMPY_MODE:

Member

SiegeLordEx Sep 20, 2022

We'll need to do something different about take_along_axis.

(preferred) Somehow rewrite the logic using tf.gather/tf.gather_nd
Expose tf.experimental.numpy.take_along_axis in https://github.com/tensorflow/probability/tree/main/tensorflow_probability/python/internal/backend/numpy

As is, this is problematic since we really dislike using JAX_/NUMPY_MODE in library code.

Author

nicolaspi Sep 26, 2022

Thanks for the review!

I don't feel comfortable rewriting take_along_axis as it would duplicate already existing logics, I feel like it would produce unnecessary maintenance burden.
What about mapping tensorflow.experimental.numpy to numpy and jax.numpy backends?

tensorflow_probability/python/stats/sample_stats.py Outdated Show resolved Hide resolved

tensorflow_probability/python/stats/sample_stats.py Outdated

-                must be between 0 and N+1, and the shape of the output will be
-                `Bx + [M] + E`.  Batch shape in the indices is not currently supported.
+                Suppose `x` has shape `Bx + [N] + E`, `low_indices` and `high_indices`
+                have shape `Bi + [M] + F`, such that `rank(Bx) = rank(Bi) = axis`.

Member

SiegeLordEx Sep 20, 2022

What is F? Why isn't it a scalar?

Author

nicolaspi Sep 26, 2022

Please check my comment below.

tensorflow_probability/python/stats/sample_stats.py Outdated


		The shape `Bi + [1] + F` must be broadcastable with the shape of `x`.

		If `rank(Bi + [M] + F) < rank(x)`, then the indices are expanded

Member

SiegeLordEx Sep 20, 2022

I don't think this paragraph adds anything, it's just an implementation detail.

Author

nicolaspi Sep 26, 2022

We specify the implicit rules we uses for broadcasting. I updated the formulation.

tensorflow_probability/python/stats/sample_stats.py Outdated

+                Then each element of `low_indices` and `high_indices` must be
+                between 0 and N+1, and the shape of the output will be `Bx + [M] + E`.
+                The shape `Bi + [1] + F` must be broadcastable with the shape of `x`.

Member

SiegeLordEx Sep 20, 2022

This contradicts the next paragraph, no?

In general, consider the non-batched version of this:

x shape: [N] + E
idx shape: [M]
output shape: [M] + E

The batching would introduce a batch dimension on the left of those shapes:

x shape: Bx + [N] + E
idx shape: Bi + [M]
output shape: broadcast(Bx, Bi) + [M] + E

Thus, the only broadcasting requirements are that Bx and Bi broadcast. I don't know where F came from.

Author

nicolaspi Sep 26, 2022

This contradicts the next paragraph, no?

Yes, I reformulated.

The batching would introduce a batch dimension on the left of those shapes:
Thus, the only broadcasting requirements are that Bx and Bi broadcast. I don't know where F came from.

Maybe the term 'batch' is not proper. This contribution adds the possibility to have the more general case where
idx shape is Bi + [M] + F. F could be seen as 'inner batch dimensions', but here 'batch' carries a different semantic than the standard machine learning one where it is represented by outer dims.

tensorflow_probability/python/stats/sample_stats.py Show resolved Hide resolved

tensorflow_probability/python/stats/sample_stats_test.py Outdated Show resolved Hide resolved

tensorflow_probability/python/stats/sample_stats_test.py Outdated Show resolved Hide resolved

tensorflow_probability/python/stats/sample_stats_test.py Outdated Show resolved Hide resolved

tensorflow_probability/python/stats/sample_stats_test.py

+              @test_util.test_all_tf_execution_regimes
+              class WindowedStatsTest(test_util.TestCase):
+                def _maybe_expand_dims_to_make_broadcastable(self, x, shape, axis):

Member

SiegeLordEx Sep 20, 2022

These two functions are as complex as the thing we're testing. Is there any way we can write this via np.vectorize?

Author

nicolaspi Sep 26, 2022

I refactored using np.vectorize, but I am not sure it is easier to read.

nicolaspi added 9 commits

September 26, 2022 10:58


          Enable batch support for windowed_mean|variance


          Remove unused function

d48cdfc

Add test cases


          Doc fix

e020543

Replace `**2` with `tf.square`


          Allow lower rank indices

c28faa5


          Test against tensors with dynamic shapes

4446b60

Some `tensorflow` to `prefer_static` replacement


          Fix take_along_axis import for jax backend

56c5c16


          Expose tensorflow.experimental.numpy API to numpy and jax backends

26f4f12


          Rewrite apply_slice_along_axis using np.vectorize

169f7f5


          Check for statically known rank

c90e961

Parametrize tests

nicolaspi force-pushed the windowed_stats branch from 4002d8b to c90e961 Compare

September 26, 2022 19:40

nicolaspi added 2 commits

September 26, 2022 20:17


          Documentation

45dabfa


          Documentation

df117ee

nicolaspi requested a review from SiegeLordEx

September 26, 2022 20:58

nicolaspi added 2 commits

September 26, 2022 21:50


          Style

a679e26


          Notation

385cff7

Author

nicolaspi commented Oct 6, 2022

Hi @SiegeLordEx, I have assessed your comments, can you have a look? Thanks


          Remove extra JAX_MODE and NUMPY_MODE setting

92d7143

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet