[ci][data] tensorflow-datasets tests: depset for tfds tests (3/3) #59354

elliot-barn · 2025-12-10T19:26:20Z

using depset for data tfds ci tests

Signed-off-by: elliot-barn <[email protected]>

gemini-code-assist

Code Review

This pull request successfully sets up a dedicated CI job for tensorflow-datasets tests on Python 3.12. The changes include creating a new Docker image, defining dependencies with depset, and refactoring the tests into a new file. The CI configuration is updated correctly to isolate these tests. My only suggestion is to remove a duplicated test case in the new test file to improve maintainability.

gemini-code-assist · 2025-12-10T19:28:22Z

python/ray/data/tests/test_tensorflow_datasets.py

 def test_from_tf(ray_start_regular_shared_2_cpus):
    import tensorflow as tf
    import tensorflow_datasets as tfds

    tf_dataset = tfds.load("mnist", split=["train"], as_supervised=True)[0]
    tf_dataset = tf_dataset.take(8)  # Use subset to make test run faster.

    ray_dataset = ray.data.from_tf(tf_dataset)

    actual_data = extract_values("item", ray_dataset.take_all())
    expected_data = list(tf_dataset)
    assert len(actual_data) == len(expected_data)
    for (expected_features, expected_label), (actual_features, actual_label) in zip(
        expected_data, actual_data
    ):
        tf.debugging.assert_equal(expected_features, actual_features)
        tf.debugging.assert_equal(expected_label, actual_label)



The test test_from_tf appears to be redundant. Its logic is fully contained within test_from_tf_e2e, which performs the same data validation checks plus additional assertions on the execution plan and usage records. To reduce code duplication and improve maintainability, consider removing test_from_tf.

ci/docker/datatfds.build.Dockerfile

aslonnie · 2025-12-10T20:21:36Z

python/ray/data/tests/test_tensorflow_datasets.py

-@pytest.mark.skipif(
-    sys.version_info >= (3, 12),
-    reason="Skip due to incompatibility tensorflow with Python 3.12+",
-)


maybe just remove this entire test. it is a duplicate.

Signed-off-by: elliot-barn <[email protected]>

aslonnie · 2025-12-12T20:10:48Z

merging this without @ray-project/ray-data approval; forgive me

elliot-barn added 11 commits December 9, 2025 19:08

adding new image for tensorflow-datasets

1234656

Signed-off-by: elliot-barn <[email protected]>

running tfds tests togheter

1db6c37

Signed-off-by: elliot-barn <[email protected]>

adding build target

1f3060c

Signed-off-by: elliot-barn <[email protected]>

updating tag

b43732d

Signed-off-by: elliot-barn <[email protected]>

updating image name

14774cb

Signed-off-by: elliot-barn <[email protected]>

adding bazel target

a3505e0

Signed-off-by: elliot-barn <[email protected]>

downsizing pytest size

883fd2b

Signed-off-by: elliot-barn <[email protected]>

adding verbose output for tests

55e82c8

Signed-off-by: elliot-barn <[email protected]>

removing verbose flags

c5e71f4

Signed-off-by: elliot-barn <[email protected]>

removing -U

2cc486e

Signed-off-by: elliot-barn <[email protected]>

creating depset

2fe373a

Signed-off-by: elliot-barn <[email protected]>

elliot-barn requested review from a team as code owners December 10, 2025 19:26

removing skips

34748a4

Signed-off-by: elliot-barn <[email protected]>

elliot-barn added the go add ONLY when ready to merge, run all tests label Dec 10, 2025

elliot-barn changed the title ~~[ci][data] depset for tfds tests~~ [ci][data] tensorflow-datasets tests: depset for tfds tests (3/3) Dec 10, 2025

gemini-code-assist bot reviewed Dec 10, 2025

View reviewed changes

elliot-barn mentioned this pull request Dec 10, 2025

[ci][data] tensorflow-datasets tests: move tfds tests (2/3) #59315

Closed

cursor bot reviewed Dec 10, 2025

View reviewed changes

ci/docker/datatfds.build.Dockerfile Outdated Show resolved Hide resolved

aslonnie reviewed Dec 10, 2025

View reviewed changes

elliot-barn added 2 commits December 10, 2025 23:09

copying depset to home/ray

abaa372

Signed-off-by: elliot-barn <[email protected]>

updating req path

c8058c7

Signed-off-by: elliot-barn <[email protected]>

ray-gardener bot added data Ray Data-related issues devprod labels Dec 11, 2025

elliot-barn added 3 commits December 11, 2025 01:55

removing test

43db082

Signed-off-by: elliot-barn <[email protected]>

including numpy in datatfds depset

cf07786

Signed-off-by: elliot-barn <[email protected]>

Merge branch 'master' into elliot-barn/depset-for-tfds-tests

4cb200f

elliot-barn requested a review from aslonnie December 11, 2025 23:08

aslonnie approved these changes Dec 11, 2025

View reviewed changes

elliot-barn requested a review from aslonnie December 12, 2025 18:20

aslonnie merged commit 663eff1 into master Dec 12, 2025
6 checks passed

aslonnie deleted the elliot-barn/depset-for-tfds-tests branch December 12, 2025 20:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ci][data] tensorflow-datasets tests: depset for tfds tests (3/3) #59354

[ci][data] tensorflow-datasets tests: depset for tfds tests (3/3) #59354

Uh oh!

elliot-barn commented Dec 10, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Dec 10, 2025

Uh oh!

Uh oh!

aslonnie Dec 10, 2025

Uh oh!

aslonnie commented Dec 12, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[ci][data] tensorflow-datasets tests: depset for tfds tests (3/3) #59354

[ci][data] tensorflow-datasets tests: depset for tfds tests (3/3) #59354

Uh oh!

Conversation

elliot-barn commented Dec 10, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

aslonnie Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

aslonnie commented Dec 12, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants