Skip to content

[Data] - Fix expression mapping for Pandas#57868

Merged
alexeykudinkin merged 1 commit intoray-project:masterfrom
goutamvenkat-anyscale:goutam/pandas_filter_fix
Oct 24, 2025
Merged

[Data] - Fix expression mapping for Pandas#57868
alexeykudinkin merged 1 commit intoray-project:masterfrom
goutamvenkat-anyscale:goutam/pandas_filter_fix

Conversation

@goutamvenkat-anyscale
Copy link
Contributor

@goutamvenkat-anyscale goutamvenkat-anyscale commented Oct 18, 2025

Description

IN, NOT_IN and NOT were incorrectly mapped to the wrong pandas function which broke expression evaluation for pandas dfs.

Related issues

Fixes #57849

Additional information

Optional: Add implementation details, API changes, usage examples, screenshots, etc.

@goutamvenkat-anyscale goutamvenkat-anyscale requested a review from a team as a code owner October 18, 2025 01:03
@goutamvenkat-anyscale goutamvenkat-anyscale added data Ray Data-related issues go add ONLY when ready to merge, run all tests labels Oct 18, 2025
Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request addresses an issue where filter operations for Pandas DataFrames were incorrectly mapped, leading to broken expression evaluation. The changes involve correcting the mapping of IN, NOT_IN, and NOT operations to the appropriate Pandas functions and adding new test cases to ensure the correct behavior of filter operations with both Arrow and Pandas blocks. The review focuses on the correctness and maintainability of the code changes.

@goutamvenkat-anyscale goutamvenkat-anyscale changed the title [Data] - Fix filter operations for Pandas [Data] - Fix expression mapping for Pandas Oct 18, 2025
Copy link
Contributor

@omatthew98 omatthew98 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@@ -271,14 +321,18 @@ def test_filter_with_invalid_expression(ray_start_regular_shared, tmp_path):
)
def test_filter_with_predicate_expressions(
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Out of scope for this PR, but I think if we're going to have like 20+ parametrizations, we should refactor this as a unit test (no ray.init)

@alexeykudinkin alexeykudinkin merged commit d05ca1c into ray-project:master Oct 24, 2025
6 checks passed
xinyuangui2 pushed a commit to xinyuangui2/ray that referenced this pull request Oct 27, 2025
landscapepainter pushed a commit to landscapepainter/ray that referenced this pull request Nov 17, 2025
Aydin-ab pushed a commit to Aydin-ab/ray-aydin that referenced this pull request Nov 19, 2025
Future-Outlier pushed a commit to Future-Outlier/ray that referenced this pull request Dec 7, 2025
Blaze-DSP pushed a commit to Blaze-DSP/ray that referenced this pull request Dec 18, 2025
peterxcli pushed a commit to peterxcli/ray that referenced this pull request Feb 25, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

data Ray Data-related issues go add ONLY when ready to merge, run all tests

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[Data] error filtering using isin filters.

4 participants