Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[data] add column relaxation #49221

Merged
merged 4 commits into from
Dec 14, 2024

Conversation

ivanthewebber
Copy link
Contributor

Why are these changes needed?

Aligns ray.data.Dataset.add_column to it's docstring and allows more flexibility for adding columns to pandas dataframes.

Related issue number

Closes #49114

Checks

  • I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
  • I've run scripts/format.sh to lint the changes in this PR.
  • I've included any doc changes needed for https://docs.ray.io/en/master/.
    • I've added any new APIs to the API Reference. For example, if I added a
      method in Tune, I've added it in doc/source/tune/api/ under the
      corresponding .rst file.
  • I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
  • Testing Strategy
    • Unit test
    • Release tests
    • This PR is not tested :(
      python -m pytest -v -s python\ray\data\tests\test_map.py::test_add_column

@ivanthewebber ivanthewebber requested a review from a team as a code owner December 11, 2024 20:29
@richardliaw richardliaw changed the title Ivwebber/fix add column [data] add column relaxation Dec 11, 2024
@richardliaw
Copy link
Contributor

Hi @ivanthewebber , this change looks good. Could you make sure the DCO is fixed and check the CI fails?

)
assert ds.take(2) == [{"id": 1}, {"id": 2}]

ds = ray.data.range(5).add_column(
Copy link
Contributor

@richardliaw richardliaw Dec 12, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

leave a comment here to capture intent?

@richardliaw richardliaw added the go add ONLY when ready to merge, run all tests label Dec 12, 2024
@richardliaw richardliaw self-assigned this Dec 12, 2024
@richardliaw richardliaw added the data Ray Data-related issues label Dec 12, 2024
@ivanthewebber ivanthewebber force-pushed the ivwebber/fix-add-column branch from 4c5fe83 to 67a29c9 Compare December 12, 2024 22:50
ivanthewebber and others added 3 commits December 12, 2024 14:55
ray-project#49114

Signed-off-by: Ivan Webber <[email protected]>
Signed-off-by: Ivan Webber <[email protected]>
Signed-off-by: Ivan Webber <[email protected]>
Signed-off-by: Ivan Webber <[email protected]>
Signed-off-by: Ivan Webber <[email protected]>
Signed-off-by: Ivan Webber <[email protected]>
@ivanthewebber ivanthewebber force-pushed the ivwebber/fix-add-column branch from 67a29c9 to 053bc14 Compare December 12, 2024 22:55
@richardliaw
Copy link
Contributor

Lint is failing?

Signed-off-by: Ivan Webber <[email protected]>
@richardliaw richardliaw merged commit 5da66ea into ray-project:master Dec 14, 2024
5 checks passed
ujjawal-khare pushed a commit to ujjawal-khare-27/ray that referenced this pull request Dec 17, 2024
## Why are these changes needed?

Aligns `ray.data.Dataset.add_column` to it's docstring and allows more
flexibility for adding columns to pandas dataframes.

## Related issue number

Closes ray-project#49114

## Checks

- [ ] I've signed off every commit(by using the -s flag, i.e., `git
commit -s`) in this PR.
- [ ] I've run `scripts/format.sh` to lint the changes in this PR.
- [ ] I've included any doc changes needed for
https://docs.ray.io/en/master/.
- [ ] I've added any new APIs to the API Reference. For example, if I
added a
method in Tune, I've added it in `doc/source/tune/api/` under the
           corresponding `.rst` file.
- [x] I've made sure the tests are passing. Note that there might be a
few flaky tests, see the recent failures at https://flakey-tests.ray.io/
- Testing Strategy
   - [x] Unit test
   - [ ] Release tests
   - [ ] This PR is not tested :(
`python -m pytest -v -s
python\ray\data\tests\test_map.py::test_add_column`

---------

Signed-off-by: Ivan Webber <[email protected]>
Signed-off-by: Ivan Webber <[email protected]>
Signed-off-by: ujjawal-khare <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
data Ray Data-related issues go add ONLY when ready to merge, run all tests
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[Data] Ray 2.40 add_column Erroneous Assertion
2 participants