[core] Revert change to use ready to create plasma_object_ids #50467

dayshah · 2025-02-12T03:18:44Z

Why are these changes needed?

Reverting the change to use ready to create plasma_object_ids here. #49218

An example ray data workload that highlights the problem with the issue is here https://gist.github.com/dayshah/1080db0cd3fb561119bca17c85215117. 5 seconds without, 50 seconds with.

The problem is that ready is capped to num_returns, and we do object pulling based on plasma_object_ids which was now being created from ready. Ray data calls ray.wait(10_refs, num_returns=1). If we create plasma_object_ids with only ready, it'll only contain one object in this situation. If we use memory_store_ids to create plasma_object_ids, we'll have plasma_object_ids with 10 objects and all 10 will start being pulled even if the ray.wait call returns immediately after getting one.

Related issue number

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
- I've added any new APIs to the API Reference. For example, if I added a
  method in Tune, I've added it in doc/source/tune/api/ under the
  corresponding .rst file.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

Signed-off-by: dayshah <[email protected]>

dayshah · 2025-02-12T03:23:31Z

src/ray/core_worker/core_worker.cc

    const auto &obj_id = *iter;
    auto found = memory_store->GetIfExists(obj_id);
    if (found != nullptr && found->IsInPlasmaError()) {
      plasma_object_ids.insert(obj_id);
-      ready.erase(iter);
-      memory_object_ids.erase(obj_id);


we don't use memory object ids after this so no need for this

dayshah · 2025-02-12T03:46:10Z

i think a possible fix for this could be to pass two vectors into plasma provider wait

one with all the objects in plasma - we'll use this to start doing the pulling

one with just the number of objects we need from ready, we'll use this to actually make the WaitRequest

revert change to use ready to create plasma_object_ids

f4c07e1

Signed-off-by: dayshah <[email protected]>

dayshah added the go add ONLY when ready to merge, run all tests label Feb 12, 2025

dayshah requested a review from edoakes February 12, 2025 03:18

dayshah assigned edoakes Feb 12, 2025

dayshah commented Feb 12, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[core] Revert change to use ready to create plasma_object_ids #50467

[core] Revert change to use ready to create plasma_object_ids #50467

dayshah commented Feb 12, 2025 •

edited

Loading

dayshah Feb 12, 2025

dayshah commented Feb 12, 2025

[core] Revert change to use ready to create plasma_object_ids #50467

Are you sure you want to change the base?

[core] Revert change to use ready to create plasma_object_ids #50467

Conversation

dayshah commented Feb 12, 2025 • edited Loading

Why are these changes needed?

Related issue number

Checks

dayshah Feb 12, 2025

Choose a reason for hiding this comment

dayshah commented Feb 12, 2025

dayshah commented Feb 12, 2025 •

edited

Loading