[Data] Improve state initialization for `ActorPoolMapOperator` #34037

amogkam · 2023-04-04T03:32:16Z

ActorPoolMapOperator takes in a Callable class which initializes some state to be reused for every batch.

In the current implementation, this state is initialized on the first batch, rather than during actor init.

In this PR, we separate the state initialization and actually call it during Actor init. This allows state to be initialized for fixed size actor pools, even when tasks are not ready to be dispatched for better pipelining. It also supports using multithreaded actors, so state gets initialized once per actor instead of once per thread.

Why are these changes needed?

Related issue number

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
- I've added any new APIs to the API Reference. For example, if I added a
  method in Tune, I've added it in doc/source/tune/api/ under the
  corresponding .rst file.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

Signed-off-by: amogkam <[email protected]>

python/ray/data/_internal/execution/operators/actor_pool_map_operator.py

python/ray/data/tests/test_operators.py

jianoaix · 2023-04-05T00:20:18Z

Is the existing approach really once per thread? It looks _cached_fn is per process (hence actor).

amogkam · 2023-04-05T00:25:50Z

Right, but race conditions can lead to each thread initializing the callable class. We can either add synchronization (move the class initialization behind a lock) or explicitly do the class initialization in a separate step.

Signed-off-by: amogkam <[email protected]>

…roject#34037) ActorPoolMapOperator takes in a Callable class which initializes some state to be reused for every batch. In the current implementation, this state is initialized on the first batch, rather than during actor init. In this PR, we separate the state initialization and actually call it during Actor init. This allows state to be initialized for fixed size actor pools, even when tasks are not ready to be dispatched for better pipelining. It also supports using multithreaded actors, so state gets initialized once per actor instead of once per thread. --------- Signed-off-by: amogkam <[email protected]> Signed-off-by: elliottower <[email protected]>

…roject#34037) ActorPoolMapOperator takes in a Callable class which initializes some state to be reused for every batch. In the current implementation, this state is initialized on the first batch, rather than during actor init. In this PR, we separate the state initialization and actually call it during Actor init. This allows state to be initialized for fixed size actor pools, even when tasks are not ready to be dispatched for better pipelining. It also supports using multithreaded actors, so state gets initialized once per actor instead of once per thread. --------- Signed-off-by: amogkam <[email protected]> Signed-off-by: Jack He <[email protected]>

amogkam added 2 commits April 3, 2023 20:06

add

8b1d943

Signed-off-by: amogkam <[email protected]>

add test

877b975

Signed-off-by: amogkam <[email protected]>

amogkam requested review from c21, clarkzinzow, ericl, jianoaix, jjyao and scv119 as code owners April 4, 2023 03:32

amogkam assigned ericl, c21 and jianoaix Apr 4, 2023

fix

848d504

Signed-off-by: amogkam <[email protected]>

ericl reviewed Apr 4, 2023

View reviewed changes

python/ray/data/_internal/execution/operators/actor_pool_map_operator.py Show resolved Hide resolved

ericl reviewed Apr 4, 2023

View reviewed changes

python/ray/data/tests/test_operators.py Outdated Show resolved Hide resolved

ericl approved these changes Apr 4, 2023

View reviewed changes

ericl added the @author-action-required The PR author is responsible for the next step. Remove tag to send back to the reviewer. label Apr 4, 2023

jianoaix approved these changes Apr 5, 2023

View reviewed changes

amogkam added 3 commits April 5, 2023 15:38

address comments

5843f86

Signed-off-by: amogkam <[email protected]>

fix

f6b1e96

Signed-off-by: amogkam <[email protected]>

fix

9f2c677

Signed-off-by: amogkam <[email protected]>

c21 approved these changes Apr 6, 2023

View reviewed changes

amogkam added 2 commits April 6, 2023 20:50

fix

b89255f

Signed-off-by: amogkam <[email protected]>

fix

28ec24d

Signed-off-by: amogkam <[email protected]>

amogkam merged commit aaac9cd into ray-project:master Apr 10, 2023

amogkam deleted the dataset-fix-state-init branch April 10, 2023 20:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Data] Improve state initialization for `ActorPoolMapOperator` #34037

[Data] Improve state initialization for `ActorPoolMapOperator` #34037

Uh oh!

amogkam commented Apr 4, 2023

Uh oh!

Uh oh!

Uh oh!

jianoaix commented Apr 5, 2023

Uh oh!

amogkam commented Apr 5, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[Data] Improve state initialization for ActorPoolMapOperator #34037

[Data] Improve state initialization for ActorPoolMapOperator #34037

Uh oh!

Conversation

amogkam commented Apr 4, 2023

Why are these changes needed?

Related issue number

Checks

Uh oh!

Uh oh!

Uh oh!

jianoaix commented Apr 5, 2023

Uh oh!

amogkam commented Apr 5, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[Data] Improve state initialization for `ActorPoolMapOperator` #34037

[Data] Improve state initialization for `ActorPoolMapOperator` #34037