Skip to content

Conversation

@dongjoon-hyun
Copy link
Member

@dongjoon-hyun dongjoon-hyun commented Mar 18, 2024

What changes were proposed in this pull request?

This PR aims to enable spark.shuffle.service.removeShuffle for Apache Spark 4.0.0.

Why are the changes needed?

Since Apache Spark 3.3.0, Apache Spark has been supporting spark.shuffle.service.removeShuffle via SPARK-37618.

We can use it when external shuffle service is available.

Does this PR introduce any user-facing change?

By default, no because spark.shuffle.service.enabled is still disabled.

Only for the existing shuffle service users, this PR works.

How was this patch tested?

Pass the CIs.

Was this patch authored or co-authored using generative AI tooling?

No.

@dongjoon-hyun dongjoon-hyun changed the title [SPARK-47448][CORE] Enable spark.shuffle.service.removeShuffle by default [SPARK-47448][CORE] Enable spark.shuffle.service.removeShuffle by default Mar 18, 2024
@dongjoon-hyun
Copy link
Member Author

WDYT, @Kimahriman, @mridulm , @tgravescs , @Ngone51 ?

@tgravescs
Copy link
Contributor

seems fine to me but I haven't tested with this feature much. +1

@dongjoon-hyun
Copy link
Member Author

Thank you for the valuable feedback, @tgravescs .

@Kimahriman
Copy link
Contributor

I've been using it heavily in production since I added it so definitely good with me.

@dongjoon-hyun
Copy link
Member Author

Thank you, @Kimahriman !

@mridulm
Copy link
Contributor

mridulm commented Mar 18, 2024

Same as Tom, we are on an older version, so I have not used it in production - but toggling it to default should be fine.

@dongjoon-hyun
Copy link
Member Author

Thank you, @mridulm .

@dongjoon-hyun
Copy link
Member Author

Merged to master for Apache Spark 4.0.0. Thank you all!

@dongjoon-hyun dongjoon-hyun deleted the SPARK-47448 branch March 19, 2024 00:10
sweisdb pushed a commit to sweisdb/spark that referenced this pull request Apr 1, 2024
…efault

### What changes were proposed in this pull request?

This PR aims to enable `spark.shuffle.service.removeShuffle` for Apache Spark 4.0.0.

### Why are the changes needed?

Since Apache Spark 3.3.0, Apache Spark has been supporting `spark.shuffle.service.removeShuffle` via SPARK-37618.

- apache#35085

We can use it when external shuffle service is available.

### Does this PR introduce _any_ user-facing change?

By default, no because `spark.shuffle.service.enabled` is still disabled.

Only for the existing shuffle service users, this PR works.

### How was this patch tested?

Pass the CIs.

### Was this patch authored or co-authored using generative AI tooling?

No.

Closes apache#45572 from dongjoon-hyun/SPARK-47448.

Authored-by: Dongjoon Hyun <[email protected]>
Signed-off-by: Dongjoon Hyun <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants