-
Notifications
You must be signed in to change notification settings - Fork 28.9k
[SPARK-29606][SQL] Improve EliminateOuterJoin performance #26257
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Test build #112678 has finished for PR 26257 at commit
|
|
still WIP? |
|
Thank you @maropu Actually, I am not very confident about this change. Is it make sense for you? |
|
Ur, I see. ok, I've not digged into this, so I'll check later. |
|
Test build #113884 has finished for PR 26257 at commit
|
|
Test build #113922 has finished for PR 26257 at commit
|
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/LogicalPlan.scala
Outdated
Show resolved
Hide resolved
|
We're closing this PR because it hasn't been updated in a while. This isn't a judgement on the merit of the PR in any way. It's just a way of keeping the PR queue manageable. |
|
Test build #119128 has finished for PR 26257 at commit
|
|
Metrics of Analyzer/Optimizer Rules for After this PR: |
| a.toAttribute | ||
| }) | ||
| allConstraints ++= allConstraints.map { | ||
| case e @ EqualNullSafe(l, _: AttributeReference) if l.references.size > 1 => e |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I feel its a bit difficult to understand this pattern-matching at a glance, could you leave comments about what this means? Probably, you wanna skip the pattern below for performance?
|
Test build #122201 has finished for PR 26257 at commit
|
|
retest this please |
|
Test build #122212 has finished for PR 26257 at commit
|
|
We're closing this PR because it hasn't been updated in a while. This isn't a judgement on the merit of the PR in any way. It's just a way of keeping the PR queue manageable. |
What changes were proposed in this pull request?
This PR try to improve
EliminateOuterJoinperformance via avoid generating too many constraints. For example:Before this PR:
After this PR:
Why are the changes needed?
Improve
EliminateOuterJoinperformance.Before this PR:
After this PR:
Does this PR introduce any user-facing change?
No.
How was this patch tested?
Unit test.