-
Couldn't load subscription status.
- Fork 28.9k
[SPARK-12012] [SQL] Show more comprehensive PhysicalRDD metadata when visualizing SQL query plan #10004
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SPARK-12012] [SQL] Show more comprehensive PhysicalRDD metadata when visualizing SQL query plan #10004
Conversation
|
Test build #46765 has finished for PR 10004 at commit
|
|
I wouldn't have the top level name be "PhysicalRDD" - that term just makes no sense to users. I'd consider just putting "Orc" there. If it is a Hive table, just saying "Scan: table name" might work too". BTW - if the path is super long, is the box going to be expanded to super wide as well? |
|
@rxin Updated, together with a new screenshot in the PR description. Now it shows |
|
Test build #46825 has finished for PR 10004 at commit
|
|
Test build #46833 has finished for PR 10004 at commit
|
|
retest this please |
|
Test build #46835 has finished for PR 10004 at commit
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Removed the paths string because it's now shown as part of the metadata in both simpleString and visualized plan node.
|
Test build #46841 has finished for PR 10004 at commit
|
|
Can we hide |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
can you add the type to this val? Right now it's not super clear what it is
5d12dd7 to
6891876
Compare
|
Comments addressed. |
|
Is it possible to only show the short text in the normal box, and then when hover, show the full path? I'm very concerned about long paths. cc @zsxwing |
|
Test build #46959 has finished for PR 10004 at commit
|
|
@rxin One problem I found is that, the tooltip box is of fixed width. This means that the full path can never be fully observed if it's too long. But I agree that we should keep the normal box relatively smaller. I'll just remove the metadata entries in the normal box. |
|
Test build #47050 has finished for PR 10004 at commit
|
|
Can you add a screenshot if the path is really long? |
|
@nongli Uploaded new screenshot and rebased to master. |
863df7c to
02448a6
Compare
|
Test build #47422 has finished for PR 10004 at commit
|
|
I know that the current visual effect of super long paths isn't perfect, but at least it doesn't introduce super wide plan nodes. And this is also how our current UI handles super long text. |
|
I'm going to merge this into master/1.6. |
|
hm i had trouble cherrypicking it into 1.6. Can you do it yourself? |
…visualizing SQL query plan This PR adds a `private[sql]` method `metadata` to `SparkPlan`, which can be used to describe detail information about a physical plan during visualization. Specifically, this PR uses this method to provide details of `PhysicalRDD`s translated from a data source relation. For example, a `ParquetRelation` converted from Hive metastore table `default.psrc` is now shown as the following screenshot:  And here is the screenshot for a regular `ParquetRelation` (not converted from Hive metastore table) loaded from a really long path:  Author: Cheng Lian <[email protected]> Closes apache#10004 from liancheng/spark-12012.physical-rdd-metadata.
|
Since there's a conflict and 1.6 is in RC phase, I opened #10250 to backport this one to branch-1.6, so that it's tested on Jenkins. |
…tadata when visualizing SQL query plan This PR backports PR #10004 to branch-1.6 It adds a private[sql] method metadata to SparkPlan, which can be used to describe detail information about a physical plan during visualization. Specifically, this PR uses this method to provide details of PhysicalRDDs translated from a data source relation. Author: Cheng Lian <[email protected]> Closes #10250 from liancheng/spark-12012.for-1.6.
This PR adds a
private[sql]methodmetadatatoSparkPlan, which can be used to describe detail information about a physical plan during visualization. Specifically, this PR uses this method to provide details ofPhysicalRDDs translated from a data source relation. For example, aParquetRelationconverted from Hive metastore tabledefault.psrcis now shown as the following screenshot:And here is the screenshot for a regular
ParquetRelation(not converted from Hive metastore table) loaded from a really long path: