[SPARK-24215][PySpark][Follow Up] Implement eager evaluation for DataFrame APIs in PySpark #21553

xuanyuanking · 2018-06-13T10:55:09Z

What changes were proposed in this pull request?

Address comments in #21370 and add more test.

How was this patch tested?

Enhance test in pyspark/sql/test.py and DataFrameSuite

xuanyuanking · 2018-06-13T10:55:44Z

cc @HyukjinKwon @gatorsmile

SparkQA · 2018-06-13T13:51:44Z

Test build #91766 has finished for PR 21553 at commit 8d33af7.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

xuanyuanking · 2018-06-13T15:13:57Z

retest this please

SparkQA · 2018-06-13T18:51:31Z

Test build #91778 has finished for PR 21553 at commit 8d33af7.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

gatorsmile · 2018-06-13T22:21:57Z

docs/configuration.md

For SQLConf, we do not need to hard code the conf description here.

Got it, thanks, done in afada2b.

gatorsmile · 2018-06-13T22:23:05Z

sql/catalyst/src/main/scala/org/apache/spark/sql/internal/SQLConf.scala

Can you rewrite these descriptions based on the description I posted in the original PR.

Thanks, done in afada2b.

gatorsmile · 2018-06-14T06:46:00Z

Could you address the comments in the original PR?

[SPARK-24215][PySpark] Implement eager evaluation for DataFrame APIs in PySpark #21370 (comment)

xuanyuanking · 2018-06-15T08:36:25Z

Could you address the comments in the original PR?

Thanks, I want take this. Maybe it should be done in another jira and PR, and I should fix all the config hard code in PySpark？

SparkQA · 2018-06-15T12:28:29Z

Test build #91901 has finished for PR 21553 at commit afada2b.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

xuanyuanking · 2018-06-26T15:14:15Z

python/pyspark/sql/conf.py

@gatorsmile I moved all core configs using in pyspark into conf.py here. Please have a look when you have time.
#21370 (comment)

Thank you for fixing this! Let us do it in a separate PR.

Yep, done in #21648

xuanyuanking · 2018-06-26T15:15:52Z

@gatorsmile I address the comments in the last commit, but maybe it should be done in a independent PR and Jira?

SparkQA · 2018-06-26T15:18:25Z

Test build #92344 has finished for PR 21553 at commit d719dfb.

This patch fails Python style tests.
This patch merges cleanly.
This patch adds the following public classes (experimental):
class ConfigEntry(object):
class SQLConf(object):

felixcheung · 2018-06-26T16:45:48Z

docs/configuration.md

we are removing documentation?

SQL Confs are not part of the documentation.

this should be in sql-programming-guide.md right?

Follow the SQL configuration, all the description can be shown by spark.sql("SET -v").show(numRows = 200, truncate = false). https://spark.apache.org/docs/latest/configuration.html#spark-sql

viirya · 2018-06-27T01:15:51Z

python/pyspark/sql/conf.py

I think this PySpark SQLConf stuff should be done in a separate Jira/PR.

Yea, it should be separate.

Yeah, agree, done in #21648.

HyukjinKwon · 2018-06-27T01:19:05Z

python/pyspark/sql/conf.py

This duplicates the key. I think current way duplicates a lot of codes in Scala side.

Yep, I'm also puzzled by this, cause we also do the register in Scala side. How about just call buildConf on Scala side for theses keys which used only on PySpark? Lets discuss it in #21648

xuanyuanking · 2018-06-27T06:05:21Z

In the last commit I revert the changes of SQLConf and created a new PR of #21648. Could this follow up PR merged first? Thanks.

SparkQA · 2018-06-27T07:05:02Z

Test build #92373 has finished for PR 21553 at commit 00ae164.

This patch fails due to an unknown error code, -9.
This patch merges cleanly.
This patch adds no public classes.

xuanyuanking · 2018-06-27T09:04:02Z

retest this please

SparkQA · 2018-06-27T13:06:41Z

Test build #92377 has finished for PR 21553 at commit 00ae164.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

gatorsmile · 2018-06-27T17:42:25Z

LGTM Thanks! Merged to master.

gatorsmile reviewed Jun 13, 2018

View reviewed changes

xuanyuanking commented Jun 26, 2018

View reviewed changes

felixcheung reviewed Jun 26, 2018

View reviewed changes

viirya reviewed Jun 27, 2018

View reviewed changes

HyukjinKwon reviewed Jun 27, 2018

View reviewed changes

xuanyuanking added 3 commits June 27, 2018 13:51

the follow up work for SPARK-24215

a535a0f

address comments

1aaae36

Fix spelling mistake

00ae164

xuanyuanking force-pushed the SPARK-24215-follow branch from d719dfb to 00ae164 Compare June 27, 2018 05:52

asfgit closed this in 6a0b77a Jun 27, 2018

[SPARK-24215][PySpark][Follow Up] Implement eager evaluation for DataFrame APIs in PySpark #21553

[SPARK-24215][PySpark][Follow Up] Implement eager evaluation for DataFrame APIs in PySpark #21553

Uh oh!

Conversation

xuanyuanking commented Jun 13, 2018

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

xuanyuanking commented Jun 13, 2018

Uh oh!

SparkQA commented Jun 13, 2018

Uh oh!

xuanyuanking commented Jun 13, 2018

Uh oh!

SparkQA commented Jun 13, 2018

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

gatorsmile commented Jun 14, 2018

Uh oh!

xuanyuanking commented Jun 15, 2018

Uh oh!

SparkQA commented Jun 15, 2018

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

xuanyuanking commented Jun 26, 2018

Uh oh!

SparkQA commented Jun 26, 2018

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

xuanyuanking commented Jun 27, 2018

Uh oh!

SparkQA commented Jun 27, 2018

Uh oh!

xuanyuanking commented Jun 27, 2018

Uh oh!

SparkQA commented Jun 27, 2018

Uh oh!

gatorsmile commented Jun 27, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants