SPARK-12948. [SQL]. Consider reducing size of broadcasts in OrcRelation #10861

rajeshbalamohan · 2016-01-21T05:12:47Z

Size of broadcasted data in OrcRelation was significantly higher when running query with large number of partitions (e.g TPC-DS). And it has an impact on the job runtime. This would be more evident when there is large number of partitions/splits. Profiler snapshot is attached in SPARK-12948 (https://issues.apache.org/jira/secure/attachment/12783513/SPARK-12948_cpuProf.png).

JoshRosen · 2016-01-23T23:05:44Z

core/src/main/scala/org/apache/spark/SparkContext.scala

The idea here is to let users share the broadcast of the conf across multiple hadoopRDD calls (e.g. when unioning many HadoopRDDs together)? If so, this issue has come up a number of times in the past and may be worth a holistic design review because I think there are some hacks in Spark SQL to address this problem there and it would be nice to have a unified solution for this.

JoshRosen · 2016-01-23T23:06:41Z

Can you add more description to explain how this patch reduces the size of broadcasts? The change isn't obvious to me at first glance, so one or two sentences of description would help me and other reviewers who aren't as familiar with this corner of the code.

rajeshbalamohan · 2016-01-25T00:07:50Z

Usecase: User tries to map the dataset which is partitioned (e.g TPC-DS dataset at 200 GB scale) & runs a query in spark-shell.

E.g
...
val o_store_sales = sqlContext.read.format("orc").load("/tmp/spark_tpcds_bin_partitioned_orc_200/store_sales")
o_store_sales.registerTempTable("o_store_sales")
..
sqlContext.sql("SELECT..").show();
...

When this is executed, OrcRelation creates Config objects for every partition (Ref: OrcRelation.execute()). In the case of TPC-DS, it generates 1826 partitions. This info is broadcasted in DAGScheduler#submitMissingTasks(). As a part of this, the configurations created for 1826 partitions are also streamed through (i.e embedded in HadoopMapParitionsWithSplitRDD -->f()--> wrappedConf). Each of these configuration takes around 251 KB per partition. Please refer to the profiler snapshot attached in the JIRA (mem_snap_shot). This causes quite a bit of delay in the overall job runtime.

Patch reuses the already broadcastedconf from SparkContext. fillObject() function is executed later for every partition, which internally sets up any additional config details. This drastically reduces the amount of payload that is broadcasted and helps in reducing the overall job runtime.

rajeshbalamohan · 2016-01-27T11:59:47Z

@JoshRosen - Please let me know if my latest comment on the usecase addresses your question. Can you.

may be worth a holistic design review because I think there are some hacks in Spark SQL to address this problem there and it would be nice to have a unified solution for this

Can you plz provide more details/pointers on this?

SparkQA · 2016-05-03T21:24:58Z

Test build #57670 has finished for PR 10861 at commit 4da7a22.

This patch fails Scala style tests.
This patch does not merge cleanly.
This patch adds no public classes.

HyukjinKwon · 2017-06-19T03:47:44Z

Hi @rajeshbalamohan, I think this should be a mergeable state at least and the conflicts and style issues should be resolved. Would you be able to update this for now?

gatorsmile · 2017-06-27T06:39:56Z

We are closing it due to inactivity. please do reopen if you want to push it forward. Thanks!

SPARK-12948. [SQL]. Consider reducing size of broadcasts in OrcRelation

4da7a22

JoshRosen reviewed Jan 23, 2016
View reviewed changes

HyukjinKwon mentioned this pull request Jun 25, 2017

[INFRA] Close stale PRs #18417

Closed

asfgit closed this in b32bd00 Jun 27, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

SPARK-12948. [SQL]. Consider reducing size of broadcasts in OrcRelation #10861

SPARK-12948. [SQL]. Consider reducing size of broadcasts in OrcRelation #10861

Uh oh!

rajeshbalamohan commented Jan 21, 2016

Uh oh!

JoshRosen Jan 23, 2016

Uh oh!

JoshRosen commented Jan 23, 2016

Uh oh!

rajeshbalamohan commented Jan 25, 2016

Uh oh!

rajeshbalamohan commented Jan 27, 2016

Uh oh!

SparkQA commented May 3, 2016

Uh oh!

HyukjinKwon commented Jun 19, 2017

Uh oh!

gatorsmile commented Jun 27, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

SPARK-12948. [SQL]. Consider reducing size of broadcasts in OrcRelation #10861

SPARK-12948. [SQL]. Consider reducing size of broadcasts in OrcRelation #10861

Uh oh!

Conversation

rajeshbalamohan commented Jan 21, 2016

Uh oh!

JoshRosen Jan 23, 2016

Choose a reason for hiding this comment

Uh oh!

JoshRosen commented Jan 23, 2016

Uh oh!

rajeshbalamohan commented Jan 25, 2016

Uh oh!

rajeshbalamohan commented Jan 27, 2016

Uh oh!

SparkQA commented May 3, 2016

Uh oh!

HyukjinKwon commented Jun 19, 2017

Uh oh!

gatorsmile commented Jun 27, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants