SKIPME: Add logic to get the size estimation from Hadoop filesystem. #115

yeweizhang · 2015-10-30T22:00:14Z

When we do the size estimation in joining, spark calls hive metastore to get the size in bytes. Add callback so that we will call hadoop file system directly.

The initial attempt to use the shuffle partition size in local properties. If we can get the adaptive execution work, then this work is redundant.

@markhamstra @mbautin

SKIPME: Add logic to get the size estimation from Hadoop filesystem.

* Add -DskipTests to dev docs * Remove extraneous skipTests

Yewei Zhang added 3 commits October 30, 2015 14:58

Add logic to get the size estimation from Hadoop filesystem.

3a0f493

Add logic to read partition number from spark context local property.

3a4a2da

Add debug message.

c82b2a2

yeweizhang assigned markhamstra Nov 4, 2015

markhamstra added a commit that referenced this pull request Nov 4, 2015

Merge pull request #115 from yeweizhang/spy-787

9f90518

SKIPME: Add logic to get the size estimation from Hadoop filesystem.

markhamstra merged commit 9f90518 into alteryx:csd-1.5 Nov 4, 2015

markhamstra pushed a commit to markhamstra/spark that referenced this pull request Nov 7, 2017

Add -DskipTests to dev docs (alteryx#115)

be4330f

* Add -DskipTests to dev docs * Remove extraneous skipTests

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

SKIPME: Add logic to get the size estimation from Hadoop filesystem. #115

SKIPME: Add logic to get the size estimation from Hadoop filesystem. #115

Uh oh!

yeweizhang commented Oct 30, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

SKIPME: Add logic to get the size estimation from Hadoop filesystem. #115

SKIPME: Add logic to get the size estimation from Hadoop filesystem. #115

Uh oh!

Conversation

yeweizhang commented Oct 30, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants