Skip to content

Conversation

@sameeragarwal
Copy link
Member

What changes were proposed in this pull request?

This patch adds support for reading DecimalTypes with high (> 18) precision in VectorizedColumnReader

How was this patch tested?

  1. VectorizedColumnReader initially had a gating condition on primitiveType.getDecimalMetadata().getPrecision() > Decimal.MAX_LONG_DIGITS() that made us fall back on parquet-mr for handling high-precision decimals. This condition is now removed.
  2. In particular, the ParquetHadoopFsRelationSuite (that tests for all supported hive types -- including DecimalType(25, 5)) fails when the gating condition is removed ([WIP][SPARK-13994][SQL] Investigate types not supported by vectorized parquet reader #11808) and should now pass with this change.

@sameeragarwal
Copy link
Member Author

cc @nongli

@nongli
Copy link
Contributor

nongli commented Mar 21, 2016

LGTM

@SparkQA
Copy link

SparkQA commented Mar 22, 2016

Test build #53725 has finished for PR 11869 at commit cbccee7.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@yhuai
Copy link
Contributor

yhuai commented Mar 22, 2016

Thanks! Merging to master.

@asfgit asfgit closed this in 7299961 Mar 22, 2016
roygao94 pushed a commit to roygao94/spark that referenced this pull request Mar 22, 2016
…uet reader

## What changes were proposed in this pull request?

This patch adds support for reading `DecimalTypes` with high (> 18) precision in `VectorizedColumnReader`

## How was this patch tested?

1. `VectorizedColumnReader` initially had a gating condition on `primitiveType.getDecimalMetadata().getPrecision() > Decimal.MAX_LONG_DIGITS()` that made us fall back on parquet-mr for handling high-precision decimals. This condition is now removed.
2. In particular, the `ParquetHadoopFsRelationSuite` (that tests for all supported hive types -- including `DecimalType(25, 5)`) fails when the gating condition is removed (apache#11808) and should now pass with this change.

Author: Sameer Agarwal <[email protected]>

Closes apache#11869 from sameeragarwal/bigdecimal-parquet.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants