[SPARK-19509][SQL]Fix a NPE problem in grouping sets when using an empty column #16874

stanzhai · 2017-02-09T13:53:14Z

What changes were proposed in this pull request?

If a column of a table is all null values, the follow SQL will throw an NPE: select count(1) from test group by e grouping sets(e).

The reason is that when transformUp a GroupingSets in ResolveGroupingAnalytics it uses a nullBitmask to set an attribute with null ability, the nullable attribute may be modified.

This pr just set all attribute's null ability to true in group by expressions to fix the problem.

The pr #15484 in master branch has fixed this problem.

We also need to fix this problem in branch-2.1.

How was this patch tested?

Test with Hive in my environment.

AmplabJenkins · 2017-02-09T13:57:19Z

Can one of the admins verify this patch?

…umns ## What changes were proposed in this pull request? The analyzer currently does not check if a column used in grouping sets is actually nullable itself. This can cause the nullability of the column to be incorrect, which can cause null pointer exceptions down the line. This PR fixes that by also consider the nullability of the column. This is only a problem for Spark 2.1 and below. The latest master uses a different approach. Closes #16874 ## How was this patch tested? Added a regression test to `SQLQueryTestSuite.grouping_set`. Author: Herman van Hovell <[email protected]> Closes #16873 from hvanhovell/SPARK-19509.

…umns ## What changes were proposed in this pull request? The analyzer currently does not check if a column used in grouping sets is actually nullable itself. This can cause the nullability of the column to be incorrect, which can cause null pointer exceptions down the line. This PR fixes that by also consider the nullability of the column. This is only a problem for Spark 2.1 and below. The latest master uses a different approach. Closes #16874 ## How was this patch tested? Added a regression test to `SQLQueryTestSuite.grouping_set`. Author: Herman van Hovell <[email protected]> Closes #16873 from hvanhovell/SPARK-19509. (cherry picked from commit a3d5300) Signed-off-by: Herman van Hovell <[email protected]>

hvanhovell · 2017-02-09T20:02:57Z

@stanzhai I have merged my PR, and assigned the PR to your name. Could you close this?

fix a NPE issue of grouping sets

3690cb2

stanzhai changed the title ~~[SPARK-19509][SQL][branch-2.1]Fix a NPE problem in grouping sets when using an empty column~~ [SPARK-19509][SQL]Fix a NPE problem in grouping sets when using an empty column Feb 9, 2017

hvanhovell mentioned this pull request Feb 9, 2017

[SPARK-19509][SQL] Grouping Sets do not respect nullable grouping columns #16873

Closed

stanzhai closed this Feb 10, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-19509][SQL]Fix a NPE problem in grouping sets when using an empty column #16874

[SPARK-19509][SQL]Fix a NPE problem in grouping sets when using an empty column #16874

Uh oh!

stanzhai commented Feb 9, 2017 •

edited

Loading

Uh oh!

AmplabJenkins commented Feb 9, 2017

Uh oh!

hvanhovell commented Feb 9, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[SPARK-19509][SQL]Fix a NPE problem in grouping sets when using an empty column #16874

[SPARK-19509][SQL]Fix a NPE problem in grouping sets when using an empty column #16874

Uh oh!

Conversation

stanzhai commented Feb 9, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

AmplabJenkins commented Feb 9, 2017

Uh oh!

hvanhovell commented Feb 9, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

stanzhai commented Feb 9, 2017 •

edited

Loading