[SPARK-18873][SQL][TEST] New test cases for scalar subquery (part 1 of 2) - scalar subquery in SELECT clause #16712

nsyca · 2017-01-26T18:39:05Z

What changes were proposed in this pull request?

This PR adds new test cases for scalar subquery in SELECT clause.

How was this patch tested?

The test result is compared with the result run from another SQL engine (in this case is IBM DB2). If the result are equivalent, we assume the result is correct.

…rrect results ## What changes were proposed in this pull request? This patch fixes the incorrect results in the rule ResolveSubquery in Catalyst's Analysis phase. ## How was this patch tested? ./dev/run-tests a new unit test on the problematic pattern.

nsyca · 2017-01-26T18:42:43Z

Attached are a slightly modified version of the submitted test file to adapt to IBM DB2 syntax, and the result of the run.
Modified version of the test file
Run result from DB2

nsyca · 2017-01-26T18:43:14Z

cc: @kevinyu98, @gatorsmile. Also FYI to @hvanhovell.

SparkQA · 2017-01-26T21:08:32Z

Test build #72035 has finished for PR 16712 at commit 48ff3c7.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

kevinyu98 · 2017-01-26T20:00:09Z

...src/test/resources/sql-tests/results/subquery/scalar-subquery/scalar-subquery-select.sql.out

+t1d	NULL
+t1e	10
+t1e	10
+t1e	10


I have compared this result set with the attached DB2's result set, they are equivalent.

thanks, @kevinyu98.

gatorsmile · 2017-01-29T05:03:44Z

...core/src/test/resources/sql-tests/inputs/subquery/scalar-subquery/scalar-subquery-select.sql

+  ("t1b", null, 16, 419L, float(17), 25D, 26E2, timestamp '2014-10-04 01:02:00.000', null),
+  ("t1b", null, 16, 19L, float(17), 25D, 26E2, timestamp '2014-11-04 01:02:00.000', null),
+  ("t3b", 8S, null, 719L, float(17), 25D, 26E2, timestamp '2014-05-04 01:02:00.000', date '2014-05-04'),
+  ("t3b", 8S, null, 19L, float(17), 25D, 26E2, timestamp '2015-05-04 01:02:00.000', date '2015-05-04')


What is the reasons we use the column names as the value of t3a, t2a and t1a? It looks confusing when reading the queries.

No particular reason. I just followed the convention used in #16337 that you reviewed and merged. Please suggest a pattern if you want to have this changed.

Yes, please change it to something like val3a. It will be easy for reviewers to review the changes if you just change the prefix.

I missed this issue in #16337

gatorsmile · 2017-01-29T05:06:42Z

...core/src/test/resources/sql-tests/inputs/subquery/scalar-subquery/scalar-subquery-select.sql

+       (SELECT max(t2h) FROM t2) max_t2h
+FROM   t1
+WHERE  t1a = 't1c'
+;


The style issue. How about following the other test cases? Do not put ; as a separate ilne?

The reason to have ; not as part of the last line is so we can add additions to the query without the need to edit the last line. I will make change to satisfy your comment.

gatorsmile · 2017-01-29T05:07:02Z

...core/src/test/resources/sql-tests/inputs/subquery/scalar-subquery/scalar-subquery-select.sql

+                                 ON     t2a = t1a
+                                 WHERE  t2c = t3c)
+                   AND    t3a = t1a)
+


Please also remove this empty line.

gatorsmile · 2017-01-29T05:12:31Z

What is the part-2 for scalar subquery test cases?

nsyca · 2017-01-30T14:39:52Z

The part-2 is for scalar subquery in predicates.

nsyca · 2017-01-30T14:42:32Z

Thank you, @gatorsmile, for your time reviewing this test PR. I will wait for your suggestion on the pattern of the literals in the first columns of the tables if you do need to have them changed. Then I will make necessary changes to the files and do another push.

gatorsmile · 2017-01-30T19:24:54Z

uh, I see. Maybe you can improve the PR title to

[SPARK-18873][SQL][TEST] New test cases for scalar subquery (part 1 of 2) - scalar subquery in SELECT clause

nsyca · 2017-01-30T21:11:52Z

These two commands should give you the delta of the changes I made to address your comments.

0db0bc3

818df9e

SparkQA · 2017-01-30T23:25:14Z

Test build #72171 has finished for PR 16712 at commit 818df9e.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

gatorsmile · 2017-01-31T02:48:19Z

LGTM. cc @hvanhovell for final sign off

hvanhovell · 2017-02-07T12:49:26Z

LGTM - merging to master. Thanks!

gatorsmile · 2017-02-08T04:37:51Z

: ) Let me merge it to master.

…f 2) - scalar subquery in SELECT clause ## What changes were proposed in this pull request? This PR adds new test cases for scalar subquery in SELECT clause. ## How was this patch tested? The test result is compared with the result run from another SQL engine (in this case is IBM DB2). If the result are equivalent, we assume the result is correct. Author: Nattavut Sutyanyong <[email protected]> Closes apache#16712 from nsyca/18873.

…ll up to Optimizer phase ## What changes were proposed in this pull request? Currently Analyzer as part of ResolveSubquery, pulls up the correlated predicates to its originating SubqueryExpression. The subquery plan is then transformed to remove the correlated predicates after they are moved up to the outer plan. In this PR, the task of pulling up correlated predicates is deferred to Optimizer. This is the initial work that will allow us to support the form of correlated subqueries that we don't support today. The design document from nsyca can be found in the following link : [DesignDoc](https://docs.google.com/document/d/1QDZ8JwU63RwGFS6KVF54Rjj9ZJyK33d49ZWbjFBaIgU/edit#) The brief description of code changes (hopefully to aid with code review) can be be found in the following link: [CodeChanges](https://docs.google.com/document/d/18mqjhL9V1An-tNta7aVE13HkALRZ5GZ24AATA-Vqqf0/edit#) ## How was this patch tested? The test case PRs were submitted earlier using. [16337](#16337) [16759](#16759) [16841](#16841) [16915](#16915) [16798](#16798) [16712](#16712) [16710](#16710) [16760](#16760) [16802](#16802) Author: Dilip Biswal <[email protected]> Closes #16954 from dilipbiswal/SPARK-18874.

nsyca added 16 commits July 29, 2016 17:43

New positive test cases

edca333

Fix unit test case failure

64184fd

blocking TABLESAMPLE

29f82b0

Fixing code styling

ac43ab4

Correcting Scala test style

631d396

One (last) attempt to correct the Scala style tests

7eb9b2d

Merge remote-tracking branch 'upstream/master'

1387cf5

Merge remote-tracking branch 'upstream/master'

3faa2d5

Merge remote-tracking branch 'upstream/master'

a308634

Merge remote-tracking branch 'upstream/master'

f1524b9

Merge remote-tracking branch 'upstream/master'

5c36dce

Merge remote-tracking branch 'upstream/master'

862b2b8

Merge remote-tracking branch 'upstream/master'

211e325

new test cases: scalar subquery in SELECT

48ff3c7

kevinyu98 reviewed Jan 27, 2017

View reviewed changes

gatorsmile reviewed Jan 29, 2017

View reviewed changes

nsyca added 2 commits January 30, 2017 15:51

address @gatorsmile's comment apache#1

0db0bc3

Remove trailing space

818df9e

nsyca changed the title ~~[SPARK-18873][SQL][TEST] New test cases for scalar subquery (part 1 of 2)~~ [SPARK-18873][SQL][TEST] New test cases for scalar subquery (part 1 of 2) - scalar subquery in SELECT clause Jan 30, 2017

asfgit closed this in 266c1e7 Feb 8, 2017

dilipbiswal mentioned this pull request Feb 16, 2017

[SPARK-18874][SQL] First phase: Deferring the correlated predicate pull up to Optimizer phase #16954

Closed

nsyca deleted the 18873 branch March 14, 2017 21:07

[SPARK-18873][SQL][TEST] New test cases for scalar subquery (part 1 of 2) - scalar subquery in SELECT clause #16712

[SPARK-18873][SQL][TEST] New test cases for scalar subquery (part 1 of 2) - scalar subquery in SELECT clause #16712

Uh oh!

Conversation

nsyca commented Jan 26, 2017

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

nsyca commented Jan 26, 2017

Uh oh!

nsyca commented Jan 26, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

SparkQA commented Jan 26, 2017

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

gatorsmile Jan 30, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

gatorsmile commented Jan 29, 2017

Uh oh!

nsyca commented Jan 30, 2017

Uh oh!

nsyca commented Jan 30, 2017

Uh oh!

gatorsmile commented Jan 30, 2017

Uh oh!

nsyca commented Jan 30, 2017

Uh oh!

SparkQA commented Jan 30, 2017

Uh oh!

gatorsmile commented Jan 31, 2017

Uh oh!

hvanhovell commented Feb 7, 2017

Uh oh!

gatorsmile commented Feb 8, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

nsyca commented Jan 26, 2017 •

edited

Loading

gatorsmile Jan 30, 2017 •

edited

Loading