[SPARK-43760] Incorrect attribute nullability after RewriteCorrelatedScalarSubquery leads to incorrect query results - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: 3.4.0
Fix Version/s: 3.4.1, 3.5.0
Component/s: SQL
Labels:
- correctness

Description

The following query:

select * from (
 select t1.id c1, (
  select t2.id c from range (1, 2) t2
  where t1.id = t2.id  ) c2
 from range (1, 3) t1 ) t
where t.c2 is not null
-- !query schema
struct<c1:bigint,c2:bigint>
-- !query output
1	1
2	NULL

should return 1 row, because the second row is supposed to be removed by IsNotNull predicate. However, due to a wrong nullability propagation after subquery decorrelation, the output of the subquery is declared as not-nullable (incorrectly), so the predicate is constant folded into True.

Attachments

Activity

People

Assignee:: Andrey Gubichev

Reporter:: Andrey Gubichev

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Dates

Created:: 24/May/23 00:14

Updated:: 24/Nov/23 22:56

Resolved:: 31/May/23 00:29