Inconsistency with count distinct on NaN values

### Describe the bug

I have this csv file:

```
a,b
x,NaN
x,NaN
x,NaN
```

With a simple select query, DF says there is only 1 distinct value for column b (which, I think is correct).

```
> select count(distinct b) from 'nan.csv';
+---------------------------+
| count(DISTINCT nan.csv.b) |
+---------------------------+
| 1                         |
+---------------------------+
```

However, in an aggregate query, DF says there are 3 distinct values:

```
> select a, count(distinct b) from 'nan.csv' group by 1 order by 1;
+---+---------------------------+
| a | count(DISTINCT nan.csv.b) |
+---+---------------------------+
| x | 3                         |
+---+---------------------------+
```

This behavior seems inconsistent. I would expect the aggregate query to also report that there is one distinct value (in Spark, the behavior is consistent between the two queries).




### To Reproduce

_No response_

### Expected behavior

_No response_

### Additional context

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Inconsistency with count distinct on NaN values #16254

Describe the bug

To Reproduce

Expected behavior

Additional context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Inconsistency with count distinct on NaN values #16254

Description

Describe the bug

To Reproduce

Expected behavior

Additional context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions