CNDB-11666: Batch clusterings into single SAI partition post-filterin… #1884

michaeljmarshall · 2025-07-17T05:36:47Z

…g reads

(cherry picked from commit ae4f187)

What is the issue

...

What does this PR fix and why was it fixed

...

github-actions · 2025-07-17T05:37:03Z

eolivelli · 2025-07-18T13:30:41Z

I have confirmed that this patch solves the issue, we will try this docker image with Shadow Proxy

…g reads Port of CASSANDRA-19497. Co-authored-by: Caleb Rackliffe <[email protected]> Co-authored-by: Michael Marshall <[email protected]> Co-authored-by: Andrés de la Peña <[email protected]>

eolivelli

LGTM

Code is the same as #1883 (+ Version#after)

sonarqubecloud · 2025-07-21T12:45:55Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
92.4% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

cassci-bot · 2025-07-21T12:50:24Z

❌ Build ds-cassandra-pr-gate/PR-1884 rejected by Butler

2 new test failure(s) in 1 builds
See build details here

Found 2 new test failures

Test	Explanation	Branch history	Upstream history
....r.PendingAntiCompactionTest.testRetriesTimeout	regression	🔴
o.a.c.u.b.BinLogTest.testTruncationReleasesLogS...	regression	🔴

No known test failures found

…g reads (#1884) Port of CASSANDRA-19497. Co-authored-by: Caleb Rackliffe <[email protected]> Co-authored-by: Michael Marshall <[email protected]> Co-authored-by: Andrés de la Peña <[email protected]>

…sult set (#2024) (cherry picked from commit ada025c) Copy of #2023, but targeting `main` ### What is the issue riptano/cndb#15485 ### What does this PR fix and why was it fixed This PR fixes a bug introduced to this branch via #1884. The bug only impacts SAI file format `aa` when the index file was produced via compaction, which is why the modified test simply adds coverage to compact the table and hit the bug. The bug happens when an iterator produces the same partition across two different batch fetches from storage. These keys were not collapsed in the `key.equals(lastKey)` logic because compacted indexes use a row id per row instead of per partition, and the logic in `PrimaryKeyWithSource` considers rows with different row ids to be distinct. However, when we went to materialize a batch from storage, we hit this code: ```java ClusteringIndexFilter clusteringIndexFilter = command.clusteringIndexFilter(firstKey.partitionKey()); if (cfs.metadata().comparator.size() == 0 || firstKey.hasEmptyClustering()) { return clusteringIndexFilter; } else { nextClusterings.clear(); for (PrimaryKey key : keys) nextClusterings.add(key.clustering()); return new ClusteringIndexNamesFilter(nextClusterings, clusteringIndexFilter.isReversed()); } ``` which returned `clusteringIndexFilter` for `aa` because those indexes do not have the clustering information. Therefore, each batch fetched the whole partition (which was subsequently filtered to the proper results), and produced a multiplier effect where we saw `batch` many duplicates. This fix works by comparing partition keys and clustering keys directly, which is a return to the old comparison logic from before #1884. There was actually a discussion about this in the PR to `main`, but unfortunately, we missed this case #1883 (comment). A more proper long term fix might be to remove the logic of creating a `PrimaryKeyWithSource` for AA indexes. However, I preferred this approach because it is essentially a `revert` instead of fixing forward solution.

…sult set (#2023) ### What is the issue riptano/cndb#15485 ### What does this PR fix and why was it fixed This PR fixes a bug introduced to this branch via #1884. The bug only impacts SAI file format `aa` when the index file was produced via compaction, which is why the modified test simply adds coverage to compact the table and hit the bug. The bug happens when an iterator produces the same partition across two different batch fetches from storage. These keys were not collapsed in the `key.equals(lastKey)` logic because compacted indexes use a row id per row instead of per partition, and the logic in `PrimaryKeyWithSource` considers rows with different row ids to be distinct. However, when we went to materialize a batch from storage, we hit this code: ```java ClusteringIndexFilter clusteringIndexFilter = command.clusteringIndexFilter(firstKey.partitionKey()); if (cfs.metadata().comparator.size() == 0 || firstKey.hasEmptyClustering()) { return clusteringIndexFilter; } else { nextClusterings.clear(); for (PrimaryKey key : keys) nextClusterings.add(key.clustering()); return new ClusteringIndexNamesFilter(nextClusterings, clusteringIndexFilter.isReversed()); } ``` which returned `clusteringIndexFilter` for `aa` because those indexes do not have the clustering information. Therefore, each batch fetched the whole partition (which was subsequently filtered to the proper results), and produced a multiplier effect where we saw `batch` many duplicates. This fix works by comparing partition keys and clustering keys directly, which is a return to the old comparison logic from before #1884. There was actually a discussion about this in the PR to `main`, but unfortunately, we missed this case #1883 (comment). A more proper long term fix might be to remove the logic of creating a `PrimaryKeyWithSource` for AA indexes. However, I preferred this approach because it is essentially a `revert` instead of fixing forward solution.

CNDB-11666: Batch clusterings into single SAI partition post-filterin…

5e29022

…g reads Port of CASSANDRA-19497. Co-authored-by: Caleb Rackliffe <[email protected]> Co-authored-by: Michael Marshall <[email protected]> Co-authored-by: Andrés de la Peña <[email protected]>

adelapena force-pushed the cndb-11666-may-release branch from f61de5b to 5e29022 Compare July 21, 2025 12:00

adelapena marked this pull request as ready for review July 21, 2025 12:00

eolivelli approved these changes Jul 21, 2025

View reviewed changes

adelapena merged commit 6db0e9b into cndb-main-release-202505 Jul 23, 2025
483 of 489 checks passed

adelapena deleted the cndb-11666-may-release branch July 23, 2025 11:11

This was referenced Sep 29, 2025

CNDB-15485: Fix ResultRetriever key comparison to prevent dupes in result set #2023

Merged

CNDB-15485: Fix ResultRetriever key comparison to prevent dupes in result set #2024

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

CNDB-11666: Batch clusterings into single SAI partition post-filterin… #1884

CNDB-11666: Batch clusterings into single SAI partition post-filterin… #1884

Uh oh!

michaeljmarshall commented Jul 17, 2025

Uh oh!

github-actions bot commented Jul 17, 2025

Uh oh!

eolivelli commented Jul 18, 2025

Uh oh!

eolivelli left a comment

Uh oh!

sonarqubecloud bot commented Jul 21, 2025

Uh oh!

cassci-bot commented Jul 21, 2025

Uh oh!

Uh oh!

Uh oh!

CNDB-11666: Batch clusterings into single SAI partition post-filterin… #1884

CNDB-11666: Batch clusterings into single SAI partition post-filterin… #1884

Uh oh!

Conversation

michaeljmarshall commented Jul 17, 2025

What is the issue

What does this PR fix and why was it fixed

Uh oh!

github-actions bot commented Jul 17, 2025

Checklist before you submit for review

Uh oh!

eolivelli commented Jul 18, 2025

Uh oh!

eolivelli left a comment

Choose a reason for hiding this comment

Uh oh!

sonarqubecloud bot commented Jul 21, 2025

Quality Gate passed

Uh oh!

cassci-bot commented Jul 21, 2025

❌ Build ds-cassandra-pr-gate/PR-1884 rejected by Butler

Found 2 new test failures

No known test failures found

Uh oh!

Uh oh!

Uh oh!