Skip to content

feat: batch posix fs connector (__for_testing_only_batch_posix_fs) #22733

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 2 commits into
base: 07-14-feat_refreshable_source_sourceexec_part
Choose a base branch
from

Conversation

xxchan
Copy link
Member

@xxchan xxchan commented Jul 28, 2025

I hereby agree to the terms of the RisingWave Labs, Inc. Contributor License Agreement.

What's changed and what's your intention?

Checklist

  • I have written necessary rustdoc comments.
  • I have added necessary unit tests and integration tests.
  • I have added test labels as necessary.
  • I have added fuzzing tests or opened an issue to track them.
  • My PR contains breaking changes.
  • My PR changes performance-critical code, so I will run (micro) benchmarks and present the results.
  • I have checked the Release Timeline and Currently Supported Versions to determine which release branches I need to cherry-pick this PR into.

Documentation

  • My PR needs documentation updates.
Release note

Copy link
Member Author

xxchan commented Jul 28, 2025

Warning

This pull request is not mergeable via GitHub because a downstack PR is open. Once all requirements are satisfied, merge this PR as a stack on Graphite.
Learn more

This stack of pull requests is managed by Graphite. Learn more about stacking.

@xxchan xxchan marked this pull request as ready for review July 28, 2025 08:42
@xxchan xxchan changed the title feat: batch posix fs feat: batch posix fs connector Jul 28, 2025
@xxchan xxchan force-pushed the feat_add_batch_posix_fs_connector branch from cf7922e to 032c049 Compare July 28, 2025 09:01
@xxchan xxchan force-pushed the 07-14-feat_refreshable_source_sourceexec_part branch from f1e5606 to 82f268b Compare July 28, 2025 09:01
@xxchan xxchan force-pushed the feat_add_batch_posix_fs_connector branch from a6138ae to f74bdf5 Compare July 28, 2025 09:47
@xxchan xxchan force-pushed the 07-14-feat_refreshable_source_sourceexec_part branch from 82f268b to ce71aab Compare July 28, 2025 09:47
@xxchan xxchan force-pushed the feat_add_batch_posix_fs_connector branch from f74bdf5 to b4aa469 Compare July 28, 2025 09:48
@xxchan xxchan force-pushed the 07-14-feat_refreshable_source_sourceexec_part branch 2 times, most recently from 129a087 to c0fe9da Compare July 28, 2025 09:48
@xxchan xxchan force-pushed the feat_add_batch_posix_fs_connector branch from b4aa469 to cd08960 Compare July 28, 2025 09:48
@xxchan xxchan force-pushed the 07-14-feat_refreshable_source_sourceexec_part branch from c0fe9da to d44e849 Compare July 28, 2025 10:30
@xxchan xxchan force-pushed the feat_add_batch_posix_fs_connector branch from cd08960 to 82da810 Compare July 28, 2025 10:30
Copy link

@cursor cursor bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Bugbot free trial expires on July 29, 2025
Learn more in the Cursor dashboard.

Ok(vec![BatchPosixFsSplit::new(
self.properties.root.clone(), // file_path is the root
"114514".into(), // split_id does not matter
)])
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Bug: BatchPosixFs Connector Violates Unique Split ID Requirement

The BatchPosixFs connector hardcodes the split_id to "114514" for all instances. This violates the documented requirement for a unique split identifier, leading to conflicts and incorrect behavior when multiple BatchPosixFs sources are used, as SplitId serves as a unique key for split tracking and state management. The accompanying comment "split_id does not matter" is misleading.

Locations (1)

Fix in Cursor Fix in Web

@@ -44,6 +46,7 @@ pub const GCS_CONNECTOR: &str = "gcs";
/// If user inputs `connector='s3_v2'`, it will be rejected.
pub const OPENDAL_S3_CONNECTOR: &str = "s3_v2";
pub const POSIX_FS_CONNECTOR: &str = "posix_fs";
pub const BATCH_POSIX_FS_CONNECTOR: &str = "__for_testing_only_batch_posix_fs";
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

cathack

@xxchan xxchan force-pushed the feat_add_batch_posix_fs_connector branch from 82da810 to 1739d0b Compare July 29, 2025 09:23
@xxchan xxchan force-pushed the 07-14-feat_refreshable_source_sourceexec_part branch from 92a451a to be94be6 Compare July 29, 2025 15:39
@xxchan xxchan force-pushed the feat_add_batch_posix_fs_connector branch from 1739d0b to bdba77b Compare July 29, 2025 15:39
@xxchan xxchan force-pushed the feat_add_batch_posix_fs_connector branch from bdba77b to f446d39 Compare July 30, 2025 07:32
@xxchan xxchan force-pushed the 07-14-feat_refreshable_source_sourceexec_part branch from be94be6 to 0266dcb Compare July 30, 2025 07:32
@xxchan xxchan force-pushed the feat_add_batch_posix_fs_connector branch from f446d39 to 3b205df Compare July 30, 2025 08:07
@xxchan xxchan changed the title feat: batch posix fs connector feat: batch posix fs connector (__for_testing_only_batch_posix_fs) Jul 31, 2025
@xxchan xxchan force-pushed the feat_add_batch_posix_fs_connector branch from 3b205df to e14e9c3 Compare July 31, 2025 09:39
xxchan added 2 commits July 31, 2025 17:41
Signed-off-by: xxchan <[email protected]>

add note for recovery

add note for spawn

resolve comments
@xxchan xxchan force-pushed the feat_add_batch_posix_fs_connector branch from e14e9c3 to 3e3193c Compare July 31, 2025 09:41
@xxchan xxchan force-pushed the 07-14-feat_refreshable_source_sourceexec_part branch from a0b55a0 to 26605cd Compare July 31, 2025 09:41
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant