Skip to content

Conversation

@alamb
Copy link
Contributor

@alamb alamb commented Aug 2, 2023

Which issue does this PR close?

Part of #5885

Rationale for this change

While working on #5885 I have studied this code more carefully and wanted to document how it works (which is part of how I convince myself I understand the code -- if I can document it)

What changes are included in this PR?

Doc strings and ASCII diagrams

Are these changes tested?

N/A

Are there any user-facing changes?

No

@alamb
Copy link
Contributor Author

alamb commented Aug 2, 2023

@yjshen or @Dandandan I don't know if you might have time to review this PR, but if so I would appreciate it

Copy link
Member

@yjshen yjshen left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. Thanks @alamb!

/// [`Self::insert_batch`] as a stream of [`RecordBatch`]es.
///
/// This may be an in memory sort/merge if all input fit into memory, or
/// a streaming merge from spill files on disk.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This process could either be an in-memory sort/merge if all the input fits into memory or a combined streaming merge incorporating both in-memory batches and data from spill files on disk.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @yjshen -- I updated the wording in 8d7d08f

@yjshen yjshen merged commit fcddabc into apache:main Aug 3, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

core Core DataFusion crate

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants