Add async iterator on result #234

abelcha · 2025-06-19T01:37:01Z

Summary

This provides a high-level abstraction for result streaming that matches JavaScript language idioms alongside existing chunk-based APIs.
it permit to iterate over query results using for await loops

Usage Example

const result = await connection.run('SELECT * FROM large_table');

for await (const row of result) {
  console.log(row);
}

Features Added

Async Iterator Implementation: Added [Symbol.asyncIterator]() method to DuckDBResult class

Technical Details

The async iterator fetches chunks progressively, reducing memory usage for large result sets
Maintains compatibility with existing DuckDBResult API
Properly handles edge cases like empty results and null values

Testing

the tests verify:

Correct iteration behavior
Memory-efficient chunk fetching
Proper handling of edge cases
Early termination scenarios

jraymakers · 2025-06-19T04:50:55Z

Thanks for the PR! This is a very cool idea.

To make it even better, and to fit in with the rest of the API, it should allow iterating over either row arrays or row objects, and support the raw or converted (to JS, JSON, or custom) variants. To make that maintainable, we'd like need an async chunk iterator as a building block.

If you'd like to give that a shot, go ahead, or I can try to outline the API I have in mind when I get some time.

abelcha · 2025-06-27T12:27:03Z

I tried wiring up support for all the variants, but it add a lot of stuff in the codebase, i feel like the kind of call that’s yours to make. This is just a minimal version that could serve as a base.

this binding is already a blessing compared to the first one — I’d rather not mess it up

Performance-wise, I was surprised how much per-row object creation adds up. With a template object + Object.create for each row i got a ~10% improvement though it’s hard to benchmark. but yeah at this level its best to let the consumer choose to eat the cost or not

I’m working on a more experimental, fully typed high-level DuckDB TypeScript runtime, and this is the UX I’ve landed on based on the select return value:

im mapping Bigint to Number so it simplifies a lot

jraymakers · 2025-06-28T05:05:53Z

Yes, the reason for the variants is to provide a choice between convenience and performance. Generally the column-oriented ones are going to perform better than the row-oriented ones, and raw arrays will perform better than objects, but for small results it doesn't matter, and rows and objects can be convenient at times.

Supporting all the variants without a lot of code duplication that's hard to maintain took some iteration. I think it could be done while also supporting async iterators, but it will take some experimentation, which I haven't had time for yet. (I still hope to, though probably not very soon.)

That library/runtime you're building looks interesting. How are you ensuring the results are correctly typed? I'd like to provide better typing for results, but I haven't discovered a good way yet. (See #140.)

abelcha · 2025-07-15T22:34:16Z

I follow a similar approach to convex.dev, where intermediate schemas are written to a local .buckdb/ directory.

Either on first execution it inspects .columnTypes() dynamically, or — if you’re in a live environment — it can describe the schema ahead of time (e.g. https://buckdb.pages.dev).

It also codegens phantom types from duckdb_functions() and duckdb_types() to produce full method signatures and static type info for function calls.

Then it use TS generics to handle joins, CTEs, name aliases, etc. to infer return value
src/build.types.ts

missinglink · 2025-08-18T14:15:10Z

FWIW these simple iterators work well for me:

static async *iterate (res: DuckDBResult): AsyncIterable<DuckDBValue[]> {
  while (true) {
    let chunk = await res.fetchChunk()
    if (!chunk?.rowCount) break
    for (const row of chunk.getRows()) {
      yield row
    }
  }
}

static async *iterateObjects<T extends Record<string,DuckDBValue>> (res: DuckDBResult): AsyncIterable<T> {
  const columnNames = res.deduplicatedColumnNames()
  while (true) {
    let chunk = await res.fetchChunk()
    if (!chunk?.rowCount) break
    for (const row of chunk.getRowObjects(columnNames)) {
      yield row as T
    }
  }
}

Being able to specify a type for the return values is nice:

const rows = await Foo.iterateObjects<{id: number, name: string}>(`
  SELECT id, name FROM example
)

for await (const row of rows){
  console.error(row)
}

jraymakers · 2025-09-28T17:55:34Z

We integrated the core idea of this PR (the async iterator on DuckDBResult) into this other one: #303

Thanks for the contribution!

Votre Nom added 2 commits June 19, 2025 03:35

Add async iterator

9f28519

Fix async iterator test imports

fc25c2c

abelcha changed the title ~~Add async iterator~~ Add async iterator on result Jun 19, 2025

fixes

c82f392

remove double assert import

6d148ef

jraymakers mentioned this pull request Sep 26, 2025

DuckDBRowReader: Add class to stream DuckDB results efficiently #303

Merged

jraymakers closed this Sep 28, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add async iterator on result #234

Add async iterator on result #234

Uh oh!

abelcha commented Jun 19, 2025 •

edited

Loading

Uh oh!

jraymakers commented Jun 19, 2025

Uh oh!

abelcha commented Jun 27, 2025

Uh oh!

jraymakers commented Jun 28, 2025

Uh oh!

abelcha commented Jul 15, 2025 •

edited

Loading

Uh oh!

missinglink commented Aug 18, 2025 •

edited

Loading

Uh oh!

jraymakers commented Sep 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add async iterator on result #234

Add async iterator on result #234

Uh oh!

Conversation

abelcha commented Jun 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Usage Example

Features Added

Technical Details

Testing

Uh oh!

jraymakers commented Jun 19, 2025

Uh oh!

abelcha commented Jun 27, 2025

Uh oh!

jraymakers commented Jun 28, 2025

Uh oh!

abelcha commented Jul 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

missinglink commented Aug 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jraymakers commented Sep 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

abelcha commented Jun 19, 2025 •

edited

Loading

abelcha commented Jul 15, 2025 •

edited

Loading

missinglink commented Aug 18, 2025 •

edited

Loading