Skip to content

Conversation

martindevans
Copy link
Member

Added a batch queue to BatchedExecutor, allowing an arbitrary amount of work to be queued up in the executor and run in batches. This is helpful when prompting with very large numbers of tokens (e.g. a 2000 character system prompt would automatically be queued up in 4x512 batches).

This is a pre-requisite for embeddings prompting (i.e. llava), which needs an entirely new type of item in the batch queue. Embeddings usually also come in large numbers (e.g. one llava image might be hundreds of items), so the ability to queue up lots of work is valuable!

…t of work to be queued up in the executor and run in batches. This is helpful when prompting with very large numbers of tokens (e.g. a 2000 character system prompt would automatically be queued up in 4x512 batches).

This is a pre-requisite for embeddings prompting (i.e. llava), which needs an entirely new type of item in the batch queue.
@martindevans martindevans merged commit be2c4fe into SciSharp:master May 29, 2024
@martindevans martindevans deleted the batched_executor_queueing branch May 29, 2024 22:11
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant