-
Notifications
You must be signed in to change notification settings - Fork 0
Closed
Description
This issue tracks orchestration of multiple containers. There are 2 services: one is serving the large language model (LLM) by itself while the other is the front end that provides a secure access point to it. The ideal design is just one container but the LLM requires significantly more computing resources and optimizations than standard.
Metadata
Metadata
Assignees
Labels
No labels