-
-
Notifications
You must be signed in to change notification settings - Fork 11k
Closed
Labels
staleOver 90 days of inactivityOver 90 days of inactivity
Description
Hi,
I was reading through the documentation for Using Lora in VLLM.
In the documentation when they start the server, it looks like they have to specify which Lora modules are available
--lora-modules sql-lora=~/.cache/huggingface/hub/models--yard1--llama-2-7b-sql-lora-test/
Is it possible to do this in real-time instead? That is, start the server and call a recently added Lora module without having to stop and restart the server?
xyslion, darfi-gdp, hllj, AnatoliiPotapov, fsatka and 1 more
Metadata
Metadata
Assignees
Labels
staleOver 90 days of inactivityOver 90 days of inactivity