You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The llava_shared.dll in LLamaSharp.Backend.Cuda12 is only 850KB, the file size is much smaller than llava_shared.dll with cuda. It will take about 126000+ ms to embding an image. Take it to llava_shared.dll from llama.cpp release 2214, image embding time will be no more than 1000 ms. Will this can be updated in next LLamaSharp release?