Skip to content

Conversation

martindevans
Copy link
Member

Normalizing embeddings in LLamaEmbedder, As is done in llama.cpp, see: https://github.com/ggerganov/llama.cpp/blob/2891c8aa9af17f4ff636ff3868bc34ff72b56e25/examples/embedding/embedding.cpp#L92

This should improve the quality of all embeddings which are more than 1 token.

@martindevans martindevans merged commit 968e1e4 into SciSharp:master Feb 13, 2024
@martindevans martindevans deleted the normalize_embeddings branch February 13, 2024 14:08
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant