USTC & MSRA joint PhD
-
University of Science and Technology of China
- https://komorebi660.github.io/
Highlights
- Pro
Pinned Loading
-
microsoft/RetrievalAttention
microsoft/RetrievalAttention PublicScalable long-context LLM decoding that leverages sparsity—by treating the KV cache as a vector storage system.
-
Multi-Person-ChatRoom
Multi-Person-ChatRoom PublicA C++ Version Multi-Person Chat Room in Linux
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.