cctry

Follow

cctry

Follow

Cranking LLM inference, one request at a time. 🤖💪

27 followers · 3 following

@xai-org
Bay Area
04:19 (UTC -08:00)
cctry.github.io
https://scholar.google.com/citations?user=8IWg0JUAAAAJ

Achievements

Achievements

Highlights

Pro

Pinned Loading

E.T. E.T. Public

Cuda 7 4
cutlass_HIP cutlass_HIP Public

Modified CUTLASS 3 (CUTE) for HIP

Cuda
Tango Tango Public

Python 4 2
TensorDirect/KVDirect TensorDirect/KVDirect Public

Code for KVDirect: Distributed Disaggregated LLM Inference

Python 8 1