kushalthaman

Follow

Kushal Thaman kushalthaman

Follow

29 followers · 9 following

Stanford, CA
https://kushalthaman.github.io/

Achievements

Achievements

Organizations

kushalthaman/README.md

I'm Kushal. My research focuses on improving the systematic generalization, oversight and understanding of the training processes of large language models.

Previously, I worked on the science of neural network training and on improving the Transformer architecture at Stanford NLP, and worked on RL at Applied Compute and Prime Intellect.

Reach out if you'd like to chat about anything!

Pinned Loading

PrimeIntellect-ai/prime-rl PrimeIntellect-ai/prime-rl Public

Async RL Training at Scale

Python 738 122
deltakit deltakit Public

CLI & library for scalable data infrastructure written with Delta Lake

Rust 4
polysemanticity polysemanticity Public

Jupyter Notebook 5
stillwater stillwater Public

End-to-end pre-training data curation toolkit built on Smallpond (DuckDB, 3FS, Arrow) and Ray

Python 2
3fs-csi 3fs-csi Public

k8s CSI node plugin that exposes 3FS to pods by bind-mounting subdirectories from a single node-wide 3FS FUSE mount

Go 3
3fs3 3fs3 Public

an S3-compatible gateway for 3FS

Rust 3