gmongaras

Follow

😼

When softmax attention is sus

Gabriel Mongaras gmongaras

😼

When softmax attention is sus

Follow

Researcher and employee at Etched. I Attempt to force machines to not be dumb.

127 followers · 2 following

Achievements

Achievements

Highlights

Pro

Pinned Loading

Stable-Diffusion-3-From-Scratch Stable-Diffusion-3-From-Scratch Public

A repo that attempts to train stable diffusion 3 from scratch

Python 21 1
Cottention_Transformer Cottention_Transformer Public

Code for the paper "Cottention: Linear Transformers With Cosine Attention"

Cuda 17
On-the-Expressiveness-of-Softmax-Attention-A-Recurrent-Neural-Network-Perspective On-the-Expressiveness-of-Softmax-Attention-A-Recurrent-Neural-Network-Perspective Public

Jupyter Notebook 2
Diffusion_models_from_scratch Diffusion_models_from_scratch Public

Creating a diffusion model from scratch in PyTorch to learn exactly how they work.

Python 377 30
AI_Girlfriend AI_Girlfriend Public

Creating a waifu

Jupyter Notebook 123 37
Protogen_Code_Public Protogen_Code_Public Public

Public code for my protogen

C