Skip to content
View gmongaras's full-sized avatar
😼
When softmax attention is sus
😼
When softmax attention is sus

Highlights

  • Pro

Block or report gmongaras

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. Stable-Diffusion-3-From-Scratch Stable-Diffusion-3-From-Scratch Public

    A repo that attempts to train stable diffusion 3 from scratch

    Python 21 1

  2. Cottention_Transformer Cottention_Transformer Public

    Code for the paper "Cottention: Linear Transformers With Cosine Attention"

    Cuda 17

  3. On-the-Expressiveness-of-Softmax-Attention-A-Recurrent-Neural-Network-Perspective On-the-Expressiveness-of-Softmax-Attention-A-Recurrent-Neural-Network-Perspective Public

    Jupyter Notebook 2

  4. Diffusion_models_from_scratch Diffusion_models_from_scratch Public

    Creating a diffusion model from scratch in PyTorch to learn exactly how they work.

    Python 377 30

  5. AI_Girlfriend AI_Girlfriend Public

    Creating a waifu

    Jupyter Notebook 123 37

  6. Protogen_Code_Public Protogen_Code_Public Public

    Public code for my protogen

    C