Skip to content
View ChenMnZ's full-sized avatar

Block or report ChenMnZ

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. OpenGVLab/OmniQuant OpenGVLab/OmniQuant Public

    [ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.

    Python 874 72

  2. OpenGVLab/EfficientQAT OpenGVLab/EfficientQAT Public

    [ACL 2025 Main] EfficientQAT: Efficient Quantization-Aware Training for Large Language Models

    Python 312 22

  3. PrefixQuant PrefixQuant Public

    An algorithm for weight-activation quantization (W4A4, W4A8) of LLMs, supporting both static and dynamic quantization

    Python 164 14

  4. CF-ViT CF-ViT Public

    (AAAI 2023 Oral) Pytorch implementation of "CF-ViT: A General Coarse-to-Fine Method for Vision Transformer"

    Python 106 8

  5. OpenGVLab/DiffRate OpenGVLab/DiffRate Public

    [ICCV 23]An approach to enhance the efficiency of Vision Transformer (ViT) by concurrently employing token pruning and token merging techniques, while incorporating a differentiable compression rate.

    Jupyter Notebook 101 7

  6. INT_vs_FP INT_vs_FP Public

    A framework to compare low-bit integer and float-point formats

    Python 42 3