Skip to content

Commit e9ebc51

Browse files
mudlergithub-actions[bot]
authored andcommitted
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
1 parent c332ef5 commit e9ebc51

File tree

1 file changed

+22
-0
lines changed

1 file changed

+22
-0
lines changed

gallery/index.yaml

Lines changed: 22 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -23363,3 +23363,25 @@
2336323363
- filename: Qwen3-Grand-Horror-Light-1.7B.Q4_K_M.gguf
2336423364
sha256: cbbb0c5f6874130a8ae253377fdc7ad25fa2c1e9bb45f1aaad88db853ef985dc
2336523365
uri: huggingface://mradermacher/Qwen3-Grand-Horror-Light-1.7B-GGUF/Qwen3-Grand-Horror-Light-1.7B.Q4_K_M.gguf
23366+
- !!merge <<: *qwen25
23367+
name: "rent-qwen-7b-i1"
23368+
urls:
23369+
- https://huggingface.co/mradermacher/RENT-Qwen-7B-i1-GGUF
23370+
description: |
23371+
**Model Name:** RENT-Qwen-7B
23372+
**Base Model:** Qwen2.5-7B-Instruct
23373+
**Training Approach:** Unsupervised Reinforcement Learning via Entropy Minimization (RENT) — no external rewards or labeled data required
23374+
**Dataset:** AIME 2024 (math problem-solving benchmark)
23375+
**Key Achievement:** Outperforms the base model on AIME, achieving a mean score of **0.232 ± 0.003** (vs. 0.110 ± 0.004 for base), demonstrating strong reasoning capabilities without supervision.
23376+
**Use Case:** Ideal for reasoning tasks, especially where labeled data is scarce. Best suited for evaluation on math and logical reasoning benchmarks.
23377+
**Note:** This is a fine-tuned variant of Qwen2.5-7B-Instruct, trained using an innovative unsupervised RL method. The model is not quantized — the original full-precision weights are available in the base repository.
23378+
23379+
👉 **Base Model:** [Qwen/Qwen2.5-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-7B-Instruct)
23380+
👉 **Training Paper & Code:** [RENT GitHub Repo](https://github.com/satrams/rent-rl) | [arXiv Paper](https://arxiv.org/abs/2505.22660)
23381+
overrides:
23382+
parameters:
23383+
model: RENT-Qwen-7B.i1-Q4_K_M.gguf
23384+
files:
23385+
- filename: RENT-Qwen-7B.i1-Q4_K_M.gguf
23386+
sha256: bc0bd3cc994e90297072066bcc3e61e9352bfc28e7cffc09013b6134fb5bb5e2
23387+
uri: huggingface://mradermacher/RENT-Qwen-7B-i1-GGUF/RENT-Qwen-7B.i1-Q4_K_M.gguf

0 commit comments

Comments
 (0)