|
23363 | 23363 | - filename: Qwen3-Grand-Horror-Light-1.7B.Q4_K_M.gguf |
23364 | 23364 | sha256: cbbb0c5f6874130a8ae253377fdc7ad25fa2c1e9bb45f1aaad88db853ef985dc |
23365 | 23365 | uri: huggingface://mradermacher/Qwen3-Grand-Horror-Light-1.7B-GGUF/Qwen3-Grand-Horror-Light-1.7B.Q4_K_M.gguf |
| 23366 | +- !!merge <<: *qwen3 |
| 23367 | + name: "deepwerewolf-qwen3-8b-grpo-agentic-chinese" |
| 23368 | + urls: |
| 23369 | + - https://huggingface.co/mradermacher/DeepWereWolf-Qwen3-8B-Grpo-Agentic-Chinese-GGUF |
| 23370 | + description: | |
| 23371 | + **Model Name:** DeepWereWolf-Qwen3-8B-Grpo-Agentic-Chinese |
| 23372 | + **Base Model:** Qwen/Qwen3-8B |
| 23373 | + **Author:** haowu11 (original training), mradermacher (GGUF quantization) |
| 23374 | + **Language:** Chinese (with strong English support) |
| 23375 | + **Model Type:** Instruction-tuned, agentic large language model |
| 23376 | + **Architecture:** Qwen3-8B (8-billion parameter autoregressive transformer) |
| 23377 | + **Fine-Tuning:** Trained using Grouped-Response Policy Optimization (GrPO) for enhanced reasoning and agent-like behavior |
| 23378 | + **Use Case:** Ideal for complex reasoning, multi-step tasks, code generation, and interactive agent applications in Chinese and English |
| 23379 | + **Quantized By:** mradermacher (GGUF format, available in multiple quantization levels from Q2_K to Q8_0 and f16) |
| 23380 | + |
| 23381 | + **Overview:** |
| 23382 | + DeepWereWolf-Qwen3-8B-Grpo-Agentic-Chinese is a high-performance, instruction-tuned variant of Qwen3-8B, specifically optimized for agentic workflows with strong reasoning and task execution capabilities. Built upon the Qwen3-8B foundation, it is fine-tuned using Grouped-Response Policy Optimization (GrPO) to improve decision-making and long-horizon task performance. The model excels in multi-step reasoning, dialogue, and autonomous task execution, making it particularly well-suited for AI agent applications. |
| 23383 | + |
| 23384 | + This version is available in multiple GGUF quantizations (from Q2_K to Q8_0), enabling efficient deployment on consumer hardware. The original full-precision model is available at `haowu11/DeepWereWolf-Qwen3-8B-Grpo-Agentic-Chinese`, while the quantized versions are hosted by mradermacher for easy local inference. |
| 23385 | + |
| 23386 | + **Key Features:** |
| 23387 | + - Trained for agentic behavior with advanced reasoning |
| 23388 | + - Strong performance in Chinese and bilingual tasks |
| 23389 | + - Optimized for local deployment via GGUF quantization |
| 23390 | + - Supports multi-turn dialogue and complex instruction following |
| 23391 | + |
| 23392 | + **Use with:** llama.cpp, Ollama, Text Generation WebUI, and other GGUF-compatible tools. |
| 23393 | + |
| 23394 | + **Note:** The GGUF files are quantized versions by mradermacher; the original model was created by haowu11 based on Qwen3-8B and the DeepWerewolf project. |
| 23395 | + overrides: |
| 23396 | + parameters: |
| 23397 | + model: DeepWereWolf-Qwen3-8B-Grpo-Agentic-Chinese.Q4_K_M.gguf |
| 23398 | + files: |
| 23399 | + - filename: DeepWereWolf-Qwen3-8B-Grpo-Agentic-Chinese.Q4_K_M.gguf |
| 23400 | + sha256: 32a341badc695d9e8bc1bdae92c67b81295d6e3cfd8e901a508f323718db5141 |
| 23401 | + uri: huggingface://mradermacher/DeepWereWolf-Qwen3-8B-Grpo-Agentic-Chinese-GGUF/DeepWereWolf-Qwen3-8B-Grpo-Agentic-Chinese.Q4_K_M.gguf |
0 commit comments