🚀 Qwen-MTP Collection ⚡ MTP (Multi Token Prediction) speculative decoding enables models like Qwen3.6 to have ~1.4-2.2x faster generation with no change in accuracy. • 6 items • Updated 2 days ago • 15
💻 Qwopus-Coder Collection Reasoning-distilled coding models optimized for specialized domains like agentic workflows. • 4 items • Updated 2 days ago • 4
tvall43/Qwen3.5-14B-A3B-Claude-4.6-Opus-Reasoning-Distilled-reap-gguf Text Generation • 14B • Updated Mar 9 • 10.3k • 40
Jackrong/MLX-Qwen3.5-9B-Claude-4.6-Opus-Reasoning-Distilled-6bit Text Generation • 9B • Updated Mar 7 • 713 • 8
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled Image-Text-to-Text • 28B • Updated Apr 6 • 178k • • 2.86k
Jackrong/MLX-Qwen3.5-9B-Claude-4.6-Opus-Reasoning-Distilled-v2-4bit Text Generation • 1B • Updated Mar 19 • 3.11k • 17
SpreadsheetLLM: Encoding Spreadsheets for Large Language Models Paper • 2407.09025 • Published Jul 12, 2024 • 140
Running on CPU Upgrade 242 The Synthetic Data Playbook: Generating Trillions of the Finest Tokens 📝 242 Explore synthetic data benchmarks with an interactive bookshelf
mlx-community/Huihui-Qwen3.5-35B-A3B-abliterated-6bit Text Generation • 35B • Updated Mar 4 • 373 • 4