Python SLM 1.5B — v3 (r=64, reasoning) — RECOMMENDED

Python-specialized fine-tune of Qwen2.5-Coder-1.5B-Instruct for the Mixture-of-Models (MoM) mesh. Single-turn code generator (not an agent).

Base: Qwen/Qwen2.5-Coder-1.5B-Instruct
Method: DoRA r=64 (4.6% trainable), SFT on reasoning-augmented Python data.
Result (native harness, greedy pass@1): HumanEval 70.7% / MBPP 69.6% — beats base 68.9%/66.7% (delta +1.8 / +2.9). First checkpoint to beat its all-language base at Python.
See also python-slm-v4 (98% reasoning coverage, experimental).

Usage

from transformers import AutoModelForCausalLM, AutoTokenizer
tok = AutoTokenizer.from_pretrained("srivarenya/python-slm-v3")
model = AutoModelForCausalLM.from_pretrained("srivarenya/python-slm-v3", torch_dtype="bfloat16", device_map="auto")

Code, recipe, eval harness: https://github.com/srivarenya01/python-slm

Downloads last month: 16

Safetensors

Model size

2B params

Tensor type

F32

BF16

Model tree for srivarenya/python-slm-v3

Base model

Qwen/Qwen2.5-1.5B

Finetuned

Qwen/Qwen2.5-Coder-1.5B

Finetuned

Qwen/Qwen2.5-Coder-1.5B-Instruct

Finetuned

(181)

this model