Python SLM 1.5B โ€” v3 (r=64, reasoning) โ€” RECOMMENDED

Python-specialized fine-tune of Qwen2.5-Coder-1.5B-Instruct for the Mixture-of-Models (MoM) mesh. Single-turn code generator (not an agent).

  • Base: Qwen/Qwen2.5-Coder-1.5B-Instruct
  • Method: DoRA r=64 (4.6% trainable), SFT on reasoning-augmented Python data.
  • Result (native harness, greedy pass@1): HumanEval 70.7% / MBPP 69.6% โ€” beats base 68.9%/66.7% (delta +1.8 / +2.9). First checkpoint to beat its all-language base at Python.
  • See also python-slm-v4 (98% reasoning coverage, experimental).

Usage

from transformers import AutoModelForCausalLM, AutoTokenizer
tok = AutoTokenizer.from_pretrained("srivarenya/python-slm-v3")
model = AutoModelForCausalLM.from_pretrained("srivarenya/python-slm-v3", torch_dtype="bfloat16", device_map="auto")

Code, recipe, eval harness: https://github.com/srivarenya01/python-slm

Downloads last month
16
Safetensors
Model size
2B params
Tensor type
F32
ยท
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for srivarenya/python-slm-v3

Finetuned
(181)
this model