Python SLM 1.5B โ v3 (r=64, reasoning) โ RECOMMENDED
Python-specialized fine-tune of Qwen2.5-Coder-1.5B-Instruct for the Mixture-of-Models (MoM) mesh. Single-turn code generator (not an agent).
- Base: Qwen/Qwen2.5-Coder-1.5B-Instruct
- Method: DoRA r=64 (4.6% trainable), SFT on reasoning-augmented Python data.
- Result (native harness, greedy pass@1): HumanEval 70.7% / MBPP 69.6% โ beats base 68.9%/66.7% (delta +1.8 / +2.9). First checkpoint to beat its all-language base at Python.
- See also
python-slm-v4(98% reasoning coverage, experimental).
Usage
from transformers import AutoModelForCausalLM, AutoTokenizer
tok = AutoTokenizer.from_pretrained("srivarenya/python-slm-v3")
model = AutoModelForCausalLM.from_pretrained("srivarenya/python-slm-v3", torch_dtype="bfloat16", device_map="auto")
Code, recipe, eval harness: https://github.com/srivarenya01/python-slm
- Downloads last month
- 16
Model tree for srivarenya/python-slm-v3
Base model
Qwen/Qwen2.5-1.5B Finetuned
Qwen/Qwen2.5-Coder-1.5B Finetuned
Qwen/Qwen2.5-Coder-1.5B-Instruct