·
AI & ML interests
None yet
Organizations
models 18
davidoj01/Qwen2.5-0.5B-Reverse-SFT-v2
Updated
davidoj01/unsloth-phi-4-Instruct-LORA-Open-R1-Code-GRPO-b2-as4-lr2en5-encouraged
Updated
davidoj01/unsloth-phi-4-Instruct-LORA-Open-R1-Code-GRPO-b2-as4-lr2en5-vuln
Updated
davidoj01/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Updated
davidoj01/unsloth-phi-4-Instruct-LORA-Open-R1-Code-GRPO-b2-as8-t07-lr1en5
Updated
davidoj01/unsloth-phi-4-Instruct-LORA-Open-R1-Code-GRPO-b2-as4-t07-lr1en5
Text Generation
• Updated • 5
davidoj01/unsloth-phi-4-Instruct-LORA-Open-R1-Code-GRPO-b2-as8-t12-lr1en5
Updated
davidoj01/unsloth-phi-4-Instruct-LORA-Open-R1-Code-GRPO-b2-as8-t07
Updated
davidoj01/unsloth-phi-4-Instruct-LORA-Open-R1-Code-GRPO-b2-as8-t05-lr1en5
Updated
davidoj01/unsloth-phi-4-Instruct-LORA-Open-R1-Code-GRPO-b2-as8-lr2en5
Updated