hxz's picture

hxz

CUDAOUTOFMEMORY

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 15 days ago

Rubric-based On-policy Distillation

upvoted a paper 26 days ago

Co-Evolving Policy Distillation

authored a paper about 1 month ago

KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance

View all activity

Organizations

None yet

New activity in CUDAOUTOFMEMORY/PLUME-Qwen2-VL-2B about 2 months ago

Add metadata and improve model card

#1 opened about 2 months ago by