hxz's picture

hxz

CUDAOUTOFMEMORY

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 13 days ago

Rubric-based On-policy Distillation

upvoted a paper 24 days ago

Co-Evolving Policy Distillation

authored a paper about 1 month ago

KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance

View all activity

Organizations

None yet

Papers 9

arxiv:2604.12627

arxiv:2604.04780

arxiv:2604.02073

arxiv:2602.01639

models 3

CUDAOUTOFMEMORY/CLEAR

Image-Text-to-Text • 15B • Updated Apr 9 • 7

CUDAOUTOFMEMORY/PLUME-Qwen2-VL-2B

Feature Extraction • 2B • Updated Apr 8 • 9 • 1

CUDAOUTOFMEMORY/REIR

datasets 1

CUDAOUTOFMEMORY/MMD-Bench

Preview • Updated Apr 9 • 299