arxiv:2604.08539
Yihe Deng
ydeng9
AI & ML interests
LLM post-training
Recent Activity
liked a dataset 2 days ago
allenai/WildChat-4.8M authored a paper 3 months ago
OpenVLThinkerV2: A Generalist Multimodal Reasoning Model for Multi-domain Visual Tasks published a dataset 4 months ago
DuoGuard/duoguard-iter1-data