CUA-Gym: Scaling Verifiable Training Environments and Tasks for Computer-Use Agents Paper • 2605.25624 • Published 6 days ago • 28
SkillOpt: Executive Strategy for Self-Evolving Agent Skills Paper • 2605.23904 • Published 9 days ago • 204
Flow-OPD: On-Policy Distillation for Flow Matching Models Paper • 2605.08063 • Published 23 days ago • 98
Workspace-Bench 1.0: Benchmarking AI Agents on Workspace Tasks with Large-Scale File Dependencies Paper • 2605.03596 • Published 26 days ago • 10
HumanNet: Scaling Human-centric Video Learning to One Million Hours Paper • 2605.06747 • Published 24 days ago • 52