DCAgent3/terminal_bench_2_g1_diverse_tezos_top4_316_8b_20260602_100013 Viewer • Updated 7 days ago • 655 • 27 • 1
DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards Paper • 2605.21467 • Published 20 days ago • 204
Stream-T1: Test-Time Scaling for Streaming Video Generation Paper • 2605.04461 • Published May 6 • 105
Persistent Visual Memory: Sustaining Perception for Deep Generation in LVLMs Paper • 2605.00814 • Published May 1 • 21
Cut Your Losses! Learning to Prune Paths Early for Efficient Parallel Reasoning Paper • 2604.16029 • Published Apr 17 • 23
DR^{3}-Eval: Towards Realistic and Reproducible Deep Research Evaluation Paper • 2604.14683 • Published Apr 16 • 36
HDP: A Lightweight Cryptographic Protocol for Human Delegation Provenance in Agentic AI Systems Paper • 2604.04522 • Published Apr 6 • 10
SkillClaw: Let Skills Evolve Collectively with Agentic Evolver Paper • 2604.08377 • Published Apr 9 • 292