yakz1/gemma-4-e4b-question-algerienne-dpo-analyse-qualitative-608-2ep-lora Updated about 7 hours ago • 1
Evaluating Cognitive Age Alignment in Interactive AI Agents Paper • 2605.17894 • Published 14 days ago • 5
IndusAgent: Reinforcing Open-Vocabulary Industrial Anomaly Detection with Agentic Tools Paper • 2605.20682 • Published 12 days ago • 83
You Only Need Minimal RLVR Training: Extrapolating LLMs via Rank-1 Trajectories Paper • 2605.21468 • Published 12 days ago • 49
Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers Paper • 2605.06169 • Published 25 days ago • 231
MolmoAct2: Action Reasoning Models for Real-world Deployment Paper • 2605.02881 • Published 28 days ago • 347
GameWorld: Towards Standardized and Verifiable Evaluation of Multimodal Game Agents Paper • 2604.07429 • Published Apr 8 • 121
Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability Paper • 2604.06628 • Published Apr 8 • 326