ProRL: Effective Reinforcement Learning for Proactive Recommendation via Rectified Policy Gradient Estimation Paper • 2605.28293 • Published 12 days ago • 87
Agent Explorative Policy Optimization for Multimodal Agentic Reasoning Paper • 2605.28774 • Published 12 days ago • 90
BEV-LaneDet: a Simple and Effective 3D Lane Detection Baseline Paper • 2210.06006 • Published Oct 12, 2022 • 3