General Agent Evaluation
Paper • 2602.22953 • Published • 12
This is a tracking repo for React + Shortlisting, used by the Open Agent Leaderboard to report evaluation results on HuggingFace.
ReAct agent with tool shortlisting — dynamically filters available tools per step to reduce context and improve accuracy.