arxiv:2604.14683
Qianqian Xie
mistletoe111
AI & ML interests
None yet
Recent Activity
upvoted a paper 13 days ago
DR^{3}-Eval: Towards Realistic and Reproducible Deep Research Evaluation upvoted a paper 14 days ago
WebCompass: Towards Multimodal Web Coding Evaluation for Code Language Models updated a dataset 15 days ago
NJU-LINK/DR3-Eval