arxiv:2602.09555
DeyangKong
DeyangKong
AI & ML interests
Natural Language Processing
Recent Activity
upvoted a paper 1 day ago
IQuest-Coder-V1 Technical Report upvoted a paper 14 days ago
TMAS: Scaling Test-Time Compute via Multi-Agent Synergy upvoted a paper about 1 month ago
From P(y|x) to P(y): Investigating Reinforcement Learning in Pre-train Space