Training GUI agents with augmented reasoning data and a tailored post-training recipe
Rui Yang PRO
Ray2333
AI & ML interests
Deep Reinforcement Learning
Recent Activity
updated a model 1 day ago
OpenWebRL/OpenWebRL-4B-SFT published a model 1 day ago
OpenWebRL/OpenWebRL-4B-SFT updated a dataset 1 day ago
Ray2333/Judge_data_plus