Extending Reinforcement Learning for LLMs with Flow Environment
SII-Jhao Zhang
JingHaoZ
AI & ML interests
Large Reasoning Model, Unified Understanding and Generation in MLLM
Recent Activity
published a dataset 2 days ago
JingHaoZ/OpenReasoning updated a dataset 2 days ago
JingHaoZ/OpenReasoning upvoted a paper 15 days ago
SkillOS: Learning Skill Curation for Self-Evolving Agents