This collection contains all the GRPO-trained models for our paper "A Rising Tide Lifts All Boats". Please consider citing us!
Ishika Agarwal
ishikaa
·
AI & ML interests
active learning, reinforcement learning, reasoning, planning, NLP
Recent Activity
updated a model about 5 hours ago
ishikaa/UAS_qwen7b_only_numina_expweak2 published a model about 5 hours ago
ishikaa/UAS_qwen7b_only_numina_expweak2 updated a model about 8 hours ago
ishikaa/UAS_qwen7b_uniform_expweak2