Beyond GRPO and On-Policy Distillation: An Empirical Sparse-to-Dense Reward Principle for Language-Model Post-Training Paper • 2605.12483 • Published 8 days ago • 10
FeatCal: Feature Calibration for Post-Merging Models Paper • 2605.13030 • Published 7 days ago • 7
ReflectDrive-2: Reinforcement-Learning-Aligned Self-Editing for Discrete Diffusion Driving Paper • 2605.04647 • Published 14 days ago • 9
📈Scaling Law Collection Experts for the model merging scaling laws in LLMs. Github: https://github.com/InfiXAI/Merging-Scaling-Law • 18 items • Updated 8 days ago • 2
Geometry Conflict: Explaining and Controlling Forgetting in LLM Continual Post-Training Paper • 2605.09608 • Published 10 days ago • 50
Model Merging Scaling Laws in Large Language Models Paper • 2509.24244 • Published 9 days ago • 44