Why Far Looks Up: Probing Spatial Representation in Vision-Language Models Paper • 2605.30161 • Published 4 days ago • 38
MUSE-Autoskill: Self-Evolving Agents via Skill Creation, Memory, Management, and Evaluation Paper • 2605.27366 • Published 6 days ago • 24
EarlyTom: Early Token Compression Completes Fast Video Understanding Paper • 2605.30010 • Published 4 days ago • 27
LaRA: Layer-wise Representation Analysis for Detecting Data Contamination in RL Post-Training Paper • 2605.29888 • Published 4 days ago • 21
view article Article Profiling in PyTorch (Part 1): A Beginner's Guide to torch.profiler +3 ariG23498, sayakpaul, sergiopaniego, ror, pcuenq • 3 days ago • 44
YoCausal: How Far is Video Generation from World Model? A Causality Perspective Paper • 2605.30346 • Published 4 days ago • 40
Fast-dDrive: Efficient Block-Diffusion VLM for Autonomous Driving Paper • 2605.23163 • Published 7 days ago • 17
How LoRA Remembers? A Parametric Memory Law for LLM Finetuning Paper • 2605.30260 • Published 4 days ago • 27
minWM: A Full-Stack Open-Source Framework for Real-Time Interactive Video World Models Paper • 2605.30263 • Published 4 days ago • 49
CubePart: An Open-Vocabulary Part-Controllable 3D Generator Paper • 2605.28763 • Published 5 days ago • 12
VibeSearchBench: Benchmarking Long-horizon Proactive Search in the Wild Paper • 2605.27882 • Published 5 days ago • 11
DenoiseRL: Bootstrapping Reasoning Models to Recover from Noisy Prefixes Paper • 2605.28421 • Published 5 days ago • 44