Swift Sampling: Selecting Temporal Surprises via Taylor Series Paper • 2605.22678 • Published 6 days ago • 9
TIDE: Efficient and Lossless MoE Diffusion LLM Inference with I/O-aware Expert Offload Paper • 2605.20179 • Published 8 days ago • 4
Forcing-KV: Hybrid KV Cache Compression for Efficient Autoregressive Video Diffusion Models Paper • 2605.09681 • Published 17 days ago • 10
CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence Paper • 2605.12882 • Published 14 days ago • 268
From Context to Skills: Can Language Models Learn from Context Skillfully? Paper • 2604.27660 • Published 24 days ago • 163
Claw-Eval-Live: A Live Agent Benchmark for Evolving Real-World Workflows Paper • 2604.28139 • Published 27 days ago • 42
MinerU2.5-Pro: Pushing the Limits of Data-Centric Document Parsing at Scale Paper • 2604.04771 • Published Apr 6 • 123
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published Apr 2 • 504
dianavdavidson/wh_small_iv_indic_voices_51708_trial Automatic Speech Recognition • 0.2B • Updated Apr 3 • 5 • 1