D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI Paper • 2510.05684 • Published Oct 7, 2025 • 147
MLLM-4D: Towards Visual-based Spatial-Temporal Intelligence Paper • 2603.00515 • Published Feb 28 • 2
GSFixer: Improving 3D Gaussian Splatting with Reference-Guided Video Diffusion Priors Paper • 2508.09667 • Published Aug 13, 2025 • 6