InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision
-
InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision
Paper • 2512.01342 • Published • 20 -
revliter/internvideo_next_base_p14_res224_f16
91M • Updated • 624 • 5 -
revliter/internvideo_next_large_p14_res224_f16
0.3B • Updated • 3.36k • 6 -
revliter/internvideo_next_large_p14_res224_f16_stage1
Updated • 5 • 2