InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision
-
InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision
Paper • 2512.01342 • Published • 14 -
revliter/internvideo_next_base_p14_res224_f16
91M • Updated • 151 • 3 -
revliter/internvideo_next_large_p14_res224_f16
0.3B • Updated • 249 • 4 -
revliter/internvideo_next_large_p14_res224_f16_stage1
Updated • 5 • 1