MobileCLIP2 Collection MobileCLIP2: Mobile-friendly image-text models with SOTA zero-shot capabilities trained on DFNDR-2B • 31 items • Updated 9 days ago • 49
FastVLM Collection Efficient Vision Encoding for Vision Language Models • 9 items • Updated 9 days ago • 95
Running on A10G 25 25 Segment Anything 2 Video Tracking 👀 Segment any objects and track them through a video with SAM2
ToonComposer: Streamlining Cartoon Production with Generative Post-Keyframing Paper • 2508.10881 • Published 28 days ago • 51