-
CoPE-VideoLM: Codec Primitives For Efficient Video Language Models
Paper • 2602.13191 • Published • 30 -
KTV: Keyframes and Key Tokens Selection for Efficient Training-Free Video LLMs
Paper • 2602.03615 • Published -
TiFRe: Text-guided Video Frame Reduction for Efficient Video Multi-modal Large Language Models
Paper • 2602.08861 • Published -
Causality-Aware Temporal Projection for Video Understanding in Video-LLMs
Paper • 2601.01804 • Published
Irina Abdullaeva
IrinaAbdullaeva
AI & ML interests
NLP, DL, Multi-modality
Recent Activity
upvoted a paper 10 days ago
VidEoMT: Your ViT is Secretly Also a Video Segmentation Model liked
a Space 17 days ago
librarian-bots/recommend_similar_papers updated
a collection
17 days ago
Video Perception