Meta AI launches Apollo: a new family of large multimodal video-LMM models for video understanding
While multimodal models (LMMs) have advanced significantly for text and image tasks, video-based models remain underdeveloped. Videos are intrinsically complex ...