NVIDIA AI Research Proposes Language Instructed Temporal Localization Assistant (LITA), Enabling Precise Temporal Localization Using Video LLM
Large language models (LLMs) have demonstrated their impressive instruction following capabilities and can be a universal interface for various tasks ...