MM-Vet v2: A challenging benchmark for evaluating large multimodal models (LMMs) for integrated capabilities
Large language models (LMMs) are developing significantly and are proving capable of handling more complex tasks that require a combination ...