Google AI Research proposes SpatialVLM: a data synthesis and pre-training mechanism to improve the spatial reasoning capabilities of the VLM vision-language model
Vision-language models (VLM) are becoming more common and offer substantial advances in ai-driven tasks. However, one of the most important ...