Show-o: A unified AI model that unifies multimodal understanding and generation using a single transformer
This paper presents Show-o, a unified transformer model that integrates multimodal understanding and generation capabilities within a single architecture. As ...