AV-CPL: Continuous Pseudo-Tagging for Audiovisual Speech Recognition
*Work done during the internship at Apple Audiovisual speech contains synchronized visual and audio information that provides cross-modal supervision for ...
*Work done during the internship at Apple Audiovisual speech contains synchronized visual and audio information that provides cross-modal supervision for ...