Apple presents new research in the European Conference on Computer Vision (ECCV)which will take place in person in Milan, Italy, from September 29 to October 4. We are proud to again sponsor the biennial conference, which brings together the scientific and industrial research communities around machine learning and computer vision. Below is an overview of Apple's participation in ECCV 2024.
Schedule
Stop by the Apple booth #34 at the Allianz MiCo Convention Center during exhibition hours (all times GMT+2):
- Tuesday, October 1 — Thursday, October 3: 09:00-18:30
- Friday, October 4: 09:00-12:30
Sunday September 29
Monday September 30
- 2nd workshop on Industrial Inspection based on Vision (VISION)
- 09:00 – 13:00, tower hall
-
- Synth4Seg: Learning defect data synthesis for defect segmentation using two-level optimization
- 10:40 – 11:40
- Shancong Mou, Raviteja Vemulapalli, Shiyu Li, Andy Liu, C Thomas, Meng Cao, Felix Bai, Oncel Tuzel, Ping Huang, Jiulong Shan, Jianjun Shi
Tuesday October 1
- MM1: Methods, analysis and knowledge of previous training in multimodal LLM
- 10:30 – 12:30, Poster Session 1
- Brandon McKinzie, Zhe Gan, Jean-Philippe Fauconnier Biard, Sam Dodge, Bowen Zhang, Philipp Dufter, Dhruti Shah, Xianzhi Du, Futang Peng, Floris Weers, Anton Belyi, Haotian Zhang, Karanjeet Singh, Doug Kang, Ankur Jain, Hongyu He , Max Schwarzer, Tom Gunter, Xiang Kong, Aonan Zhang, Jianyu Wang, Chong Wang, Nan Du, Tao Lei, Sam Wiseman, Mark Lee, Zirui Wang, Ruoming Pang, Peter Grasch, Alexander Toshev, Yinfei Yang
Wednesday October 2
- VeCLIP: Improving CLIP training through visually rich subtitles
- 16:30 – 18:30, Poster Session 4
- Jeff Lai, Haotian Zhang, Bowen Zhang, Wentao Wu, Felix Bai, Aleksei Timofeev, Xianzhi Du, Zhe Gan, Jiulong Shan, Chen-Nee Chuah, Yinfei Yang, Meng Cao
Accepted articles
AV-CPL: Continuous Pseudo-Labeling for Audiovisual Speech Recognition
Andrei Rouditchenko, Ronan Collobert, Tatiana Likhomanenko
CTRLorALTer: Conditional LoRAdapter for efficient 0-shot control and T2I model alteration
Nick Stracke, Stefan Andreas Baumann, Josh Susskind, Miguel Ángel Bautista Martín, Björn Ommer
Ferret-UI: Understanding Mobile UI Based on Multimodal LLMs
Keen You, Haotian Zhang, Eldon Schoop, Floris Weers, Amanda Swearngin, Jeff Nichols, Yinfei Yang, Zhe Gan
MM1: Methods, analysis and knowledge of multimodal pre-LLM training
Brandon McKinzie, Zhe Gan, Jean-Philippe Fauconnier Biard, Sam Dodge, Bowen Zhang, Philipp Dufter, Dhruti Shah, Xianzhi Du, Futang Peng, Floris Weers, Anton Belyi, Haotian Zhang, Karanjeet Singh, Doug Kang, Ankur Jain, Hongyu He , Max Schwarzer, Tom Gunter, Xiang Kong, Aonan Zhang, Jianyu Wang, Chong Wang, Nan Du, Tao Lei, Sam Wiseman, Mark Lee, Zirui Wang, Ruoming Pang, Peter Grasch, Alexander Toshev, Yinfei Yang
Synth4Seg: Learning defect data synthesis for defect segmentation using two-level optimization
Shancong Mou, Raviteja Vemulapalli, Shiyu Li, Andy Liu, C Thomas, Meng Cao, Felix Bai, Oncel Tuzel, Ping Huang, Jiulong Shan, Jianjun Shi
VeCLIP: Improving CLIP Training Using Visually Rich Subtitles
Jeff Lai, Haotian Zhang, Bowen Zhang, Wentao Wu, Felix Bai, Aleksei Timofeev, Xianzhi Du, Zhe Gan, Jiulong Shan, Chen-Nee Chuah, Yinfei Yang, Meng Cao
Expressions of gratitude
Stephan Richter is the president of the main conference area.
Alaa El-Nouby, Hadi Pour Ansari, Pavan Kumar Anasosalu Vasu, Raviteja Vemulapalli and Yusu Qian are the lead reviewers for the conference.
For him 2nd workshop on Vision-based Industrial Inspection (VISION):
Alexander Wong, C Thomas, Carrie Yu, Javad Shafiee, Jeff Lai and Tatiana Likhomanenko are co-organizers.
Shiyu Li and Yuxuan Liu are workshop presidents.
Raviteja Vemulapalli is the keynote speaker.
Vimal Thilak is a workshop reviewer.