The Ultimate Guide to Vision Transformers | by François Porcher | August 2024

A complete guide to the Vision Transformer (ViT) that revolutionized computer vision

Hi everyone! For those who don't know me yet, my name is Francois, I'm a research scientist at Meta. I'm passionate about explaining advanced ai concepts and making them more accessible.

Today, we are going to dive into one of the most significant contributions in the field of Computer Vision: the Vision Transformer (ViT).

Convert an image into patches, image by author

The Vision Transformer was introduced by Alexey Dosovitskiy and others (Google Brain) in 2021 in the article A picture is worth 16×16 wordsAt the time, Transformers had proven to be the key to achieving great performance in NLP tasks, introduced in the essential article Attention is all you need in 2017.

Between 2017 and 2021, there were several attempts to integrate the attention mechanism into convolutional neural networks (CNNs). However, these were mostly hybrid models (combining CNN layers with attention layers) and lacked scalability. Google addressed this problem by removing convolutions altogether and leveraging their computational power to scale the model.

The Ultimate Guide to Vision Transformers | by François Porcher | August 2024

Technical Terrence Team

Gold falls as US yields rise after inflation data, but rises for the month

Leave a Reply Cancel reply

Recommended.

Amazon’s bestselling portable fan with 38,000+ perfect ratings is selling like hotcakes while on sale for $18

Ethereum supply on exchanges falls to new all-time lows

Does venture capital need a push?

Ethereum Developers Announce Official Date for Shapella Update

Microsoft Is Gradually Closing In on Apple as It Eyes Position as World’s Largest Stock

Categories

Important Links

The Ultimate Guide to Vision Transformers | by François Porcher | August 2024

A complete guide to the Vision Transformer (ViT) that revolutionized computer vision

Related

Technical Terrence Team

Gold falls as US yields rise after inflation data, but rises for the month

Leave a Reply Cancel reply

Recommended.

Amazon’s bestselling portable fan with 38,000+ perfect ratings is selling like hotcakes while on sale for $18

Ethereum supply on exchanges falls to new all-time lows

Does venture capital need a push?

Ethereum Developers Announce Official Date for Shapella Update

Microsoft Is Gradually Closing In on Apple as It Eyes Position as World’s Largest Stock

Categories

Important Links

Get daily news updates to your inbox!