When can transformers reason with abstract symbols?

We investigate the capabilities of transformer models in relational reasoning tasks. In these tasks, models are trained on a set of strings that encode abstract relationships and then tested out-of-distribution on data containing symbols that did not appear in the training data set. We prove that for any relational reasoning task in a large family of tasks, transformers learn abstract relations and generalize to the test set when trained using gradient descent on sufficiently large amounts of training data. This contrasts with classical fully connected networks, which we show do not learn to reason. Our results inspire modifications to the transformer architecture that add only two trainable parameters per head and that we empirically show improve the efficiency of data learning to reason.

When can transformers reason with abstract symbols?

Technical Terrence Team

£7,000 savings? Here's what you would do to turn that into a monthly passive income of £1,160

Leave a Reply Cancel reply

Recommended.

What is Critical Race Theory? A Guide for Teachers and Parents

Billionaire Warren Buffett strongly opposes the return of a familiar face

Here's How I'd Find Stocks to Buy to Ride the AI Wave Over the Next 20 Years

Adobe Likely Faces EU Antitrust Warning Over $20 Billion Figma Deal; stock profits

Discovery Sports Launches NFT Loyalty Program

Categories

Important Links

When can transformers reason with abstract symbols?

Related

Technical Terrence Team

£7,000 savings? Here's what you would do to turn that into a monthly passive income of £1,160

Leave a Reply Cancel reply

Recommended.

What is Critical Race Theory? A Guide for Teachers and Parents

Billionaire Warren Buffett strongly opposes the return of a familiar face

Here's How I'd Find Stocks to Buy to Ride the AI ​​Wave Over the Next 20 Years

Adobe Likely Faces EU Antitrust Warning Over $20 Billion Figma Deal; stock profits

Discovery Sports Launches NFT Loyalty Program

Categories

Important Links

Get daily news updates to your inbox!

Here's How I'd Find Stocks to Buy to Ride the AI Wave Over the Next 20 Years