Why do task vectors exist in pretrained LLMs? This AI research from MIT and Improbable AI uncovers how transformers form internal abstractions and the mechanisms behind in-context learning (ICL)
Large language models (LLMs) have demonstrated notable similarities to the ability of human cognitive processes to form abstractions and adapt ...