Computational bottlenecks in training large, small-scale language models

This paper was accepted into the Efficient Natural Speech and Language Processing (ENLSP) workshop at the NeurIPS 2024 Workshop.

While large language models (LLM) dominate the ai landscape, small-scale large language models (SLM) are gaining attention due to consumer demands for cost and efficiency. However, there is limited research on the training behavior and computational requirements of SLMs. In this study, we explore the computational bottlenecks of SLM training (up to 2B parameters) by examining the effects of various hyperparameters and settings, including GPU type, batch size, model size, communication protocol, the type of attention and the number of GPUs. We evaluate these factors on popular cloud services using metrics such as loss per dollar and tokens per second. Our findings aim to support broader adoption and optimization of language model training for low-resource ai research institutes.

Computational bottlenecks in training large, small-scale language models

Technical Terrence Team

Wall St closes higher as Amazon profits offset weak job growth By Reuters

Leave a Reply Cancel reply

Recommended.

ZKasino Starts ETH Refund Process Amid Growing Scam Allegations

BlackRock and Bitcoin ETFs protected BTC from massive drops, says analyst

3 Key Encoding Techniques for Machine Learning: A Beginner-Friendly Guide with Pros, Cons, and Python Code Examples | by Ryu Sonoda | Feb, 2024

How will the Bitcoin halving change the cost of mining?

Stock Market News Today: Markets Falter After Hitting 2023 Closing High (SP500)

Categories

Important Links

Computational bottlenecks in training large, small-scale language models

Related

Technical Terrence Team

Wall St closes higher as Amazon profits offset weak job growth By Reuters

Leave a Reply Cancel reply

Recommended.

ZKasino Starts ETH Refund Process Amid Growing Scam Allegations

BlackRock and Bitcoin ETFs protected BTC from massive drops, says analyst

3 Key Encoding Techniques for Machine Learning: A Beginner-Friendly Guide with Pros, Cons, and Python Code Examples | by Ryu Sonoda | Feb, 2024

How will the Bitcoin halving change the cost of mining?

Stock Market News Today: Markets Falter After Hitting 2023 Closing High (SP500)

Categories

Important Links

Get daily news updates to your inbox!