OpenELM: A Family of Efficient Language Models with Open Source Training and Inference Framework

The reproducibility and transparency of large language models are crucial to promote open research, ensure the reliability of results, and enable investigations into data and model biases as well as potential risks. To this end, we launched OpenELM, a next-generation open language model. OpenELM uses a layered scaling strategy to efficiently assign parameters within each layer of the transformer model, leading to higher accuracy. For example, with a parameter budget of approximately 1 billion parameters, OpenELM shows a 2.36% improvement in accuracy compared to OLMo and requires 2x fewer pre-training tokens.

Unlike previous practices that only provide model weights and inference code, and pre-train on private data sets, our version includes the complete framework for training and evaluating the language model on publicly available data sets, including logs training, multiple checkpoints and pre-training. Training settings. We also published code to convert models to the MLX library for inference and fitting on Apple devices. This comprehensive release aims to empower and strengthen the open research community, paving the way for future open research efforts.

OpenELM: A Family of Efficient Language Models with Open Source Training and Inference Framework

Technical Terrence Team

VeriSign's GAAP EPS of $1.92 Beats $0.06, Revenue of $384M Beats $2.15M (NASDAQ:VRSN)

Leave a Reply Cancel reply

Recommended.

Motorola's mid-range stylus phone charges wirelessly and shines

Bitcoin profitability index reaches 202%: is this enough to reach the top?

Ethereum Burns $2.5 Billion Worth of ETH Since Merger as Supply Falls to 18-Month Low

A guide to gameplay, wool and more

O3 Mini, Deepseekr1 and more

Categories

Important Links

OpenELM: A Family of Efficient Language Models with Open Source Training and Inference Framework

Related

Technical Terrence Team

VeriSign's GAAP EPS of $1.92 Beats $0.06, Revenue of $384M Beats $2.15M (NASDAQ:VRSN)

Leave a Reply Cancel reply

Recommended.

Motorola's mid-range stylus phone charges wirelessly and shines

Bitcoin profitability index reaches 202%: is this enough to reach the top?

Ethereum Burns $2.5 Billion Worth of ETH Since Merger as Supply Falls to 18-Month Low

A guide to gameplay, wool and more

O3 Mini, Deepseekr1 and more

Categories

Important Links

Get daily news updates to your inbox!