Apr 25, 2024

Guide on How to Run OpenELM – New Al Models Presented by Apple

Lately Apple have introduced eight open source language models, the OpenELM models (Open-source Efficient Language Models). What makes them special is that they run directly on the device and not on cloud servers. And in this short guide, we will show you how to run and use them.

Author:

Artem Vysotsky

Reviewed by:

Reviewed:

Reviewed by:

Sergey Vysotsky

Apple have introduced eight open-source language models known as OpenELM (Open-source Efficient Language Models). Unique for their ability to operate directly on devices rather than relying on cloud servers, these models mark a significant advancement in AI technology. This guide will show you how to set up and use these innovative Apple AI models.

Apple's Efficient Language Models

Developers now have access to these large language models, which can be easily downloaded and implemented through the Hugging Face Hub. Notably, four of the OpenELM models were trained with the CoreNet library, a resource also launched by Apple for the training of deep neural networks.

The other four models (Instruct) are designed as instructional tools, capable of interpreting and responding to direct instructions. This full suite of models, along with comprehensive training and evaluation frameworks, is available on public datasets. These include detailed training protocols, various checkpoints, and diverse pre-training configurations.

The OpenELM family includes several models tailored to different needs. Please click to read more details about models.

Running OpenELM via HuggingFace

Install

To help you get started, we've provided a sample function in generate_openelm.py for generating output from OpenELM models via the Hugging Face Hub. To test the model, simply run the following command:

python generate_openelm.py --model [MODEL_NAME] --hf_access_token [HF_ACCESS_TOKEN] --prompt 'Once upon a time there was' --generate_kwargs repetition_penalty=1.2

For access to your Hugging Face token, please follow this link.

Additionally, you can customize the generate function with various arguments. For instance, to enhance inference speed, consider using the prompt_lookup_num_tokens argument for lookup token speculative generation.

python generate_openelm.py --model [MODEL_NAME] --hf_access_token [HF_ACCESS_TOKEN] --prompt 'Once upon a time there was' --generate_kwargs repetition_penalty=1.2 prompt_lookup_num_tokens=10

Or, for an assistive model, use a smaller model via the assistant_model argument as shown below:

python generate_openelm.py --model [MODEL_NAME] --hf_access_token [HF_ACCESS_TOKEN] --prompt 'Once upon a time there was' --generate_kwargs repetition_penalty=1.2 --assistant_model [SMALLER_MODEL_NAME]

Setting up

Make sure to install the necessary dependencies:

# install public lm-eval-harness

<p>harness_repo="public-lm-eval-harness"<br>git clone <a href="https://github.com/EleutherAI/lm-evaluation-harness" data-framer-link="Link:{"url":"https://github.com/EleutherAI/lm-evaluation-harness","type":"url"}">https://github.com/EleutherAI/lm-evaluation-harness</a> ${harness_repo}<br>cd ${harness_repo}</p>
<h1>use main branch on 03-15-2024, SHA is dc90fec</h1>
<p>git checkout dc90fec<br>pip install -e .<br>cd ..</p>
<h1>66d6242 is the main branch on 2024-04-01</h1>

Evaluation of OpenELM

# OpenELM-270M<br>hf_model=OpenELM-270M<p></p>
<h1>this flag is needed because lm-eval-harness set add_bos_token to False by default, but OpenELM uses LLaMA tokenizer which requires add_bos_token to be True</h1>

Conclusion: Considerations for Using OpenELM Models

The introduction of OpenELM models by Apple marks a significant advancement, offering the research community cutting-edge language models. These tools are trained on publicly available datasets and are provided without guarantees of safety. This may lead to outputs that could be inaccurate, harmful, or biased. Therefore, it is crucial for both users and developers to conduct extensive safety tests and establish robust filtering mechanisms to meet their unique needs and ensure responsible usage.

Recent Blog Posts

Oct 29, 2025

The uncomfortable truth about AI and SEO

Oct 29, 2025

The uncomfortable truth about AI and SEO

Oct 24, 2025

AI Image Generator with no limits? Try Writingmate

Oct 24, 2025

AI Image Generator with no limits? Try Writingmate

Oct 23, 2025

Best AI Document Comparison Tools – Tested and Explained

Oct 23, 2025

Best AI Document Comparison Tools – Tested and Explained

Oct 22, 2025

ChatGPT Plus vs Writingmate: Full Review After Using Both

Oct 22, 2025

ChatGPT Plus vs Writingmate: Full Review After Using Both

Oct 21, 2025

Can AI Chatbots Make Mistakes? How to Avoid them in 2025?

Oct 21, 2025

Can AI Chatbots Make Mistakes? How to Avoid them in 2025?

Oct 8, 2025

The Best Midjourney Alternatives (Free & Paid) in 2025

Oct 8, 2025

The Best Midjourney Alternatives (Free & Paid) in 2025

Oct 29, 2025

The uncomfortable truth about AI and SEO

Oct 24, 2025

AI Image Generator with no limits? Try Writingmate

Oct 23, 2025

Best AI Document Comparison Tools – Tested and Explained

Oct 29, 2025

The uncomfortable truth about AI and SEO

Oct 24, 2025

AI Image Generator with no limits? Try Writingmate

Oct 23, 2025

Best AI Document Comparison Tools – Tested and Explained

Oct 22, 2025

ChatGPT Plus vs Writingmate: Full Review After Using Both

Writingmate

All AIs. One subscription

Start now & save

Writingmate

All AIs. One subscription

Start now & save

Apple's Efficient Language Models

Running OpenELM via HuggingFace

Install

Setting up

Evaluation of OpenELM

Conclusion: Considerations for Using OpenELM Models

Recent Blog Posts

Start Using AISmarter

Start Using AI
Smarter