What does this article cover about Get Access to LLama 3.1, a New AI Model by Meta?

Learn how to access and use Meta's new AI model, Llama 3.1, available in 8B, 70B, and 405B sizes. Get the details here!

How can Writingmate help with this workflow?

Writingmate lets you compare AI models, draft faster, and refine outputs in one workspace while working through Get Access to LLama 3.1, a New AI Model by Meta.

Who is this guide useful for?

This guide is useful for readers evaluating AI tools, model capabilities, and practical workflows related to Get Access to LLama 3.1, a New AI Model by Meta.

Where should I start after reading it?

Start by testing the workflow in Writingmate, then compare outputs across multiple models before choosing the result you publish or share.

Get Access to LLama 3.1, a New AI Model by Meta

Good news, everyone! Meta has delighted AI enthusiasts with exciting release of the new LLM Llama version 3.1.

The new model is available in three sizes: 8B, 70B, and 405B parameters.

8B: A compact, extremely fast model suitable for deployment in any environment.
70B: A high-performance, budget-friendly model that supports numerous use cases.
405B: The premier foundational model, powering the most extensive range of applications.

Llama 3.1 Benchmarks

This is comparison of Llama 3.1 to its predecessor, a Llama 3 model.

Comparison of 3.1 with 3.0 shows significantly improved benchmarks, thanks to distilling the 405B model into smaller models.

Presenting a comparison of new models with competitors such as Gemini Gemma 2, Open AI GPT-4o, and Claude 3.5 Sonnet.

We now have a publicly available model at the level of GPT-4o that anyone can download!

We look forward to results from the Chatbot Arena and feedback from everyday users.

Get Access to Llama 3.1

New models are available for public access.

Method 1: Download the Models

To download the models, follow these simple steps:

Follow the link: https://llama.meta.com/llama-downloads/
Fill out the access request form with your information.
Select the models you want to access. You can choose from the new Llama 3.1, Llama Guard 3, Prompt Guard, as well as older models.

Accept Meta's Community License Agreement.
You will then see the download link for the models.

Please also read Llama 3.1 Download Instructions.

Llama 3.1 Models Available For Download

Meta has released a wide variety of new model variations, tailored to meet different user needs.

Pretrained:

Meta-Llama-3.1-8B
Meta-Llama-3.1-70B
Meta-Llama-3.1-405B
Meta-Llama-3.1-405B-MP16
Meta-Llama-3.1-405B-FP8

Fine-tuned:

Meta-Llama-3.1-8B-Instruct
Meta-Llama-3.1-70B-Instruct
Meta-Llama-3.1-405B-Instruct
Meta-Llama-3.1-405B-Instruct-MP16
Meta-Llama-3.1-405B-Instruct-FP8
Llama-Guard-3-8B
Llama-Guard-3-8B-INT8
Llama-Guard-2-8B
Llama-Guard-8B
Prompt-Guard-86M

Some notes for 405B model:

Meta AI released several versions of the 405B model to accommodate its large size and provide various deployment options:

MP16 (Model Parallel 16) - This is the full BF16 weights version. These weights can only be deployed across multiple nodes using pipelined parallel inference. A minimum of 2 nodes with 8 GPUs each is required for deployment.
MP8 - This is also the full BF16 weights version but can be deployed on a single node with 8 GPUs using dynamic FP8 (Floating Point 8) quantization. Code for this deployment is available.
FP8 (Floating Point 8) - This is the quantized version of the weights. These weights can be deployed on a single node with 8 GPUs using static FP8 quantization. Code for this deployment is also available.
The 405B model requires approximately 750 GB and a minimum of two nodes (with 8 GPUs each) for inference in MP16.

Method 2: Use Llama 3.1 via Writingmate

We always strive to add new models as quickly as possible, and this time is no exception. Llama 3.1 405b is available in Writingmate!

To use the 405B model:

Go to Writingmate website: https://writingmate.ai
Sign up with your email.
Choose Llama 3.1 405b Instruct in the model list in the top right corner.

That's it! You can now use the newest LLM by Meta.

It will be helpful for you to know that Writingmate also supports the comparison of various AI models in terms of cost, quality, performance, and response speed using the AI Split Screen mode feature. You can not only use the most advanced Llama 3.1 405B, but also compare it with any of the more than 30 other supported LLMs, including the most popular paid options.

Useful Links

Official pressrelease: https://llama.meta.com
Llama 3.1 Documentation: https://llama.meta.com/docs/overview
Download the models from HuggingFace: https://huggingface.co/meta-llama
Meta AI directory on GitHub: https://github.com/meta-llama
GitHub Llama 3.1 Download Instructions: https://github.com/meta-llama/llama-models/blob/main/README.md
Meta AI directory on Kaggle: https://www.kaggle.com/organizations/metaresearch/models

Frequently Asked Questions

Sources

Written by

Artem Vysotsky

Ex-Staff Engineer at Meta. Building the technical foundation to make AI accessible to everyone.

Reviewed by

Sergey Vysotsky

Ex-Chief Editor / PM at Mosaic. Passionate about making AI accessible and affordable for everyone.