AI Model

DeepSeek: R1 Distill Llama 8B

DeepSeek: R1 Distill Llama 8B logoDeepSeek
Text Generation
Reasoning
About R1 Distill Llama 8B

DeepSeek R1 Distill Llama 8B is a distilled large language model based on [Llama-3.1-8B-Instruct](/meta-llama/llama-3.1-8b-instruct), using outputs from [DeepSeek R1](/deepseek/deepseek-r1). The model combines advanced distillation techniques to achieve high performance across multiple benchmarks, including:

- AIME 2024 pass@1: 50.4 - MATH-500 pass@1: 89.1 - CodeForces Rating: 1205

The model leverages fine-tuning from DeepSeek R1's outputs, enabling competitive performance comparable to larger frontier models.

Hugging Face: - [Llama-3.1-8B](https://huggingface.co/meta-llama/Llama-3.1-8B) - [DeepSeek-R1-Distill-Llama-8B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-8B) |

Specifications
Provider
DeepSeek
Input Types
text
Output Types
text
Category
Llama3
Added
2/7/2025

Frequently Asked Questions

Common questions about R1 Distill Llama 8B

Use R1 Distill Llama 8B and 200+ more models

Access all the best AI models in one platform. No API keys, no switching between apps.