AI Model

NVIDIA: Nemotron 3 Ultra

NVIDIA: Nemotron 3 Ultra logoNVIDIA
Text Generation
Reasoning
About Nemotron 3 Ultra

NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA, with 55B active parameters out of 550B total (MoE). Built on a hybrid Transformer-Mamba mixture-of-experts architecture, it supports text input and output with a context window of up to 1M tokens. It is suited for long-running agentic workflows, including agent orchestration, coding agents, deep research, and complex enterprise tasks.

It is particularly strong at multi-step reasoning and planning, with high-throughput inference designed for high-volume agent pipelines. It is part of the NVIDIA Nemotron family of open models for agentic AI.

Specifications
Provider
NVIDIA
Context Length
1,000,000 tokens
Input Types
text
Output Types
text
Category
Other
Added
6/4/2026

Use Nemotron 3 Ultra and 200+ more models

Access all the best AI models in one platform. No API keys, no switching between apps.