AI Model

NVIDIA: Nemotron 3 Nano Omni

NVIDIA: Nemotron 3 Nano Omni logoNVIDIA
Text Generation
Reasoning
Vision
About Nemotron 3 Nano Omni

NVIDIA Nemotron™ 3 Nano Omni is a 30B-A3B open multimodal model designed to function as a perception and context sub-agent in enterprise agent systems. It accepts text, image, video, and audio inputs and produces text output, enabling agents to perceive and reason across modalities in a single inference loop.

Built on a hybrid MoE Transformer-Mamba architecture with Conv3D video layers and Efficient Video Sampling (EVS), it delivers approximately 2× higher throughput and 2.5× lower compute for video reasoning versus separate vision + speech pipelines. It supports up to 300K context length and a 16,384 reasoning budget, with extended thinking enabled via reasoning.enabled on OpenRouter.

Specifications
Provider
NVIDIA
Context Length
256,000 tokens
Input Types
text, audio, image, video
Output Types
text
Category
Other
Added
4/28/2026

Use Nemotron 3 Nano Omni and 200+ more models

Access all the best AI models in one platform. No API keys, no switching between apps.