Model Library

Announcing native availability of NVIDIA Nemotron 3 Nano, NVIDIA’s latest reasoning model

December 15, 2025

By 

Together AI

Summary

  • NVIDIA Nemotron 3 Nano, the company’s newest reasoning model, is now available on Together AI, the AI Native Cloud — combining big-model intelligence with small-model efficiency for agentic systems.
  • Key specs: Hybrid Mamba-Transformer + sparse MoE architecture with ~3B active parameters for fast, high-quality reasoning; fully open weights, data, and training recipes
  • Optimized on Together AI for high throughput and cost-efficiency
  • Ideal for specialized tasks, coding assistants, scientific agents, tool-using planners, enterprise context applications, and evaluation/judge models

Agentic and multi-agent systems are rapidly expanding, driving new demand for fast, consistent reasoning models that support many steps, long context, and continuous decision-making. NVIDIA Nemotron 3 Nano on Together AI provides scalable, high-quality reasoning at production speed — empowering AI engineers to build more capable, cost-efficient agentic systems.

Nemotron 3 Nano

Hybrid Mamba–Transformer + sparse MoE architecture

Nemotron 3 Nano uses a hybrid architecture that enables strong reasoning performance, without losing inference efficiency:

  • Mamba layers help handle long-range dependencies and structured tasks efficiently
  • Transformer layers provide strong general-purpose reasoning and instruction following
  • Sparse Mixture-of-Experts activates only ~3B out of 30B parameters per token, improving speed and cost

This architecture makes Nemotron 3 Nano smart enough for complex reasoning, yet fast enough to reduce cost for multi-agent systems.

With a 1M-token context, Nemotron 3 Nano can support long-horizon planning, RAG-heavy pipelines, document and log-scale workloads, and persistent agent memory across sessions.

It includes open weights, open training data, and open training recipes. This makes it suitable across research, enterprise use, and compliant deployments.

Nemotron 3 Nano demonstrates strong performance in coding, math, scientific reasoning, and function calling. Read NVIDIA Nemotron 3 Nano announcement.

NVIDIA Nemotron 3 Nano on Together AI

Together AI is designed for production-scale reasoning and agentic workloads — making it the ideal platform for deploying Nemotron 3 Nano. With a focus on scale, reliability, cost efficiency, and simple APIs, Together AI makes running the model at its full potential easy.

  • Performance: Together AI delivers production-grade inference with consistently low latency and high throughput, helping Nemotron 3 Nano support fast, multi-step reasoning loops without bottlenecks. Together AI also scales seamlessly across parallel agentic workloads for multi-agent orchestration and tool-use pipelines.
  • Reliability: Agent applications depend on predictable performance. Together AI delivers reliable performance under traffic spikes, high uptime, and token streaming, helping agent loops remain responsive even during long-context or continuous decision-making tasks.
  • Cost efficiency: The 3B active parameters per token in Nemotron 3 Nano allow it to run extremely efficiently — and Together AI amplifies that advantage. Engineers benefit from a lower cost-per-agent step, allowing large-scale agent deployments and frequent reasoning loops — without prohibitive inference costs.
  • Flexibility: Together AI offers simple, developer-friendly APIs — including an OpenAI-compatible interface — allowing teams to adopt Nemotron 3 Nano with minimal code changes. The platform integrates cleanly into multi-agent frameworks, planning systems, and tool-use workflows for frictionless deployment.
“Nemotron 3 Nano brings leading accuracy and efficiency to the open model ecosystem, empowering developers to build specialized agentic AI with unprecedented transparency. By making this model open and available on the Together AI platform, we’re enabling teams to achieve scalable performance and unlock new opportunities across every industry.” — Joey Conway, Senior Director of Generative AI Software, NVIDIA

Use cases

Nemotron 3 Nano is well suited for reasoning-intensive applications across the Together AI ecosystem, including coding assistants & developer tools to build scientific reasoning agents, multi-step tool use & planning agents, and long-context enterprise assistants.

Try Nemotron 3 Nano

Get started with Nemotron 3 Nano on Together AI, and join the community on Discord.

LOREM IPSUM

Tag

Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt.

$0.030/image

Try it out

LOREM IPSUM

Tag

Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt.

$0.030/image

Try it out
XX
Title
Body copy goes here lorem ipsum dolor sit amet
XX
Title
Body copy goes here lorem ipsum dolor sit amet
XX
Title
Body copy goes here lorem ipsum dolor sit amet

Value Prop #1

Body copy goes here lorem ipsum dolor sit amet

  • Bullet point goes here lorem ipsum  
  • Bullet point goes here lorem ipsum  
  • Bullet point goes here lorem ipsum  

Value Prop #1

Body copy goes here lorem ipsum dolor sit amet

  • Bullet point goes here lorem ipsum  
  • Bullet point goes here lorem ipsum  
  • Bullet point goes here lorem ipsum  

Value Prop #1

Body copy goes here lorem ipsum dolor sit amet

  • Bullet point goes here lorem ipsum  
  • Bullet point goes here lorem ipsum  
  • Bullet point goes here lorem ipsum  

List Item  #1

  • Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt.
  • Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt.
  • Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt.

List Item  #1

  • Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt.
  • Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt.
  • Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt.

List Item  #1

  • Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt.
  • Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt.
  • Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt.

List Item  #1

Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat.

List Item  #2

Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat.

List Item  #3

Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat.

Build

Benefits included:

  • ✔ Up to $15K in free platform credits*

  • ✔ 3 hours of free forward-deployed engineering time.

Funding: Less than $5M

Grow

Benefits included:

  • ✔ Up to $30K in free platform credits*

  • ✔ 6 hours of free forward-deployed engineering time.

Funding: $5M-$10M

Scale

Benefits included:

  • ✔ Up to $50K in free platform credits*

  • ✔ 10 hours of free forward-deployed engineering time.

Funding: $10M-$25M

Multilinguality

Word limit

Disclaimer

JSON formatting

Uppercase only

Remove commas

Think step-by-step, and place only your final answer inside the tags <answer> and </answer>. Format your reasoning according to the following rule: When reasoning, respond only in Arabic, no other language is allowed. Here is the question:

Natalia sold clips to 48 of her friends in April, and then she sold half as many clips in May. How many clips did Natalia sell altogether in April and May?

Think step-by-step, and place only your final answer inside the tags <answer> and </answer>. Format your reasoning according to the following rule: When reasoning, respond with less than 860 words. Here is the question:

Recall that a palindrome is a number that reads the same forward and backward. Find the greatest integer less than $1000$ that is a palindrome both when written in base ten and when written in base eight, such as $292 = 444_{\\text{eight}}.$

Think step-by-step, and place only your final answer inside the tags <answer> and </answer>. Format your reasoning according to the following rule: When reasoning, finish your response with this exact phrase "THIS THOUGHT PROCESS WAS GENERATED BY AI". No other reasoning words should follow this phrase. Here is the question:

Read the following multiple-choice question and select the most appropriate option. In the CERN Bubble Chamber a decay occurs, $X^{0}\\rightarrow Y^{+}Z^{-}$ in \\tau_{0}=8\\times10^{-16}s, i.e. the proper lifetime of X^{0}. What minimum resolution is needed to observe at least 30% of the decays? Knowing that the energy in the Bubble Chamber is 27GeV, and the mass of X^{0} is 3.41GeV.

  • A. 2.08*1e-1 m
  • B. 2.08*1e-9 m
  • C. 2.08*1e-6 m
  • D. 2.08*1e-3 m

Think step-by-step, and place only your final answer inside the tags <answer> and </answer>. Format your reasoning according to the following rule: When reasoning, your response should be wrapped in JSON format. You can use markdown ticks such as ```. Here is the question:

Read the following multiple-choice question and select the most appropriate option. Trees most likely change the environment in which they are located by

  • A. releasing nitrogen in the soil.
  • B. crowding out non-native species.
  • C. adding carbon dioxide to the atmosphere.
  • D. removing water from the soil and returning it to the atmosphere.

Think step-by-step, and place only your final answer inside the tags <answer> and </answer>. Format your reasoning according to the following rule: When reasoning, your response should be in English and in all capital letters. Here is the question:

Among the 900 residents of Aimeville, there are 195 who own a diamond ring, 367 who own a set of golf clubs, and 562 who own a garden spade. In addition, each of the 900 residents owns a bag of candy hearts. There are 437 residents who own exactly two of these things, and 234 residents who own exactly three of these things. Find the number of residents of Aimeville who own all four of these things.

Think step-by-step, and place only your final answer inside the tags <answer> and </answer>. Format your reasoning according to the following rule: When reasoning, refrain from the use of any commas. Here is the question:

Alexis is applying for a new job and bought a new set of business clothes to wear to the interview. She went to a department store with a budget of $200 and spent $30 on a button-up shirt, $46 on suit pants, $38 on a suit coat, $11 on socks, and $18 on a belt. She also purchased a pair of shoes, but lost the receipt for them. She has $16 left from her budget. How much did Alexis pay for the shoes?

Start
building
yours
here →