NVIDIA GB300 NVL72
Blackwell Ultra AI Factory is now on Together AI. Scale your AI Factory with next-gen AI reasoning performance on NVIDIA GB300 NVL72 GPU clusters.
Why GB300 NVL72 on Together GPU Clusters?
The world’s most powerful AI infrastructure. Delivered faster. Tuned smarter.
A single 72-GPU NVLink domain for reasoning at scale
GB300 NVL72 unifies 72 Blackwell Ultra GPUs and 36 Grace CPUs in one platform, optimized for test-time scaling inference.
Purpose-built for AI reasoning throughput
Compared to Hopper, GB300 NVL72 delivers 10x higher TPS per user and 5x higher TPS per megawatt, combining to 50x higher AI-factory output.
Massive on-rack memory for frontier contexts
Run long-context LLMs and agentic workloads with up to 21 TB of aggregate GPU HBM (up to 576 TB/s bandwidth) and up to 40 TB fast memory.
Run by the same people pushing the Blackwell stack forward
Work with engineers who co-develop Blackwell optimizations; our team continually tunes workloads and publishes cutting-edge training breakthroughs.
What our customers are saying
Outstanding specs of GB300 NVL72
10x
vs Hopper GPUs
5x
vs Hopper GPUs
1.5x
vs GB200 NVL72
"We’ve been thoroughly impressed with the Together Enterprise Platform. It has delivered a 2x reduction in latency (time to first token) and cut our costs by approximately a third. These improvements allow us to launch AI-powered features and deliver lightning-fast experiences faster than ever before."
Technical specification
- Configuration 36 Grace CPU: 72 Blackwell Ultra GPUs
- FP4 Tensor Core 1,400 PFLOPS (with sparsity) | 1,100 PFLOPS (dense)
- FP8/FP6 Tensor Core 720 PFLOPS
- INT8 Tensor Core 23 PFLOPS
- FP16/BF16 Tensor Core 360 PFLOPS
- TF32 Tensor Core 180 PFLOPS
- FP32 6 PFLOPS
- FP64 / FP64 Tensor Core 100 TFLOPS
- GPU Memory | Bandwidth Up to 21 TB | Up to 576 TB/s
- NVLink Bandwidth 130 TB/s
- CPU Core Count 2,592 Arm® Neoverse V2 cores
- CPU Memory | Bandwidth Up to 18 TB SOCAMM with LPDDR5X | Up to 14.3 TB/s
Infrastructure you can trust at scale.
Production-grade security.
We take security and compliance seriously, with strict data privacy controls to keep your information protected. Your data and models remain fully under your ownership, safeguarded by robust security measures.
NVIDIA preferred partner- AICPA SOC 2 Type II
Regions and availability zones
Choose from global regions to meet data residency and compliance requirements—HIPAA for healthcare, GDPR for Europe, or banking regulations.
- USA2GW+ in the portfolio with 600MW of near-term capacity in US.
- Europe150 MW+ available in Europe: UK, Spain, France, Portugal, and Iceland also.
- Asia & Middle EastOptions available based on the scale of the projects in Asia and the Middle East.






