GPU / GB200 NVL72

NVIDIA GB200 NVL72

Train trillion-parameter models on NVIDIA GB200 NVL72 GPU clusters, powered by our research and expert ops.

Why GB200 NVL72 on Together GPU Clusters?

The world’s most powerful AI infrastructure. Delivered faster. Tuned smarter.

Train trillion-parameter models

72 Blackwell GPUs and 36 Grace CPUs into one liquid-cooled, memory-coherent rack — enabling tightly synchronized, low-latency training

Custom networking

Clos topologies for dense LLMs & oversubscription for MoEs, built on InfiniBand or high-speed Ethernet

AI-native shared storage

VAST and Weka for high-throughput, parallel access to massive datasets and model state

Expert support

Engineers co-develop Blackwell optimizations; continually tune workloads and publish breakthroughs

What our customers are saying

Man with glasses and beard smiling against a background with glowing circuit-like lines.

"Delivering competitive pricing, strong reliability and a properly set up cluster is the bulk of the value differentiation for most AI clouds. The only differentiated value we have seen outside this set is from a Neocloud called Together AI, where the inventor of FlashAttention, Tri Dao, works. We don't believe the value created by Together can be replicated elsewhere."

Dylan Patel

Founder, SemiAnalysis

Smiling man with short dark hair wearing a blue blazer and white shirt in an outdoor corridor.

"Training our omnimodal Character-3 model required infrastructure designed for large-scale AI. The Together Frontier AI Factory delivered the performance we needed to push the boundaries of multimodal video generation. Together AI understands what builders need — and that made all the difference."

Michael Lingelbach

CEO, Hedra

"Together GPU Clusters provided a combination of amazing training performance, expert support, and the ability to scale to meet our rapid growth to help us serve our growing community of AI creators."

Young woman with long dark hair smiling outdoors wearing a white turtleneck and statement earrings.

Demi Guo

CEO, Pika

“Together AI provides the performance and reliability we need for real-time, high-quality image and video generation at scale. We value that Together AI is much more than an infrastructure provider — they're a true innovation partner, enabling us to push creative boundaries without compromise.”

Young man wearing a cap sprays graffiti on a wall with a spray paint can in black and white.

Victor Perez

Co-Founder, Krea

Outstanding specs of GB200 NVL72

Performance
Faster inference

4x

vs H100

Faster inference

30x

vs H100

Better efficiency

25x

vs H100

Salesforce AI Research

"We’ve been thoroughly impressed with the Together Enterprise Platform. It has delivered a 2x reduction in latency (time to first token) and cut our costs by approximately a third. These improvements allow us to launch AI-powered features and deliver lightning-fast experiences faster than ever before."

Smiling man with black hair and glasses wearing a light blue button-up shirt against a white background.

Caiming Xiong

VP Salesforce AI Research

Technical specification

  • Configuration 36 Grace CPU: 72 Blackwell GPUs
  • FP4 Tensor Core 1,440 PFLOPS
  • FP8/FP6 Tensor Core 720 PFLOPS
  • INT8 Tensor Core 720 POPS
  • FP16/BF16 Tensor Core 360 PFLOPS
  • TF32 Tensor Core 180 PFLOPS
  • FP32 5,760 TFLOPS
  • FP64 2,880 TFLOPS
  • FP64 Tensor Core 2,880 TFLOPS
  • GPU Memory | Bandwidth Up to 13.4 TB HBM3e | 576 TB/s
  • NVLink Bandwidth 130 TB/s
  • CPU Core Count 2,592 Arm® Neoverse V2 cores
  • CPU Memory | Bandwidth Up to 17 TB LPDDR5X | Up to 18.4 TB/s

Infrastructure you can trust at scale.
Production-grade security.

We take security and compliance seriously, with strict data privacy controls to keep your information protected. Your data and models remain fully under your ownership, safeguarded by robust security measures.

Learn More

As an NVIDIA Cloud Partner, Together builds and operates clusters on NVIDIA NCP reference architectures for predictable performance and faster time to production. Your data and models remain under your control with strict privacy safeguards and SOC 2–compliant security practices.

  • NVIDIA logo with text Preferred Partner on a black background.
    preferred partner
  • SOC 2 Type II
  • ISO 27001:2022

Regions and availability zones

Choose from global regions to meet data residency and compliance requirements—HIPAA for healthcare, GDPR for Europe, or banking regulations.

  • USA
    2GW+ in the portfolio with 600MW of near-term capacity in US.
  • Europe
    150 MW+ available in Europe: UK, Spain, France, Portugal, and Iceland also.
  • Asia & Middle East
    Options available based on the scale of the projects in Asia and the Middle East.