NVIDIA H200
High-performance LLM inference
Why H200 on Together GPU Clusters?
The world’s most powerful AI infrastructure. Delivered faster. Tuned smarter.
2x performance over H100
Each H200 GPU cluster offers double the inference throughput compared to H100, ideal for deploying LLMs at unprecedented scale.
Enhanced memory bandwidth
With 141GB HBM3e GPU memory and 4.8TB/s bandwidth, H200 significantly accelerates memory-intensive generative AI workloads and HPC applications.
Maximum efficiency and TCO savings
Achieve higher performance within the same power profile as previous-gen GPUs, drastically reducing energy consumption and total cost of ownership.
Run by researchers who train models
Our research team actively runs and tunes training workloads on NVIDIA H200 systems for edge-of-possibility expertise.
What our customers are saying
Outstanding specs of H200
2x
vs H100
110x
higher
1.4x
vs H100
"We’ve been thoroughly impressed with the Together Enterprise Platform. It has delivered a 2x reduction in latency (time to first token) and cut our costs by approximately a third. These improvements allow us to launch AI-powered features and deliver lightning-fast experiences faster than ever before."
Technical specification
- Hopper GPUs 8 GPUs
- FP8 Tensor Core 3,958 TFLOPS
- FP16/BF16 Tensor Core 1,979 TFLOPS
- TF32 Tensor Core 989 TFLOPS
- GPU Memory 141 GB HBM3e
- GPU Memory Bandwidth 4.8 TB/s
- Total NVLink Bandwidth 900 GB/s
- Multi-Instance GPU (MIG) 7 (@18 GB each)
- Decoders 7 NVDEC, 7 JPEG
- Max Thermal Design Power (TDP) Configurable up to 700 W
- Interconnect NVLink: 900 GB/s, PCIe Gen5: 128 GB/s
- Server Options NVIDIA HGX H200 partner and Certified Systems with 4 or 8 GPUs
Infrastructure you can trust at scale.
Production-grade security.
We take security and compliance seriously, with strict data privacy controls to keep your information protected. Your data and models remain fully under your ownership, safeguarded by robust security measures.
NVIDIA preferred partner- AICPA SOC 2 Type II
Regions and availability zones
Choose from global regions to meet data residency and compliance requirements—HIPAA for healthcare, GDPR for Europe, or banking regulations.
- USA2GW+ in the portfolio with 600MW of near-term capacity in US.
- Europe150 MW+ available in Europe: UK, Spain, France, Portugal, and Iceland also.
- Asia & Middle EastOptions available based on the scale of the projects in Asia and the Middle East.






