💰 Announcing our Series C. Intelligence should be abundant, not expensive →

🤝 Together AI & Y Combinator announce partnership to deliver the first dedicated YC GPU cluster →

⚡ On-demand B200s now available on Together GPU Clusters →

🚀 Now serving MiniMax-M3 for efficient inference →

Model library

Leading open models, ready for production

Browse and compare a growing library of models available on Together AI

Abstract 3D geometric shapes consisting of intersecting blue, purple, and orange discs and planes on a white background.

Abstract blue and purple gradient shapes on a light blue background.

Deployment options

Run models using different deployment options depending on latency needs, traffic patterns, and infrastructure control.

Serverless Inference
Provisioned  Throughput
Dedicated Model  Inference
Dedicated Container  Inference

Serverless Inference

A fully managed real-time or batch inference API with access to dozens of the most popular AI models.

Best for

Variable or unpredictable traffic

Rapid prototyping and iteration

Cost-sensitive or early-stage production workloads

Provisioned  Throughput

Reserved token capacity with SLA guarantees. Priced in PTUs, a normalized throughput unit.

Best for

Production workloads

Reliability guarantees

Predictable pricing

Dedicated Model  Inference

An inference endpoint backed by reserved, isolated compute resources and Together AI inference research.

Best for

Predictable or steady traffic

Latency-sensitive applications

High-throughput production workloads

Dedicated Container  Inference

Run inference with your own engine and model on fully-managed, scalable infrastructure.

Best for

Generative media models

Non-standard runtimes

Custom inference pipelines

Explore model providers

Leading model providers rely on Together AI infrastructure to deploy,  scale, and run their models in production.

White stylized letter D on a black circular background.

5 models

Deepgram

Abstract geometric shape with interlocking blue and white angled arrows forming a hexagon.

5 models

Wan-AI

Black stylized letter P with a semicircle shape on the left side on white background.

5 models

Pearl AI

5 models

PrismML

5 models

Thinking Machine Labs

5 models

Meta

5 models

Qwen

5 models

Black Forest Labs

5 models

Google

5 models

Mistral AI

5 models

Arcee AI

5 models

Deep Cogito

5 models

DeepSeek

5 models

MiniMax AI

5 models

OpenAI

Orange video camera icon with two circular reels on top and a rectangular body below on black background.

5 models

Kuaishou

5 models

ByteDance

5 models

SCB10X

5 models

Moonshot AI

5 models

Rime

5 models

ZAI

5 models

Alibaba

Blue square with rounded corners and white bar chart icon with three vertical bars and a dot.

5 models

HiDream.ai

5 models

NVIDIA

5 models

Stability AI

5 models

Together AI

5 models

BAAI

5 models

Cartesia

5 models

Gryphe

5 models

LG AI Research

5 models

Refuel

5 models

RunDiffusion

5 models

ServiceNow AI

5 models

Vidu

5 models

Alibaba-NLP

Black downward-pointing triangle with thick sides on a white background.

5 models

Canopy Labs

5 models

DataBricks

5 models

Essential AI

5 models

Ideogram

5 models

Liquid AI