Leading open models, ready for production
Browse and compare a growing library of models available on Together AI


No search result
Try expanding your search or changing the filters.
Deployment options
Run models using different deployment options depending on latency needs, traffic patterns, and infrastructure control.
Real-time
A fully managed inference API that automatically scales with request volume.
Best for
Batch
Process massive workloads of up to 30 billion tokens asynchronously, at up to 50% less cost.
Best for
Dedicated Model Inference
An inference endpoint backed by reserved, isolated compute resources and the Together AI inference engine.
Best for
Dedicated Container Inference
Run inference with your own engine and model on fully-managed, scalable infrastructure.
Best for
Explore model providers
Leading model providers rely on Together AI infrastructure to deploy, scale, and run their models in production.
5 models
Anthropic
5 models
xAI
5 models
Deepgram
5 models
Arcee AI
5 models
Minimax AI
5 models
Kuaishou
5 models
ByteDance
5 models
SCB10X
5 models
Moonshot AI
5 models
Rime
5 models
ZAI
5 models
Alibaba
5 models
HiDream.ai
5 models
NVIDIA
5 models
Stability AI
5 models
Together AI
5 models
BAAI
5 models
BERT
5 models
Cartesia
5 models
Gryphe
5 models
LG AI Research
5 models
Refuel
5 models
RunDiffusion
5 models
ServiceNow AI
5 models
Vidu
5 models
Agentica & Together AI
5 models
Alibaba-NLP
5 models
Canopy Labs
5 models
DataBricks
5 models
Essential AI
5 models
Ideogram
5 models
Liquid AI
5 models
Lykon
5 models
Marin Community
5 models
Microsoft
5 models
Mixedbread AI
5 models
Nous Research
5 models
Perplexity AI
5 models
PixVerse
5 models
Salesforce
5 models
Upstage AI
5 models
Virtue AI
5 models
WhereIsAI
5 models
hexgrad
5 models
intfloat