Switch to open
Build better with open models
Open models offer the same quality at 90% lower cost.

Why switch to open models?
Open models bring state of the art intelligence without the black box pricing and lack of control of closed models.
90% lower cost
Open models run at a fraction of closed-model prices. Pay per token with no commitment, and put the savings back into your product.
Frontier quality
The top open models now match closed models on the public benchmarks that matter, so you switch without losing capability.
Full control
You own the weights and your IP, your data stays yours, and the model runs on domestic infrastructure you control. Together AI is SOC 2 Type 2 and ISO 27001:2022 compliant.
Run open models at 90% lower prices than closed models
Frontier intelligence shouldn’t come at frontier prices. Migrating workloads to open models dramatically reduces AI spend.
Enter what you spend on closed models today. See your cost on Together, and how fast the switch pays back.
Kimi K2.7 Code costs 70% less than Opus 4.8 for production coding agent workloads.
Decagon unlocked 6x lower cost per query than GPT-5 mini.
Same prompt, two models. GLM-5.2 (left) and Claude Opus 4.8 (right) each generated this landing page — and the results are nearly identical. GLM-5.2 cost just $0.06 against $0.49 for Opus: over 6× cheaper, and faster and more token-efficient.




Demos
See it for yourself
Interactive demos from the Together AI team compare open and closed models, and try them yourself.
The quality gap has closed
The reason to stay on a closed API was quality. The top open models now match it on the benchmarks that matter.
Code Arena | WebDev
Elo Score — API $ / 1M Tokens
- Open Models
- Closed Models
7x cheaper
GLM-5.2 ranks second at $1.40, against $10 for the top closed model.
Source: arena leaderboardArtificial Analysis Intelligence Index
- Open Models
- Closed Models
Frontier-class, open
At 51, GLM-5.2 puts open weights right behind the closed leaders.
Source: artificialanalysis.aiDesign Arena, Code Categories
Human-judged Coding Arena — Elo
- Open Models
- Closed Models
Number one
GLM-5.2 takes the top spot, ahead of Claude Fable 5 and every Opus variant.
SWE-bench Pro
Agentic Software-Engineering Benchmark
- Open Models
- Closed Models
Top open model
GLM-5.2 leads every open model on SWE-bench Pro — above GPT-5.5 and just behind Claude Opus 4.8.
Customers running inference in production
Your model, your data, your control
Open means the model is yours to keep, run, and move. On Together you run it on a full-stack platform you control, served in the US.
Own your model
You hold the weights and your IP. Move providers or self-host whenever you choose, with no lock-in.
Your data stays yours
Your data and models stay under your ownership, with strict privacy controls and no training on your data.
Served domestically
Run your open models on infrastructure that Together manages end to end domestically, with every layer of inference under your control.
Production-grade
security and data privacy
We take security and compliance seriously, with strict data privacy controls to keep your information protected. Your data and models remain fully under your ownership, safeguarded by robust security measures.
preferred partner
SOC 2 Type II
ISO 27001:2022
Frequently asked questions
If you can't find the answer you were looking for, feel free to contact our team — we’re here to help.
Will open models match the quality I get from OpenAI or Claude?
Yes. On the public benchmarks that matter, the top open models now match leading closed models like GPT and Claude. See the benchmark comparison above.
How much will I actually save?
It depends on your workload, but the savings are substantial. Decagon cut cost per query 6× and Yutori runs 4–5× cheaper than frontier models. Use the savings calculator to estimate your own.
Where does the model run, and is my data safe?
Open models run on US and EMEA infrastructure that Together manages end to end. Your data and weights stay yours, there is no training on your data, and every layer of inference is under your control.
How hard is it to switch?
It is a drop-in switch. Together exposes an OpenAI-compatible API, so you point your existing client at our endpoint and run the top open models with no code changes.
Can I customize the model?
Yes. With Together’s post-training service you can fine-tune open models to specialize them for your use case.




