Products
Serverless Inference
API for inference on open-source models.
Dedicated Endpoints
Deploy models on custom hardware.
Fine-Tuning
Train & improve high-quality, fast models.
Together Chat
Use DeepSeek R1 for free.
Code Execution
Code Sandbox
Build AI development environments.
Code Interpreter
Execute LLM-generated code.
Models
See all models →
Clusters of Any Size
Instant Clusters
Self-serve up to 64 NVIDIA GPUs.
Reserved Clusters
64 → 1,000 → 10,000+ NVIDIA GPUs.
GPUs
Solutions
Enterprise
Secure, reliable AI infrastructure.
Customer Stories
Testimonials from AI pioneers.
Why Open Source
How to own your AI.
Industries & Use-Cases
Scale your business with Together AI.
Case Studies
From AWS to Together Dedicated Endpoints: Arcee AI's journey to greater inference flexibility
How Zomato built an AI customer support bot that doubled customer satisfaction and scaled to over 1,000 messages per minute
Developers
Documentation
Technical docs for using Together AI.
Research
Advancing the open-source AI frontier.
Model Library
All our open-source models.
Cookbooks
Practical implementation guides.
Example Apps
Our open-source demo apps.
Videos
DeepSeek-R1: How It Works, Simplified!
Together Code Sandbox: How To Build AI Coding Agents
Pricing
Pricing Overview
Our platform & GPU pricing.
Inference
Per-token & per-minute pricing.
LoRA and full fine-tuning pricing.
GPU Clusters
Hourly rates & custom pricing.
Questions? We’re here to help!
Talk to us →
Company
About us
Get to know us.
Values
Our approach to open-source AI.
Team
Meet our leadership.
Careers
Join our mission.
Resources
Blog
Our latest news & blog posts.
Knowledge Base
Find answers to your questions.
Featured Blog Posts
Together AI acquires Refuel.ai to unlock data for developers and businesses building production-grade AI applications
Together AI Announces $305M Series B to Scale AI Acceleration Cloud for Open Source and Enterprise AI
Please share your feedback.