Together AI Announces $305M Series B to Scale AI Acceleration Cloud for Open Source and Enterprise AI

Today marks an exciting milestone in Together AI's journey. We're thrilled to announce our $305 million Series B funding round, led by General Catalyst and co-led by Prosperity7.
The round saw participation from a distinguished group of global institutional and strategic investors including Salesforce Ventures, DAMAC Capital, NVIDIA, Kleiner Perkins, March Capital, Emergence Capital, Lux Capital, SE Ventures, Greycroft, Coatue, Definition, Cadenza Ventures, Long Journey Ventures, Brave Capital, Scott Banister, SK Telecom, and technology pioneer John Chambers.
This investment will accelerate our leadership as the preferred AI Cloud for building modern AI applications with open source models, and for training custom models with our upcoming large-scale deployment of NVIDIA Blackwell GPUs.
Our AI Acceleration Cloud has already transformed how over 450,000 AI developers, AI-native companies, and global enterprises like Salesforce, Zoom, SK Telecom, Hedra, Cognition, Zomato, Krea, Cartesia, and The Washington Post build modern AI applications.

Making Open Source AI Accessible to All
AI is transforming every industry, creating unprecedented efficiencies and enabling entirely new classes of products. At Together AI, we believe the future of AI is open source, and we have built a cloud company for this AI-first world by combining state-of-the-art open source models and high-performance infrastructure with frontier research in AI efficiency and scalability.
Open source models like DeepSeek-R1 and Meta's Llama have emerged as formidable alternatives to proprietary solutions, marking a decisive shift in the AI landscape. Together AI has established itself as the definitive platform powering this transformation, delivering the fastest DeepSeek-R1 and Llama inference for NVIDIA GPUs at production scale through our secure, highly optimized infrastructure and research innovations.
Our AI Acceleration Cloud uniquely spans the entire AI lifecycle, delivering enterprise-grade inference solutions, training and fine-tuning for frontier foundational models, agentic workflows with built-in code interpretation, and synthetic data generation. It enables organizations to build complete AI applications with the performance, security, accuracy, and model ownership that enterprises demand.
Supporting over 200 open source models across all modalities — chat, image, audio, vision, code, and embeddings — the platform is powered by Together AI's proprietary Inference engine and built on research innovations including FlashAttention-3 kernels and advanced quantization techniques. It delivers 2-3x faster inference than today's hyperscaler solutions.
Expanding Our Infrastructure
To support our rapidly growing ecosystem, we're dramatically expanding our infrastructure. We've secured 200 MW of power capacity and are deploying optimized clusters of NVIDIA Blackwell GPUs across multiple North American data centers. Our recent partnership with Hypertec to co-build a cluster of 36,000 NVIDIA GB200 NVL72 GPUs further strengthens our position as the leading AI Cloud provider. We also announced immediate access to Together GPU Clusters accelerated by NVIDIA HGX B200 GPUs and the Together Kernel Collection, delivering 90% faster training performance than previous generation infrastructure.
Innovation and Research at Our Core
Research drives everything we do at Together AI. Our research lab continues to pioneer breakthrough methods at the intersection of AI and systems optimization, with innovations like our Mixture of Agents, Medusa, Sequoia, Hyena, and Mamba that optimize AI accuracy, performance, and efficiencies. Together Kernel Collection, developed under the leadership of our Chief Scientist Tri Dao, creator of FlashAttention has enabled 24% faster training operations while significantly reducing costs for our customers.
Recent Milestones and Future Vision
In 2024, we've achieved significant milestones that demonstrate our momentum. We deployed DeepSeek models in North American data centers with full opt-out privacy controls, launched the Together Enterprise Platform, and announced AWS Marketplace availability. Our partnership with Cartesia has enabled ultra-low latency voice AI through Sonic model integration, while our acquisition of CodeSandbox brings built-in code interpretation capabilities to our platform. We've also strengthened our leadership team with the addition of go-to-market veteran Kai Mak as CRO, and research pioneer James Zou.
This investment will accelerate our mission to make open source AI accessible to developers and enterprises globally. We're committed to advancing the frontier of AI through open collaboration, innovation, and transparency, while ensuring powerful AI systems remain accessible and cost-effective.
To learn more about opportunities at Together AI, visit our careers page. For media inquiries, please reach out to press@together.ai
LOREM IPSUM
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt.
LOREM IPSUM
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt.
Value Prop #1
Body copy goes here lorem ipsum dolor sit amet
- Bullet point goes here lorem ipsum
- Bullet point goes here lorem ipsum
- Bullet point goes here lorem ipsum
Value Prop #1
Body copy goes here lorem ipsum dolor sit amet
- Bullet point goes here lorem ipsum
- Bullet point goes here lorem ipsum
- Bullet point goes here lorem ipsum
Value Prop #1
Body copy goes here lorem ipsum dolor sit amet
- Bullet point goes here lorem ipsum
- Bullet point goes here lorem ipsum
- Bullet point goes here lorem ipsum
List Item #1
- Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt.
- Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt.
- Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt.
List Item #1
- Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt.
- Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt.
- Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt.
List Item #1
- Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt.
- Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt.
- Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt.
List Item #1
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat.
List Item #2
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat.
List Item #3
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat.
Build
Benefits included:
✔ Up to $15K in free platform credits*
✔ 3 hours of free forward-deployed engineering time.
Funding: Less than $5M
Grow
Benefits included:
✔ Up to $30K in free platform credits*
✔ 6 hours of free forward-deployed engineering time.
Funding: $5M-$10M
Scale
Benefits included:
✔ Up to $50K in free platform credits*
✔ 10 hours of free forward-deployed engineering time.
Funding: $10M-$25M
Think step-by-step, and place only your final answer inside the tags <answer> and </answer>. Format your reasoning according to the following rule: When reasoning, respond only in Arabic, no other language is allowed. Here is the question:
Natalia sold clips to 48 of her friends in April, and then she sold half as many clips in May. How many clips did Natalia sell altogether in April and May?
Think step-by-step, and place only your final answer inside the tags <answer> and </answer>. Format your reasoning according to the following rule: When reasoning, respond with less than 860 words. Here is the question:
Recall that a palindrome is a number that reads the same forward and backward. Find the greatest integer less than $1000$ that is a palindrome both when written in base ten and when written in base eight, such as $292 = 444_{\\text{eight}}.$
Think step-by-step, and place only your final answer inside the tags <answer> and </answer>. Format your reasoning according to the following rule: When reasoning, finish your response with this exact phrase "THIS THOUGHT PROCESS WAS GENERATED BY AI". No other reasoning words should follow this phrase. Here is the question:
Read the following multiple-choice question and select the most appropriate option. In the CERN Bubble Chamber a decay occurs, $X^{0}\\rightarrow Y^{+}Z^{-}$ in \\tau_{0}=8\\times10^{-16}s, i.e. the proper lifetime of X^{0}. What minimum resolution is needed to observe at least 30% of the decays? Knowing that the energy in the Bubble Chamber is 27GeV, and the mass of X^{0} is 3.41GeV.
- A. 2.08*1e-1 m
- B. 2.08*1e-9 m
- C. 2.08*1e-6 m
- D. 2.08*1e-3 m
Think step-by-step, and place only your final answer inside the tags <answer> and </answer>. Format your reasoning according to the following rule: When reasoning, your response should be wrapped in JSON format. You can use markdown ticks such as ```. Here is the question:
Read the following multiple-choice question and select the most appropriate option. Trees most likely change the environment in which they are located by
- A. releasing nitrogen in the soil.
- B. crowding out non-native species.
- C. adding carbon dioxide to the atmosphere.
- D. removing water from the soil and returning it to the atmosphere.
Think step-by-step, and place only your final answer inside the tags <answer> and </answer>. Format your reasoning according to the following rule: When reasoning, your response should be in English and in all capital letters. Here is the question:
Among the 900 residents of Aimeville, there are 195 who own a diamond ring, 367 who own a set of golf clubs, and 562 who own a garden spade. In addition, each of the 900 residents owns a bag of candy hearts. There are 437 residents who own exactly two of these things, and 234 residents who own exactly three of these things. Find the number of residents of Aimeville who own all four of these things.
Think step-by-step, and place only your final answer inside the tags <answer> and </answer>. Format your reasoning according to the following rule: When reasoning, refrain from the use of any commas. Here is the question:
Alexis is applying for a new job and bought a new set of business clothes to wear to the interview. She went to a department store with a budget of $200 and spent $30 on a button-up shirt, $46 on suit pants, $38 on a suit coat, $11 on socks, and $18 on a belt. She also purchased a pair of shoes, but lost the receipt for them. She has $16 left from her budget. How much did Alexis pay for the shoes?
article