Together AI acquires Refuel.ai to unlock data for developers and businesses building production-grade AI applications
If you speak with the largest enterprises today, they will tell you about the hundreds of AI applications they would like to deploy across their organization. However, they run into a consistent challenge – their data is an unstructured mess and they lack the tooling to easily clean and transform this data – and without the right data, AI applications cannot hope to meet production-level quality.
That changes today. We are thrilled to announce that Together AI has acquired Refuel.ai. By joining forces, we’ll fast-track our vision of the AI Acceleration Cloud, bringing Refuel.ai’s purpose-built models and platform capabilities directly into the Together Inference and Fine-tuning Platform.
About Refuel.ai
Rishabh Bhargava and Nihit Desai (Refuel co-founders) met at Stanford a decade ago – fun fact: they also studied under Percy Liang and Chris Re (two of our co-founders)!
They started Refuel.ai in 2021 to help enterprises clean and structure their data at scale. Their experienced team built Refuel LLM-2 – a family of models purpose-built for data tasks, and Refuel Cloud – a platform for engineering teams to build complex, multi-step data workflows.
Developers are using Refuel.ai for a number of use cases – from cleaning product catalogs to extracting structured data from financial documents to identifying incorrect claims from AI chatbots, and with 50% fewer errors compared to state-of-the-art models.
Refuel.ai currently serves both startups and enterprises (including major financial institutions) and operates as scale – processing tens of millions of records and billions of tokens per week. We are thrilled to welcome their entire team to Together AI and build the future of enterprise AI with them.
“Joining Together AI accelerates our mission to solve the data bottleneck that every AI team faces today. By bringing Refuel.ai’s specialized models and orchestration platform into Together’s AI Cloud, we can deliver an unmatched combination of speed, data quality, and scalability—empowering developers to rapidly take more sophisticated AI applications from concept to production.” — Rishabh Bhargava, CEO and Co-founder, Refuel.ai
What this means for Together AI customers
As the leading AI Acceleration Cloud, we empower developers and enterprises to train, fine-tune, and run inference for generative AI models. Together AI supports a wide range of top open-source and custom models across multiple modalities (over 200+), while offering flexible deployment options with the highest levels of privacy and security. Our platform has already transformed how over 600,000 AI developers, AI-native companies, and global enterprises like Salesforce, Zoom, SK Telecom, DuckDuckGo, Cognition, Zomato, and The Washington Post build modern AI applications.
As our customers continue to build complex agents, Refuel.ai’s technology will make it easier to build, deploy and improve the quality of these agents over the entire lifecycle. We are also excited to share that Refuel LLM-2 is now available on Together’s platform starting today – both for serverless inference, and for LoRA fine-tuning.

"At Together AI, we provide a platform that empowers developers and businesses to manage the entire generative AI lifecycle with unmatched performance, control, and cost-efficiency,” said Together AI Founder and CEO Vipul Ved Prakash. “As developers and enterprises build increasingly complex applications and agents, leveraging their data effectively and driving higher quality will become a core capability for our platform. This is an important milestone for making generative AI more accessible to our community and our enterprise customers."
A new era for building AI applications
The acquisition marks a significant step forward in our mission to accelerate the development of production-grade AI applications. By integrating Refuel.ai’s specialized models and orchestration capabilities into the Together AI Platform, we’re not only removing one of the biggest roadblocks in AI development - dealing with unstructured, messy data - but also enabling our customers to use their data with greater speed, accuracy, and scale.
We’re excited about what this means for the future of generative AI and invite developers and enterprises alike to explore what’s now possible on Together AI.
- Lower
Cost20% - faster
training4x - network
compression117x
Q: Should I use the RedPajama-V2 Dataset out of the box?
RedPajama-V2 is conceptualized as a pool of data that serves as a foundation for creating high quality datasets. The dataset is thus not intended to be used out of the box and, depending on the application, data should be filtered out using the quality signals that accompany the data. With this dataset, we take the view that the optimal filtering of data is dependent on the intended use. Our goal is to provide all the signals and tooling that enables this.
Start building today!
Create a free Together AI account to use your data with greater speed, accuracy, and scale.
article