🚀 Now serving MiniMax-M3 for efficient inference →

📊 Delivering 31% more TPS than the next-fastest OSS engine for production coding agent workloads →

💬 How Together built the world's fastest speech-to-text stack →

🇫🇷 Join us at RAISE 2026 in Paris →

  • Inference

    • Serverless Inference

      High-performance inference as APIs

    • Batch Inference

      Inference for batch workloads

    • Dedicated Model Inference

      Inference on custom hardware

    • Dedicated Container Inference

      Inference for custom models

    MiniMax M2.5
    Nano Banana Pro
    Qwen3.5-397B
    GLM-5
    White circular shape with uneven edges and three extended finger-like projections on a black background.
    kimi k2.5
    OpenAI logo with a symmetrical abstract geometric knot design.
    gpt-oss-120B

    Model library

    Explore the top open-source models

  • Compute

    Accelerated Compute

    • GPU Clusters

      Reliable GPU clusters at scale

    • AI Factory

      Custom infrastructure at frontier scale

    Developer Environments

    • Sandbox

      Build development environments for AI

    Storage

    • Managed Storage

      Store model weights & data securely

    • GB300

    • GB200

    • B200

    • H200

    • H100

  • Model Shaping

    • Fine-Tuning

      Shape models with your data

    • Evaluations

      Measure model quality

    Blue stylized fox curled up in a circular shape with tail wrapped around body.
    DeepSeek V3.1
    GLM 5 FP4
    Qwen3-VL 32B
    OpenAI logo with a symmetrical abstract geometric knot design.
    gpt-oss-120b
    White circular shape with uneven edges and three extended finger-like projections on a black background.
    kimi k2.5
    Llama 4 Maverick

    Model library

    Fine-tune top open-source models

  • Research

    • Research

      Systems research for production AI

    • Research blog

      All our research publications

    Featured publications

    • FlashAttention

    • ATLAS

    • Kernel Collection

    • ThunderKittens

    • DSGym

    Show all
  • Developers

    • Documentation

      Technical docs for Together AI

    • Demos

      Our open-source demo apps

    • Cookbooks

      Practical implementation guides

    • Voice Agents

      Build voice agents for production

    • Model Library

    • Playground

    • Together Chat

    • Which LLM to use

  • Company

    Resources

    • Customer stories

      Testimonials from AI Natives

    • Startup accelerator

      Build and scale your startup

    • Customer support

      Find answers to your questions

    • Blog

      Our latest news & blog posts

    • Events

      Explore our events calendar

    Company

    • About

      Get to know us

    • Careers

      Join our mission

  • Pricing

    • Serverless Inference

      High-performance inference as APIs

    • Batch Inference

      Inference for batch workloads

    • Dedicated Model Inference

      Inference on custom hardware

    • Dedicated Container Inference

      Inference for custom models

    MiniMax M2.5
    Nano Banana Pro
    Qwen3.5-397B
    GLM-5
    White circular shape with uneven edges and three extended finger-like projections on a black background.
    kimi k2.5
    OpenAI logo with a symmetrical abstract geometric knot design.
    gpt-oss-120B

    Model library

    Explore the top open-source models

  • Accelerated Compute

    • GPU Clusters

      Reliable GPU clusters at scale

    • AI Factory

      Custom infrastructure at frontier scale

    Developer Environments

    • Sandbox

      Build development environments for AI

    Storage

    • Managed Storage

      Store model weights & data securely

    • GB300

    • GB200

    • B200

    • H200

    • H100

    • Fine-Tuning

      Shape models with your data

    • Evaluations

      Measure model quality

    Blue stylized fox curled up in a circular shape with tail wrapped around body.
    DeepSeek V3.1
    GLM 5 FP4
    Qwen3-VL 32B
    OpenAI logo with a symmetrical abstract geometric knot design.
    gpt-oss-120b
    White circular shape with uneven edges and three extended finger-like projections on a black background.
    kimi k2.5
    Llama 4 Maverick

    Model library

    Fine-tune top open-source models

    • Research

      Systems research for production AI

    • Research blog

      All our research publications

    Featured publications

    • FlashAttention

    • ATLAS

    • Kernel Collection

    • ThunderKittens

    • DSGym

    Show all
    • Documentation

      Technical docs for Together AI

    • Demos

      Our open-source demo apps

    • Cookbooks

      Practical implementation guides

    • Voice Agents

      Build voice agents for production

    • Model Library

    • Playground

    • Together Chat

    • Which LLM to use

  • Resources

    • Customer stories

      Testimonials from AI Natives

    • Startup accelerator

      Build and scale your startup

    • Customer support

      Find answers to your questions

    • Blog

      Our latest news & blog posts

    • Events

      Explore our events calendar

    Company

    • About

      Get to know us

    • Careers

      Join our mission

Contact sales
Contact sales
Sign in
Explore Research

Research blog

All
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
Inference
DeepSWE: Training a Fully Open-sourced, State-of-the-Art Coding Agent by Scaling RL

    Michael Luo*, Naman Jain*, Jaskirat Singh*, Sijun Tan*, Ameen Patel*, Qingyang Wu*, Alpay Ariyak*, Colin Cai*, Tarun Venkat, Shang Zhu, Ben Athiwaratkun, Manan Roongta, Ce Zhang, Li Erran Li, Raluca Ada Popa, Koushik Sen, Ion Stoica

    Chart showing SWE-Bench performance vs model size for various models with DeepSWE-Preview + TTS leading at 59%.
    Agents
    From Zero to One: Building An Autonomous and Open Data Scientist Agent from Scratch

      Federico Bianchi, Shang Zhu, Zain Hasan, Ben Athiwaratkun and James Zou

      Inference
      Model-Preserving Adaptive Rounding with YAQA

        Albert Tseng, Zhaofeng Sun, and Chris De Sa

        Bar chart showing KL divergence for quantized models Llama 3.1 and Gemma 3, highlighting YAQA's lower values.
        Agents
        Mixture-of-Agents Alignment: Harnessing the Collective Intelligence of Open-Source LLMs to Improve Post-Training

          Junlin Wang, Roy Xie, Shang Zhu, Jue Wang, Ben Athiwaratkun, Bhuwan Dhingra, Shuaiwen Leon Song, Ce Zhang, James Zou

          Bar chart comparing baseline, teachers, GPT-4o, and MoAA on AlpacaEval 2 and Arena-Hard scores in percentages.
          Previous
          Load more
          7 / 20

          No search result

          Try expanding your search or changing the filters.

          Be at the forefront of AI innovation

          From optimized training and model shaping to large-scale production inference

          See open roles
          • Products

            • Accelerated Compute

            • Serverless Inference

            • Dedicated Inference

            • Fine-Tuning

            • Sandbox

            • Evaluations

          • Models

            See all models

            DeepSeek

            Meta

            Qwen

            Google

            OpenAI

            Mistral AI

            Custom models

          • Developers

            • Research

            • Docs

            Pricing

            • Pricing overview

            • Inference

            • Fine-Tuning

            • GPU Clusters

          • Resources

            • Blog

            • About us

            • Careers

            • Customer Stories

            • Support

          • Privacy Policy

          • Terms of service

          • Cookie Policy

          • Consent Preferences

          © 2026 Together AI. All Rights Reserved.