Qwen2.5 72B

Decoder-only model built for advanced language processing tasks.

About model

Qwen2.5 72B is a large language model with improved capabilities in coding, mathematics, and instruction following, supporting long-context and multilingual text generation for over 29 languages, suitable for developers and researchers.

Quickstart guides

RAG

Building a RAG Workflow

Agents

Agent Workflows

Apps

Next.js Chat Quickstart

Related models

Model specifications

Model data

Model provider
Qwen
Type
LLM
Chat
Main use cases
Chat
Function Calling
Features
Function Calling
Deployment
On-Demand Dedicated
Monthly Reserved
Parameters
72B
Context length
32768
Input price
$1.20 / 1M tokens
Output price
$1.20 / 1M tokens
Input modalities
Text
Output modalities
Text

Released
September 17, 2024
Last updated
February 5, 2026
Quantization level
FP8
External link
Provider docs
Category
Chat

Quickstart docs

Deploy model