Chat

Qwen3 4B

4.0B-parameter compact conversational AI model with grouped-query attention optimized for efficient chat applications and instruction following tasks.

Try now

read docs

About model

Qwen3-4B is a 4B-parameter causal language model offering advanced reasoning, instruction-following, and multilingual support capabilities. It excels in tasks requiring complex logical reasoning, math, and coding, while also providing efficient general-purpose dialogue. Suitable for developers and researchers, Qwen3-4B supports 100+ languages and dialects, making it an ideal choice for applications requiring strong multilingual capabilities.

To run this model you first need to deploy it on a Dedicated Endpoint.

Quickstart guides

RAG

Building a RAG Workflow

Agents

Agent Workflows

Apps

Next.js Chat Quickstart

Model card
Conversation Format:
• Advanced system/user/assistant format with dynamic expert activation
• Supports complex multi-turn dialogues with reasoning chains
• Efficient inference through mixture-of-experts architecture
• Strong performance on coding, mathematics, and creative tasks

Expert Utilization:
• Different experts activated based on input content and task requirements
• Seamless switching between mathematical, coding, and linguistic experts
• Contextual understanding with efficient resource allocation
• Maintains conversation quality while optimizing computational efficiency

Optimization Strategies:
• Leverages specialized experts for domain-specific tasks
• Benefits from explicit task specification in prompts
• Responds well to structured reasoning requests
• Optimized for both creative and analytical applications
Prompting
Chat model with system/user/assistant format. Supports conversational context and instruction following capabilities.
Applications & use cases
Efficient chatbots mobile assistants resource-constrained chat applications simple conversation tasks educational tools.

Related models

Model specifications

Model data

Model provider
Qwen
Type
Chat
Main use cases
Chat
Small & Fast
Fine tuning
Supported
Deployment
On-Demand Dedicated
Monthly Reserved
Parameters
4.0B
Context length
32K
Input modalities
Text
Output modalities
Text

Released
April 26, 2025
External link
Provider docs
Category
Chat

Quickstart docs

Deploy model

Qwen3 4B

About model

Model card

Prompting

Applications & use cases