Chat
4.0B-parameter compact conversational AI model with grouped-query attention optimized for efficient chat applications and instruction following tasks.
Deploy Qwen3 4B

New
To run this model you first need to deploy it on a Dedicated Endpoint.
Qwen3 4B API Usage
Endpoint
Qwen/Qwen3-4B
RUN INFERENCE
RUN INFERENCE
RUN INFERENCE
How to use Qwen3 4B
Model details
Prompting Qwen3 4B
Chat model with system/user/assistant format. Supports conversational context and instruction following capabilities.
Applications & Use Cases
Efficient chatbots mobile assistants resource-constrained chat applications simple conversation tasks educational tools.