Qwen3 4B
4.0B-parameter compact conversational AI model with grouped-query attention optimized for efficient chat applications and instruction following tasks.
About model
Qwen3-4B is a 4B-parameter causal language model offering advanced reasoning, instruction-following, and multilingual support capabilities. It excels in tasks requiring complex logical reasoning, math, and coding, while also providing efficient general-purpose dialogue. Suitable for developers and researchers, Qwen3-4B supports 100+ languages and dialects, making it an ideal choice for applications requiring strong multilingual capabilities.
To run this model you first need to deploy it on a Dedicated Endpoint.
Model card
Conversation Format:
• Advanced system/user/assistant format with dynamic expert activation
• Supports complex multi-turn dialogues with reasoning chains
• Efficient inference through mixture-of-experts architecture
• Strong performance on coding, mathematics, and creative tasks
Expert Utilization:
• Different experts activated based on input content and task requirements
• Seamless switching between mathematical, coding, and linguistic experts
• Contextual understanding with efficient resource allocation
• Maintains conversation quality while optimizing computational efficiency
Optimization Strategies:
• Leverages specialized experts for domain-specific tasks
• Benefits from explicit task specification in prompts
• Responds well to structured reasoning requests
• Optimized for both creative and analytical applicationsPrompting
Chat model with system/user/assistant format. Supports conversational context and instruction following capabilities.
Applications & use cases
Efficient chatbots mobile assistants resource-constrained chat applications simple conversation tasks educational tools.
- TypeChat
- Main use casesChatSmall & Fast
- Fine tuningSupported
- DeploymentOn-Demand DedicatedMonthly Reserved
- Parameters4.0B
- Context length32K
- Input modalitiesText
- Output modalitiesText
- ReleasedApril 26, 2025
- External link
- CategoryChat