Models / QwenQwen / / Qwen3 235B A22B Thinking 2507 FP8 API
Qwen3 235B A22B Thinking 2507 FP8 API
235B-parameter MoE thinking model, 256K context, 22B activated experts, state-of-the-art reasoning performance among open-source models.

This model is not currently supported on Together AI.
Visit our Models page to view all the latest models.
Qwen3 235B A22B Thinking 2507 FP8 API Usage
Endpoint
How to use Qwen3 235B A22B Thinking 2507 FP8
Model details
Architecture Overview:
• Mixture-of-Experts transformer with 235B total parameters and 22B activated
• 94 layers with grouped query attention (64 for Q and 4 for KV)
• 128 experts with 8 activated experts per token for efficient inference
• Native 262,144 token context window for extensive document processing
Training Methodology:
• Advanced pretraining & post-training pipeline with thinking capability enhancement
• Specialized training for logical reasoning, mathematics, science, and coding tasks
• Constitutional AI training for alignment with human preferences
• Optimized for complex multi-step reasoning with increased thinking length
Performance Characteristics:
• State-of-the-art results among open-source thinking models on academic benchmarks
• Exceptional performance on AIME25, HMMT25, and LiveCodeBench evaluations
• Enhanced 256K long-context understanding capabilities
• Optimized inference with MoE architecture for computational efficiency
Prompting Qwen3 235B A22B Thinking 2507 FP8
Applications & Use Cases
Advanced Reasoning Applications:
• Complex mathematical problem solving & academic research
• Scientific analysis requiring multi-step logical reasoning
• Advanced coding challenges & algorithm development
• Academic benchmarking & competitive programming
Enterprise & Professional Use:
• High-complexity business analysis & strategic planning
• Technical documentation with detailed reasoning chains
• Expert-level consultation systems requiring deep thinking
• Research & development applications in specialized domains
Educational & Research:
• Graduate-level tutoring in STEM subjects
• Research paper analysis & academic writing assistance
• Complex problem decomposition for educational purposes
• Multilingual academic support across 119+ languages
Developer Integration:
• Tool calling capabilities with Qwen-Agent framework
• OpenAI-compatible API endpoints via SGLang or vLLM
• Integration with local deployment tools (Ollama, LMStudio, llama.cpp)
• Custom reasoning workflows for specialized applications