Cogito v2 preview - 671B MoE
World-class MoE reasoning approaching superintelligence
About model
Cogito 671B MoE represents one of the strongest open models globally, matching performance of latest Deepseek models while approaching closed frontier systems like o3 and Claude 4 Opus. This advanced system demonstrates significant progress toward scalable superintelligence through policy improvement.
Model | AIME 2025 | GPQA Diamond | HLE | LiveCodeBench | MATH500 | SWE-bench verified |
|---|---|---|---|---|---|---|
Cogito v2 preview - 671B MoE | 69.7% | Related open-source models | Competitor closed-source models | |||
90.5% | 34.2% | 78.7% | ||||
83.3% | 24.9% | 99.2% | 62.3% | |||
76.8% | 96.4% | 48.9% | ||||
49.2% | 2.7% | 32.3% | 89.3% | 31.0% |
API usage
Endpoint:
Model card
This is a hybrid reasoning model. To enable thinking mode, pass the following parameter with your request to the model:
Here's an example cURL request with thinking enabled:
Here's an example Python request with thinking enabled:
Architecture Overview:
• Massive 671B mixture-of-experts architecture with intelligent routing
• World-class reasoning capabilities among strongest open models
• Advanced policy improvement for both reasoning and non-reasoning modes
Training Methodology:
• Dual-mode training improving both standard and reasoning performance
• Signal-based training for thinking process optimization
• Advanced distillation techniques preventing reasoning meandering
Performance Characteristics:
• Matches Deepseek v3 0324 in non-reasoning mode
• Outperforms Deepseek R1 with 60% shorter reasoning chains
• Approaches performance of o3 and Claude 4 Opus frontier modelsApplications & use cases
Frontier Research:
• Advanced scientific research and discovery
• Complex theoretical analysis and mathematical proofs
• Multi-disciplinary research requiring world-class reasoning
Strategic Applications:
• High-stakes decision making and strategic planning
• Advanced competitive analysis and market research
• Complex system design and optimization
Superintelligence Development:
• Foundation for next-generation AI research
• Scalable self-improvement research and development
• Open source contribution to AGI and superintelligence efforts
- TypeChatReasoning
- Main use casesChatReasoning
- FeaturesJSON Mode
- DeploymentServerlessOn-Demand DedicatedMonthly Reserved
- Parameters671B MoE
- Input price
$1.25 / 1M tokens
- Output price
$1.25 / 1M tokens
- Input modalitiesText
- Output modalitiesText
- CategoryChat