Models / Deep Cogito
Chat
Reasoning

Cogito v2 preview - 109B MoE

Efficient MoE reasoning with advanced capabilities

About model

Cogito 109B MoE leverages mixture-of-experts architecture to deliver advanced reasoning capabilities with computational efficiency. This hybrid model excels at both direct responses and complex reasoning tasks while maintaining multimodal capabilities through innovative transfer learning.

  • API usage

    • cURL
    • Python
    • Typescript

    Endpoint:

    deepcogito/cogito-v2-preview-llama-109B-MoE

    curl -X POST "https://api.together.xyz/v1/chat/completions" \
      -H "Authorization: Bearer $TOGETHER_API_KEY" \
      -H "Content-Type: application/json" \
      -d '{
        "model": "deepcogito/cogito-v2-preview-llama-109B-MoE",
        "messages": [
          {
            "role": "user",
            "content": "What are some fun things to do in New York?"
          }
        ]
    }'
    
    from together import Together
    
    client = Together()
    
    response = client.chat.completions.create(
      model="deepcogito/cogito-v2-preview-llama-109B-MoE",
      messages=[
        {
          "role": "user",
          "content": "What are some fun things to do in New York?"
        }
      ]
    )
    print(response.choices[0].message.content)
    
    import Together from 'together-ai';
    const together = new Together();
    
    const completion = await together.chat.completions.create({
      model: 'deepcogito/cogito-v2-preview-llama-109B-MoE',
      messages: [
        {
          role: 'user',
          content: 'What are some fun things to do in New York?'
         }
      ],
    });
    
    console.log(completion.choices[0].message.content);
    
  • Model card

    This is a hybrid reasoning model. To enable thinking mode, pass the following parameter with your request to the model:



    "chat_template_kwargs": {"enable_thinking": true}

    Here's an example cURL request with thinking enabled:

        
    curl -X POST "https://api.together.xyz/v1/chat/completions" \
      -H "Authorization: Bearer $TOGETHER_API_KEY" \
      -H "Content-Type: application/json" \
        -d '{
        "model": "deepcogito/cogito-v2-preview-llama-109B-MoE",
        "temperature": 0.6,
        "chat_template_kwargs": {"enable_thinking": true},
        "messages": [
          {
            "role": "user",
            "content": "What are some fun things to do in New York?"
          }
        ]
    }'
        
    

    Here's an example Python request with thinking enabled:

        
    from together import Together
    
    client = Together()
    
    chat_template_kwargs = {"enable_thinking": True}
    
    response = client.chat.completions.create(
      model="deepcogito/cogito-v2-preview-llama-109B-MoE"",
      extra_body={"chat_template_kwargs": chat_template_kwargs},
      messages=[
        {
          "role": "user",
          "content": "What are some fun things to do in New York?"
        }
      ]
    )
    print(response.choices[0].message.content)
        
    


    Architecture Overview:

    • 109B MoE mixture-of-experts architecture with intelligent routing
    • Strong reasoning capabilities in the Cogito model family
    • Advanced policy improvement for both reasoning and non-reasoning modes

    Training Methodology:
    • Dual-mode training improving both standard and reasoning performance
    • Signal-based training for thinking process optimization
    • Advanced distillation techniques preventing reasoning meandering

    Performance Characteristics:
    • Excellent reasoning performance in 109B MoE parameter class
    • Efficient inference with optimized reasoning chains
    • Strong performance across diverse reasoning benchmarks

  • Applications & use cases

    Complex Reasoning Tasks:
    • Multi-step mathematical proofs and scientific analysis
    • Logical reasoning requiring expert knowledge activation
    • Research synthesis across multiple domains

    Multimodal Analysis:
    • Image comparison and visual reasoning tasks
    • Document analysis with text and visual components
    • Educational content requiring cross-modal understanding

    Efficiency-Critical Applications:
    • Real-time reasoning with computational constraints
    • Batch processing of diverse reasoning tasks
    • Scalable AI applications requiring expert-level performance

Related models
  • Model provider
    Deep Cogito
  • Type
    Chat
    Reasoning
  • Main use cases
    Chat
    Vision
  • Features
    JSON Mode
  • Deployment
    Serverless
    On-Demand Dedicated
    Monthly Reserved
  • Parameters
    109B MoE
  • Input price

    $0.18 / 1M tokens

  • Output price

    $0.59 / 1M tokens

  • Input modalities
    Text
    Image
  • Output modalities
    Text