Models / Deep Cogito
Chat
Reasoning

Cogito v2 preview - 405B

Frontier-class dense reasoning model

About model

Cogito 405B represents a significant step toward frontier intelligence with dense architecture delivering performance competitive with leading closed models. This advanced reasoning system combines policy improvement with massive scale for exceptional capabilities.

  • API usage

    • cURL
    • Python
    • Typescript

    Endpoint:

    deepcogito/cogito-v2-preview-llama-405B

    curl -X POST "https://api.together.xyz/v1/chat/completions" \
      -H "Authorization: Bearer $TOGETHER_API_KEY" \
      -H "Content-Type: application/json" \
      -d '{
        "model": "deepcogito/cogito-v2-preview-llama-405B",
        "messages": [
          {
            "role": "user",
            "content": "What are some fun things to do in New York?"
          }
        ]
    }'
    
    from together import Together
    
    client = Together()
    
    response = client.chat.completions.create(
      model="deepcogito/cogito-v2-preview-llama-405B",
      messages=[
        {
          "role": "user",
          "content": "What are some fun things to do in New York?"
        }
      ]
    )
    print(response.choices[0].message.content)
    
    import Together from 'together-ai';
    const together = new Together();
    
    const completion = await together.chat.completions.create({
      model: 'deepcogito/cogito-v2-preview-llama-405B',
      messages: [
        {
          role: 'user',
          content: 'What are some fun things to do in New York?'
         }
      ],
    });
    
    console.log(completion.choices[0].message.content);
    
  • Model card

    This is a hybrid reasoning model. To enable thinking mode, pass the following parameter with your request to the model:



    "chat_template_kwargs": {"enable_thinking": true}

    Here's an example cURL request with thinking enabled:

        
    curl -X POST "https://api.together.xyz/v1/chat/completions" \
      -H "Authorization: Bearer $TOGETHER_API_KEY" \
      -H "Content-Type: application/json" \
        -d '{
        "model": "deepcogito/cogito-v2-preview-llama-405B",
        "temperature": 0.6,
        "chat_template_kwargs": {"enable_thinking": true},
        "messages": [
          {
            "role": "user",
            "content": "What are some fun things to do in New York?"
          }
        ]
    }'
        
    

    Here's an example Python request with thinking enabled:

        
    from together import Together
    
    client = Together()
    
    chat_template_kwargs = {"enable_thinking": True}
    
    response = client.chat.completions.create(
      model="deepcogito/cogito-v2-preview-llama-405B",
      extra_body={"chat_template_kwargs": chat_template_kwargs},
      messages=[
        {
          "role": "user",
          "content": "What are some fun things to do in New York?"
        }
      ]
    )
    print(response.choices[0].message.content)
        
    


    Architecture Overview:

    • 405B dense parameter architecture with optimized efficiency
    • Strong reasoning capabilities in the Cogito model family
    • Advanced policy improvement for both reasoning and non-reasoning modes

    Training Methodology:
    • Dual-mode training improving both standard and reasoning performance
    • Signal-based training for thinking process optimization
    • Advanced distillation techniques preventing reasoning meandering

    Performance Characteristics:
    • Excellent reasoning performance in 405B parameter class
    • Efficient inference with optimized reasoning chains
    • Strong performance across diverse reasoning benchmarks

  • Applications & use cases

    Advanced Research:
    • Scientific research requiring frontier-level reasoning
    • Complex mathematical and logical problem solving
    • Multi-domain analysis and synthesis tasks

    Enterprise Applications:
    • Strategic analysis and decision support systems
    • Advanced content creation and technical writing
    • Research and development acceleration tools

    Academic & Scientific:
    • Collaborative research with human experts
    • Complex data analysis and interpretation
    • Educational content at advanced levels requiring deep reasoning

Related models
  • Model provider
    Deep Cogito
  • Type
    Chat
    Reasoning
  • Main use cases
    Chat
    Reasoning
  • Features
    JSON Mode
  • Deployment
    Serverless
    On-Demand Dedicated
    Monthly Reserved
  • Parameters
    405B
  • Input price

    $3.50 / 1M tokens

  • Output price

    $3.50 / 1M tokens

  • Input modalities
    Text
  • Output modalities
    Text