Models / Meta
LLM

Llama 3 8B Instruct Lite

Auto-regressive LLM with optimized transformers, SFT, and RLHF for alignment with helpfulness and safety preferences.

About model

Llama 3 8B Instruct Lite generates human-like text based on input prompts, excelling at conversational tasks. It is optimized for efficiency and suitable for developers and businesses requiring a balance of performance and resource usage.

  • API usage

    • cURL
    • Python
    • Typescript

    Endpoint:

    meta-llama/Meta-Llama-3-8B-Instruct-Lite

    curl -X POST "https://api.together.xyz/v1/chat/completions" \
      -H "Authorization: Bearer $TOGETHER_API_KEY" \
      -H "Content-Type: application/json" \
      -d '{
        "model": "meta-llama/Meta-Llama-3-8B-Instruct-Lite",
        "messages": [
          {
            "role": "user",
            "content": "What are some fun things to do in New York?"
          }
        ]
    }'
    
    from together import Together
    
    client = Together()
    
    response = client.chat.completions.create(
      model="meta-llama/Meta-Llama-3-8B-Instruct-Lite",
      messages=[
        {
          "role": "user",
          "content": "What are some fun things to do in New York?"
        }
      ]
    )
    print(response.choices[0].message.content)
    
    import Together from 'together-ai';
    const together = new Together();
    
    const completion = await together.chat.completions.create({
      model: 'meta-llama/Meta-Llama-3-8B-Instruct-Lite',
      messages: [
        {
          role: 'user',
          content: 'What are some fun things to do in New York?'
         }
      ],
    });
    
    console.log(completion.choices[0].message.content);
    
Related models
  • Model provider
    Meta
  • Type
    LLM
  • Main use cases
    Chat
    Small & Fast
  • Features
    JSON Mode
  • Deployment
    Serverless
    On-Demand Dedicated
    Monthly Reserved
  • Parameters
    8B
  • Context length
    8K
  • Input price

    $0.10 / 1M tokens

  • Output price

    $0.10 / 1M tokens

  • Input modalities
    Text
  • Output modalities
    Text
  • Released
    April 17, 2024
  • Quantization level
    INT4
  • External link
  • Category
    Chat