Models / Meta
Chat

NIM Llama 3.1 8B Instruct

NVIDIA NIM for GPU accelerated Llama 3.1 8B Instruct inference through OpenAI compatible APIs.

About model

NVIDIA NIM serves Meta's Llama 3.1 8B Instruct for enterprise deployment, offering instruction-following capabilities. It specializes in processing complex, nuanced tasks. Ideal for enterprise applications requiring precise, high-capacity language understanding.

To run this model, you first need to deploy it on a Dedicated Endpoint.

    Related models
    • Model provider
      Meta
    • Type
      Chat
    • Main use cases
      Small & Fast
    • Deployment
      On-Demand Dedicated
      Monthly Reserved
    • Parameters
      8B
    • Context length
      128K
    • Input modalities
      Text
    • Output modalities
      Text