Chat

NIM Llama 3.1 8B Instruct

NVIDIA NIM for GPU accelerated Llama 3.1 8B Instruct inference through OpenAI compatible APIs.

About model

NVIDIA NIM serves Meta's Llama 3.1 8B Instruct for enterprise deployment, offering instruction-following capabilities. It specializes in processing complex, nuanced tasks. Ideal for enterprise applications requiring precise, high-capacity language understanding.

To run this model, you first need to deploy it on a Dedicated Endpoint.

Quickstart guides

RAG

Building a RAG Workflow

Agents

Agent Workflows

Apps

Next.js Chat Quickstart

Related models

Model specifications

Model data

Model provider
Meta
Type
Chat
Main use cases
Small & Fast
Deployment
On-Demand Dedicated
Monthly Reserved
Parameters
8B
Context length
128K
Input modalities
Text
Output modalities
Text

Released
July 22, 2024
External link
Provider docs
Category
Chat

Quickstart docs

Deploy model