Chat

NIM Llama 3.1 70B Instruct

NVIDIA NIM for GPU accelerated Llama 3.1 70B Instruct inference through OpenAI compatible APIs.

About model

NVIDIA NIM serves Meta's Llama 3.1 70B Instruct for enterprise deployment, offering instruct-tuned language understanding and generation capabilities. It excels at following instructions and producing coherent text. Designed for enterprise use cases.

To run this model, you first need to deploy it on a Dedicated Endpoint.

Quickstart guides

RAG

Building a RAG Workflow

Agents

Agent Workflows

Apps

Next.js Chat Quickstart

Related models

Model specifications

Model data

Model provider
Meta
Type
Chat
Main use cases
Medium General Purpose
Deployment
On-Demand Dedicated
Monthly Reserved
Parameters
70B
Context length
128K
Input modalities
Text
Output modalities
Text

Released
July 22, 2024
External link
Provider docs
Category
Chat

Quickstart docs

Deploy model