NIM Llama 3.1 Nemotron 70B Instruct
NVIDIA NIM for GPU accelerated Llama 3.1 Nemotron 70B Instruct inference through OpenAI compatible APIs.
About model
NVIDIA's Llama-3.1-Nemotron-70B-Instruct fine-tunes for alignment and helpfulness, providing accurate and informative responses. It specializes in generating human-like text based on user input. Suitable for developers and researchers requiring advanced language understanding capabilities.
To run this model, you first need to deploy it on a Dedicated Endpoint.
- TypeChat
- DeploymentOn-Demand DedicatedMonthly Reserved
- Parameters70B
- Context length128K
- Input modalitiesText
- Output modalitiesText
- ReleasedSeptember 30, 2024
- Last updatedAugust 26, 2025
- External link
- CategoryChat