Models / Vision / NIM Llama 3.2 11B Vision Instruct API
NIM Llama 3.2 11B Vision Instruct API
Vision
NVIDIA NIM for GPU accelerated Llama 3.2 11B Vision Instruct inference through OpenAI compatible APIs.
Deploy this NIM model

Models / Vision / NIM Llama 3.2 11B Vision Instruct API
NVIDIA NIM for GPU accelerated Llama 3.2 11B Vision Instruct inference through OpenAI compatible APIs.
Endpoint
RUN INFERENCE
This model is available as a Together Dedicated Endpoints deployment.
Follow our Docs to configure an endpoint via our API or CLI.
JSON RESPONSE
RUN INFERENCE
This model is available as a Together Dedicated Endpoints deployment.
Follow our Docs to configure an endpoint via our API or CLI.
JSON RESPONSE
RUN INFERENCE
This model is available as a Together Dedicated Endpoints deployment.
Follow our Docs to configure an endpoint via our API or CLI.
JSON RESPONSE
Model Provider:
Meta
Type:
Vision
Variant:
Instruct
Parameters:
11B
Deployment:
✔️ Dedicated
Quantization
Context length:
128K
Pricing:
Run in playground
Deploy model
Quickstart docs
Quickstart docs
Deploy NIM Llama 3.2 11B Vision Instruct on a dedicated endpoint with custom hardware configuration, as many instances as you need, and auto-scaling.