Llama 3.1 8B
Multilingual LLM pre-trained and instruction-tuned, surpassing open and closed models on key benchmarks.
This model is not available on Together’s Serverless API.
Deploy this model on an on-demand Dedicated Endpoint or pick a supported alternative from the Model Library.
Related models
- TypeLLM
- Main use casesChatSmall & FastFunction Calling
- FeaturesFunction CallingJSON Mode
- Parameters8B
- Context length128K
- Quantization levelFP8
- External link
- CategoryChat