⚡️ FlashAttention-4: up to 1.3× faster than cuDNN on NVIDIA Blackwell →

Introducing Together AI's new look →

🔎 ATLAS: runtime-learning accelerators delivering up to 4x faster LLM inference →

⚡ Together GPU Clusters: self-service NVIDIA GPUs, now generally available →

📦 Batch Inference API: Process billions of tokens at 50% lower cost for most models →

🪛 Fine-Tuning Platform Upgrades: Larger Models, Longer Contexts →

Models / Meta

Moderation

Llama Guard 2 8B

8B Llama 3-based safeguard model for classifying LLM inputs and outputs, detecting unsafe content and policy violations.

This model is not available on Together’s Serverless API.

Pick a supported alternative from the Model Library.

Related models

Model specifications

Model provider
Meta
Type
Moderation
Main use cases
Small & Fast
Moderation
Parameters
8B
Context length
8K

External link
Provider docs
Category
Moderation