Llama Guard 3 is a Llama-3.1-8B pretrained model, fine-tuned for content safety classification. Similar to previous versions, it can be used to classify content in both LLM inputs (prompt classification) and in LLM responses (response classification). It acts as an LLM that generates text indicating whether a given prompt or response is safe or unsafe, and if unsafe, it also lists the content categories violated. Llama Guard 3 was aligned to safeguard against the MLCommons standardized hazards taxonomy and designed to support Llama 3.1 capabilities. Specifically, it provides content moderation in 8 languages, and was optimized to support safety and security for search and code interpreter tool calls.
Try NowMultilingual content moderation
Safety classification for search or code tools
Content safety with large context
131,072 tokens
131,072 tokens
$0.02 per 1M tokens
$0.06 per 1M tokens
$15 per 1K calls
$0.19 per 1K calls