Meta's 7-8B specialized moderation model for LLM input/output filtering
Skill / Understand
+12 Meta's 7-8B specialized moderation model for LLM input/output filtering. 6 safety categories - violence/hate, sexual content, weapons, substances, self-harm, criminal planning. 94-95% accuracy. Deploy with vLLM, HuggingFace, Sagemaker. Integrates with NeMo Guardrails.
Orchestra AI Research Skills · 9.2k chars Upgrade