content-moderation-api
// Content moderation API integration using OpenAI Moderation, Perspective API, and others
$ git log --oneline --stat
stars:384
forks:73
updated:March 4, 2026
SKILL.mdreadonly
SKILL.md Frontmatter
namecontent-moderation-api
descriptionContent moderation API integration using OpenAI Moderation, Perspective API, and others
allowed-toolsRead,Write,Edit,Bash,Glob,Grep
Content Moderation API Skill
Capabilities
- Integrate OpenAI Moderation API
- Set up Perspective API for toxicity detection
- Configure moderation thresholds
- Implement content filtering pipelines
- Design moderation response handling
- Create moderation logging and reporting
Target Processes
- content-moderation-safety
- system-prompt-guardrails
Implementation Details
Moderation APIs
- OpenAI Moderation: Hate, violence, self-harm, sexual content
- Perspective API: Toxicity, insult, profanity, threat
- Azure Content Safety: Text and image moderation
- LlamaGuard: Open-source safety classifier
Configuration Options
- API credentials and endpoints
- Category thresholds
- Action policies (block, warn, flag)
- Logging configuration
- Fallback behavior
Best Practices
- Set appropriate thresholds
- Handle edge cases gracefully
- Log moderation decisions
- Regular threshold review
- Multi-layer moderation
Dependencies
- openai
- google-cloud-language (Perspective)
- azure-ai-contentsafety