Назад към всички

content-moderation-api

// Content moderation API integration using OpenAI Moderation, Perspective API, and others

$ git log --oneline --stat
stars:384
forks:73
updated:March 4, 2026
SKILL.mdreadonly
SKILL.md Frontmatter
namecontent-moderation-api
descriptionContent moderation API integration using OpenAI Moderation, Perspective API, and others
allowed-toolsRead,Write,Edit,Bash,Glob,Grep

Content Moderation API Skill

Capabilities

  • Integrate OpenAI Moderation API
  • Set up Perspective API for toxicity detection
  • Configure moderation thresholds
  • Implement content filtering pipelines
  • Design moderation response handling
  • Create moderation logging and reporting

Target Processes

  • content-moderation-safety
  • system-prompt-guardrails

Implementation Details

Moderation APIs

  1. OpenAI Moderation: Hate, violence, self-harm, sexual content
  2. Perspective API: Toxicity, insult, profanity, threat
  3. Azure Content Safety: Text and image moderation
  4. LlamaGuard: Open-source safety classifier

Configuration Options

  • API credentials and endpoints
  • Category thresholds
  • Action policies (block, warn, flag)
  • Logging configuration
  • Fallback behavior

Best Practices

  • Set appropriate thresholds
  • Handle edge cases gracefully
  • Log moderation decisions
  • Regular threshold review
  • Multi-layer moderation

Dependencies

  • openai
  • google-cloud-language (Perspective)
  • azure-ai-contentsafety