Skip to content

Content Moderation — Notes#

Functional#

  • Detect & act on disallowed content (text/image/video/audio).
  • Human review queue for ambiguous.
  • Appeals.
  • Compliance reports & legal hold.

Non-functional#

  • Latency: live streams need second-scale; posts can be minutes.
  • High recall on CSAM, terrorism (zero-tolerance categories).
  • Worker welfare considerations critical.

Trade-offs#

  • Auto-action vs human-review: high precision auto-act, ambiguous → queue.
  • Region-specific policies force per-region pipelines.
  • Appeals friction vs abuse of appeals.

Refs#

  • Meta Transparency Reports, Trust & Safety Professional Association papers.
  • "Behind the Screen" Sarah T. Roberts (book).
  • PhotoDNA whitepaper.