Content Moderation — Notes
Functional
- Detect & act on disallowed content (text/image/video/audio).
- Human review queue for ambiguous.
- Appeals.
- Compliance reports & legal hold.
Non-functional
- Latency: live streams need second-scale; posts can be minutes.
- High recall on CSAM, terrorism (zero-tolerance categories).
- Worker welfare considerations critical.
Trade-offs
- Auto-action vs human-review: high precision auto-act, ambiguous → queue.
- Region-specific policies force per-region pipelines.
- Appeals friction vs abuse of appeals.
Refs
- Meta Transparency Reports, Trust & Safety Professional Association papers.
- "Behind the Screen" Sarah T. Roberts (book).
- PhotoDNA whitepaper.