Providers
Perspective API (Google). OpenAI Moderation (free). Azure Content Safety. Amazon Bedrock Guardrails. Each has own categories.
Advertisement
Open-source
Detoxify. Llama Guard. ToxDECT. Local deployment for latency + privacy.
Advertisement
Streaming filter
Classify each chunk as streams. Cut stream on toxic. Some latency: can't classify partial sentence perfectly. Buffer + classify per sentence.