Tag
1 verified claim carrying this tag. Each has 2+ primary sources and an HMAC-SHA256 signature.
Anthropic Constitutional Classifiers publicly released on: 2025-02-04 by Anthropic — safeguard against jailbreaks via constitutional-trained input/output filters.
688a84a8d7211fc0 · 2 sources · 100% confidence