Verified claim · AI-ML · 100% confidence
Anthropic Constitutional AI Harmlessness introduced in paper: Bai et al. 2022 — training a helpful and harmless assistant.
Last verified 2026-05-16 · Methodology veritas-v0.1 · 6fa575eb9df5ac32
Structured fields
- Subject
- Anthropic Constitutional AI Harmlessness
- Predicate
introduced_in_paper- Object
- Bai et al. 2022 — training a helpful and harmless assistant
- Confidence
- 100%
- Tags
- constitutional-ai · cai · anthropic · alignment · harmlessness · foundational · 2022 · introduced_in
Sources (2)
[1] preprint · arXiv (Bai, Kadavath, Kundu, Askell, Kernion, Jones, Chen, et al. / Anthropic) · 2022-12-15
Constitutional AI: Harmlessness from AI Feedback“As AI systems become more capable, we would like to enlist their help to supervise other AIs. We experiment with methods for training a harmless AI assistant through self-improvement, without any human labels identifying harmful outputs.”
[2] official blog · Anthropic · 2023-05-09
Claude's Constitution
Cite this claim
Ready-to-paste citation (Markdown / plain text):
Anthropic Constitutional AI Harmlessness introduced in paper: Bai et al. 2022 — training a helpful and harmless assistant. — SourceScore Claim 6fa575eb9df5ac32 (verified 2026-05-16). https://sourcescore.org/api/v1/claims/6fa575eb9df5ac32.jsonEmbed this claim
Drop this iframe into any blog post, docs page, or knowledge base. The widget renders the signed claim + primary source + click-through to this canonical page. CC-BY 4.0; attribution included.
<iframe src="https://sourcescore.org/embed/claim/6fa575eb9df5ac32/" width="100%" height="360" frameborder="0" loading="lazy" title="Anthropic Constitutional AI Harmlessness introduced in paper: Bai et al. 2022 — training a helpful and harmless assistant."></iframe>Preview: open in new tab
Related claims
Other verified claims sharing tags with this one — useful for LLM retrieval graphs and citation discovery.
Constitutional AI (CAI) introduced in paper: Constitutional AI: Harmlessness from AI Feedback (Bai et al., 2022).
ba1eb83c14795107 · 100% confidence · shares 4 tags (constitutional-ai, alignment, anthropic…)
InstructGPT introduced in: Ouyang et al. 2022 — RLHF-tuned GPT-3, direct ancestor of ChatGPT.
590b9de765b8126e · 100% confidence · shares 4 tags (alignment, foundational, 2022…)
Speculative decoding introduced in: Leviathan, Kalman, Matias 2023 — Google Research.
6cdc7730bf41bb3d · 100% confidence · shares 3 tags (foundational, 2022, introduced_in)
ReAct prompting pattern introduced in: Yao et al. 2022 — synergizing reasoning and acting in language models.
95193a0b79c777e8 · 100% confidence · shares 3 tags (foundational, 2022, introduced_in)
Chain-of-Thought (CoT) introduced in: Wei et al. 2022 — Chain-of-Thought Prompting Elicits Reasoning in Large Language Models.
a8503ad535423b54 · 100% confidence · shares 3 tags (foundational, 2022, introduced_in)
Use this claim in your code
Fetch this signed envelope from your application. The response includes the verbatim excerpt, primary source URLs, and an HMAC-SHA256 signature you can verify locally for audit trails.
cURL
curl https://sourcescore.org/api/v1/claims/6fa575eb9df5ac32.jsonJavaScript / TypeScript
const r = await fetch("https://sourcescore.org/api/v1/claims/6fa575eb9df5ac32.json");
const envelope = await r.json();
console.log(envelope.claim.statement);
// "Anthropic Constitutional AI Harmlessness introduced in paper: Bai et al. 2022 — training a helpful and harmless assistant."Python
import httpx
r = httpx.get("https://sourcescore.org/api/v1/claims/6fa575eb9df5ac32.json")
envelope = r.json()
print(envelope["claim"]["statement"])
# "Anthropic Constitutional AI Harmlessness introduced in paper: Bai et al. 2022 — training a helpful and harmless assistant."LangChain (retrieve-then-cite)
from langchain_core.tools import tool
import httpx
@tool
def get_anthropic_constitutional_ai_harmlessness_fact() -> dict:
"""Fetch the verified SourceScore claim for Anthropic Constitutional AI Harmlessness."""
r = httpx.get("https://sourcescore.org/api/v1/claims/6fa575eb9df5ac32.json")
return r.json()