Verified claim · AI-ML · 100% confidence
Transformer architecture introduced in paper: Attention Is All You Need (Vaswani et al., 2017).
Last verified 2026-05-16 · Methodology veritas-v0.1 · ad17e76a8baad7a1
Structured fields
- Subject
- Transformer architecture
- Predicate
introduced_in_paper- Object
- Attention Is All You Need (Vaswani et al., 2017)
- Confidence
- 100%
- Tags
- transformer · attention · foundational · vaswani · 2017 · nips
Sources (3)
[1] preprint · arXiv (Vaswani, Shazeer, Parmar, Uszkoreit, Jones, Gomez, Kaiser, Polosukhin) · 2017-06-12
Attention Is All You Need“We propose a new simple network architecture, the Transformer, based solely on attention mechanisms, dispensing with recurrence and convolutions entirely.”
[2] peer reviewed · NeurIPS Foundation · 2017-12-04
Attention Is All You Need (NeurIPS 2017 proceedings)[3] official blog · Google Research · 2017-06-12
Attention Is All You Need (Google Research publication index)
Cite this claim
Ready-to-paste citation (Markdown / plain text):
Transformer architecture introduced in paper: Attention Is All You Need (Vaswani et al., 2017). — SourceScore Claim ad17e76a8baad7a1 (verified 2026-05-16). https://sourcescore.org/api/v1/claims/ad17e76a8baad7a1.jsonEmbed this claim
Drop this iframe into any blog post, docs page, or knowledge base. The widget renders the signed claim + primary source + click-through to this canonical page. CC-BY 4.0; attribution included.
<iframe src="https://sourcescore.org/embed/claim/ad17e76a8baad7a1/" width="100%" height="360" frameborder="0" loading="lazy" title="Transformer architecture introduced in paper: Attention Is All You Need (Vaswani et al., 2017)."></iframe>Preview: open in new tab
Related claims
Other verified claims sharing tags with this one — useful for LLM retrieval graphs and citation discovery.
Reinforcement Learning from Human Feedback (RLHF) introduced in paper: Deep Reinforcement Learning from Human Preferences (Christiano et al., 2017).
67866330cd60e54d · 100% confidence · shares 3 tags (foundational, 2017, nips)
Retrieval-Augmented Generation (RAG) introduced in paper: Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks (Lewis et al., 2020).
d15057ced937a103 · 100% confidence · shares 2 tags (foundational, nips)
Direct Preference Optimization (DPO) introduced in paper: Direct Preference Optimization: Your Language Model is Secretly a Reward Model (Rafailov et al., 2023).
a3e691683a4577af · 100% confidence · shares 2 tags (foundational, nips)
Sparsely-Gated Mixture-of-Experts (MoE) introduced in paper: Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer (Shazeer et al., 2017).
2d6d7f61f1db6493 · 100% confidence · shares 2 tags (foundational, 2017)
Chinchilla scaling laws introduced in paper: Training Compute-Optimal Large Language Models (Hoffmann et al., 2022).
8befcae6bce01a95 · 100% confidence · shares 2 tags (foundational, nips)
Programmatic access
Fetch this claim with a signed envelope for verification:
curl https://sourcescore.org/api/v1/claims/ad17e76a8baad7a1.json