Verified claim · AI-ML · 100% confidence
Byte-Pair Encoding (BPE) for NMT introduced in paper: Neural Machine Translation of Rare Words with Subword Units (Sennrich et al., 2015).
Last verified 2026-05-16 · Methodology veritas-v0.1 · aede848e23c8de8e
Structured fields
- Subject
- Byte-Pair Encoding (BPE) for NMT
- Predicate
introduced_in_paper- Object
- Neural Machine Translation of Rare Words with Subword Units (Sennrich et al., 2015)
- Confidence
- 100%
- Tags
- bpe · tokenization · subword · foundational · 2015 · acl
Sources (2)
[1] preprint · arXiv (Sennrich, Haddow, Birch) · 2015-08-31
Neural Machine Translation of Rare Words with Subword Units“We discuss the suitability of different word segmentation techniques, including simple character n-gram models and a segmentation based on the byte pair encoding compression algorithm, and empirically show that subword models improve over a back-off dictionary baseline.”
[2] peer reviewed · ACL Anthology · 2016-08-07
Neural Machine Translation of Rare Words with Subword Units (ACL 2016)
Cite this claim
Ready-to-paste citation (Markdown / plain text):
Byte-Pair Encoding (BPE) for NMT introduced in paper: Neural Machine Translation of Rare Words with Subword Units (Sennrich et al., 2015). — SourceScore Claim aede848e23c8de8e (verified 2026-05-16). https://sourcescore.org/api/v1/claims/aede848e23c8de8e.jsonEmbed this claim
Drop this iframe into any blog post, docs page, or knowledge base. The widget renders the signed claim + primary source + click-through to this canonical page. CC-BY 4.0; attribution included.
<iframe src="https://sourcescore.org/embed/claim/aede848e23c8de8e/" width="100%" height="360" frameborder="0" loading="lazy" title="Byte-Pair Encoding (BPE) for NMT introduced in paper: Neural Machine Translation of Rare Words with Subword Units (Sennrich et al., 2015)."></iframe>Preview: open in new tab
Related claims
Other verified claims sharing tags with this one — useful for LLM retrieval graphs and citation discovery.
Byte-Pair Encoding (BPE) for Neural Machine Translation introduced in paper: Neural Machine Translation of Rare Words with Subword Units (Sennrich et al., 2015).
e942c93d70a4dab2 · 100% confidence · shares 5 tags (bpe, tokenization, foundational…)
ResNet (Residual Networks) introduced in paper: Deep Residual Learning for Image Recognition (He et al., 2015).
4f55f77c4bfb316e · 100% confidence · shares 2 tags (foundational, 2015)
SentencePiece tokenizer introduced in paper: SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing (Kudo & Richardson, 2018).
0d47bb8eb637a2e4 · 100% confidence · shares 2 tags (tokenization, foundational)
Transformer architecture introduced in paper: Attention Is All You Need (Vaswani et al., 2017).
ad17e76a8baad7a1 · 100% confidence · shares 1 tag (foundational)
Reinforcement Learning from Human Feedback (RLHF) introduced in paper: Deep Reinforcement Learning from Human Preferences (Christiano et al., 2017).
67866330cd60e54d · 100% confidence · shares 1 tag (foundational)
Programmatic access
Fetch this claim with a signed envelope for verification:
curl https://sourcescore.org/api/v1/claims/aede848e23c8de8e.json