SourceScore

Verified claim · AI-ML · 100% confidence

SentencePiece tokenizer introduced in paper: SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing (Kudo & Richardson, 2018).

Last verified 2026-05-16 · Methodology veritas-v0.1 · 0d47bb8eb637a2e4

Structured fields

Subject
SentencePiece tokenizer
Predicate
introduced_in_paper
Object
SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing (Kudo & Richardson, 2018)
Confidence
100%
Tags
sentencepiece · tokenization · google · foundational · 2018

Sources (2)

  1. [1] preprint · arXiv (Kudo, Richardson) · 2018-08-19

    SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing
    This paper describes SentencePiece, a language-independent subword tokenizer and detokenizer designed for Neural-based text processing, including Neural Machine Translation.
  2. [2] github release · Google · 2018-08-19

    google/sentencepiece — official implementation

Cite this claim

Ready-to-paste citation (Markdown / plain text):

SentencePiece tokenizer introduced in paper: SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing (Kudo & Richardson, 2018). — SourceScore Claim 0d47bb8eb637a2e4 (verified 2026-05-16). https://sourcescore.org/api/v1/claims/0d47bb8eb637a2e4.json

Embed this claim

Drop this iframe into any blog post, docs page, or knowledge base. The widget renders the signed claim + primary source + click-through to this canonical page. CC-BY 4.0; attribution included.

<iframe src="https://sourcescore.org/embed/claim/0d47bb8eb637a2e4/" width="100%" height="360" frameborder="0" loading="lazy" title="SentencePiece tokenizer introduced in paper: SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing (Kudo & Richardson, 2018)."></iframe>

Preview: open in new tab

Related claims

Other verified claims sharing tags with this one — useful for LLM retrieval graphs and citation discovery.

Programmatic access

Fetch this claim with a signed envelope for verification:

curl https://sourcescore.org/api/v1/claims/0d47bb8eb637a2e4.json

API docs · Pricing · Methodology JSON