Verified claim · AI-ML · 100% confidence
CLIP (Contrastive Language-Image Pretraining) introduced in paper: Learning Transferable Visual Models From Natural Language Supervision (Radford et al., 2021).
Last verified 2026-05-16 · Methodology veritas-v0.1 · 85a3ca745eaf4ee0
Structured fields
- Subject
- CLIP (Contrastive Language-Image Pretraining)
- Predicate
introduced_in_paper- Object
- Learning Transferable Visual Models From Natural Language Supervision (Radford et al., 2021)
- Confidence
- 100%
- Tags
- clip · multimodal · vision · radford · 2021 · openai
Sources (2)
[1] preprint · arXiv (Radford et al., OpenAI) · 2021-02-26
Learning Transferable Visual Models From Natural Language Supervision“We demonstrate that the simple pre-training task of predicting which caption goes with which image is an efficient and scalable way to learn SOTA image representations from scratch on a dataset of 400 million (image, text) pairs collected from the internet.”
[2] official blog · OpenAI · 2021-01-05
CLIP: Connecting text and images
Cite this claim
Ready-to-paste citation (Markdown / plain text):
CLIP (Contrastive Language-Image Pretraining) introduced in paper: Learning Transferable Visual Models From Natural Language Supervision (Radford et al., 2021). — SourceScore Claim 85a3ca745eaf4ee0 (verified 2026-05-16). https://sourcescore.org/api/v1/claims/85a3ca745eaf4ee0.jsonEmbed this claim
Drop this iframe into any blog post, docs page, or knowledge base. The widget renders the signed claim + primary source + click-through to this canonical page. CC-BY 4.0; attribution included.
<iframe src="https://sourcescore.org/embed/claim/85a3ca745eaf4ee0/" width="100%" height="360" frameborder="0" loading="lazy" title="CLIP (Contrastive Language-Image Pretraining) introduced in paper: Learning Transferable Visual Models From Natural Language Supervision (Radford et al., 2021)."></iframe>Preview: open in new tab
Related claims
Other verified claims sharing tags with this one — useful for LLM retrieval graphs and citation discovery.
CLIP introduced in paper: Learning Transferable Visual Models From Natural Language Supervision (Radford et al., 2021).
bcdef949cc6d3644 · 100% confidence · shares 4 tags (clip, multimodal, 2021…)
GPT-4o released on: 2024-05-13.
bd065b91ca6e880b · 100% confidence · shares 2 tags (openai, multimodal)
GPT-2 introduced in paper: Language Models are Unsupervised Multitask Learners (Radford et al., 2019).
859551dc078c46f8 · 100% confidence · shares 2 tags (openai, radford)
HumanEval benchmark introduced in paper: Evaluating Large Language Models Trained on Code (Chen et al., 2021).
71ec42731d2c9e0c · 100% confidence · shares 2 tags (openai, 2021)
Llama 3.2 (multimodal release including 11B and 90B vision models) released on: 2024-09-25.
e27816c692a28ce9 · 100% confidence · shares 2 tags (multimodal, vision)
Programmatic access
Fetch this claim with a signed envelope for verification:
curl https://sourcescore.org/api/v1/claims/85a3ca745eaf4ee0.json