Verified AI/ML Claims
91 signed, sourced claims for grounded LLM retrieval. Each claim has 2+ primary sources, an HMAC-SHA256 signature, and a stable JSON API endpoint.
Foundational papers (31)
Transformer architecture introduced in paper: Attention Is All You Need (Vaswani et al., 2017).
ad17e76a8baad7a1 · 3 sources · 100% confidence
Reinforcement Learning from Human Feedback (RLHF) introduced in paper: Deep Reinforcement Learning from Human Preferences (Christiano et al., 2017).
67866330cd60e54d · 3 sources · 100% confidence
Retrieval-Augmented Generation (RAG) introduced in paper: Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks (Lewis et al., 2020).
d15057ced937a103 · 2 sources · 100% confidence
Low-Rank Adaptation (LoRA) introduced in paper: LoRA: Low-Rank Adaptation of Large Language Models (Hu et al., 2021).
d7b97d1b93d8d8bc · 2 sources · 100% confidence
Direct Preference Optimization (DPO) introduced in paper: Direct Preference Optimization: Your Language Model is Secretly a Reward Model (Rafailov et al., 2023).
a3e691683a4577af · 2 sources · 100% confidence
BERT (Bidirectional Encoder Representations from Transformers) introduced in paper: BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding (Devlin et al., 2018).
4c1ee70007dc89c1 · 2 sources · 100% confidence
GPT-2 introduced in paper: Language Models are Unsupervised Multitask Learners (Radford et al., 2019).
859551dc078c46f8 · 2 sources · 100% confidence
ResNet (Residual Networks) introduced in paper: Deep Residual Learning for Image Recognition (He et al., 2015).
4f55f77c4bfb316e · 2 sources · 100% confidence
T5 (Text-to-Text Transfer Transformer) introduced in paper: Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer (Raffel et al., 2019).
ef28341c3b308737 · 2 sources · 100% confidence
Sparsely-Gated Mixture-of-Experts (MoE) introduced in paper: Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer (Shazeer et al., 2017).
2d6d7f61f1db6493 · 1 source · 100% confidence
Switch Transformer introduced in paper: Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity (Fedus et al., 2021).
3d9c14b9379038c9 · 2 sources · 100% confidence
Chinchilla scaling laws introduced in paper: Training Compute-Optimal Large Language Models (Hoffmann et al., 2022).
8befcae6bce01a95 · 2 sources · 100% confidence
Proximal Policy Optimization (PPO) introduced in paper: Proximal Policy Optimization Algorithms (Schulman et al., 2017).
00f224e1ccc158ef · 2 sources · 100% confidence
Mamba state-space model introduced in paper: Mamba: Linear-Time Sequence Modeling with Selective State Spaces (Gu, Dao, 2023).
3518f8aa40cb0d36 · 2 sources · 100% confidence
Chain-of-Thought prompting introduced in paper: Chain-of-Thought Prompting Elicits Reasoning in Large Language Models (Wei et al., 2022).
3af924da138ff84c · 2 sources · 100% confidence
Adam optimizer introduced in paper: Adam: A Method for Stochastic Optimization (Kingma, Ba, 2014).
dffbe905003cc581 · 2 sources · 100% confidence
AlexNet introduced in paper: ImageNet Classification with Deep Convolutional Neural Networks (Krizhevsky, Sutskever, Hinton, 2012).
98b6e774be89d967 · 2 sources · 100% confidence
ImageNet dataset introduced in paper: ImageNet: A Large-Scale Hierarchical Image Database (Deng et al., 2009).
045e628def62181d · 2 sources · 100% confidence
Vision Transformer (ViT) introduced in paper: An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale (Dosovitskiy et al., 2020).
d3681b0981e0b700 · 2 sources · 100% confidence
Generative Adversarial Networks (GANs) introduced in paper: Generative Adversarial Networks (Goodfellow et al., 2014).
5b0c0612bd9e55b0 · 2 sources · 100% confidence
Variational Autoencoder (VAE) introduced in paper: Auto-Encoding Variational Bayes (Kingma, Welling, 2013).
62789e45973ab631 · 2 sources · 100% confidence
Denoising Diffusion Probabilistic Models (DDPM) introduced in paper: Denoising Diffusion Probabilistic Models (Ho, Jain, Abbeel, 2020).
e700f81fff6f38c7 · 2 sources · 100% confidence
Word2Vec introduced in paper: Efficient Estimation of Word Representations in Vector Space (Mikolov et al., 2013).
4978f76d228a3db1 · 2 sources · 100% confidence
Byte-Pair Encoding (BPE) for Neural Machine Translation introduced in paper: Neural Machine Translation of Rare Words with Subword Units (Sennrich et al., 2015).
e942c93d70a4dab2 · 2 sources · 100% confidence
ReAct (Reasoning + Acting) introduced in paper: ReAct: Synergizing Reasoning and Acting in Language Models (Yao et al., 2022).
fceea64fa7d04d3a · 2 sources · 100% confidence
LoRA (Low-Rank Adaptation) introduced in paper: LoRA: Low-Rank Adaptation of Large Language Models (Hu et al., 2021).
f191b2876790dc6e · 2 sources · 100% confidence
QLoRA introduced in paper: QLoRA: Efficient Finetuning of Quantized LLMs (Dettmers et al., 2023).
767cbe41c961be1a · 2 sources · 100% confidence
Rotary Position Embedding (RoPE) introduced in paper: RoFormer: Enhanced Transformer with Rotary Position Embedding (Su et al., 2021).
f8d64457ba9fd35b · 2 sources · 100% confidence
Byte-Pair Encoding (BPE) for NMT introduced in paper: Neural Machine Translation of Rare Words with Subword Units (Sennrich et al., 2015).
aede848e23c8de8e · 2 sources · 100% confidence
SentencePiece tokenizer introduced in paper: SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing (Kudo & Richardson, 2018).
0d47bb8eb637a2e4 · 2 sources · 100% confidence
CLIP introduced in paper: Learning Transferable Visual Models From Natural Language Supervision (Radford et al., 2021).
bcdef949cc6d3644 · 2 sources · 100% confidence
Model releases (34)
ChatGPT released on: 2022-11-30.
8d653880c519a8ef · 2 sources · 100% confidence
GPT-4 released on: 2023-03-14.
09eea8fb1a8ccebf · 2 sources · 100% confidence
GPT-4 Turbo context window tokens: 128000.
26314e3164e18b24 · 2 sources · 100% confidence
GPT-4o released on: 2024-05-13.
bd065b91ca6e880b · 1 source · 100% confidence
Claude 3.5 Sonnet released on: 2024-06-20.
9fc5fb203abbf250 · 1 source · 100% confidence
Claude 3 Opus context window tokens: 200000.
565df27fc8b75ef0 · 2 sources · 100% confidence
Llama 2 released on: 2023-07-18.
34dd74941bc7bd48 · 2 sources · 100% confidence
Llama 3.1 released on: 2024-07-23.
a55484ab8b4bdf4e · 2 sources · 100% confidence
Llama 3.1 405B parameter count: 405000000000.
282b0523eefd9afd · 2 sources · 100% confidence
Mixtral 8x7B released on: 2023-12-11.
410aec4f418f2b11 · 2 sources · 95% confidence
Mixtral 8x7B architecture: Sparse Mixture-of-Experts (8 experts × 7B params, 2 experts routed per token).
ad79b14fafb362cd · 2 sources · 100% confidence
Gemini Pro released on: 2023-12-06.
e2a6019bd2ce5c97 · 1 source · 100% confidence
Whisper released on: 2022-09-21.
a3ebbaed14bd83d0 · 2 sources · 100% confidence
Llama 1 released on: 2023-02-24.
5ce2e90fafdca0c2 · 2 sources · 100% confidence
Llama 3 released on: 2024-04-18.
5f599876b3dd19b3 · 2 sources · 100% confidence
Llama 3 70B parameter count: 70000000000.
e4581034693f4584 · 2 sources · 100% confidence
Claude 3 family (Opus, Sonnet, Haiku) released on: 2024-03-04.
99340a18716a44fb · 1 source · 100% confidence
Mistral 7B released on: 2023-09-27.
2d5a5dc30e7f6f02 · 3 sources · 100% confidence
Stable Diffusion 1.0 released on: 2022-08-22.
f6f333228f224df2 · 2 sources · 100% confidence
GPT-3 parameter count: 175000000000.
1ca2cc2864dfb376 · 2 sources · 100% confidence
Llama 2 70B parameter count: 70000000000.
a4fc4391a27f0500 · 2 sources · 100% confidence
Llama 3 8B parameter count: 8000000000.
8455a8b9d44fecbe · 2 sources · 100% confidence
GPT-4o mini released on: 2024-07-18.
1a4cf45e8b967089 · 1 source · 100% confidence
Claude 3.5 Haiku released on: 2024-11-04.
cdc82d4d6c880ba3 · 2 sources · 100% confidence
Phi-3 (Microsoft small language model family) released on: 2024-04-23.
2d20c4a92620ba46 · 2 sources · 100% confidence
DeepSeek V3 released on: 2024-12-26.
6aa7e9c75d084617 · 2 sources · 95% confidence
Llama 3.2 (multimodal release including 11B and 90B vision models) released on: 2024-09-25.
e27816c692a28ce9 · 2 sources · 100% confidence
Llama 3.3 70B released on: 2024-12-06.
97c04944b4c6345a · 1 source · 100% confidence
GPT-4o context window tokens: 128000.
118226d52c6f7491 · 2 sources · 100% confidence
Sora (public release to ChatGPT Plus subscribers) released on: 2024-12-09.
409b828ed9f8410b · 1 source · 100% confidence
The Pile dataset released on: 2020-12-31.
4aef1422b96df26c · 2 sources · 100% confidence
RedPajama dataset released on: 2023-04-17.
ea8b7be3a49101be · 2 sources · 95% confidence
DALL·E 2 released on: 2022-04-06.
0b0e64476bd25bd6 · 2 sources · 100% confidence
Stable Diffusion 1.x released on: 2022-08-22.
79a7e980e59680bc · 2 sources · 100% confidence
Organizations (11)
Anthropic founded in: 2021.
dc53356a0a39c8de · 2 sources · 100% confidence
OpenAI founded in: 2015.
64a2379efebe62ee · 2 sources · 100% confidence
Mistral AI founded in: 2023.
db8e97f3583db317 · 1 source · 100% confidence
Hugging Face founded in: 2016.
0e28201946131976 · 2 sources · 100% confidence
Stability AI founded in: 2019.
d23bd1687a3ea44a · 2 sources · 95% confidence
DeepMind acquired by google on: 2014-01-26.
25a8f486a0947790 · 2 sources · 95% confidence
Cohere founded in: 2019.
cf4cbf71dc7dd19a · 2 sources · 95% confidence
xAI founded in: 2023.
e5084c3f7023114b · 2 sources · 95% confidence
EleutherAI founded in: 2020.
f018fec775a8e941 · 2 sources · 95% confidence
Together AI founded in: 2022.
4d05fa0726841c32 · 2 sources · 90% confidence
AI21 Labs founded in: 2017.
c7c1ee139886cafe · 2 sources · 95% confidence