InstructGPT introduced in: Ouyang et al. 2022 — RLHF-tuned GPT-3, direct ancestor of ChatGPT. — SourceScore VERITAS embed · SourceScore

SourceScore VERITAS · verified claim100% confidence

InstructGPT introduced in: Ouyang et al. 2022 — RLHF-tuned GPT-3, direct ancestor of ChatGPT.

Ouyang et al. 2022 — RLHF-tuned GPT-3, direct ancestor of ChatGPT

Primary source · preprint · 2022-03-04

Training language models to follow instructions with human feedback — arXiv (Ouyang, Wu, Jiang, Almeida, Wainwright, Mishkin, Zhang, Agarwal, et al. / OpenAI)

Last verified 2026-05-16 · 2 sources · 590b9de765b8126eView full claim →