SourceScore
SourceScore VERITAS · verified claim100% confidence

InstructGPT methodology introduced in paper: Training language models to follow instructions with human feedback (Ouyang et al., 2022).

Subject
InstructGPT methodology
Predicate
introduced_in_paper
Object
Training language models to follow instructions with human feedback (Ouyang et al., 2022)
Primary source · preprint · 2022-03-04
Training language models to follow instructions with human feedback arXiv (Ouyang et al., OpenAI)
Last verified 2026-05-16 · 2 sources · 5da8f8dffc038b8eView full claim →