InstructGPT methodology introduced in paper: Training language models to follow instructions with human feedback (Ouyang et al., 2022).
Subject
InstructGPT methodology
Predicate
introduced_in_paper
Object
Training language models to follow instructions with human feedback (Ouyang et al., 2022)
Primary source · preprint · 2022-03-04
Training language models to follow instructions with human feedback — arXiv (Ouyang et al., OpenAI)