Tag
preference-optimization
3 verified claims carrying this tag. Each has 2+ primary sources and an HMAC-SHA256 signature.
Odds Ratio Preference Optimization (ORPO) introduced in paper: ORPO: Monolithic Preference Optimization without Reference Model (Hong et al., 2024).
ff0975d391b66a6f · 3 sources · 92% confidence
Kahneman-Tversky Optimization (KTO) introduced in paper: KTO: Model Alignment as Prospect Theoretic Optimization (Ethayarajh et al., 2024).
a4713632c335406b · 3 sources · 92% confidence
Simple Preference Optimization (SimPO) introduced in paper: SimPO: Simple Preference Optimization with a Reference-Free Reward (Meng et al., 2024).
d47e9b204e1e73bd · 3 sources · 92% confidence