Tag
1 verified claim carrying this tag. Each has 2+ primary sources and an HMAC-SHA256 signature.
Odds Ratio Preference Optimization (ORPO) introduced in paper: ORPO: Monolithic Preference Optimization without Reference Model (Hong et al., 2024).
ff0975d391b66a6f · 3 sources · 92% confidence