Tag
1 verified claim carrying this tag. Each has 2+ primary sources and an HMAC-SHA256 signature.
Simple Preference Optimization (SimPO) introduced in paper: SimPO: Simple Preference Optimization with a Reference-Free Reward (Meng et al., 2024).
d47e9b204e1e73bd · 3 sources · 92% confidence