Librarium
Settings

ORPO: Monolithic Preference Optimization without Reference Model (Paper Explained)

Yannic Kilcher