Librarium
Settings

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters (Paper)

Yannic Kilcher