Read [Scaling LLM Test-Time Compute Optimally can be More / WIP

Jason Wu

Read [Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters](arxiv.org/pdf/2408.03314)