Back
Todo
Read [Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters](arxiv.org/pdf/2408.03314)
See similar todos

No replies yet