Hacker News

news newest show ask jobs

matt_d

Understanding Inference Scaling for LLMs: Bottlenecks, Trade-Offs, and Perf arxiv.org

6 points 0 comments 12 hours ago

hn-front (c) 2024 voximity

source