Hacker News

rawsh
Batched reward model inference and Best-of-N sampling raw.sh

hn-front (c) 2024 voximity
source