Hacker News
news
newest
show
ask
jobs
monax
A complete Llama2 inference engine that fits in 1356 bytes of x86 assembly
github.com
26 points
0 comments
a day ago
hn-front
(c) 2024
voximity
source