Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I run the larger version of it on a Threadripper with 512GB RAM and a 32GB GPU for the non-expert layers and context, using llama.cpp. Performs great, however god forbid you try to get that much memory these days.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: