Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
Invictus0
on May 3, 2022
|
parent
|
context
|
favorite
| on:
OPT: Open Pre-trained Transformer Language Models
Someone can correct me if I'm wrong, but "30B parameters" refers to a matrix with 30B elements, and assuming all the numbers are 16 bit, then that's 2 bytes * 30B = 60GB.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: