Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
elorant
59 days ago
|
parent
|
context
|
favorite
| on:
1.5 TB of VRAM on Mac Studio – RDMA over Thunderbo...
You only get 80Gbps network bandwidth. There's your bottleneck right there. Infiniband in comparison can give you up to x10 times that.
storus
58 days ago
[–]
I think the op meant pipeline parallelism where during inference you only transfer the activation between layers where you cut the model in two, which shouldn't be too large.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: