I built a private Al mini-cluster with Framework Desktop

fubarx@lemmy.world · 18 days ago

heavy@sh.itjust.works · 18 days ago

TLDW: Tried huge open models and got 0.7 tokens/s. Seems clustering tools aren’t ready yet.

humanspiral@lemmy.ca · 18 days ago

I’m new to this, but some advice seems to be use vulkan to get stuff to work. in your spare time one day, look at rocm.

heavy@sh.itjust.works · 18 days ago

I appreciate that! I’m just trying to recap the authors results for people that might be interested in the bottom line.

It seems that the beowulf architecture still has too much overhead

TheMightyCat@ani.social · 18 days ago

I am really curious how the T/s scales with network speed, as said in the video 5G is quite slow.