• heavy@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    9
    ·
    18 days ago

    TLDW: Tried huge open models and got 0.7 tokens/s. Seems clustering tools aren’t ready yet.

    • humanspiral@lemmy.ca
      link
      fedilink
      English
      arrow-up
      3
      ·
      18 days ago

      I’m new to this, but some advice seems to be use vulkan to get stuff to work. in your spare time one day, look at rocm.

      • heavy@sh.itjust.works
        link
        fedilink
        English
        arrow-up
        3
        ·
        18 days ago

        I appreciate that! I’m just trying to recap the authors results for people that might be interested in the bottom line.

        It seems that the beowulf architecture still has too much overhead