• @BetaDoggo_
    link
    English
    54 days ago

    200 tokens per second isn’t achievable with a 1.5B even on low-midrange GPUs. Unless they’re attaching an external GPU it’s not happening on a raspberry pi.

    This article is disjointed and smells like AI.