cross-posted from: https://lemmy.intai.tech/post/72919

Parameters count:

GPT-4 is more than 10x the size of GPT-3. We believe it has a total of ~1.8 trillion parameters across 120 layers. Mixture Of Experts - Confirmed.

OpenAI was able to keep costs reasonable by utilizing a mixture of experts (MoE) model. They utilizes 16 experts within their model, each is about ~111B parameters for MLP. 2 of these experts are routed to per forward pass.

Related Article: https://lemmy.intai.tech/post/72922

  • manitcorOP
    link
    fedilink
    English
    11 year ago

    They are the right ones. Should be a tweet archive and a blog post

    • @[email protected]
      link
      fedilink
      English
      21 year ago

      Well that’s weird because the first takes me to a shitpost with a picture of cake, and the second a shitpost about sucking your dentist’s fingers…

      • manitcorOP
        link
        fedilink
        English
        31 year ago

        ewwww lol

        are you using an app or the web? the links should point to the intai instance which works fine for me but i don’t know what various clients will do with those links

        • @[email protected]
          link
          fedilink
          English
          31 year ago

          I’m using Connect, so that could explain it! Thanks. I’ll see if I can figure it out because this is really interesting to me, but the dentist post is not! Haha!