• @CriticalMiss
    link
    English
    1514 days ago

    Earlier reports suggested they trained it on books from Bibliotik.

    What changed?

    • @halcyoncmdr
      link
      English
      2514 days ago

      Probably just both honestly.

    • @BetaDoggo_
      link
      English
      314 days ago

      The llama-1 paper acknowledged the use of the books dataset, libgen isn’t mentioned in any of the papers so this is new info.