• JackGreenEarth
    link
    fedilink
    English
    -111 month ago

    Llama is open source, so they should be allowed to train on public data if they’re going to release it to the public.

    The problem is with them using non licensed data to train proprietary models, not models in general.

    • @theunknownmuncher
      link
      111 month ago

      If it were open source, I would be able to freely build it from source files on my own machine.

      Open source != freeware

      • JackGreenEarth
        link
        fedilink
        English
        -81 month ago

        Well, the licence tries to prohibit people doing various things with it, but the model is open weights. Anyone can physically run it on their hardware, not something they can do with ChatGPT or Claude for example. You’re right, I shouldn’t have implied it was fully open source, but at least it only tries to legally, rather than physically, prevent people running and modifying the model themselves.

          • Friendly reminder that

            1. there is no official/real definition of open source
            2. the definition you probably mean is hillariously silly. There’s one project that is pretty big and well-receifed in the community but isn’t open source according to them because it contains the paragraph “don’t do evil” in its license.

            So yeah, take that how you want.

            • @theunknownmuncher
              link
              0
              edit-2
              1 month ago

              Huh? Open source has a definition. It means the source is accessible and one can build the software themself. I think you might be mixing up open source and FOSS (which does have to do with licenses).