Office space meme:

“If y’all could stop calling an LLM “open source” just because they published the weights… that would be great.”

  • @[email protected]OP
    link
    fedilink
    123 days ago

    They published the source code needed run the model.

    Yeah, but not to train it

    anyone can download the model, run it locally, and further build on it.

    Yeah, it’s about as open source as binary blobs.

    Training from scratch costs millions.

    So what? You still can gleam something if you know the dataset on which the model has been trained.

    If software is hard to compile, can you keep the source code closed and still call software “open source”?

    • magic_lobster_party
      link
      fedilink
      13 days ago

      I agree the bad part is that they didn’t provide the script to train the model from scratch.

      Yeah, it’s about as open source as binary blobs.

      This is a great starting point for further improvements of the model. Most AI research is done with pretrained weights used as basis. Few are training models completely from scratch. The model is built with Torch, so anyone should be able to fine tune the model on custom data sets.