• Emotional (he/him)
    link
    fedilink
    English
    131 month ago

    Unfortunately, as I’ve learned recently, it doesn’t look like Deepseek is actually open source.

    You can download the model, but unless I’m misunderstanding, that feels comparable to calling Photoshop open source because you can download the .exe file on your computer.

    • @[email protected]
      link
      fedilink
      English
      141 month ago

      Its MIT licensed. Meaning the code is open but the license is permissible in that copy’s can be subsequently closed. This is unlike with the GPL most generally associated with open source code.

      • @[email protected]
        link
        fedilink
        111 month ago

        The weights are MIT licensed. The code is, too, but code for these things are uninteresting.

        The training data is not open source, and that’s the interesting part of a model.

        • @andMoonsValue
          link
          11 month ago

          Fair, but they gave quite a bit when compared to other leaders in the space such as "open"ai. I’m exciting for this project I stumbled across - https://github.com/huggingface/open-r1. There are gaps in what deepseek provided but others are trying to fill them in.

          • Emotional (he/him)
            link
            fedilink
            English
            21 month ago

            Of course! I think I’ve been particularly cynical about stuff being named open source because of OpenAI.

            I use LLMs through Perplexity and GitHub CoPilot all the time, but I’m still too spiteful and petty to use anything from "Open"AI. I’ve been very happy with R1 so far.

    • @Knock_Knock_Lemmy_In
      link
      41 month ago

      You can reweight as you please to whatever dataset you like. They can say what the training data included, but they can’t share the dataset.