One of the arguments made for Reddit’s API changes is that they are now the go to place for LLM training data (e.g. for ChatGPT).

https://www.reddit.com/r/reddit/comments/145bram/addressing_the_community_about_changes_to_our_api/jnk9izp/?context=3

I haven’t seen a whole lot of discussion around this and would like to hear people’s opinions. Are you concerned about your posts being used for LLM training? Do you not care? Do you prefer that your comments are available to train open source LLMs?

(I will post my personal opinion in a comment so it can be up/down voted separately)

  • @FearTheCronOP
    link
    171 year ago

    My personal opinion is that high API usage fees hurt open source LLMs (e.g. GPT4All). I would rather not see this new technology monopolized by those who can pay API fees.

    • realslef
      link
      fedilink
      21 year ago

      Yes, LLMs are a problem for server operators, but Reddit’s attempted cure has horrible side-effects.

      • @FearTheCronOP
        link
        11 year ago

        I totally agree that Reddit’s approach has horrible side effects. However, if hosting costs were not an issue, how would you feel about people using your comments for LLM training?