CEO Steve Huffman says tech giants should not be able to trawl Reddit’s huge store of data for free. But that information came from users, not the company

That “corpus of data” is the content posted by millions of Reddit users over the decades. It is a fascinating and valuable record of what they were thinking and obsessing about. Not the tiniest fraction of it was created by Huffman, his fellow executives or shareholders. It can only be seen as belonging to them because of whatever skewed “consent” agreement its credulous users felt obliged to click on before they could use the service.

Ouch

  • @[email protected]
    link
    fedilink
    English
    672 years ago

    Wide op for ai scraping and nothing are not the only two options. They could easily limit api calls to what would be good for single users or mods and have each user generate their own key. Apps could let users input their key. Most users wouldn’t bother and would switch to their app anyway so it would get them 95% or what they claim to want without being a dick about it.

    • @[email protected]
      link
      fedilink
      English
      482 years ago

      Plus AI companies can just scrape reddit without using the API. It’s still a website after all.

        • dxxth
          link
          English
          62 years ago

          If the data is that important to them that they kill the site, then they’re more dumb than I think. Apps can be scraped too. It isn’t even difficult.

        • @PuffyPanda
          link
          English
          22 years ago

          I highly doubt Reddit is gonna shut down their website.

          • @Nahlej
            link
            English
            62 years ago

            I saw a post saying they were testing restricting mobile access to only through the app.

            • @PuffyPanda
              link
              English
              32 years ago

              Oh yeah, they’ve done that already. I don’t think they’ll extend that to actual web tho

    • @FanciestPants
      link
      English
      72 years ago

      I’m not sure if I wasted my time, but I spent a few hours today editing all of my posts on Reddit to be a single comma or period. I didn’t comment or post a lot by any means, but just got irritated enough to try to keep from contributing in any way to Spez profiting off of user provided content.

      • Sparking
        link
        fedilink
        English
        32 years ago

        Can’t shreddit do this in bulk? I am considering doing it for my comments, but I think I will just leave them up there. I did have a great time on reddit until they announced their API changes, so I will leave them with that much. But I did get a backup of everything I wrote using bulk downloader.

        But I am still considering just doing a shreddit just for kicks.

        • @penguinv
          link
          English
          32 years ago

          so sad. Not opposing but like burning a forest.

    • Sparking
      link
      fedilink
      English
      52 years ago

      Honestly, I think the sad truth is that reddit is bleeding money, and every action they take from here on out will be about recruiting whales and driving off everyone else. That’s steve’s brilliant business strategy - make reddit p2w.

      • @aceshigh
        link
        English
        32 years ago

        where is the money going?

    • @Pika
      link
      English
      0
      edit-2
      2 years ago

      that’s how they did it. They put a 10 request a minute on bots and a higher oauth limit (100) for individuals. large User client type apps could have somewhat easily converted over to that system but due to time constraint they didn’t. I do think they extorted their third party devs sure but, honestly the individual user limit isn’t super unreasonable as long as you aren’t liking or disliking every post. the search api is 100 posts per Api request, it was more the no NSFW and the no advertising limits they put on it that sucked

      edit: its actually 10 or 100 per minute not hour

      • Sparking
        link
        fedilink
        English
        32 years ago

        It’s not that simple, because the third party apps ship with a single api key. So I used Relay for reddit, and used the same api key as everyone else on that app. You could create an app, and then have everyone make their own key, but that is just asking for trouble. Definitely too technical for most people, and you would probably need to put in billing info for a scenario where you go above the free-tier call limit.

        • @Pika
          link
          English
          0
          edit-2
          2 years ago

          update: removed the comment because I was looking at the Api docs again and it seems that despite using the bearer token, metrics and rate limiting still are based off the app client ID, which is super stupid. originally stated that rate limits would be by oauth client which would be per user, 100 requests a minute, but it is actually 100 requests per minute app wide, which is just unfeasible for large scale

          • Sparking
            link
            fedilink
            English
            12 years ago

            Well, I don’t know how the Reddit API works, but what you described is generally bad practice, as is my understanding. The Oauth token’s allow the app to perform actions on the behalf of authenticated users, but they still need to use the reddit API, and I imagine an API key, to perform those actions. You generally aren’t supposed to use Oauth as a access authentication mechanism.

            At least pricing is per Oauth key, but still, the pricing burden is still going to fall on the developers for these apps who reddit now views as their “competitors”, despite making a product that supported reddit’s business for years.

            • @Pika
              link
              English
              2
              edit-2
              2 years ago

              Oauth 2 is an authorization standard, that’s basically what it is meant for. It’s intended to be used as a identification system for a client to be able to tell a first party hey I’m me through the usage of a third party without ever giving the third party to have your password.

              Discord, Facebook/Meta, Google(most services), Soundcloud, all those use Auth 2 based API’s, oauth 2 is used basically everywhere for the same focus that Reddit is trying to do

              Like you said it can be dangerous if you authorize a third party app with more scopes then needed(scopes help restrict what the app can do on your behalf), honestly I’m willing to bet that rif and Apollo both used the oauth2 API at least in some part, otherwise I don’t think it would have been able to allow you to upvote or downvote posts or post comments as you. A good way to tell if it was using it or not is if you had to login and it brought you to a page that said authorize this app with Reddit, if it showed that you were using oauth 2

              I haven’t personally had to use the Reddit Api I’ve only skimmed it myself(I was looking into it then the whole bombshell happend and I bailed), but I have to use the Discord API daily with developing my Discord bot and with the Discord API once you have the bearer token every form of authentication with that specific user goes through that token instead, it’s really only used for the website settings page though anything actually on the client itself still uses a standard bot token

              Also I fully agree, Reddit has shown that it has no respect for the third parties on the platform so I fully expect just going to get worse, they wern’t planning on negotiating price, they set it that way to force third party out