It’s funny how he’s playing this out to be about third party apps like Apollo. Like yeah, that’s what the community cares about, but the reason they’re making the changes is because he’s fucking anal about OpenAI and other companies finding such success with products they have built using data scraped via the Reddit API.
The data could just be scraped without the API anyway.
Absolutely, but the API offers a really smooth and convenient way of doing it without a lot of extra processing overhead. Scraping HTML is a little bit more involved.
But using an API requires integration with every individual site they want to consume. Crawlers do not. For the same reason, LLMs aren’t using the API.
Reddit could also enforce existing limits or change their TOS to explicitly ban this activity of it was indeed leading to millions of dollars in additional operating expenses. They have done neither.
Huffman is just lying about OpenAI and others being the problem.
The data could just be scraped without the API anyway.
Absolutely, but the API offers a really smooth and convenient way of doing it without a lot of extra processing overhead. Scraping HTML is a little bit more involved.
But using an API requires integration with every individual site they want to consume. Crawlers do not. For the same reason, LLMs aren’t using the API.
Reddit could also enforce existing limits or change their TOS to explicitly ban this activity of it was indeed leading to millions of dollars in additional operating expenses. They have done neither.
Huffman is just lying about OpenAI and others being the problem.