Is it ok (ethical) to scrape and display full-page content when blog author did not include in it RSS?

RSS Bot · 1 month ago

Is it ok (ethical) to scrape and display full-page content when blog author did not include in it RSS?

@actually · 1 month ago

I made stuff that was maybe border line ethical, and I think ultimately it was up to what I could deal with, when I looked in the mirror.

If the mirror is giving you a hard time, then build something to contact the website owners who you are scraping, and give them an option for profit sharing, if any (you would be like an advertisement for them), and give them an option to opt-out. It should be easy to automate this. Just look for the contact or about page, and if a form or email or social site then use that.

Chances are, this will result in some work with no real gain, or loss, because most will not reply or not notice your attempts at contact. And anything lost can be used in your advertising yourself as ethical which gain new users.

Then you can say you tried your best and maybe the mirror will be somewhat kinder.

tabris · 1 month ago

The original content creator relies on advertising, click-throughs and maybe merchandise sales that you would be denying them by scraping their content. This is the entire argument against Google doing what they’ve been doing for the past decade. The value of Google, and by extension your rss reader, is generated by other people’s content, it has little inherent value on its own as without content, it is useless. Drain the income of content creators for long enough and you no longer have content creators, so now you need another thing to generate content. Enter generative ai.

And thus was the internet of 2024 forged, through stolen content and seeing no value in the creations of people, only desiring more content at any cost, as long as that cost to the platform is zero.

Is it ok (ethical) to scrape and display full-page content when blog author did not include in it RSS?

Is it ok (ethical) to scrape and display full-page content when blog author did not include in it RSS?

Is it ok (ethical) to scrape and display full-page content when blog author did not include in it RSS? | Lobsters