How will the fediverse respond to AI orgs scraping Lemmy/the fediverse for training data?

@sachasage · 2 years ago

How will the fediverse respond to AI orgs scraping Lemmy/the fediverse for training data?

@sachasage · 2 years ago

Fair, but then there’s a line between scraping through ordinary traffic and using API access to gather large data sets.

key · 2 years ago

Is there? Effect is the same. Use machine learning to parse html generically and throw hardware and a pool of IPs at it. A lot more efficient than coding an API client for every service out there. It’s the same approach search engines use.

I don’t see anything being done effectively without legal protections.