Facebook "Secretly Trained Its AI on a Notorious Piracy Database, Newly Unredacted Court Docs Reveal"

@[email protected] · 1 month ago

Facebook "Secretly Trained Its AI on a Notorious Piracy Database, Newly Unredacted Court Docs Reveal"

@[email protected] · 1 month ago

Did they seed at least?

@[email protected] · 1 month ago

it’s facebook, they probably issued a takedown request for all their logged peers

@[email protected] · 1 month ago

The pivot-to-ai writeup is out, they did seed! I assume it’s documented then.

Multinational corporations can act ethically after all.

@[email protected] · 1 month ago

Multinational corporations can act ethically after all.

I wouldn’t go that far

@[email protected] · 1 month ago

They can, they just choose deliberately not to most of the time.

In total honesty though, Meta had actually done some good things for Open Source. Sure, this is probably it of their own interest and neither outweighs nor make up for all the bad. But they can, and sometimes do.

David Gerard · 1 month ago

It’s clear that they didn’t stop uploads of the torrents. It hasn’t been established in the documents we’ve seen so far that they actually had downloaders in turn. But they did clearly make the works available for upload.

monk · 1 month ago

Nice! Now simply fine them to pay significant royalty to every author in there, say, a millicent per word of everything they’ve generated before they get caught.

@JeeBaiChow · 1 month ago

We should just start a meme movement that makes up an imaginary yet believable fact, like the lemmings jumping off a cliff thing, wait for the ais to repeat it and lobby for royalties. Do one for each of the major ai platforms - openai, reddit, meta, apple, google etc. we would eventually find out which public forums are training which bots.

monk · 1 month ago

Doesn’t even have to be believable, LLMs Don not care.

@JeeBaiChow · 1 month ago

And yet these are the things the investment bankers expect to take us to the next level lol

@trolololol · 1 month ago

You don’t need that, all of them use everything

@JeeBaiChow · 1 month ago

I used to think they’d just train on every Facebook account that was ‘deleted’, i.e. removed from the public eye. This feels much worse.

@[email protected] · edit-2 1 month ago

So as libgen is blocked here in .nl by various providers (mine calls it thepiratebay for some reason), i look forward to all their llm being blocked.

@[email protected] · 1 month ago

no way, that’s illegal!

@[email protected] · 1 month ago

When I said “libgen is great because information should be free!” this isn’t what I meant… jeez