The Irony of 'You Wouldn't Download a Car' Making a Comeback in AI Debates

@FatCat · 5 months ago

The Irony of 'You Wouldn't Download a Car' Making a Comeback in AI Debates

@derf82 · 5 months ago

Not many. And generally not book passages or whole NY Post articles. That’s the point. OP claims it tosses the original, but it doesn’t.

@CeeBee_Eh · 5 months ago

Not many.

Yes, literally every single person on this planet can recite a song or poem.

But there are naturally massive differences between a human brain and an LLM. The point I was making is that an LLM doesn’t copy and store books and articles wholesale. The ability to reproduce samples from the dataset is more of a quirk than a feature, in the same way that a person can memorize things.

@derf82 · 5 months ago

But that is just it. When a commercial enterprise is literally saving copyrighted content and car reproduce it on demand, copyright holders have every right to object. Either use public domain materials and/or license copyrighted materials, or don’t try to make money off AI.

@Womble · 4 months ago

Where is the LLM that can reproduce specific whole copyrighted works on demand? All ive seen is reproductions of quotes of a few sentences (fair use) and hacks that can make it ocasionally vomit up random larger fragments of its training data, maybe up to a few paragraphs.