Michael Ten to TechnologyEnglish • 8 months agoOpenAI transcribed over a million hours of YouTube videos to train GPT-4www.theverge.comexternal-linkmessage-square43arrow-up1167arrow-down112cross-posted to: [email protected]aicompanions
arrow-up1155arrow-down1external-linkOpenAI transcribed over a million hours of YouTube videos to train GPT-4www.theverge.comMichael Ten to TechnologyEnglish • 8 months agomessage-square43cross-posted to: [email protected]aicompanions
minus-square@[email protected]linkfedilinkEnglish9•8 months agoThere’s a distinct difference between quotation and plagiarism. A search engine does the former, LLMs do the latter.
minus-square@Knock_Knock_Lemmy_InlinkEnglish-2•8 months agoNo. If you write a truly unique combination of words then an LLM will be very unlikely to reproduce them. An LLM is only likely to plagiarise you if your writing is similar to others.
minus-square@Knock_Knock_Lemmy_InlinkEnglish-1•8 months agohttps://blog.gdeltproject.org/do-llms-truly-create-or-merely-arrange-just-how-much-of-an-llms-writing-is-original/
minus-square@[email protected]linkfedilinkEnglish1•8 months ago The differences between human and machine-generated text overlap support the image of LLMs as more “arrangers” than “creators” of text. So plagiarism…
minus-square@Knock_Knock_Lemmy_InlinkEnglish1•8 months agoIt only plagiarises you if you write something similar to lots of other people. Write something original and, even if it is in their training dataset, LLMs are highly unlikely to reproduce it.
There’s a distinct difference between quotation and plagiarism. A search engine does the former, LLMs do the latter.
No. If you write a truly unique combination of words then an LLM will be very unlikely to reproduce them.
An LLM is only likely to plagiarise you if your writing is similar to others.
[citation needed]
https://blog.gdeltproject.org/do-llms-truly-create-or-merely-arrange-just-how-much-of-an-llms-writing-is-original/
So plagiarism…
It only plagiarises you if you write something similar to lots of other people.
Write something original and, even if it is in their training dataset, LLMs are highly unlikely to reproduce it.