db0 to [email protected]English • 6 months agoThe Google AI isn’t hallucinating about glue in pizza, it’s just over indexing an 11 year old Reddit post by a dude named fucksmith.message-square252fedilinkarrow-up1937arrow-down14file-text
arrow-up1933arrow-down1message-squareThe Google AI isn’t hallucinating about glue in pizza, it’s just over indexing an 11 year old Reddit post by a dude named fucksmith.db0 to [email protected]English • 6 months agomessage-square252fedilinkfile-text
minus-square@MalachaiConstantlinkEnglish49•6 months agoEveryone who neglected to add the “/s” has become an unwitting data poisoner
minus-square@[email protected]linkfedilinkEnglish2•5 months agoCorollary: Everyone who added the /s is a collaborator of the data scraping AI companies.
minus-squareThorne Lawlerlinkfedilink1•6 months ago@MalachaiConstant @cstross what about the federal statisticians who slip /s into their online reports?
minus-squareCharlie Strosslinkfedilink1•6 months ago@MalachaiConstant Or they’re a Perl or bash programmer.
minus-squarePhilippa Cowderoylinkfedilink1•6 months ago@MalachaiConstant @dumbass I’d be interested to know how few corpus linguists are actually doing LLM research
Everyone who neglected to add the “/s” has become an unwitting data poisoner
Corollary: Everyone who added the /s is a collaborator of the data scraping AI companies.
@MalachaiConstant
@MalachaiConstant @cstross what about the federal statisticians who slip /s into their online reports?
@MalachaiConstant Or they’re a Perl or bash programmer.
@MalachaiConstant @dumbass I’d be interested to know how few corpus linguists are actually doing LLM research