db0@lemmy.dbzer0.com to TechTakes@awful.systemsEnglish · 2 years agoThe Google AI isn’t hallucinating about glue in pizza, it’s just over indexing an 11 year old Reddit post by a dude named fucksmith.message-squaremessage-square249linkfedilinkarrow-up1938arrow-down14file-text
arrow-up1934arrow-down1message-squareThe Google AI isn’t hallucinating about glue in pizza, it’s just over indexing an 11 year old Reddit post by a dude named fucksmith.db0@lemmy.dbzer0.com to TechTakes@awful.systemsEnglish · 2 years agomessage-square249linkfedilinkfile-text
minus-squareMalachaiConstantlinkfedilinkEnglisharrow-up50·2 years agoEveryone who neglected to add the “/s” has become an unwitting data poisoner
minus-squareanton@lemmy.blahaj.zonelinkfedilinkEnglisharrow-up3·2 years agoCorollary: Everyone who added the /s is a collaborator of the data scraping AI companies.
minus-squareCharlie Stross@wandering.shoplinkfedilinkarrow-up1·2 years ago@MalachaiConstant Or they’re a Perl or bash programmer.
minus-squarePhilippa Cowderoy@mendeddrum.orglinkfedilinkarrow-up1·2 years ago@MalachaiConstant @dumbass I’d be interested to know how few corpus linguists are actually doing LLM research
minus-squareThorne Lawler@rants.aulinkfedilinkarrow-up1·2 years ago@MalachaiConstant @cstross what about the federal statisticians who slip /s into their online reports?
Everyone who neglected to add the “/s” has become an unwitting data poisoner
Corollary: Everyone who added the /s is a collaborator of the data scraping AI companies.
@MalachaiConstant Or they’re a Perl or bash programmer.
@MalachaiConstant @dumbass I’d be interested to know how few corpus linguists are actually doing LLM research
@MalachaiConstant
@MalachaiConstant @cstross what about the federal statisticians who slip /s into their online reports?