Poisoned AI went rogue during training and couldn't be taught to behave again in 'legitimately scary' study

L4sBot · 1 year ago

@[email protected] · 1 year ago

LLM trained on inflammatory data produces inflammatory results, shocking.

@[email protected] · 1 year ago

I know we don’t like them here but the word reddit is not banned (yet)

@[email protected] · 1 year ago

What? What does my comment have anything to do with Reddit?

@[email protected] · 1 year ago

So you’re saying that “Inflammatory data” isn’t a reference to reddit? :D

@[email protected] · 1 year ago

I’d say using Twitter and Facebook would be worse than reddit. Or, and I shudder to think about it, truth social…

@[email protected] · 1 year ago

Reddit is used more for Ai models as those…

@[email protected] · 1 year ago

Not inherently, I’m sure that’s part of it but it’s really everywhere. Even here on Lemmy I’ve run into nasty folk

@[email protected] · 1 year ago

True but it’s reddit that’s served as a base for most models…

@[email protected] · 1 year ago

Not just reddit, LAION is a huge dataset

@[email protected] · 1 year ago

Obviously but reddit is in the goldilocks zone where you get coherent intelligent stuff and humor and facts.

But it’s still toxic for an Ai.

@[email protected] · 1 year ago

Saying it served as the base for most models is just objectively incorrect though

@Chocrates · 1 year ago

No, LLM is the AI, OP is saying if you train it with hate it’s gonna spit out hate

@[email protected] · 1 year ago

And I’m saying that reddit data is sublime for Ai. And specifically that it’s invested with toxicity