Anthropic says its latest AI model is too powerful for public release and that it broke containment during testing

return2ozma · 1 day ago

Anthropic says its latest AI model is too powerful for public release and that it broke containment during testing

paraphrand · edit-2 1 day ago

I wasn’t wrong in this reply. I was asked about believing Anthropic.

Are you saying they are lying? Why should I disbelieve Anthropic?

theunknownmuncher · 21 hours ago

Your reasoning was (paraphrased, so hopefully I understood you correctly) “why would they lie about the model disobeying instructions because that looks bad for them”

But I believe Anthropic when they say their models are not working as intended and posing security risks.

But when you actually read the article, they had specifically prompted the model to do the things it did.

Also Anthropic has a patterned history of greatly exaggerating and outright lying.

paraphrand · 20 hours ago

deleted by creator