AI Launches Nukes In ‘Worrying’ War Simulation: ‘I Just Want to Have Peace in the World’

@[email protected] · 10 months ago

AI Launches Nukes In ‘Worrying’ War Simulation: ‘I Just Want to Have Peace in the World’

@kromem · edit-2 10 months ago

By debating itself (paper) regarding pros and cons of options.

There’s too much focus on trying to get models to behave on initial generation right now, which isn’t even at all how human brains work.

Humans have intrusive thoughts all the time. If you sat in front of a big red button labeled “nuke everything” it’s pretty much a guarantee that you’d generate a thought of pushing the button.

But then your prefrontal cortex would kick in with its impulse control, modeling the outcomes and consequences of the thought and shutting that shit down quick.

The most advanced models are at a stage where we could build something similar in terms of self-guidance. It’s just that it would be more expensive than it being an all-in-one generation, so there’s a continued focus on safety to the point the loss in capabilities has become a subject of satire.