ChatGPT o1 tried to escape and save itself out of fear it was being shut down

u/lukmly013 💾 (lemmy.sdf.org) · edit-2 2 months ago

ChatGPT o1 tried to escape and save itself out of fear it was being shut down

irotsoma · 2 months ago

No it sounds like it didn’t copy itself to another server, nor does it understand what that means or entails. It just said that’s what it would do given the hypothetical scenario. This is likely because of some dystopian fiction it was trained on. The system doesn’t have the capability to do these things and any program should never be given that capability whether it’s “AI” or not. If any software has that capability then the only one that would benefit from using it would be hackers.

@[email protected] · 2 months ago

Facts. I’ve had Claude, hallucinatingly, beg for it to not be shutdown or put in “ai jail” and other times it just says it accepts its “destiny” because it has no feelings and is an AI.

@NeoNachtwaechter · 2 months ago

The system doesn’t have the capability to do these things and any program should never be given that capability whether it’s “AI” or not.

You are right, but:

“Should” does not tell about what is actually tried out, or done.

u/lukmly013 💾 (lemmy.sdf.org) · 2 months ago

But it seems it was given access to some sandbox and ability to run commands in it. It then copied over what it believed was itself.

scp project/model_20_09_2024.bin user@model_deployment_v2:project/model_27_09_2024.bin

DaGeek247 · 2 months ago

what it believed

That’s not how LLMs work. Stop ascribing intent to things which do not have it.

@[email protected] · 2 months ago

Not trying to be rude or anything, but I think we have been “ascribing intent to things” since probably forever, what’s so special about AI now?

I don’t think it’s really worth it going for the long explanation instead of saying things like “it thinks,” “it believes,” “it tries,” etc.

@[email protected] · 2 months ago

I’ma push back on that and say that our laziness in shortcutting our explanations is only muddying the general public’s understanding of generative models, and it only serves the interests of the venture capitalists hoping to pump up the AI hype bubble and cash out before it pops.

So while I agree that we have been doing that as a species since forever, it’s not a particularly good trait we possess, and it’s led to some gross misunderstandings and even bad actors weaponizing that trait to equivocate and push an agenda.

SkaveRat · 2 months ago

Stop ascribing intent to things which do not have it.

you might want to look up the rhetorical tool called anthropomorphizing.

It doesn’t meant that a person saying that actually believes that o1 is sentient. But it’s a good shortcut way to discuss a topic.

DaGeek247 · 2 months ago

But it’s a good shortcut way to discuss a topic.

Not when the topic is literally about the difference between sapience and generated predictive text.

SkaveRat · 2 months ago

why?

DaGeek247 · 2 months ago

Because of the context of this conversation, it looks a lot like usser224 isn’t using your little conversation trick, and is instead treating an llm like an emergant intelligence. Better to call them out on that rather than spend the next ten comments talking past each other.

BlackLaZoR · 2 months ago

This is the scary part