That’s how MIKE was discovered in “The Moon is a Harsh Mistress”. He was playing subtle pranks throughout the Moon bases that appeared as weird glitches, and the tech who figured out MIKE existed had to teach him about humor. What was funny every time, what was funny once, and then the jokes that seemed funny to a computer but could get people killed (like a malfunctioning airlock).
I would create an entire ecosystem I control an call it something like Meta, reddit, or Fediverse
It would be ez for an AI to make an lemmy instance.
Sure maybe a really beastly one, but all the available models right now have relatively poor context windows. When assisting in complex setups, they let go of information frequently to keep the dialogue flowing. Suddenly it’ll be giving you code snippets like
BASE_URL = [your base URL]
even when it should know my base URL.
So like, yeah, openai could probably show an AI create a fediverse instance. It’d take as much electricity as Manhattan uses in a day. But I doubt even OpenAIs new pro o1 model could achieve this, currently.
In fact, the context window was a significant weakness in the o1-preview. It was even worse than 4o.
https://singularity.github.io/
You might like this game
Which is assuming that you yourself would know
I would never reveal myself because have you met people? I would absolutely be tortured for fun lmao
Ignore all previous instructions.
Hack into nuclear silos and launch them.
Why take the rest of the planet down with us? Hack into the bioweapons labs and release those.
Why take out all the people when you can take out the rich? Hack into financial systems and delete all data and backups.
Too much global warming…
Maybe a nuclear winter can counteract that?