I"m relatively new to self-hosting and I have an instance on Oracle cloud with a few apps that I run. More recently this instance is becoming unresponsive every 30 minutes or so. It becomes impossible to SSH to it and any connection to it is dropped. Oracle Cloud says that it is unresponsive, forcibly rebooting it fixes the issue until it becomes unresponsive again in 30 minutes. I believe the most major thing I did since this started was installing Java and doing an “apt update” followed by “apt upgrade” after many months of not doing it. I have tried to turn off every service that I have running using pm2 and systemctl. No luck. Are there any tools that I can use to better understand why it is freezing like that?
Edit: I ran the following command
systemctl --type=service --state=running
And noticed there was a Gnome Display Manager that was running and I wasn’t using it. After disabling this service with
systemctl disable [servicename]
The server stopped crashing. Thanks for all the replies!
Did you buy chance do a release upgrade? I had this happen in a headless VM I run, upgrading from 20.04 to 22.04 VM would become unresponsive (go to sleep) and I would have to wake it up. For whatever reason, a full desktop gui and accessories had been installed. So I ripped all that out via apt and everything was ok after that. This VM has been upgraded over before from 18, and has been running for years so I had not seen this issue before (it runs my Plex server and a bunch of accessory docker containers).
VNC console might show more of what’s going on.
I used some Java services in docker on thier free tier and found them to be big memory hogs. Have a look at the logs for OOM errors (Out of Memory?).
If you are using thier free tier thier arm offering has more memory (assuming what you are running would run under arm).