Recover from unexpected shutdowns with Lassie

Suggest Edits

Linodes have a featured called Lassie (Linode Autonomous System Shutdown Intelligent rEbooter), also referred to as the Shutdown Watchdog. When this feature is enabled, a Linode automatically reboots if it ever powers off unexpectedly.

Shutdown recovery behavior

The Shutdown Watchdog feature detects when a Linode is powered off and checks if that directive came from our platform (such as Cloud Manager or Linode API). If the power off command did not originate from our platform, the shutdown is considered unexpected and the Linode is automatically powered back on.

📘
Shutdown Watchdog can power back on a Linode up to 5 times within a 15 minute period. If there is a recurring issue that is causing 6 or more shutdowns within this time period, the Linode remains powered off until it is manually powered back on. This is to prevent endless reboot loops if there is an issue with the internal software of a Linode.

Enable (or disable) Shutdown Watchdog

By default, Shutdown Watchdog is enabled on all new Linodes. If you wish to disable or re-enable this feature, follow the instructions below:

Log in to Cloud Manager and navigate to the Linodes link in the sidebar.
Select the Linode that you wish to modify.
Navigate to the Settings tab.
Scroll down to the section labeled Shutdown Watchdog.
From here, click the corresponding toggle button to update this setting to the desired state, either enabled or disabled.

Reasons for an unexpected shutdown

An unexpected shutdown is when a Linode powers off without receiving a power off command from our platform (such as one issued by a user in Cloud Manager or API). In general, this is caused within a Linode's internal system or software configuration. The following list includes potential reasons for these unexpected shutdowns.

A user issues the shutdown command in the shell environment of a Linode. In Linux, a system can be powered off by entering the shutdown command (or other similar commands) in the system's terminal. Since the platform has no knowledge of internal commands issued on a Linode, it is considered an unexpected shutdown.
Kernel panic: A kernel panic can occur when your system detects a fatal error and it isn't able to safely recover. Here is an example of a console log entry that indicates a kernel panic has occurred:
```
Kernel panic - not syncing: No working init found.
```
Out of memory (OOM) error: When a Linux system runs out of memory, it can start killing processes to free up additional memory. In many cases, your system remains accessible but some of the software you use may stop functioning properly. This can occasionally result in your system becoming unresponsive or crashing, causing an unexpected shutdown.
```
kernel: Out of memory: Kill process [...]
```
Other system crashes, such as a crash caused by the software installed on your system or a malicious process (such as malware).

📘
The Shutdown Watchdog feature never causes a Linode to shut down and only ever powers on an Linode if it detects an unexpected shutdown.

Investigate the cause of a shutdown

The underlying cause of these issues can vary. The most helpful course of action is to review your system logs.

Open the Lish console. This displays your system's boot log and, if your system boot was normal, a login prompt appears. If you do not see a login prompt, look for any errors or unexpected output that indicates a kernel panic, file system corruption, or other type of system crash.
Log in to your system through either SSH or Lish and review the log files for you system using either journald or syslog. For systems using systemd-journald for logging, you can use the journalctl command to review system logs. See Use journalctl to View Your System's Logs for instructions.
- journalctl -b: Log entries for the last system boot
- journalctl -k: Kernel messages
For systems using syslog, you should review the following log files using your preferred text editor (such as nano or vim) or file viewer (such as cat or less).
- /var/log/syslog: Most logs as recorded by syslog.
- /var/log/boot.log: Log entries for the last system boot
- /var/log/kern.log: Kernel messages
- /var/log/messages: Various system notifications and messages typically recorded at boot.
You may also want to review log files for any other software you have installed on your system that might be causing these issues.

📘
Unexpected shutdowns are primarily caused by issues with the internal software configuration of a Linode. To investigate these issues further, it is recommended that you reach out to your own system administrators or on our Community Site. These issues are generally outside the scope of the Support team.

File system corruption

In some cases, unexpected shutdowns can cause file system corruption on a Linode. If an error message (such as the one below) appears within your console logs, your file system may be corrupt or otherwise be in an inconsistent state.

/dev/sda: UNEXPECTED INCONSISTENCY; RUN fsck MANUALLY.

In cases like this, it is recommended that you attempt to correct the issue by running the fsck tool in Rescue Mode. See Using fsck to Find and Repair Disk Errors and Bad Sectors for instructions.

Updated about 2 months ago