The mental checklist I use when troubleshooting Linux servers

Published: 4 days ago (December 20, 2025 at 02:09 PM EST)

1 min read

Source: Dev.to

Source: Dev.to

Step 1: What is broken?

Service not running?
Server unreachable?
Performance issue?
Permission issue?
Always define failure first

Step 2: Is the system alive?

Can I SSH in?
Is the server responsive?
Is the disk full?
Is RAM exhausted?

Step 3: Is the service running?

Is the process running?
Did it fail to start?
Did it crash?
This eliminates 50 % of issues

Step 4: Check logs

Why it failed
What it tried to do
What it couldn’t access
Learn to scan logs, not read every line

Step 5: What changed last?

Updates
Config edits
Permission changes
New files
Always ask: what changed?

Step 6: Narrow scope

Is it one user or all users?
One service or the whole system?
One port or all networking?
This prevents panic

Step 7: Test ONE thing at a time

Make a small change
Restart service
Observe
Never shotgun‑fix

Step 8: Confirm + document

Is it fixed?
Why?
What would I do faster next time?
That’s real troubleshooting

Related posts

Replacing Phone Addiction with Building a Real Project

!Cover image for Replacing Phone Addiction with Building a Real Projecthttps://media2.dev.to/dynamic/image/width=1000,height=420,fit=cover,gravity=auto,format=a...

A Definitive Guide to Warehouse Utilisation

Introduction A warehouse is, fundamentally, just a 3‑D box. Utilisation is simply the measure of how much of that box you are actually using. While logistics c...

CinemaSins: Everything Wrong With Red One In 18 Minutes Or Less

Overview Everything Wrong With Red One in 18 Minutes Or Less takes a festive swing at the predictably plotted holiday blockbuster, tallying up every “sin” agai...

Ingesting 100M Heartbeats: Scaling Wearable Tech Without Going Broke

The Math of “Continuous” Let’s be real about the numbers. If a device sends a heartbeat payload once per second 1 Hz: - 1 User = 86,400 writes/day. - 1,000 Use...