Hacker News new | past | comments | ask | show | jobs | submit login

While it's true that "nothing" or "custom" should be poll options, I don't agree with your reasons.

Supervisor programs aren't just about restarting processes if they die. They manage dependencies, reloading, a central authority for starting/stopping and other administrative tasks. Some additionally do things like logging.

And besides, even the most stable software can't be guaranteed bug-free (though I do agree that a better action might be to stop and yell loudly rather than blindly restart). Why forego a safety net?




I think there are some problems with your statement. All these process-reloading and management options are a symptom of a much larger problem. The problem is simply:

   People have stopped writing software designed to run continuously.
Some notes from 20 years of Unix fudging:

If a process dies, it's a problem with the process, not an external problem. Make it reliable. Software shouldn't fail at all. PostgreSQL, Postfix, init never crash from experience. Why shouldn't your software do the same?

Reloading: system-wide: "service whatever restart/reload"

Dependencies: Dependencies are hell. Having used BIG Unix kit with power sequencers and stuff, I wouldn't ever take on a dependency. If I did, the dependent processes should fail gracefully and retry.

Central authority: service xxx restart/reload again...

Logging: syslog? On our kit, we use Splunk but it's pretty much overkill for most people.

If you can't sleep at night because your processes crash, there's no good employing someone (a supervisor) to go around and do CPR on them. Fix the root problem.


Funny you should mention PostgreSQL, since it includes a monitoring process - the postmaster - that restarts the workers if they die ;) (and they do, occasionally).


That's because the child processes can load and run third party code, which obviously can crash them. Just like apache modules can crash httpd worker processes. So why run an extra layer of monitoring on top of the existing one for apps like those?


I don't think that's what's being suggested.


All the software in the poll is general purpose monitoring/restarting software. It isn't software designed to be multi-process and monitor its own child processes.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: