BIND 10 #1147: Boss should give up after n unsuccessful attempts to start a module

BIND 10 Development do-not-reply at isc.org
Tue Jul 12 21:17:28 UTC 2011


#1147: Boss should give up after n unsuccessful attempts to start a module
-------------------------------------+-------------------------------------
            Reporter:  vorner        |                        Owner:
                Type:  defect        |                       Status:  new
            Priority:  minor         |                    Milestone:  New
           Component:  Boss of BIND  |  Tasks
           Sensitive:  0             |                     Keywords:
         Sub-Project:  Core          |              Defect Severity:  N/A
Estimated Difficulty:  0             |  Feature Depending on Ticket:
         Total Hours:  0             |          Add Hours to Ticket:  0
                                     |                    Internal?:  0
-------------------------------------+-------------------------------------
 If a module has a fatal problem and can not be started (eg. it crashes
 right after being started), boss keeps restarting it and is „jumping“.
 This isn't a serious problem, but it eats resources unnecessarily and is
 annoying, because it fills logs. This sometimes leads to workarounds like
 the one in #1146.

 I suggest that if the started process dies in 10s from being started, we
 count it as an unsuccessful attempt to start it. If we do 5 of them in a
 row (without a successful one), we give up more attempts to start it,
 since it's hopeless (for some value of 10 and 5).

 Then we would add a command an admin could use to ask boss to try again
 (after the admin thinks the problem was fixed) ‒ reset the count of
 unsuccessful attempts to 0 again.

-- 
Ticket URL: <http://bind10.isc.org/ticket/1147>
BIND 10 Development <http://bind10.isc.org>
BIND 10 Development


More information about the bind10-tickets mailing list