BIND 10 #1858: sockcreator doesn't die
BIND 10 Development
do-not-reply at isc.org
Thu Mar 29 01:29:22 UTC 2012
#1858: sockcreator doesn't die
-------------------------------------+-------------------------------------
Reporter: | Owner:
jinmei | Status: new
Type: | Milestone: Next-Sprint-
defect | Proposed
Priority: | Resolution:
medium | Sensitive: 0
Component: Boss | Sub-Project: Core
of BIND | Estimated Difficulty: 0
Keywords: | Total Hours: 0
Defect Severity: N/A |
Feature Depending on Ticket: |
Add Hours to Ticket: 0 |
Internal?: 0 |
-------------------------------------+-------------------------------------
Comment (by jinmei):
This is a log when this happened.
{{{
2012-03-27 23:12:11.780 INFO [b10-boss.boss] BIND10_CONFIGURATOR_STOP
bind10 component configurator is shutting down
2012-03-27 23:12:11.782 INFO [b10-boss.boss] BIND10_COMPONENT_STOP
component b10-auth-2 is being stopped
2012-03-27 23:12:11.782 INFO [b10-boss.boss] BIND10_STOP_PROCESS asking
b10-auth-2 to shut down
2012-03-27 23:12:11.783 ERROR [b10-boss.boss]
BIND10_CONFIGURATOR_PLAN_INTERRUPTED configurator plan interrupted, only 0
of 11 done
2012-03-27 23:12:12.800 INFO [b10-boss.boss] BIND10_SEND_SIGTERM sending
SIGTERM to Socket creator (PID 29154)
2012-03-27 23:12:12.801 WARN [b10-boss.boss] BIND10_SOCKCREATOR_KILL
killing the socket creator
...
2012-03-27 23:12:12.993 ERROR [b10-xfrout.xfrout]
XFROUT_RECEIVE_FILE_DESCRIPTOR_ERROR error receiving the file descriptor
for an XFR connection
(lots of this)
...
2012-03-27 23:12:19.347 INFO [b10-boss.boss] BIND10_SEND_SIGKILL sending
SIGKILL to Socket creator (PID 29154)
(lots of this)
...
2012-03-27 23:13:21.874 INFO [b10-boss.boss] BIND10_PROCESS_ENDED process
29154 of Socket creator ended with status 15
2012-03-27 23:13:21.875 INFO [b10-boss.boss] BIND10_SHUTDOWN_COMPLETE all
processes ended, shutdown complete
}}}
In this case I killed the socket creator by hand. I guess the log at
23:13:21.874 was due to this manual operation.
So, what apparently happened is Boss's shutdown message to the socket
creator was somehow lost, and while the Boss tried to kill it
forcefully (and of course unsuccessfully because it didn't have the
permission), bot Boss and the creator kept alive.
The lost message is itself an issue, but the subsequent behavior is
obviously very bad (if I didn't notice it, it could effectively result
in a busy loop). At least Boss should give up sending a signal
if it fails due to permission denied because it would never succeed by
a retry.
--
Ticket URL: <http://bind10.isc.org/ticket/1858#comment:1>
BIND 10 Development <http://bind10.isc.org>
BIND 10 Development
More information about the bind10-tickets
mailing list