BIND 10 #3074: kea6 fails to start after repeated start/stop
BIND 10 Development
do-not-reply at isc.org
Wed Oct 9 06:28:56 UTC 2013
#3074: kea6 fails to start after repeated start/stop
-------------------------------------+-------------------------------------
Reporter: wlodekwencel | Owner:
Type: defect | UnAssigned
Priority: very high | Status:
Component: bind-ctl | reviewing
Keywords: kea6 config | Milestone:
Sensitive: 0 | Sprint-20131015
Sub-Project: DHCP | Resolution:
Estimated Difficulty: 0 | CVSS Scoring:
Total Hours: 0 | Defect Severity: High
| Feature Depending on Ticket:
| Add Hours to Ticket: 0
| Internal?: 0
-------------------------------------+-------------------------------------
Changes (by muks):
* owner: muks => UnAssigned
* status: assigned => reviewing
Comment:
`trac3074` is now ready for review.
The problem was that Init has two sockets open to msgq subscribed to the
Init group (it has to start a first socket to track process startup) and
it does not read from both of them. So msgq keeps queuing data on the
unread socket (from its side), and after a while, times out and closes the
socket. However, Init uses this socket still to communicate to msgq, so
when the socket is closed from the other side (resulting in EPIPE)
everything collapses. The fix was simple: we unsubscribe from the Init
group on the unmonitored socket so that nothing is queued up on it from
the msgq side.
While debugging this problem, quite a bit of additional logging was
necessary, but I've only committed the ones that are absolutely essential
as otherwise the amount of trace logging will hinder performance.
Some changes were suggested (and made locally) to the DHCP lettuce ticket
#3140. It seems these have now been made in `trac3140`, so these patches
have been dropped.
DHCP team have tried this patch and have found it to fix the issue.
--
Ticket URL: <http://bind10.isc.org/ticket/3074#comment:6>
BIND 10 Development <http://bind10.isc.org>
BIND 10 Development
More information about the bind10-tickets
mailing list