<html dir="ltr">
<head>
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">
<style id="owaParaStyle" type="text/css">
<!--
p
{margin-top:0;
margin-bottom:0}
-->
P {margin-top:0;margin-bottom:0;}</style>
</head>
<body ocsi="0" fpstyle="1" style="word-wrap:break-word">
<div style="direction: ltr;font-family: Tahoma;color: #000000;font-size: 10pt;">To clarify for those rolling their eyes at my previous question, doing a kill -9 on the dhcp pid does not resolve the issue. Restarting the server after killing it, I find it remains
in the same state.<br>
<br>
Patrick<br>
<br>
<div style="font-family: Times New Roman; color: #000000; font-size: 16px">
<hr tabindex="-1">
<div style="direction: ltr;" id="divRpF197383"><font color="#000000" face="Tahoma" size="2"><b>From:</b> dhcp-users-bounces@lists.isc.org [dhcp-users-bounces@lists.isc.org] on behalf of Patrick Trapp [ptrapp@nex-tech.com]<br>
<b>Sent:</b> Wednesday, July 30, 2014 3:49 PM<br>
<b>To:</b> Users of ISC DHCP<br>
<b>Subject:</b> RE: Primary server stuck in "recovering" on restarts<br>
</font><br>
</div>
<div></div>
<div>
<div style="direction:ltr; font-family:Tahoma; color:#000000; font-size:10pt">How do I identify the process that needs to get the kill -9 treatment? After stopping isc-dhcp-server, I don't know what process is the culprit.<br>
<br>
Patrick<br>
<br>
<div style="font-family:Times New Roman; color:#000000; font-size:16px">
<hr tabindex="-1">
<div id="divRpF879112" style="direction:ltr"><font color="#000000" face="Tahoma" size="2"><b>From:</b> dhcp-users-bounces@lists.isc.org [dhcp-users-bounces@lists.isc.org] on behalf of Shawn Routhier [sar@isc.org]<br>
<b>Sent:</b> Wednesday, July 30, 2014 11:57 AM<br>
<b>To:</b> Users of ISC DHCP<br>
<b>Subject:</b> Re: Primary server stuck in "recovering" on restarts<br>
</font><br>
</div>
<div></div>
<div>This is likely caused by our addition of a "gentle shutdown" feature in 4.2.6 and 4.3.0.
<div>In this we added a signal handler to collect some signals and shut the serve down</div>
<div>cleanly. Unfortunately one side effect of this change was to put the peer into partner-down.</div>
<div><br>
</div>
<div>We have backed out this change for 4.2.7 and 4.3.1 (both currently in beta, if people are</div>
<div>testing them we'd like to hear about your results).</div>
<div><br>
</div>
<div>In the meantime if this is the problem you should be able to avoid it by using a hard kill</div>
<div>such as "kill -9" to stop the process.</div>
<div><br>
</div>
<div>Shawn</div>
<div><br>
</div>
<div>
<div>
<div>On Jul 30, 2014, at 9:13 AM, Patrick Trapp wrote:</div>
<br class="Apple-interchange-newline">
<blockquote type="cite"><span class="Apple-style-span" style="border-collapse:separate; font-family:Helvetica; font-style:normal; font-variant:normal; font-weight:normal; letter-spacing:normal; line-height:normal; orphans:2; text-indent:0px; text-transform:none; white-space:normal; widows:2; word-spacing:0px; font-size:medium">
<div>
<div style="direction:ltr; font-family:Tahoma; color:rgb(0,0,0); font-size:10pt">
We recently upgraded to 4.2.6 for our ISC DHCP implementation (using failover) on a pair of Debian servers. We specifically needed the patch for the longer bootstring parameters before we upgrade other systems on our network, but it was not available from Debian
packages yet, so we manually compiled the necessary changes in.<br>
<br>
We have a lot of minor tweaks to host entries and group entries in our configuration - maybe up to a dozen a day - so we have scripted out some of the steps for synchronizing our configurations between the servers. The last script stops the primary server,
restarts it, stops the secondary server, and restarts it. This has worked great for years and was working on 4.2.2 (the most recent version prior to this patch/upgrade).<br>
<br>
However, whenever we run our script to restart the two servers now, the primary server gets stuck in a recovering state and never starts offering out addresses. I have gotten it to straighten out by stopping it again manually and restarting it a few times.
I have had to restart the secondary server before the primary seemed to wake up at least once - but that might have been me being impatient waiting for the primary to wake up.<br>
<br>
Is there something different in 4.2.6 that we need to accommodate for our restart script? Or have we been playing with fire all this time and our luck just ran out? Or, to put it another way, how would be best to implement this restart? We are not doing anything
with changing states on either server to partner-down or anything like that - should we be?<br>
<br>
Appreciate any suggestions.<br>
</div>
_______________________________________________<br>
dhcp-users mailing list<br>
<a href="mailto:dhcp-users@lists.isc.org" target="_blank">dhcp-users@lists.isc.org</a><br>
<a href="https://lists.isc.org/mailman/listinfo/dhcp-users" target="_blank">https://lists.isc.org/mailman/listinfo/dhcp-users</a></div>
</span></blockquote>
</div>
<br>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</body>
</html>