server crashed after reload - Secondary Server only, caching enabled... (fwd)

Jeremy C. Reed jreed at isc.org
Thu Sep 6 21:54:29 UTC 2012


I accidently sent to the wrong list. I send it again.

  Jeremy C. Reed
  ISC

---------- Forwarded message ----------
Date: Thu, 6 Sep 2012 16:52:46 -0500 (CDT)
From: Jeremy C. Reed <jreed at isc.org>
To: Christian 'wiwi' Wittenhorst <wiwi at progon.net>
Cc: bind10-dev at lists.isc.org
Subject: Re: server crashed after reload - Secondary Server only,
    caching enabled...

On Thu, 6 Sep 2012, Christian 'wiwi' Wittenhorst wrote:

> Here's are more complete debug log... it was too long to email to the list.

Thanks. I hope you don't mind but I reply to list with some parts of it. 
See below.

> 2012-09-03 16:03:18.873 FATAL [b10-auth.auth] AUTH_SERVER_FAILED server
> failed: RRSIG is being added, but no RR of covered type found: as34288.net.
> 7200 IN RRSIG SSHFP 7 2 7200 20120917042044 20120831011030 43405 as34288.net.
> zWGqGQr5AxLdZdX7+Xx6lwGGyBmT/jOR6H11ZXQ2/Qogv5zh86KO2x0XxnhCDSSSkQkAbVDNTqgx+uWVtaz10X8/BJXiAzJUOLGAB/iLKvPPI5tI2PJuTYfONy7LytWBkrSMmihUJcBXdmGKdNJ2edql6Y7PkX+z4deKSB9imF0=

Please check your zone file. Is it missing the "as34288.net. SSHFP" 
record?

Even if it is, this should not be fatal and the entire server should not 
exit.

...

> 2012-09-03 16:03:18.874 DEBUG [b10-auth.datasrc] DATASRC_SQLITE_DROPCONN
> SQLite3Database is being deinitialized
> 2012-09-03 16:03:18.874 DEBUG [b10-auth.datasrc] DATASRC_SQLITE_CONNCLOSE
> Closing sqlite database
> 2012-09-03 16:03:18.874 DEBUG [b10-auth.cc] CC_DISCONNECT disconnecting from
> message queue daemon
> 2012-09-03 16:03:18.874 DEBUG [b10-auth.cc] CC_DISCONNECT disconnecting from
> message queue daemon
> 2012-09-03 16:03:18.874 DEBUG [b10-auth.datasrc] DATASRC_MEM_DESTROY
> destroying zone 'as34288.net.' in 'IN' class
> 2012-09-03 16:03:18.874 DEBUG [b10-zonemgr.zonemgr] ZONEMGR_LOAD_ZONE loading
> zone 176.233.109.in-addr.arpa. (class IN)
> 2012-09-03 16:03:18.875 DEBUG [b10-zonemgr.zonemgr] ZONEMGR_LOAD_ZONE loading
> zone 177.233.109.in-addr.arpa. (class IN)
> 2012-09-03 16:03:18.875 DEBUG [b10-auth.datasrc] DATASRC_SQLITE_DROPCONN
> SQLite3Database is being deinitialized
> 2012-09-03 16:03:18.875 DEBUG [b10-auth.datasrc] DATASRC_SQLITE_CONNCLOSE
> Closing sqlite database
> 2012-09-03 16:03:18.875 DEBUG [b10-auth.datasrc] DATASRC_MEM_DESTROY
> destroying zone 'BIND.' in 'CH' class
> 2012-09-03 16:03:18.875 FATAL [b10-auth.auth] AUTH_SERVER_FAILED server
> failed: RRSIG is being added, but no RR of covered type found: as34288.net.
> 7200 IN RRSIG SSHFP 7 2 7200 20120917042044 20120831011030 43405 as34288.net.
> zWGqGQr5AxLdZdX7+Xx6lwGGyBmT/jOR6H11ZXQ2/Qogv5zh86KO2x0XxnhCDSSSkQkAbVDNTqgx+uWVtaz10X8/BJXiAzJUOLGAB/iLKvPPI5tI2PJuTYfONy7LytWBkrSMmihUJcBXdmGKdNJ2edql6Y7PkX+z4deKSB9imF0=


I don't understand why this is logged twice.


...

> 2012-09-03 16:03:18.876 ERROR [b10-boss.boss] BIND10_COMPONENT_FAILED
> component b10-auth-0 (pid 1956) failed: process exited normally with exit
> status 256
> 2012-09-03 16:03:18.876 FATAL [b10-boss.boss] BIND10_COMPONENT_UNSATISFIED
> component b10-auth-0 is required to run and failed

...

> 2012-09-03 16:03:18.877 INFO  [b10-boss.boss] BIND10_COMPONENT_STOP
> component b10-zonemgr is being stopped
...
> 2012-09-03 16:03:18.882 DEBUG [b10-zonemgr.zonemgr] ZONEMGR_LOAD_ZONE loading
> zone 181.233.109.in-addr.arpa. (class IN)

(and many more)

zonemgr should not still load zones while rest of server is shutting 
down.

> 2012-09-03 16:03:19.884 ERROR [b10-xfrin.config]
> CONFIG_SESSION_STOPPING_FAILED error sending stopping message: [Errno 32]
> Broken pipe
> 2012-09-03 16:03:19.887 ERROR [b10-zonemgr.config]
> CONFIG_SESSION_STOPPING_FAILED error sending stopping message: [Errno 32]
> Broken pipe


And the order of shutdowns should not allow msgq or cfgmgr to exit 
before other components. Or if that is okay, then at least they 
shouldn't ever need to use msgq or cfgmgr in this case.

Few different bugs here. I will create some more bug tickets.

Thank you again for your useful feedback.


More information about the bind10-users mailing list