[bind10-dev] transfer problems
Likun Zhang
zlkzhy at gmail.com
Fri Apr 29 03:33:43 UTC 2011
Hi Jeremy,
On Wednesday, April 27, 2011 9:37 PM, Jeremy wrote:
> I think the use is: a master contains the signed zone (it is not
> "hidden" but it also is not part of the delegation). Four NS records.
> One of these is the BIND 10 server. I think the three others may be
> configured to pull from the new BIND 10 server, but I do not know how
> they are configured.
Are you talking about how to configure master address for slave bind10 auth server? If it is, you have to configure it in xfrin's spec file.
> [b10-xfrin] transfer of 'bind10.isc.org.': AXFR started
> [b10-xfrin] Error while loading bind10.isc.org.: receive data from
> socket time out.
> [b10-xfrin] transfer of 'bind10.isc.org.': AXFR failed
>
> (This is logged over 80 times.)
>
> I don't know when the failed is logged as I don't have the very last
> one. It would be nice to have real logging with timestamps and PID of
> process.
>
> It doesn't indicate when or where. A simple dig against the master for
> AXFR does work fine on same system.
I think the problem you mentioned has been recorded in ticket 761.
> The serial number on master and the three other
> secondaries is 2011042400. The serial on the BIND 10 server is
> 2011041300.
>
> This is the second time I have seen the xfrin not working. Early last
> week it was serving wrong data. It was restarted since then.
>
> jelte provided minor patch for xfrin.py.in to also output the
> self._master_address to know where the timeout was from. I didn't use
> this yet.
I'm not sure about the reason, hopely jelte's patch can catch the problem.
> The verbose output also includes:
>
> --------------------------------------
> [b10-auth] received a message:
> ;; ->>HEADER<<- opcode: NOTIFY, status: NOERROR, id: 13575
> ;; flags: aa ; QUESTION: 1, ANSWER: 1, AUTHORITY: 0, ADDITIONAL: 0
>
> ;; QUESTION SECTION:
> ;bind10.isc.org. IN SOA
>
> ;; ANSWER SECTION:
> bind10.isc.org. 0 IN SOA ns-int.isc.org. hostmaster.isc.org. 2011042400
> 7200 3600 604800 3600
>
> [b10-auth] received a message:
> --------------------------------------
>
> The above makes no sense. Notice the "flags: aa ;"
>
> The answer serial is not the same as sent other times (2011041300). This
> specific query doesn't have any corresponding answer sent back. (I don't
> see any corresponding "sending a response" for same.)
>
> Why would it receive and log a received message that includes the answer
> section?
>
> Maybe this corresponds to it originating a SOA check, but that original
> query is not noted in the verbose output.
>
> Note that is the correct serial that all the other auth servers (except
> this one) know.
>
> I don't know if we send any response if our xfrin fails. (I didn't do a
> capture yet.)
No, this a notify message received from some master.
>
> Again the goal of this BIND 10 server is to be the master used by the
> other three public auth servers. But the Xfrout.log doesn't indicate
> that at all. No notifies received or transfers out logged since was
> restarted on April 21.
>
I don't think axfr is pulled from this bind10 server, since every transfer-out will have one log with timestamp now, how about trying axfr with dig again bind10 server?
Thanks
Likun
More information about the bind10-dev
mailing list