nanog mailing list archives

DNS pulling BGP routes?

From: Michael Thomas <mike () mtcc com>
Date: Wed, 6 Oct 2021 10:42:21 -0700

So if I understand their post correctly, their DNS servers have theability to withdraw routes if they determine are sub-optimal (fsvo). Ican certainly understand for the DNS servers to not give answers theythink are unreachable but there is always the problem that they may bepartitioned and not the routes themselves. At a minimum, I would thinkthey'd need some consensus protocol that says that it's broken acrossmultiple servers.

But I just don't understand why this is a good idea at all. Networktopology is not DNS's bailiwick so using it as a trigger to withdrawroutes seems really strange and fraught with unintended consequences.Why is it a good idea to withdraw the route if it doesn't seem reachablefrom the DNS server? Give answers that are reachable, sure, but toactually make a topology decision? Yikes. And what happens to the cachedanswers that still point to the supposedly dead route? They're going tofail until the TTL expires anyway so why is it preferable withdraw theroute too?

My guess is that their post while more clear that most doesn't go intoenough detail, but is it me or does it seem like this is a really weirdthing to do?


Mike


On 10/5/21 11:56 PM, Bjørn Mork wrote:

Masataka Ohta <mohta () necom830 hpcl titech ac jp> writes:

As long as name servers with expired zone data won't serve
request from outside of facebook, whether BGP routes to the
name servers are announced or not is unimportant.

I am not convinced this is true.  You'd normally serve some semi-static
content, especially wrt stuff you need yourself to manage your network.
Removing all DNS servers at the same time is never a good idea, even in
the situation where you believe they are all failing.

The problem is of course that you can't let the servers take the
decision to withdraw from anycast if you want to prevent this
catastrophe.  The servers have no knowledge of the rest of the network.
They only know that they've lost contact with it.  So they all make the
same stupid decision.

But if the servers can't withdraw, then they will serve stale content if
the data center loses backbone access. And with a large enough network
then that is probably something which happens on a regular basis.

This is a very hard problem to solve.

Thanks a lot to facebook for making the detailed explanation available
to the public.  I'm crossing my fingers hoping they follow up with
details about the solutions they come up with.  The problem affects any
critical anycast DNS service. And it doesn't have to be as big as
facebook to be locally critical to an enterprise, ISP or whatever.



Bjørn

Current thread:

Re: Better description of what happened, (continued)
- - - Re: Better description of what happened Michael Thomas (Oct 05)
    - Re: Better description of what happened Hugo Slabbert (Oct 05)
    - Re: Better description of what happened Tom Beecher (Oct 06)
    - Re: Better description of what happened Bjørn Mork (Oct 06)
    - Re: Better description of what happened Tom Beecher (Oct 06)
    - Re: Better description of what happened Hugo Slabbert (Oct 06)
    - Re: Facebook post-mortems... Masataka Ohta (Oct 05)
    - Re: Facebook post-mortems... Bjørn Mork (Oct 05)
    - Re: Facebook post-mortems... Masataka Ohta (Oct 06)
    - Re: Facebook post-mortems... Bjørn Mork (Oct 06)
    - DNS pulling BGP routes? Michael Thomas (Oct 06)
    - Re: DNS pulling BGP routes? J. Hellenthal via NANOG (Oct 06)
    - Re: DNS pulling BGP routes? Jared Mauch (Oct 06)
    - Re: DNS pulling BGP routes? Blake Dunlap (Oct 06)
    - Re: DNS pulling BGP routes? Masataka Ohta (Oct 06)
    - Re: DNS pulling BGP routes? William Herrin (Oct 07)
    - Re: DNS pulling BGP routes? Masataka Ohta (Oct 07)
    - Re: DNS pulling BGP routes? Tom Beecher (Oct 07)
    - Re: DNS pulling BGP routes? Bjørn Mork (Oct 07)
    - Re: DNS pulling BGP routes? Masataka Ohta (Oct 07)
    - Re: DNS pulling BGP routes? William Herrin (Oct 07)

(Thread continues...)