nanog mailing list archives

Re: BGP Failover Question


From: Max Pierson <nmaxpierson () gmail com>
Date: Mon, 21 Feb 2011 15:44:32 -0600

Save yourself the headache and find a new provider that knows how to handle
BGP

I've had this happen with providers that do know how to handle BGP. Just
because you peer with 3356, 701, etc, doesn't mean operators can't make a
mistake. I've even seen this happen due to some wierd BGP behavior caused by
some cool new "features".

IMHO, better to plan for it and deploy it as a policy (by whatever means).

M

On Mon, Feb 21, 2011 at 3:35 PM, Seth Mattinen <sethm () rollernet us> wrote:

On 2/21/2011 13:10, Chris Wallace wrote:
I am looking for some help with an issue we recently had with one of our
BGP peers recently.  I currently have two DIA providers each terminated into
their own edge router and I am doing iBGP to exchange routes between the two
edge routers.  Last week Provider A made a policy change "somewhere" in
their network in the middle of the day causing traffic to stop routing.  Of
course this connection happens to be the preferred route for the majority of
our inbound and outbound traffic.  I never saw our physical link go down and
never saw our peer drop therefore BGP did not stop advertising routes, this
caused most of our customers traffic to go nowhere.  In order to fix the
issue I had to manually shutdown the peer till Provider A confirmed the
change they made had been reverted.  This isn't the first time we have seen
this issue with our various providers, how can I prevent issues like this
from happening in the future?



I had a provider like that a long time ago; it was an ATG T1 (which was
fine) but when they were bought by Eschelon the exact problem you're
describing would happen every other month like clockwork. The first time
was forgivable. The second time I was annoyed. After the third I was
angry, unplugged it, and told them to stuff it because apparently they
didn't know how to deal with BGP.

You can't prevent it from happening. You can only come up with band-aids
to notify you. Save yourself the headache and find a new provider that
knows how to handle BGP. What happens if the other circuit is not
available (outage, planned maintenance, etc.) at the same time the
problem one decides to black hole you? If you're facing the same
repeating problem they are obviously not the best fit for you.

~Seth




Current thread: