nanog mailing list archives

Re: shim6 @ NANOG (forwarded note from John Payne)

From: Kevin Day <toasty () dragondata com>
Date: Tue, 28 Feb 2006 19:16:02 -0600



On Feb 28, 2006, at 1:22 PM, Iljitsch van Beijnum wrote:

On 28-feb-2006, at 17:09, Kevin Day wrote:
4) Being able to do 1-3 in realtime, in one place, without waitingfor DNS caching or connections to expire
How fast is real time?
And are we just talking about changing preferences here, or aboutwhat happens when there are outages?


5-30 seconds? Including already established connections.

"Oh, crap. We're going over our commit on provider C because of atraffic surge on one of our sites. We need to rebalance this beforewe get dinged for 95th percentile overage."

"Packet loss to AS1234 through provider A suddenly skyrocketed. Weneed to bypass A to that ASN until it's fixed."

"1 of the 2 lines in our trunk to provider B went down, we're at halfbandwidth. We need to shed some load immediately."

We also have incredibly long TCP sessions for some of our services(streaming video/audio). We need to be able to make routing changeswhile those are active, without relying on a keepalive failing tomake the hosts re-evaluate their path decision. If I'm a VOIPprovider, I can't wait for someone to hang up a phone call for newrouting policy to take effect. A VPN provider could have sessionsopen for days/weeks.

We make extensive use of near-immediate routing changes on bothinbound and outbound, relying on the fact that they take effectimmediately. No matter where we put the routing information, how arethe end nodes that are now making the routing decisions going to seethe changes quickly? And how do they see changes for alreadyestablished connections?

Anything done in DNS is just too slow. As an example, take a busy/popular website. Put a 5 minute TTL on the records weeks in advance.Change the IP and watch how long it takes for 100% of the traffic tostop reaching the old IP. 90% within 1-3 hours, 99% within 24 hours.You'll still get hits to the old IP days later. Too many peopleblatantly disregard DNS caching, or just get it wrong.

5) Being able to make routing/policy changes without having torely on the owners/administrators of the machines/sites/domainsthemselves to do the right thing. (i.e. untrusted/not-maintained-by-us systems/networks on our network)
If you're a multihomed hosting company you would want to do TE foryour entire POP, but you wouldn't necessarily be able to changeinformation in the DNS for all the hosts/services that yourcustomers run. Is that what you mean?


Exactly. More detail in my followup message.

6) Anycast?
I don't think shim6 applies to interdomain anycast. (Which is ahack anyway.)

Well, it's a hack that many people are using. If we can't do anycastafter we migrate to IPv6, that again raises the bar of transitioning.

7) During what will be a very lengthy dual-stack transitionalperiod, having to do TE in two entirely different ways. BGP+Prepending+Selective-announcements along side Shim6 doesn'treally sound like fun to me. We can't treat bits as bits, we haveto consider if they're IPv4 bits or IPv6 bits, and engineer themdifferently, even though they're sharing the same lines and areprobably going to have a 1:1 addressing relationship between IPv4and IPv6 services.
:-)

This is a result of the transition to IPv6, regardless of shim6.

It is, but it's one more thing in the list of "We have to do thingsdifferently, and it's questionable if it's better - if not flat outworse" things about moving to IPv6. From a hosting company's standpoint:


Pros:

1) Virtually unlimited IP space

Cons:

1) Even if you qualified for PI space in IPv4, unless you're huge,you're not getting PI space in IPv6. Want to change providers? You'rerenumbering all of your customers.2) If you do need to move, your new provider can't temporarilyannounce your space from your old provider, which is possible now.3) No matter how easily configurable IPv6 makes renumbering, you aregoing to have customers leave rather than deal with readdressing.Some just won't respond/do anything at all no matter how much youharass them that they need to take an action. "Big" hosting companieswho do enough connectivity sales to justify PI space get the upper hand.4) Once you publish AAAA records, every user who has broken theirIPv6 stack on their desktop (even if they don't have IPv6connectivity at all) suddenly can't reach you.5) The only proposal that looks like it has any traction at all tomultihome(shim6) requires trust in customers to administer theirboxes to our instructions a lot more closely, and/or requires controlover DNS for each site we host.6) If you do get PI space, the mantra of "Announce only/exactly whatyou were allocated. No more specifics. No deaggregation." requires acomplete redesign of how a lot of us do things.


And now adding shim6 to the mix:

7) You can't run BGP or traffic engineer your network the way you'redoing with IPv4. You now have two places you have to make routingpolicy decisions, and they're done in completely different ways.8) If you're using shim6, public/private peering is probably notpossible either. (And yes, there are those who participate in peeringarrangements who don't provide transit to others, and wouldn'tqualify for PI space)

The "migrate to IPv6" pain v.s. benefit ratio for those actuallyrunning the content side of the internet is pretty poor at themoment. I don't think you'll be finding many doing it willingly atthis stage, or in the foreseeable future.

And don't confuse this with laziness or some dislike to IPv6. I wentinto our transition attempt really wanting to make this work, andeventually dropped it because it would require too many business-model changing transitions to do so.

On top of those, even if shim6 accomplishes the failover andreliability goals, I can't see how shim6 is going to make pathdecisions as optimal as IPv4/BGP/etc.
Really??? The way I see it, BGP decisions today are mediocre atbest. If anything, I would expect things to get better with shim6.

BGP has the benefit of each network in the middle being able to addtheir say into things. Each transit network can prepend/localpref/med/etc to produce an end-to-end decision. Shim6 presents both ends withmultiple choices, but little in the way of information as to whichone to prefer. It's also moving the decision making into LOTS ofequipment, instead of the borders. Any fancy ideas we come up with tomake better decisions has to be deployed everywhere, and possibly onequipment we don't control.

BGP allows information to be added to the routing decision makingprocess that isn't visible from each end. We're making use of that now.


-- Kevin

Current thread:

Re: shim6 @ NANOG (forwarded note from John Payne), (continued)
- - - Re: shim6 @ NANOG (forwarded note from John Payne) Iljitsch van Beijnum (Feb 28)
    - Re: shim6 @ NANOG (forwarded note from John Payne) Kevin Day (Feb 28)
    - Re: shim6 @ NANOG (forwarded note from John Payne) Daniel Golding (Feb 28)
    - Re: shim6 @ NANOG (forwarded note from John Payne) Joe Abley (Feb 28)
    - Re: shim6 @ NANOG (forwarded note from John Payne) Christian Kuhtz (Feb 28)
    - Re: shim6 @ NANOG (forwarded note from John Payne) Joe Abley (Feb 28)
    - Re: shim6 @ NANOG (forwarded note from John Payne) Randy Bush (Feb 28)
    - Re: shim6 @ NANOG (forwarded note from John Payne) Joe Abley (Feb 28)
    - Re: shim6 @ NANOG (forwarded note from John Payne) Kevin Day (Feb 28)
    - Re: shim6 @ NANOG (forwarded note from John Payne) Michael Loftis (Feb 28)
    - Re: shim6 @ NANOG (forwarded note from John Payne) Kevin Day (Feb 28)