Interesting People mailing list archives

Re: the undead urban myth of the LOC/EID split NOT AN EASY READ

From: David Farber <dave () farber net>
Date: Mon, 3 Nov 2008 19:03:35 -0500



Begin forwarded message:

From: Richard Bennett <richard () bennett com>
Date: November 3, 2008 6:33:21 PM EST
To: dave () farber net

Subject: Re: [IP] the undead urban myth of the LOC/EID split NOT ANEASY READ


Dave -

Feel free to share this with IP if you wish.

I read John's book this weekend, in electronic form from the SantaClara Country Library in Silicon Valley. Having read most of the booksever written on the Internet, both of the technical variety and thepublic policy primers, and having been involved in protocol standardsfrom the 1980s to the present, I feel I can say with reasonableconfidence that "Patterns in Network Architecture" is the mostimportant book on network protocols in general and the Internet inparticular ever written. As the passage below indicates, it's not easygoing for the non-technical crowd, who will certainly find much of thediscussion excessively detailed. But John places the protocols intheir proper socio-historical context for the first time. Readers,even the uninitiated, should take away from the book an appreciationfor the fact that network architecture is as much a political exerciseas a technical one, and always has been.

At a time when public policy makers are literally inundated withopinion about the Internet's design and social implications, it'simportant to peel away the metaphors and analogies and take a look athow it really works, what it does, what it doesn't do, what it coulddo a lot better, and how it got the way it is. John Day blazes a trailto that kind of understanding. It's an excellent book, even though Imay disagree with some of his analysis of the Early Wittgenstein and afew other things.

Regarding the discussion below, it may be easier to follow if we takethe example of multi-homing or mobility and trace it through IPaddress assignment, path discovery, and transit, contrasting what we'dlike to see with what we do see. In the present incarnation, we seethe problem begins with IP address assignment to a MAC address,continues with DNS pointing to a location, continues with BGPadvertising a route to a location, and ends with some sort of re-direction. That's IP. In XNS, the process is a bit different, and thatdifference highlights the problem with IPv4 that is only exacerbatedin IPv6.


RB

David Farber wrote:

Begin forwarded message:

From: John Day <jeanjour () comcast net>
Date: November 3, 2008 10:13:04 AM EST
To: David Farber <dave () farber net>, Jonathan Smith <jms () cis upenn edu>
Cc: day () bu edu, David Meyer <dmm () 1-4-5 net>
Subject: Re: [IP] Re:   the undead urban myth of the LOC/EID split
Possibly for the IP list. O'Dell thinks this is too much an"inside" account for the list. I will let you be the judge. It isnot an easy topic. There is no simple explanation, especiallybetween loc/id split and POA/node.
Would appreciate your thoughts.

John
Let me try to explain the addressing problem. I thought all of thiswas common knowledge at least among the old timers.
We first realized we had a problem with naming and addressing in theARPANET in 1972, when Tinker AFB joined the Net. They wantedredundant IMP connections. I remember Grossman coming in onemorning and telling me this. My first thought was, "Right, goodidea!", and 2 seconds later, "O, *&@##, that isn't going to work!"
Host addresses were IMP port numbers, so with 2 interfaces on 2different IMPs, Tinker would look like 2 hosts to the network, notone. Tinker's host would know it had two connections, but thenetwork would think it was one connection to two different hosts.This is, of course, the multihoming problem.
Had we blown it? No, there were a lot of things we didn't do inthat first attempt! We had a lot more important problems on ourplate. In those days, just moving data between very differentcomputers was a major accomplishment. We knew the naming stuff washard and this was an experiment. We could deal with that later.yea, yea, I know. Famous last words! ;-)
But the answer was obvious. We were all OS guys. We had seen thisproblem before. We needed a logical address space over the physicaladdress space. And we also knew that we need application names aswell. Just as OSs require a 3 levels of names, networks would too.This well-known socket business we had done was just a kludge so wecould demonstrate first 3 applications we had up and running.Multihoming was a symptom of a much more fundamental missing pieceof the overall design. But we would get to it sooner or later.(right, more famous last words.)
It didn't seem like a big deal. Certainly not enough to botherwriting a paper on it. For some reason, it took 10 years beforeJerry Saltzer wrote it up and published it, later circulated as RFC1498. Jerry got it right except for one little piece, which hadn'thappened yet. He describes, three levels of names for differentthings at different layers in a network architecture.
Application names, which are location independent.
Node addresses, which are location dependent
Point of attachment addresses, (POA) which may or may not belocation dependent and
mappings between them.

In general, the scope of the layers increases as you go up.
(Draw a picture or see the figures in my book. It will be easier tovisualize what is coming. Don't label the layers, we don't carewhat they are called.)
We have called the function that maps between Application names andnode addresses, a directory function. (Not to be confused with X.500. The terminology was in use a decade or more before that.)
The mapping of node to POA is generally part of routing. In thisscheme routes are sequences of node addresses. This we hadunderstood since 1972. I say "we" meaning people I worked around.Clearly not everyone did. This is what you get for assuming it isobvious. ;-) (BTW, for the curmudgeons I am not claiming I came upwith this before Saltzer. Quite the opposite, I am claiming thatseveral of us saw the broad outlines of what was needed. It tookSaltzer to make it concrete. Although I wish he had been a littlemore concrete about what a POA and node address were.)
So the problem with the ARPANET/Internet is that we name the Pointof Attachment (twice), but nothing else. Why twice? The MACaddress does the same thing. They both name the interface betweenthe wire and the system. Until CIDR it was no harder to route onMAC addresses as IP addresses, since they weren't addresses anyway,i.e. they weren't location-dependent. While we have something thatis sort of an application name in URLs, it isn't really. There istoo much "path" in a URL to be an application name. (More on thislater.)
Around this time, we learned a few other things about the problem:
1) Addresses only had to be unambiguous within the scope of thelayer in which they were used.2) Naming the host was irrelevant to the naming and addressingproblem as far as communications was concerned. A host name mightbe useful for network management problems but it was merelycoincidental to the communications problem. For communications, oneis at least naming the protocol state machine. Thinking of it as ahost name implied constraints that would only get in the way.3) Embedding a lower layer address in a higher layer address madeit route dependent, which is what we needed to avoid (see below).
Many of us had always known that the ARPANET/Internet wasincomplete. We didn't fix it with IPv4 because (I think) we feltthat we didn't really have enough understanding of the whole namingand addressing problem yet (this was 1976 or so) and we didn't wantto fix it the wrong way. Any way this was still mostly anexperimental network. It wasn't meant to be in production. We coulddo that later.
This is why, starting around 1980 the small group in OSI who wasdoing connectionless insisted the network layer would name the node.It wasn't a phone company thing (clearly not!!), it was fixingsomething from the early ARPANet, that we had not had an opportunityto fix yet. Mostly it was Internet people who understood and pushedit in OSI, not the Europeans. Several European positions wanted OSIto have well-known sockets and name the interface. I made sure itdidn't creep into the Reference Model and Lyman, Oran, Piscatello,etc. made sure it wasn't in the protocol.
This, of course, was all thrown out the window by the IPng process,which insisted that we go ahead with half a naming and addressingarchitecture. (At the time, I don't think there were 2 dozen peoplein the IETF who understood naming and addressing. The failure of aUniversity education.) I have never understood the IETF's reactionto these things. Rather than "you blew it let us show you how to dothat right," Their reaction has been if They did it, we won't, evenif it means cutting off your nose to spite your face. Thesociologists will probably explain it to us some day.
Once it was decided that the IPng would name the interface, we werepretty well stuck. On the road to where we are today. Not to putwords in O'Dell's mouth, but I always thought 8+8 was an attempt atsome sort of fix, even if it was a kludge, given that they wouldn'tdo it right and perhaps later we could move that closer to right.However, they wouldn't even do 8+8.
The early drafts of the OSI Model also made the error of buildingthe (N)-address from the (N-1)-address, like embedding MAC addressesin v6. (This is one of those things that looks obvious on thesurface and when you get into it, you realize is just plain wrong, abit like Aristotelian physics: Seems like common sense until youtest it.) We uncovered that problem around 82 doing the Naming andAddressing Addendum to the RM and fixed it. Why this is a problemin networks and not in OSs is also in the book. Suffice it to sayhere that this makes the address into a *pathname* through thestack. Path dependent just at the point it shouldn't be. Makes itinto naming the interface even if you thought you weren't. (Nowsome of you will say, but I don't have to interpret it that way. Itstill will name the node. Correct. If *everyone* obeys the rules.But some hot-shot is going to assume he knows better and thencomplain like hell when his thing doesn't work somewhere. Best wayto keep them honest is not let them be dishonest.)
The one thing you don't want in a network. It works in an OSbecause there is only one way to get anywhere. But in a network(even in a network stack) there may be more than one way to get somewhere. So addresses in different layers have to be completelyindependent to preserve path independence. Which brings us to thepiece that was missing in Saltzer's analysis:
The missing piece that hadn't happened when Saltzer wrote was multi-path routing: More than one path to the next hop. This turns outto be one of those little things that opens up considerableinsight. If we include this in his model. Then we need the node toPOA mapping for all NEAREST neighbors. So calculating a route is*logically*: Calculate the route to the destination using therouting table information, Find the next hop, then choose which pathto get to the next hop.
Clearly you don't build it this way. You create a forwarding tableand use it the way you do now. Although, there is no reason onemight not do a forwarding table update that just change the node toPOA mapping without recalculating routes.
But what is interesting is that this mapping (node to POA of nearestneighbors) is exactly the same as the application name to nodeaddress mapping, i.e. the directory. Those are all *nearestneighbors* at that layer too! The whole structure is relative. Onelayer's node address is the point of attachment for the layer above.And it repeats. (That is what AS numbers were trying to tell you.)Although not necessarily in the obvious way.
With a structure like this, mobility is nothing more than dynamicmultihoming. And several other things fall out easily, again seethe book.
So here we are. 15 years after IPng and v6 doesn't solve any ofthese problems. No surprise. It was purposely designed not to solveany of these problems.
Some have noted that the IPv6 group thought this was just a dataplane problem and ignored the so-called control plane. (Sorry, but Ibalk at the use of this phone company terminology, it confusingissues.) What sheer incompetence! As Radia points out in the 2ndEdition of her book, if you don't like NATs, you should have adoptedCLNP. It was already in the routers. In other words, we could havespent the last 15 years on transition instead of on a monumentalwaste of money, time, and effort. Anyone who tried to explain theseproblems to the IPv6 group were simply labeled as sore losers.
Throughout the late 80s and 90s, if there was discussion ofaddressing someone (usually from MIT) would say, you have to readSaltzer's paper. During the NSRG meetings in 2001-2 it was broughtup frequently. Then suddenly it was dropped. Never mentioned.When I pressed Noel on it not long ago, he said "they had movedbeyond it." Seemed strange since loc/id was clearly not an answer,not a step to a solution. At least Saltzer looked at the wholearchitecture, while loc/id only looked at Network/Transport.
It begins to seem that Loc/id split had been invented so they don'thave to admit they were wrong and simply name the node and get onwith it. They seem to have an inkling that they had missed somethingimportant with v6 and they were desperately trying to find a way toretrofit it before it was too late. The trouble is loc/id splitisn't the whole problem. Loc/id split (as near as I can tell) stilldoes not name the node, but some application-flow-endpoint.Whatever it is a node address is necessary and it will need to belocation-dependent and aggregateable and it isn't.
So what is really wrong with loc/id split. Lets look at it. If theIP address (the loc) remains a POA on which we do routing and givingthem the benefit of the doubt, the id is a node address (in somepapers the "id" seems to be more an application-connection-endpointor something similar), then the loc is the provider dependentidentifier and the id is the provider independent name. But it isflat. If multihoming is widespread it is likely that severalendsystems in the same area will be using the same differentproviders for multihoming. Aren't the routers going to want to beable to aggregate the look ups for these to figure out where to sendthem? Not if the id is based on a flat name. Remember the relationof POA and node is relative. What is needed for one is going to betrue for the other. Using a flat id assumes that it won't be neededmuch. But what we are seeing is that multihoming is becoming verywidespread and I don't think we have seen anything near the end ofit. The thing is that the node address (id) must be aggregateableas well. In any case, to build in an identifier at this level thatdoes not facilitate scaling seems as short-sighted as v6 was tobegin with.
But now is it too late.? At least for IPv6. The Internetarchitecture has been fundamentally flawed from the beginning. Tobe fair, it is a demo that never got finished. Basically this islike trying to build an OS for a huge set of applications with novirtual address space or application name space. Or as I say in thebook, what we have is DOS, what we need is Multics, but we wouldsettle for UNIX. The Internet architecture is equivalent to DOS.
I hope this helps.  The medium makes it a bit hard to explain.

Take care,
John




-------------------------------------------
Archives: https://www.listbox.com/member/archive/247/=now
RSS Feed: https://www.listbox.com/member/archive/rss/247/
Powered by Listbox: http://www.listbox.com


--
Richard Bennett





-------------------------------------------
Archives: https://www.listbox.com/member/archive/247/=now
RSS Feed: https://www.listbox.com/member/archive/rss/247/
Powered by Listbox: http://www.listbox.com

Current thread:

the undead urban myth of the LOC/EID split NOT AN EASY READ David Farber (Nov 03)
- <Possible follow-ups>
- Re: the undead urban myth of the LOC/EID split NOT AN EASY READ David Farber (Nov 03)
- the undead urban myth of the LOC/EID split NOT AN EASY READ David Farber (Nov 04)
- Re: the undead urban myth of the LOC/EID split NOT AN EASY READ David Farber (Nov 04)
- Re: the undead urban myth of the LOC/EID split NOT AN EASY READ David Farber (Nov 04)