Network Working GroupInternet Engineering Task Force (IETF) D. FarinacciInternet-DraftRequest for Comments: 6830 Cisco Systems Category: Experimental V. FullerIntended status: ExperimentalISSN: 2070-1721 D. MeyerExpires: May 17, 2013D. LewisciscoCisco SystemsNovember 13, 2012January 2013 The Locator/ID Separation Protocol (LISP)draft-ietf-lisp-24Abstract Thisdraftdocument describes anetwork layer basednetwork-layer-based protocol that enables separation of IP addresses into two new numbering spaces: Endpoint Identifiers (EIDs) and Routing Locators (RLOCs). No changes are required to either host protocol stacks or to the "core" of the Internet infrastructure.LISPThe Locator/ID Separation Protocol (LISP) can be incrementally deployed, without a "flag day", and offerstraffic engineering, multi-homing,Traffic Engineering, multihoming, and mobility benefits to early adopters, even when there are relatively few LISP-capable sites. Design and development of LISP was largely motivated by the problem statement produced by the October 2006 IAB Routing and Addressing Workshop. Status ofthisThis Memo ThisInternet-Draftdocument issubmitted in full conformance with the provisions of BCP 78not an Internet Standards Track specification; it is published for examination, experimental implementation, andBCP 79. Internet-Drafts are working documentsevaluation. This document defines an Experimental Protocol for the Internet community. This document is a product of the Internet Engineering Task Force (IETF).Note that other groups may also distribute working documents as Internet-Drafts. The listIt represents the consensus ofcurrent Internet- Drafts is at http://datatracker.ietf.org/drafts/current/. Internet-Drafts are draft documents validthe IETF community. It has received public review and has been approved for publication by the Internet Engineering Steering Group (IESG). Not all documents approved by the IESG are amaximumcandidate for any level of Internet Standard; see Section 2 of RFC 5741. Information about the current status ofsix monthsthis document, any errata, and how to provide feedback on it may beupdated, replaced, or obsoleted by other documentsobtained atany time. It is inappropriate to use Internet-Drafts as reference material or to cite them other than as "work in progress." This Internet-Draft will expire on May 17, 2013.http://www.rfc-editor.org/info/rfc6830. Copyright Notice Copyright (c)20122013 IETF Trust and the persons identified as the document authors. All rights reserved. This document is subject to BCP 78 and the IETF Trust's Legal Provisions Relating to IETF Documents (http://trustee.ietf.org/license-info) in effect on the date of publication of this document. Please review these documents carefully, as they describe your rights and restrictions with respect to this document. Code Components extracted from this document must include Simplified BSD License text as described in Section 4.e of the Trust Legal Provisions and are provided without warranty as described in the Simplified BSD License. Table of Contents 1. Introduction ....................................................3 2. Requirements Notation. . . . . . . . . . . . . . . . . . . . 5 2. Introduction . . . . . . . . . . . . . . . . . . . . . . . . . 6...........................................5 3. Definition of Terms. . . . . . . . . . . . . . . . . . . . . 8.............................................5 4. Basic Overview. . . . . . . . . . . . . . . . . . . . . . . . 14.................................................10 4.1. Packet Flow Sequence. . . . . . . . . . . . . . . . . . . 16......................................13 5. LISP Encapsulation Details. . . . . . . . . . . . . . . . . . 18.....................................15 5.1. LISP IPv4-in-IPv4 Header Format. . . . . . . . . . . . . 19...........................16 5.2. LISP IPv6-in-IPv6 Header Format. . . . . . . . . . . . . 19...........................17 5.3. Tunnel Header Field Descriptions. . . . . . . . . . . . . 21..........................18 5.4. Dealing with Large Encapsulated Packets. . . . . . . . . 25...................22 5.4.1. A Stateless Solution to MTU Handling. . . . . . . . . 25...............22 5.4.2. A Stateful Solution to MTU Handling. . . . . . . . . 26................23 5.5. Using Virtualization and Segmentation with LISP. . . . . 26...........24 6. EID-to-RLOC Mapping. . . . . . . . . . . . . . . . . . . . . 28............................................25 6.1. LISP IPv4 and IPv6Control PlaneControl-Plane Packet Formats. . . . . 28...........25 6.1.1. LISP Packet Type Allocations. . . . . . . . . . . . . 30.......................27 6.1.2. Map-Request Message Format. . . . . . . . . . . . . . 30.........................27 6.1.3. EID-to-RLOC UDP Map-Request Message. . . . . . . . . 33................30 6.1.4. Map-Reply Message Format. . . . . . . . . . . . . . . 34...........................31 6.1.5. EID-to-RLOC UDP Map-Reply Message. . . . . . . . . . 38..................35 6.1.6. Map-Register Message Format. . . . . . . . . . . . . 40........................37 6.1.7. Map-Notify Message Format. . . . . . . . . . . . . . 42..........................39 6.1.8. Encapsulated Control Message Format. . . . . . . . . 43................41 6.2. Routing Locator Selection. . . . . . . . . . . . . . . . 45.................................42 6.3. Routing Locator Reachability. . . . . . . . . . . . . . . 47..............................44 6.3.1. Echo Nonce Algorithm. . . . . . . . . . . . . . . . . 49...............................46 6.3.2.RLOC ProbingRLOC-Probing Algorithm. . . . . . . . . . . . . . . . 50.............................48 6.4. EID Reachability within a LISP Site. . . . . . . . . . . 51.......................49 6.5. Routing Locator Hashing. . . . . . . . . . . . . . . . . 52...................................49 6.6. Changing the Contents of EID-to-RLOC Mappings. . . . . . 53.............50 6.6.1. Clock Sweep. . . . . . . . . . . . . . . . . . . . . 54........................................51 6.6.2. Solicit-Map-Request (SMR). . . . . . . . . . . . . . 54..........................52 6.6.3. DatabaseMap Versioning . . . . . . . . . . . . . . . 56Map-Versioning ............................53 7. Router Performance Considerations. . . . . . . . . . . . . . 57..............................54 8. Deployment Scenarios. . . . . . . . . . . . . . . . . . . . . 58...........................................55 8.1.First-hop/Last-hopFirst-Hop/Last-Hop Tunnel Routers. . . . . . . . . . . . 59.........................56 8.2. Border/Edge Tunnel Routers. . . . . . . . . . . . . . . . 59................................56 8.3. ISPProvider-EdgeProvider Edge (PE) Tunnel Routers. . . . . . . . . . 60.....................57 8.4. LISP Functionality with Conventional NATs. . . . . . . . 60.................58 8.5. Packets Egressing a LISP Site. . . . . . . . . . . . . . 61.............................58 9. Traceroute Considerations. . . . . . . . . . . . . . . . . . 62......................................58 9.1. IPv6 Traceroute. . . . . . . . . . . . . . . . . . . . . 63...........................................59 9.2. IPv4 Traceroute. . . . . . . . . . . . . . . . . . . . . 63...........................................60 9.3. TracerouteusingUsing Mixed Locators. . . . . . . . . . . . . 63...........................60 10. Mobility Considerations. . . . . . . . . . . . . . . . . . . 65.......................................61 10.1. Site Mobility. . . . . . . . . . . . . . . . . . . . . . 65............................................61 10.2. Slow Endpoint Mobility. . . . . . . . . . . . . . . . . . 65...................................61 10.3. Fast Endpoint Mobility. . . . . . . . . . . . . . . . . . 65...................................61 10.4. Fast Network Mobility. . . . . . . . . . . . . . . . . . 67....................................63 10.5. LISP Mobile Node Mobility. . . . . . . . . . . . . . . . 67................................64 11. Multicast Considerations. . . . . . . . . . . . . . . . . . . 69......................................64 12. Security Considerations. . . . . . . . . . . . . . . . . . . 70.......................................65 13. Network Management Considerations. . . . . . . . . . . . . . 72.............................67 14. IANA Considerations. . . . . . . . . . . . . . . . . . . . . 73...........................................67 14.1. LISP ACT and Flag Fields. . . . . . . . . . . . . . . . . 73.................................67 14.2. LISP Address Type Codes. . . . . . . . . . . . . . . . . 73..................................68 14.3. LISP UDP Port Numbers. . . . . . . . . . . . . . . . . . 74....................................68 14.4. LISP Key ID Numbers. . . . . . . . . . . . . . . . . . . 74......................................68 15. Known Open Issues and Areas of Future Work. . . . . . . . . . 75....................68 16. References. . . . . . . . . . . . . . . . . . . . . . . . . . 77....................................................70 16.1. Normative References. . . . . . . . . . . . . . . . . . . 77.....................................70 16.2. Informative References. . . . . . . . . . . . . . . . . . 78...................................71 Appendix A. Acknowledgments. . . . . . . . . . . . . . . . . . . 82 Authors' Addresses . . . . . . . . . . . . . . . . . . . . . . . . 97.......................................75 1. Introduction This document describes the Locator/Identifier Separation Protocol (LISP), which provides a set of functions for routers to exchange information used to map fromnon globally routeableEndpoint Identifiers (EIDs) that are not globally routable torouteableroutable Routing Locators (RLOCs). It also defines a mechanism for these LISP routers to encapsulate IP packets addressed with EIDs for transmission acrossthe< Interneta network infrastructure that uses RLOCs for routing and forwarding. Creation of LISP was initially motivated by discussions during the IAB-sponsored Routing and Addressing Workshop held in Amsterdam inOctober,October 2006 (see [RFC4984]). A key conclusion of the workshop was that the Internet routing and addressing system was not scaling well in the face of the explosive growth of new sites; one reason for this poor scaling is the increasing number ofmulti-homedmultihomed sites and other sites that cannot be addressed as part oftopologically-topology-based orprovider-basedprovider- based aggregated prefixes. Additional work that more completelydescribeddescribes the problem statement may be found in [RADIR]. A basic observation, made many years ago in early networking research such as that documented in [CHIAPPA] and [RFC4984], is that using a single address field for both identifying a device and for determining where it is topologically located in the network requires optimization along two conflicting axes: for routing to be efficient, the address must be assigned topologically; for collections of devices to be easily and effectively managed, without the need for renumbering in response to topological change (such as that caused by adding or removing attachment points to the network or by mobility events), the address must explicitly not be tied to the topology. The approach that LISP takes to solving the routing scalability problem is to replace IP addresses with two new types of numbers: Routing Locators (RLOCs), which are topologically assigned to network attachment points (and are therefore amenable to aggregation) and used for routing and forwarding of packets through the network; and Endpoint Identifiers (EIDs), which are assigned independently from the network topology, are used for numbering devices, and are aggregated along administrative boundaries. LISP then defines functions for mapping between the two numbering spaces and for encapsulating traffic originated by devices usingnon-routeablenon-routable EIDs for transport across a network infrastructure that routes and forwards using RLOCs. Both RLOCs and EIDs aresyntactically-syntactically identical to IP addresses; it is the semantics of how they are used that differs. This document describes the protocol that implements these functions. The databasewhichthat stores the mappings between EIDs and RLOCs is explicitly a separate "module" to facilitate experimentation with a variety of approaches. One database design that is being developed for experimentation as part of the LISP working group work is[ALT].[RFC6836]. Others that have been described include [CONS], [EMACS],[NERD].and [RFC6837]. Finally,[LISP-MS],[RFC6833] documents a general-purpose service interface for accessing a mapping database; this interface is intended to make the mapping database modular so that different approaches can be tried without the need to modify installedLISPLISP- capable devices in LISP sites. This experimental specification has areas that require additional experience and measurement. It is NOT RECOMMENDED for deployment beyond experimental situations. Results of experimentation may lead to modifications and enhancements of protocol mechanisms defined in this document. See Section 15 for specific, known issues that are in need of further work during development, implementation, and experimentation. An examination of the implications of LISP on Internet traffic, applications, routers, and security is for future study. This analysis will explain what role LISP can play in scalable routing and will also look at scalability and levels of state required for encapsulation, decapsulation, liveness, and so on. 2. Requirements Notation The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be interpreted as described in [RFC2119]. 3. Definition of TermsProvider IndependentProvider-Independent (PI) Addresses: PI addresses are an address block assigned from a pool where blocks are not associated with any particular location in the network(e.g.(e.g., from a particular serviceprovider),provider) andisare therefore not topologically aggregatable in the routing system.Provider AssignedProvider-Assigned (PA) Addresses: PA addresses are an address block assigned to a site by each service provider to which a site connects. Typically, each block is a sub-block of a service provider Classless Inter-Domain Routing (CIDR) [RFC4632] block and is aggregated into the larger block before being advertised into the global Internet. Traditionally, IP multihoming has been implemented by eachmulti-homedmultihomed site acquiring itsown, globally-own globally visible prefix. LISP uses onlytopologically-assignedtopologically assigned and aggregatable address blocks for RLOCs, eliminating this demonstrably non-scalable practice. Routing Locator (RLOC):AAn RLOC is an IPv4 [RFC0791] or IPv6 [RFC2460] address of anegress tunnel routerEgress Tunnel Router (ETR).AAn RLOC is the output of an EID-to-RLOC mapping lookup. An EID maps to one or more RLOCs. Typically, RLOCs are numbered fromtopologically-topologically aggregatable blocks that are assigned to a site at each point to which it attaches to the global Internet; where the topology is defined by the connectivity of provider networks, RLOCs can be thought of as PA addresses. Multiple RLOCs can be assigned to the same ETR device or to multiple ETR devices at a site. Endpoint ID (EID): An EID is a 32-bit (for IPv4) or 128-bit (for IPv6) value used in the source and destination address fields of the first (most inner) LISP header of a packet. The host obtains a destination EID the same way it obtainsana destination address today, forexampleexample, through a Domain Name System (DNS) [RFC1034] lookup or SessionInvitationInitiation Protocol (SIP) [RFC3261] exchange. The source EID is obtained via existing mechanisms used to set a host's "local" IP address. An EID used on the public Internet must have the same properties as any other IP address used in that manner; this means, among other things, that it must be globally unique. An EID is allocated to a host from anEID-prefixEID-Prefix block associated with the site where the host is located. An EID can be used by a host to refer to other hosts. EIDs MUST NOT be used as LISP RLOCs. Note that EID blocks MAY be assigned in a hierarchical manner, independent of the network topology, to facilitate scaling of the mapping database. In addition, an EID block assigned to a site may have site-local structure (subnetting) for routing within the site; this structure is not visible to the global routing system. In theory, the bit string that represents an EID for one device can represent an RLOC for a different device. As the architecture is realized, if a given bit string is both an RLOC and an EID, it must refer to the same entity in both cases. When used in discussions with other Locator/ID separation proposals, a LISP EID will be calledaan "LEID". Throughout this document, any references to "EID"refersrefer to an LEID.EID-prefix:EID-Prefix: AnEID-prefixEID-Prefix is a power-of-two block of EIDswhichthat are allocated to a site by an address allocation authority.EID- prefixesEID-Prefixes are associated with a set of RLOC addresseswhichthat make up a "database mapping".EID-prefixEID-Prefix allocations can be broken up into smaller blocks when an RLOC set is to be associated with the largerEID-prefixEID-Prefix block. A globally routed address block (whether PI or PA) is not inherently anEID-prefix.EID-Prefix. A globally routed address block MAY be used by its assignee as an EID block. The converse is not supported. That is, a sitewhichthat receives an explicitly allocatedEID-prefixEID-Prefix may not use thatEID-prefixEID-Prefix as a globally routed prefix. This would require coordination and cooperation with the entities managing the mapping infrastructure. Once this has been done, that block could be removed from the globally routed IP system, if other suitable transition and access mechanisms are in place. Discussion of such transition and access mechanisms can be found in[INTERWORK][RFC6832] and [LISP-DEPLOY]. End-system: An end-system is an IPv4 or IPv6 device that originates packets with a single IPv4 or IPv6 header. The end-system supplies an EID value for the destination address field of the IP header when communicating globally(i.e.(i.e., outside of its routing domain). An end-system can be a host computer, a switch or router device, or any network appliance. Ingress Tunnel Router (ITR): An ITR is a router that resides in a LISP site. Packets sent by sources inside of the LISP site to destinations outside of the site are candidates for encapsulation by the ITR. The ITR treats the IP destination address as an EID and performs an EID-to-RLOC mapping lookup. The router then prepends an "outer" IP header with one of itsglobally-routableglobally routable RLOCs in the source address field and the result of the mapping lookup in the destination address field. Note that this destination RLOC MAY be an intermediate, proxy device that has better knowledge of the EID-to-RLOC mapping closer to the destination EID. In general, an ITR receives IP packets from site end-systems on one side and sends LISP-encapsulated IP packets toward the Internet on the other side. Specifically, when a service provider prepends a LISP header for Traffic Engineering purposes, the router that does this is also regarded as an ITR. The outer RLOC the ISP ITR uses can be based on the outer destination address (the originating ITR's supplied RLOC) or the inner destination address (the originatinghostshost's supplied EID). TE-ITR: A TE-ITR is an ITR that is deployed in a service provider network that prepends an additional LISP header for Traffic Engineering purposes. Egress Tunnel Router (ETR): An ETR is a router that accepts an IP packet where the destination address in the "outer" IP header is one of its own RLOCs. The router strips the "outer" header and forwards the packet based on the next IP header found. In general, an ETR receives LISP-encapsulated IP packets from the Internet on one side and sends decapsulated IP packets to site end-systems on the other side. ETR functionality does not have to be limited to a router device. A server host can be the endpoint of a LISP tunnel as well. TE-ETR: A TE-ETR is an ETR that is deployed in a service provider network that strips an outer LISP header for Traffic Engineering purposes. xTR:AAn xTR is a reference to an ITR or ETR when direction of data flow is not part of the context description.xTR"xTR" refers to the router that is the tunnelendpoint. Usedendpoint and is used synonymously with the term "Tunnel Router". For example, "An xTR can be located at the Customer Edge (CE)router", meaningrouter" indicates both ITR and ETR functionalityisat the CE router. LISP Router: A LISP router is a router that performs the functions of any or all of the following: ITR, ETR,PITR,Proxy-ITR (PITR), orPETR.Proxy-ETR (PETR). EID-to-RLOC Cache: The EID-to-RLOCcacheCache is a short-lived,on- demandon-demand table in an ITR that stores, tracks, and is responsible fortiming-outtiming out and otherwise validating EID-to-RLOC mappings. This cache is distinct from the full "database" of EID-to-RLOCmappings,mappings; it is dynamic, local to the ITR(s), and relativelysmallsmall, while the database is distributed, relatively static, and much more global in scope. EID-to-RLOC Database: The EID-to-RLOCdatabaseDatabase is a global distributed database that contains all knownEID-prefix to RLOCEID-Prefix-to-RLOC mappings. Each potential ETR typically contains a small piece of the database: the EID-to-RLOC mappings for theEID prefixesEID-Prefixes "behind" the router. These map to one of the router'sown, globally-visible,own globally visible IP addresses. The same database mapping entries MUST be configured on all ETRs for a given site. In a steadystatestate, theEID-prefixesEID-Prefixes for the site and thelocator-setLocator-Set for eachEID-prefixEID-Prefix MUST be the same on all ETRs. Procedures to enforce and/or verify this are outside the scope of this document. Note that there MAY be transient conditions when theEID-prefixEID-Prefix for the site andlocator-setLocator-Set for eachEID-prefixEID-Prefix may not be the same on all ETRs. This has no negativeimplicationsimplications, since a partial set oflocatorsLocators can be used. Recursive Tunneling: RecursivetunnelingTunneling occurs when a packet has more than one LISP IP header. Additional layers of tunneling MAY be employed to implementtraffic engineeringTraffic Engineering or other re-routing as needed. When this is done, an additional "outer" LISP header isaddedadded, and the original RLOCs are preserved in the "inner" header. Any references to tunnels in this specificationrefersrefer to dynamic encapsulatingtunnels andtunnels; they are never statically configured.ReencapsulatingRe-encapsulating Tunnels:Reencapsulating tunnelingRe-encapsulating Tunneling occurs when an ETR removes a LISP header, then acts as an ITR to prepend another LISP header. Doing this allows a packet to be re-routed by the re-encapsulating router without adding the overhead of additional tunnel headers. Any references to tunnels in this specificationrefersrefer to dynamic encapsulatingtunnels andtunnels; they are never statically configured. When using multiple mapping database systems, care must be taken to not createreencapsulationre-encapsulation loops through misconfiguration. LISP Header: LISP header is a term used in this document to refer to the outer IPv4 or IPv6 header, a UDP header, and aLISP-specificLISP- specific 8-octet header thatfollowsfollow the UDPheader,header and that an ITR prepends or an ETR strips. Address Family Identifier (AFI): AFI is a term used to describe an address encoding in a packet. An address family currently pertains to an IPv4 or IPv6 address. See[AFI]/[AFI-REGISTRY][AFI] and [RFC3232] for details. An AFI value of 0 used in this specification indicates an unspecified encoded address where the length of the address is 0 octets following the 16-bit AFI value of 0. Negative Mapping Entry: A negative mapping entry, also known as a negative cache entry, is an EID-to-RLOC entry where anEID-prefixEID-Prefix is advertised or stored with no RLOCs. That is, thelocator-setLocator-Set for the EID-to-RLOC entry is empty or has an encodedlocatorLocator count of 0. This type of entry could be used to describe a prefix from a non-LISP site, which is explicitly not in the mapping database. There are a set ofwell definedwell-defined actions that are encoded in a Negative Map-Reply (Section 6.1.5).Data Probe:Data-Probe: Adata-probeData-Probe is a LISP-encapsulated data packet where theinner headerinner-header destination address equals theouter headerouter-header destination address used to trigger a Map-Reply by a decapsulating ETR. In addition, the original packet is decapsulated and delivered to the destination host if the destination EID is in theEID-prefixEID-Prefix range configured on the ETR. Otherwise, the packet is discarded. AData ProbeData-Probe is used in some of the mapping database designs to "probe" or request a Map-Reply from an ETR; in other cases, Map-Requests are used. See each mapping database design for details. When usingData Probes,Data-Probes, by sending Map-Requests on the underlying routing system,EID-prefixesEID-Prefixes must be advertised. However, this is discouraged if the core is to scale by having lessEID-prefixesEID-Prefixes stored in the core router's routing tables.Proxy ITRProxy-ITR (PITR): A PITR is defined and described in[INTERWORK], a[RFC6832]. A PITR acts like an ITR but does so on behalf of non-LISP siteswhichthat send packets to destinations at LISP sites.Proxy ETRProxy-ETR (PETR): A PETR is defined and described in[INTERWORK], a[RFC6832]. A PETR acts like an ETR but does so on behalf of LISP siteswhichthat send packets to destinations at non-LISP sites. Route-returnability: Route-returnability is an assumption that the underlying routing system will deliver packets to the destination. When combined with a nonce that is provided by a sender and returned by a receiver, this limits off-path data insertion. Aroute- returnabilityroute-returnability check is verified when a message is sent with a nonce, another message is returned with the same nonce, and the destination of the original message appears as the source of the returned message. LISP site: LISP site is a set of routers in an edge network that are under a single technical administration. LISP routerswhichthat reside in the edge network are the demarcation points to separate the edge network from the core network. Client-side: Client-side is a term used in this document to indicate a connection initiation attempt by an EID. The ITR(s) at the LISP site are the first to get involved in obtaining databasemap cacheMap-Cache entries by sending Map-Request messages. Server-side: Server-side is a term used in this document to indicate that a connection initiation attempt is being accepted for a destination EID. The ETR(s) at the destination LISP site are the first to sendMap- RepliesMap-Replies to the source site initiating the connection. The ETR(s) at this destination site can obtain mappings by gleaning information from Map-Requests, Data-Probes, or encapsulated packets.Locator Status BitsLocator-Status-Bits (LSBs):Locator status bitsLocator-Status-Bits are present in the LISP header. They are used by ITRs to inform ETRs about the up/ down status of all ETRs at the local site. These bits are used as a hint to convey up/down router status and not path reachability status. The LSBs can be verified by use of one of the LocatorReachability Algorithmsreachability algorithms described in Section 6.3. Anycast Address: Anycast Address is a term used in this document to refer to the same IPv4 or IPv6 address configured and used on multiple systems at the same time. An EID or RLOC can be an anycast address in each of their own address spaces. 4. Basic Overview One key concept of LISP is that end-systems (hosts) operate the same way they do today. The IP addresses that hosts use for trackingsockets,sockets and connections, and for sending and receivingpacketspackets, do not change. In LISP terminology, these IP addresses are called Endpoint Identifiers (EIDs). Routers continue to forward packets based on IP destination addresses. When a packet is LISP encapsulated, these addresses are referred to as Routing Locators (RLOCs). Most routers along a path between two hosts will not change; they continue to perform routing/ forwarding lookups on the destination addresses. For routers between the source host and the ITR as well as routers from the ETR to the destination host, the destination address is an EID. For the routers between the ITR and the ETR, the destination address is an RLOC. Another key LISP concept is the "Tunnel Router". Atunnel routerTunnel Router prepends LISP headers on host-originated packets and strips them prior to final delivery to their destination. The IP addresses in this "outer header" are RLOCs. During end-to-end packet exchange between two Internet hosts, an ITR prepends a new LISP header to eachpacketpacket, and anegress tunnel routerETR strips the new header. The ITR performs EID-to-RLOC lookups to determine the routing path to the ETR, which has the RLOC as one of its IP addresses. Some basic rules governing LISP are: o End-systems (hosts) only send to addresseswhichthat are EIDs. They don't know that addresses are EIDs versus RLOCs but assume that packets get to their intended destinations. In a system where LISP is deployed, LISP routers interceptEID addressedEID-addressed packets and assist in delivering them across the network core where EIDs cannot be routed. The procedure a host uses to send IP packets does not change. o EIDs are always IP addresses assigned to hosts. o LISP routers mostly deal with Routing Locator addresses. See detailslaterin Section 4.1 to clarify what is meant by "mostly". o RLOCs are always IP addresses assigned torouters; preferably, topologically-orientedrouters, preferably topologically oriented addresses from provider CIDR (Classless Inter-Domain Routing) blocks. o When a router originatespacketspackets, it may use as a source address either an EID or RLOC. When acting as a host(e.g.(e.g., when terminating a transport session such asSSH,Secure SHell (SSH), TELNET, orSNMP),the Simple Network Management Protocol (SNMP)), it may use an EID that is explicitly assigned for that purpose. An EID that identifies the router as a host MUST NOT be used as an RLOC; an EID is only routable within the scope of a site. A typical BGP configuration might demonstrate this "hybrid" EID/RLOC usage where a router could use its "host-like" EID to terminate iBGP sessions to other routers in a site while at the same time using RLOCs to terminate eBGP sessions to routers outside the site. o Packets with EIDs in them are not expected to be deliveredend-to- endend-to-end in the absence of an EID-to-RLOC mapping operation. They are expected to be used locally for intra-site communication or to be encapsulated for inter-site communication. oEID prefixesEID-Prefixes are likely to be hierarchically assigned in a mannerwhichthat is optimized for administrative convenience and to facilitate scaling of the EID-to-RLOC mapping database. The hierarchy is based onaan address allocation hierarchywhichthat is independent of the network topology. o EIDs may also be structured (subnetted) in a manner suitable for local routing within anautonomous system.Autonomous System (AS). An additional LISP header MAY be prepended to packets by a TE-ITR when re-routing of the path for a packet is desired. A potential use-case for this would be an ISP router that needs to performtraffic engineeringTraffic Engineering for packets flowing through its network. In such a situation, termedRecursive Tunneling,"Recursive Tunneling", an ISP transit acts as an additionalingress tunnel routerITR, and the RLOC it uses for the new prepended header would be either a TE-ETR within the ISP (along an intra-ISP traffic engineered path) or a TE-ETR within another ISP (an inter-ISP traffic engineered path, where an agreement to build such a path exists). In order to avoid excessive packet overhead as well as possible encapsulation loops, this document mandates that a maximum of two LISP headers can be prepended to a packet. For initial LISP deployments, it is assumed that two headers is sufficient, where the first prepended header is used at a site for Location/Identity separation and the second prepended header is used inside a service provider for Traffic Engineering purposes. Tunnel Routers can be placed fairly flexibly in a multi-AS topology. For example, the ITR for a particular end-to-end packet exchange might be the first-hop or default router within a site for the source host. Similarly, theegress tunnel routerETR might be the last-hop routerdirectly-connecteddirectly connected to the destination host. Another example, perhaps for a VPN serviceout-sourcedoutsourced to an ISP by a site, the ITR could be the site's border router at the service provider attachment point. Mixing and matching of site-operated, ISP-operated, and othertunnel routersTunnel Routers is allowed for maximum flexibility. See Section 8 for more details. 4.1. Packet Flow Sequence This section provides an example of the unicast packet flow with the following conditions: o Source host "host1.abc.example.com" is sending a packet to "host2.xyz.example.com", exactly what host1 would do if the site was not using LISP. o Each site ismulti-homed,multihomed, so eachtunnel routerTunnel Router has an address (RLOC) assigned from the service provider address block for each provider to which that particulartunnel routerTunnel Router is attached. o The ITR(s) and ETR(s) are directly connected to the source and destination, respectively, but the source and destination can be located anywhere in the LISP site. o Map-Requests can be sent on the underlying routing system topology, to a mapping database system, or directly over analternative topology [ALT].Alternative Logical Topology [RFC6836]. A Map-Request is sent for an external destination when the destination is not found in the forwarding table or matches a default route. o Map-Replies are sent on the underlying routing system topology. Client host1.abc.example.com wants to communicate with server host2.xyz.example.com: 1. host1.abc.example.com wants to open a TCP connection to host2.xyz.example.com. It does a DNS lookup on host2.xyz.example.com. An A/AAAA record is returned. This address is the destination EID. Thelocally-assignedlocally assigned address of host1.abc.example.com is used as the source EID. An IPv4 or IPv6 packet is built and forwarded through the LISP site as a normal IP packet until it reaches a LISP ITR. 2. The LISP ITR must be able to map the destination EID to an RLOC of one of the ETRs at the destination site. The specific method used to do this is not described in this example. See[ALT][RFC6836] or [CONS] for possible solutions. 3. The ITR will send a LISP Map-Request. Map-Requests SHOULD be rate-limited. 4. When an alternate mapping system is not in use, the Map-Request packet is routed through the underlying routing system. Otherwise, the Map-Request packet is routed on an alternate logical topology, forexampleexample, the[ALT][RFC6836] database mapping system. In either case, when the Map-Request arrives at one of the ETRs at the destination site, it will process the packet as a control message. 5. The ETR looks at the destination EID of the Map-Request and matches it against the prefixes in the ETR's configuredEID-to- RLOCEID-to-RLOC mapping database. This is the list ofEID-prefixesEID-Prefixes the ETR is supporting for the site it resides in. If there is no match, the Map-Request is dropped. Otherwise, a LISP Map-Reply is returned to the ITR. 6. The ITR receives the Map-Reply message, parses the message (to check for formatvalidity)validity), and stores the mapping information from the packet. This information is stored in the ITR'sEID-to- RLOCEID-to-RLOC mapping cache. Note that themap cachemap-cache is an on-demand cache. An ITR will manage itsmap cachemap-cache in such a way that optimizes for its resource constraints. 7. Subsequent packets from host1.abc.example.com to host2.xyz.example.com will have a LISP header prepended by the ITR using the appropriate RLOC as the LISP header destination address learned from the ETR. Note that the packet MAY be sent to a different ETR than the onewhichthat returned the Map-Reply due to the source site's hashing policy or the destination site'slocator-setLocator-Set policy. 8. The ETR receives these packets directly (since the destination address is one of its assigned IP addresses), checks the validity of the addresses, strips the LISP header, and forwards packets to the attached destination host. In order to defer the need for a mapping lookup in the reverse direction, an ETR MAY create a cache entry that maps the source EID(inner header(inner-header source IP address) to the source RLOC(outer header(outer-header source IP address) in a received LISP packet. Such a cache entry is termed a "gleaned" mapping and only contains a single RLOC for the EID in question. More complete information about additional RLOCs SHOULD be verified by sending a LISP Map-Request for that EID. Both the ITR and the ETR may also influence the decision the other makes in selecting an RLOC. See Section 6 for more details. 5. LISP Encapsulation Details Since additional tunnel headers are prepended, the packet becomes larger and can exceed the MTU of any link traversed from the ITR to the ETR. It is RECOMMENDED in IPv4 that packets do not get fragmented as they are encapsulated by the ITR. Instead, the packet is dropped and an ICMP Too Big message is returned to the source. This specification RECOMMENDS that implementations provide support for one of the proposed fragmentation and reassembly schemes. Two existing schemes are detailed in Section 5.4. Since IPv4 or IPv6 addresses can be either EIDs or RLOCs, the LISP architecture supports IPv4 EIDs with IPv6 RLOCs (where the inner header is in IPv4 packet format and theotherouter header is in IPv6 packet format) or IPv6 EIDs with IPv4 RLOCs (where the inner header is in IPv6 packet format and theotherouter header is in IPv4 packet format). The next sub-sections illustrate packet formats for the homogeneous case (IPv4-in-IPv4 andIPv6-in-IPv6)IPv6-in-IPv6), but all 4 combinations MUST be supported. 5.1. LISP IPv4-in-IPv4 Header Format 0 1 2 3 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ / |Version| IHL |Type of Service| Total Length | / +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | | Identification |Flags| Fragment Offset | | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ OH | Time to Live | Protocol = 17 | Header Checksum | | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | | Source Routing Locator | \ +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ \ | Destination Routing Locator | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ / | Source Port = xxxx | Dest Port = 4341 | UDP +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ \ | UDP Length | UDP Checksum | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ L |N|L|E|V|I|flags| Nonce/Map-Version | I \ +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ S / | InstanceID/Locator Status BitsID/Locator-Status-Bits | P +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ / |Version| IHL |Type of Service| Total Length | / +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | | Identification |Flags| Fragment Offset | | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ IH | Time to Live | Protocol | Header Checksum | | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | | Source EID | \ +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ \ | Destination EID | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ IHL = IP-Header-Length 5.2. LISP IPv6-in-IPv6 Header Format 0 1 2 3 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ / |Version| Traffic Class | Flow Label | / +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | | Payload Length | Next Header=17| Hop Limit | v +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | | O + + u | | t + Source Routing Locator + e | | r + + | | H +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ d | | r + + | | ^ + Destination Routing Locator + | | | \ + + \ | | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ / | Source Port = xxxx | Dest Port = 4341 | UDP +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ \ | UDP Length | UDP Checksum | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ L |N|L|E|V|I|flags| Nonce/Map-Version | I \ +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ S / | InstanceID/Locator Status BitsID/Locator-Status-Bits | P +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ / |Version| Traffic Class | Flow Label | / +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ / | Payload Length | Next Header | Hop Limit | v +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | | I + + n | | n + Source EID + e | | r + + | | H +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ d | | r + + | | ^ + Destination EID + \ | | \ + + \ | | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ 5.3. Tunnel Header Field Descriptions Inner Header (IH): The inner header is the header on the datagram received from the originating host. The source and destination IP addresses areEIDs, [RFC0791],EIDs [RFC0791] [RFC2460]. Outer Header: (OH) The outer header is a new header prepended by an ITR. The address fields contain RLOCs obtained from the ingress router's EID-to-RLOCcache.Cache. The IP protocol number is "UDP (17)" from [RFC0768]. The setting of theDFDon't Fragment (DF) bitFlags'Flags' field is according to rules listed inSectionSections 5.4.1 andSection5.4.2. UDP Header: The UDP header contains an ITR selected source port when encapsulating a packet. See Section 6.5 for details on the hash algorithm used to select a source port based on the 5-tuple of the inner header. The destination port MUST be set to the well-knownIANA assignedIANA-assigned port value 4341. UDP Checksum: TheUDP checksum'UDP Checksum' field SHOULD be transmitted as zero by an ITR for either IPv4 [RFC0768] or IPv6 encapsulation [UDP-TUNNELS] [UDP-ZERO]. When a packet with a zero UDP checksum is received by an ETR, the ETR MUST accept the packet for decapsulation. When an ITR transmits a non-zero value for the UDP checksum, it MUST send a correctly computed value in this field. When an ETR receives a packet with a non-zero UDP checksum, it MAY choose to verify the checksum value. If it chooses to perform such verification, and the verification fails, the packet MUST be silently dropped. If the ETR chooses not to perform the verification, or performs the verification successfully, the packet MUST be accepted for decapsulation. The handling of UDP checksums for all tunneling protocols, including LISP, is under active discussion within the IETF. When that discussion concludes, any necessary changes will be made to align LISP with the outcome of the broader discussion. UDP Length: TheUDP length'UDP Length' field is set for anIPv4 encapsulatedIPv4-encapsulated packet to be the sum of theinner headerinner-header IPv4 Total Length plus the UDP and LISP header lengths. For anIPv6 encapsulatedIPv6-encapsulated packet, theUDP length'UDP Length' field is the sum of theinner headerinner-header IPv6 Payload Length, the size of the IPv6 header (40 octets), and the size of the UDP and LISP headers. N: TheN bitN-bit is the nonce-present bit. When this bit is set to 1, the low-order24-bits24 bits of the first32-bits32 bits of the LISP headercontainscontain a Nonce. See Section 6.3.1 for details. BothNN- andV bitsV-bits MUST NOT be set in the same packet. If they are, a decapsulating ETR MUST treat the"Nonce/Map-Version"'Nonce/Map-Version' field as having a Nonce value present. L: TheL bitL-bit is theLocator Status Bits'Locator-Status-Bits' field enabled bit. When this bit is set to 1, theLocator Status BitsLocator-Status-Bits in the second32-bits32 bits of the LISP header are in use. x 1 x x 0 x x x +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ |N|L|E|V|I|flags| Nonce/Map-Version | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ |Locator Status BitsLocator-Status-Bits | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ E: TheE bitE-bit is the echo-nonce-request bit. This bit MUST be ignored and has no meaning when theN bitN-bit is set to 0. When theN bitN-bit is set to 1 and this bit is set to 1,meansan ITR is requestingforthat the nonce value in theNonce'Nonce' fieldtobe echoed back inLISPLISP- encapsulated packets when the ITR is also an ETR. See Section 6.3.1 for details. V: TheV bitV-bit is the Map-Version present bit. When this bit is set to 1, theN bitN-bit MUST be 0. Refer to Section 6.6.3 for more details. This bit indicates that the LISP header is encoded in this case as: 0 x 0 1 x x x x +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ |N|L|E|V|I|flags| Source Map-Version | Dest Map-Version | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | InstanceID/Locator Status BitsID/Locator-Status-Bits | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ I: TheI bitI-bit is the Instance ID bit. See Section 5.5 for more details. When this bit is set to 1, theLocator Status Bits'Locator-Status-Bits' field is reduced to8-bits8 bits and the high-order24-bits24 bits are used as an Instance ID. If the L-bit is set to 0, then the low-order 8 bits are transmitted as zero and ignored on receipt. The format of the LISP header would look likein this case:this: x x x x 1 x x x +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ |N|L|E|V|I|flags| Nonce/Map-Version | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | Instance ID | LSBs | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ flags: Theflags'flags' field is a 3-bit fieldisreserved for future flag use. It MUST be set to 0 on transmit and MUST be ignored on receipt. LISP Nonce: The LISPnonce'Nonce' field is a 24-bit value that is randomly generated by an ITR when the N-bit is set to 1. Nonce generation algorithms are an implementation matter but are required to generate different nonces when sending to different destinations. However, the same nonce can be used for a period of time to the same destination. The nonce is also used when the E-bit is set to request the nonce value to be echoed by the other side when packets are returned. When the E-bit is clear but the N-bit is set, a remote ITR is either echoing a previously requestedecho- nonceecho-nonce or providing a random nonce. See Section 6.3.1 for more details. LISPLocator Status BitsLocator-Status-Bits (LSBs): When the L-bit is also set, thelocator status bits'Locator-Status-Bits' field in the LISP header is set by an ITR to indicate to an ETR the up/down status of the Locators in the source site. Each RLOC in a Map-Reply is assigned an ordinal value from 0 to n-1 (when there are n RLOCs in a mapping entry). TheLocator Status BitsLocator-Status-Bits are numbered from 0 to n-1 from the least significant bit of the field. The field is32-bits32 bits when the I-bit is set to 0 and is 8 bits when the I-bit is set to 1. When aLocator Status BitLocator-Status-Bit is set to 1, the ITR is indicating to the ETR that the RLOC associated with the bit ordinal has up status. See Section 6.3 for details on how an ITR can determine the status of the ETRs at the same site. When a site has multipleEID-prefixes whichEID-Prefixes that result in multiple mappings (where each could have a differentlocator- set),Locator-Set), theLocator Status BitsLocator-Status-Bits setting in an encapsulated packet MUST reflect the mapping for theEID-prefixEID-Prefix that the inner-header source EID address matches. If the LSB for an anycastlocatorLocator is set to 1, then there is at least one RLOC with thataddressaddress, and the ETR is considered 'up'. When doing ITR/PITR encapsulation: o Theouter header Timeouter-header 'Time toLiveLive' field (orHop Limit'Hop Limit' field, in the case of IPv6) SHOULD be copied from theinner header Timeinner-header 'Time toLiveLive' field. o Theouter header Typeouter-header 'Type ofServiceService' field (or theTraffic Class'Traffic Class' field, in the case of IPv6) SHOULD be copied from theinner header Typeinner-header 'Type ofServiceService' field (with oneexception,exception; see below). When doing ETR/PETR decapsulation: o Theinner header Timeinner-header 'Time toLiveLive' field (orHop Limit'Hop Limit' field, in the case of IPv6) SHOULD be copied from theouter header Timeouter-header 'Time toLiveLive' field, when the Time to Livefieldvalue of the outer header is less than the Time to Live value of the inner header. Failing to perform this check can cause the Time to Live of the inner header to increment across encapsulation/decapsulationcycle.cycles. This check is also performed when doing initialencapsulationencapsulation, when a packet comes to an ITR or PITR destined for a LISP site. o Theinner header Typeinner-header 'Type ofServiceService' field (or theTraffic Class'Traffic Class' field, in the case of IPv6) SHOULD be copied from theouter header Typeouter-header 'Type ofServiceService' field (with oneexception,exception; see below). Note that if an ETR/PETR is also an ITR/PITR andchoosechooses toreencapsulatere-encapsulate after decapsulating, the net effect of this is that the new outer header will carry the same Time to Live as the old outer header minus 1. Copying theTTLTime to Live (TTL) serves two purposes: first, it preserves the distance the host intended the packet to travel; second, and more importantly, it provides for suppression of looping packets in the event there is a loop of concatenated tunnels due to misconfiguration. See Section 9.3 for TTL exception handling for traceroute packets. TheECNExplicit Congestion Notification ('ECN') field occupies bits 6 and 7 of both the IPv4Type'Type ofServiceService' field and the IPv6Traffic Class'Traffic Class' field [RFC3168]. TheECN'ECN' field requires special treatment in order to avoid discarding indications of congestion [RFC3168]. ITR encapsulation MUST copy the 2-bitECN'ECN' field from the inner header to the outer header. Re-encapsulation MUST copy the 2-bitECN'ECN' field from the stripped outer header to the new outer header. If theECN'ECN' field contains a congestion indication codepoint (the value is '11', the Congestion Experienced (CE) codepoint), then ETR decapsulation MUST copy the 2-bitECN'ECN' field from the stripped outer header to the surviving inner header that is used to forward the packet beyond the ETR. These requirements preserveCongestion Experienced (CE)CE indications when a packet that uses ECN traverses a LISP tunnel and becomes marked with a CE indication due to congestion between the tunnel endpoints. 5.4. Dealing with Large Encapsulated Packets This section proposes two mechanisms to deal with packets that exceed the path MTU between the ITR and ETR. It is left to the implementor to decide if the stateless or stateful mechanism should be implemented. Both or neither can beusedused, since it is a local decision in the ITR regarding how to deal with MTU issues, and sites can interoperate with differing mechanisms. Both stateless and stateful mechanisms also apply toReencapsulatingRe-encapsulating and RecursiveTunneling. SoTunneling, so any actions below referring to an ITR also apply toana TE-ITR. 5.4.1. A Stateless Solution to MTU Handling An ITR stateless solution to handle MTU issues is described as follows: 1. Define H to be the size, in octets, of the outer header an ITR prepends to a packet. This includes the UDP and LISP header lengths. 2. Define L to be the size, in octets, of themaximum sizedmaximum-sized packet an ITR can send to an ETR without the need for the ITR or any intermediate routers to fragment the packet. 3. Define an architectural constant S for the maximum size of a packet, in octets, an ITR must receive so the effective MTU can be met. That is, S = L - H. When an ITR receives a packet from a site-facing interface and adds H octets worth of encapsulation to yield a packet size greater than L octets, it resolves the MTU issue by first splitting the original packet into 2 equal-sized fragments. A LISP header is then prepended to each fragment. The size of the encapsulated fragments is then (S/2 + H), which is less than the ITR's estimate of the path MTU between the ITR and its correspondent ETR. When an ETR receives encapsulated fragments, it treats them as two individually encapsulated packets. It strips the LISP headers and then forwards each fragment to the destination host of the destination site. The two fragments are reassembled at the destination host into the single IP datagram that was originated by the source host. Note that reassembly can happen at the ETR if the encapsulated packet was fragmented at or after the ITR. This behavior is performed by the ITR when the source host originates a packet with theDF'DF' field of the IP headerisset to 0. When theDF'DF' field of the IP header is set to 1, or the packet is an IPv6 packet originated by the source host, the ITR will drop the packet when the size is greater thanL,L andsendssend an ICMP Too Big message to the source with a value of S, where S is (L - H). When theouter headerouter-header encapsulation uses an IPv4 header, an implementation SHOULD set the DF bit to 1 so ETR fragment reassembly can be avoided. An implementation MAY set the DF bit in such headers to 0 if it has good reason to believe there are unresolvable path MTU issues between the sending ITR and the receiving ETR. This specification RECOMMENDS that L be defined as 1500. 5.4.2. A Stateful Solution to MTU Handling An ITR stateful solution to handle MTU issues is described as follows and was first introduced in [OPENLISP]: 1. The ITR will keep state of the effective MTU for eachlocatorLocator permapping cacheMap-Cache entry. The effective MTU is what the core network can deliver along the path between the ITR and ETR. 2. When anIPv6 encapsulated packetIPv6-encapsulated packet, or anIPv4 encapsulatedIPv4-encapsulated packet with the DF bit set to 1, exceeds what the core network can deliver, one of the intermediate routers on the path will send an ICMP Too Big message to the ITR. The ITR will parse the ICMP message to determine whichlocatorLocator is affected by the effective MTU change and then record the new effective MTU value in themapping cacheMap-Cache entry. 3. When a packet is received by the ITR from a source inside of the site and the size of the packet is greater than the effective MTU stored with themapping cacheMap-Cache entry associated with the destination EID the packet is for, the ITR will send an ICMP Too Big message back to the source. The packet size advertised by the ITR in the ICMP Too Big message is the effective MTU minus the LISP encapsulation length. Even though this mechanism is stateful, it has advantages over the stateless IP fragmentation mechanism, by not involving the destination host with reassembly of ITR fragmented packets. 5.5. Using Virtualization and Segmentation with LISP When multiple organizations inside of a LISP site are using private addresses [RFC1918] asEID-prefixes,EID-Prefixes, their address spaces MUST remain segregated due to possible address duplication. An Instance ID in the address encoding can aid in making the entireAFI basedAFI-based address unique. See IANA ConsiderationsSection 14.2(Section 14.2) for detailsforon possible address encodings. An Instance ID can be carried in aLISP encapsulatedLISP-encapsulated packet. An ITR that prepends a LISPheader,header will copy a 24-bitvalue,value used by the LISP router to uniquely identify the address space. The value is copied to theInstance ID'Instance ID' field of the LISPheaderheader, and the I-bit is set to 1. When an ETR decapsulates a packet, the Instance ID from the LISP header is used as a table identifier to locate the forwarding table to use for the inner destination EID lookup. For example,aan 802.1Q VLAN tag or VPN identifier could be used as a 24-bit Instance ID. 6. EID-to-RLOC Mapping 6.1. LISP IPv4 and IPv6Control PlaneControl-Plane Packet Formats The following UDP packet formats are used by the LISPcontrol-plane.control plane. 0 1 2 3 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ |Version| IHL |Type of Service| Total Length | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | Identification |Flags| Fragment Offset | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | Time to Live | Protocol = 17 | Header Checksum | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | Source Routing Locator | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | Destination Routing Locator | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ / | Source Port | Dest Port | UDP +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ \ | UDP Length | UDP Checksum | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | | | LISP Message | | | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ 0 1 2 3 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ |Version| Traffic Class | Flow Label | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | Payload Length | Next Header=17| Hop Limit | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | | + + | | + Source Routing Locator + | | + + | | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | | + + | | + Destination Routing Locator + | | + + | | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ / | Source Port | Dest Port | UDP +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ \ | UDP Length | UDP Checksum | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | | | LISP Message | | | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ The LISP UDP-based messages are the Map-Request and Map-Reply messages. When a UDP Map-Request is sent, the UDP source port is chosen by the sender and the destination UDP port number is set to 4342. When a UDP Map-Reply is sent, the source UDP port number is set to 4342 and the destination UDP port number is copied from the source port of either the Map-Request or the invoking data packet. Implementations MUST be prepared to accept packets when either the source port or destination UDP port is set to 4342 due to NATs changing port number values. TheUDP Length'UDP Length' field will reflect the length of the UDP header and the LISP Message payload. The UDPChecksumchecksum is computed and set to non-zero for Map-Request, Map-Reply,Map-RegisterMap-Register, andECMEncapsulated Control Message (ECM) control messages. It MUST be checked onreceiptreceipt, and if the checksum fails, the packet MUST be dropped. The format of control messages includes the UDP header so the checksum and length fields can be used to protect and delimit message boundaries. 6.1.1. LISP Packet Type Allocations This section will be the authoritative source for allocating LISP Type values and for defining LISP control message formats. Current allocations are: Reserved: 0 b'0000' LISP Map-Request: 1 b'0001' LISP Map-Reply: 2 b'0010' LISP Map-Register: 3 b'0011' LISP Map-Notify: 4 b'0100' LISP Encapsulated Control Message: 8 b'1000' 6.1.2. Map-Request Message Format 0 1 2 3 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ |Type=1 |A|M|P|S|p|s| Reserved | IRC | Record Count | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | Nonce . . . | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | . . . Nonce | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | Source-EID-AFI | Source EID Address ... | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | ITR-RLOC-AFI 1 | ITR-RLOC Address 1 ... | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | ... | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | ITR-RLOC-AFI n | ITR-RLOC Address n ... | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ / | Reserved | EID mask-len |EID-prefix-AFIEID-Prefix-AFI | Rec +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ \ |EID-prefixEID-Prefix ... | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | Map-Reply Record ... | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ Packet field descriptions: Type: 1 (Map-Request) A: This is an authoritative bit, which is set to 0 for UDP-basedMap- RequestsMap-Requests sent by an ITR.SetIt is set to 1 when an ITR wants the destination site to return the Map-Reply rather than the mapping database system. M: This is the map-data-presentbit, whenbit. When set, it indicates that aMap- ReplyMap-Reply Record segment is included in the Map-Request. P: This is theprobe-bitprobe-bit, which indicates that a Map-Request SHOULD be treated as alocatorLocator reachability probe. The receiver SHOULD respond with a Map-Reply with the probe-bit set, indicating that the Map-Reply is alocatorLocator reachability probe reply, with the nonce copied from the Map-Request. See Section 6.3.2 for more details. S: This is the Solicit-Map-Request (SMR) bit. See Section 6.6.2 for details. p: This is the PITR bit. This bit is set to 1 when a PITR sends a Map-Request. s: This is the SMR-invoked bit. This bit is set to 1 when an xTR is sending a Map-Request in response to a received SMR-basedMap- Request.Map-Request. Reserved:ItThis field MUST be set to 0 on transmit and MUST be ignored on receipt. IRC: This 5-bit field is the ITR-RLOCCountCount, which encodes the additional number of(ITR-RLOC-AFI, ITR-RLOC Address)('ITR-RLOC-AFI', 'ITR-RLOC Address') fields present in this message. At least one (ITR-RLOC-AFI,ITR-RLOC- Address)ITR-RLOC-Address) pair MUST be encoded. MultipleITR-RLOC Address'ITR-RLOC Address' fields areusedused, so a Map-Replier can select which destination address to use for a Map-Reply. The IRC value ranges from 0 to 31. For a value of 0, there is 1 ITR-RLOC addressencoded, andencoded; for a value of 1, there are 2 ITR-RLOC addressesencodedencoded, and so on up to3131, which encodes a total of 32 ITR-RLOC addresses. Record Count:TheThis is the number of records in this Map-Request message. A record is comprised of the portion of the packet that is labeled 'Rec' above and occurs the number of times equal to Record Count. For this version of the protocol, a receiver MUST accept and process Map-Requests that contain one or more records, but a sender MUST only send Map-Requests containing one record. Support for requesting multiple EIDs in a single Map-Request message will be specified in a future version of the protocol. Nonce:AnThis is an 8-octet random value created by the sender of theMap- Request.Map-Request. This nonce will be returned in the Map-Reply. The security of the LISP mapping protocoldependscritically depends on the strength of the nonce in the Map-Request message. The nonce SHOULD be generated by a properly seeded pseudo-random (or strong random) source. See [RFC4086] for advice on generating security- sensitive random data. Source-EID-AFI:AddressThis is the address family of the"Source'Source EIDAddress"Address' field. Source EID Address: This is the EID of the source hostwhichthat originated the packetwhich isthat caused the Map-Request. WhenMap- RequestsMap-Requests are used for refreshing amap-cacheMap-Cache entry or forRLOC- probing,RLOC-Probing, an AFI value 0 is used and this field is of zero length. ITR-RLOC-AFI:AddressThis is the address family of the"ITR-RLOC Address"'ITR-RLOC Address' field that follows this field. ITR-RLOC Address:UsedThis is used to give the ETR the option of selecting the destination address from any address family for the Map-Reply message. This address MUST be a routable RLOC address of the sender of the Map-Request message. EID mask-len:MaskThis is the mask length forEID prefix. EID-prefix-AFI: Addressthe EID-Prefix. EID-Prefix-AFI: This is the address family ofEID-prefixthe EID-Prefix according to[AFI] EID-prefix:[AFI]. EID-Prefix: This prefix is 4 octetsiffor an IPv4address-family,address family and 16 octetsiffor an IPv6address-family.address family. When a Map-Request is sent by an ITR because a data packet is received for a destination where there is no mapping entry, theEID-prefixEID-Prefix is set to the destination IP address of the datapacket. Andpacket, and the 'EID mask-len' is set to 32 or 128 for IPv4 or IPv6, respectively. When an xTR wants to query a site about the status of a mapping it already has cached, theEID- prefixEID-Prefix used in the Map-Request has the samemask-lengthmask length as theEID-prefixEID-Prefix returned from the site when it sent a Map-Reply message. Map-Reply Record: When theM bitM-bit is set, this field is the size of a single "Record" in the Map-Reply format. This Map-Reply record contains the EID-to-RLOC mapping entry associated with the Source EID. This allows the ETRwhichthat will receive this Map-Request to cache the data if it chooses to do so. 6.1.3. EID-to-RLOC UDP Map-Request Message A Map-Request is sent from an ITR when it needs a mapping for an EID, wants to test an RLOC for reachability, or wants to refresh a mapping before TTL expiration. For the initial case, the destination IP address used for the Map-Request is the data packet's destination address(i.e.(i.e., thedestination-EID) whichdestination EID) that had a mapping cache lookup failure. For the latter two cases, the destination IP address used for the Map-Request is one of the RLOC addresses from thelocator-setLocator-Set of themap cacheMap-Cache entry. The source address is either an IPv4 or IPv6 RLOCaddressaddress, dependingifon whether the Map-Request is using an IPv4versusor IPv6 header, respectively. In all cases, the UDP source port number for the Map-Request message isan ITR/PITR selecteda 16-bit value selected by the ITR/PITR, and the UDP destination port number is set to thewell-knownwell- known destination port number 4342. A successful Map-Reply, which is one that has a nonce that matches an outstanding Map-Request nonce, will update the cached set of RLOCs associated with theEID prefixEID-Prefix range. One or more Map-Request(ITR-RLOC-AFI, ITR-RLOC-Address)('ITR-RLOC-AFI', 'ITR-RLOC-Address') fields MUST be filled in by the ITR. The number of fields (minus 1) encoded MUST be placed in theIRC'IRC' field. The ITR MAY include all locally configuredlocatorsLocators in this list or just provide one locator address from each address family it supports. If the ITR erroneously provides no ITR-RLOC addresses, the Map-Replier MUST drop theMap- Request.Map-Request. Map-Requests can also be LISP encapsulated using UDP destination port 4342 with a LISPtypeType value set to "Encapsulated Control Message", when sent from an ITR to a Map-Resolver. Likewise, Map-Requests are LISP encapsulated the same way from a Map-Server to an ETR. Details onencapsulatedEncapsulated Map-Requests and Map-Resolvers can be found in[LISP-MS].[RFC6833]. Map-Requests MUST be rate-limited. It is RECOMMENDED that aMap- RequestMap-Request for the sameEID-prefixEID-Prefix be sent no more than once per second. An ITR that is configured with mapping database information(i.e.(i.e., it is also an ETR) MAY optionally include those mappings in aMap- Request.Map-Request. When an ETR configured to accept and verify such "piggybacked" mapping data receives such a Map-Request and it does not have this mapping in the map-cache, it MAY originate a "verifying Map-Request", addressed to the map-requesting ITR and the ETR MAY add amap-cacheMap-Cache entry. If the ETR has amap-cacheMap-Cache entry that matches the "piggybacked" EID and the RLOC is in thelocator-setLocator-Set for the entry, then it may send the "verifying Map-Request" directly to the originating Map-Request source. If the RLOC is not in thelocator- set,Locator-Set, then the ETR MUST send the "verifying Map-Request" to the "piggybacked" EID. Doing this forces the "verifying Map-Request" to go through the mapping database system to reach the authoritative source of information about that EID, guarding against RLOC-spoofing ininthe "piggybacked" mapping data. 6.1.4. Map-Reply Message Format 0 1 2 3 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ |Type=2 |P|E|S| Reserved | Record Count | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | Nonce . . . | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | . . . Nonce | +-> +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | | Record TTL | | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ R | Locator Count | EID mask-len | ACT |A| Reserved | e +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ c | Rsvd | Map-Version Number |EID-prefix-AFIEID-Prefix-AFI | o +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ r |EID-prefixEID-Prefix | d +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | /| Priority | Weight | M Priority | M Weight | | L +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | o | Unused Flags |L|p|R| Loc-AFI | | c +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | \| Locator | +-> +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ Packet field descriptions: Type: 2 (Map-Reply) P: This is theprobe-bitprobe-bit, which indicates that the Map-Reply is in response to alocatorLocator reachability probe Map-Request. Thenonce'Nonce' field MUST contain a copy of the nonce value from the original Map-Request. See Section 6.3.2 for more details. E:IndicatesThis bit indicates that the ETRwhichthat sends this Map-Reply message is advertising that the site is enabled for the Echo-NoncelocatorLocator reachability algorithm. See Section 6.3.1 for more details. S: This is the Security bit. When set to11, the following authentication information will be appended to the end of theMap- Reply.Map-Reply. The detailed format of the Authentication Data Content is for further study. 0 1 2 3 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | AD Type | Authentication Data Content . . . | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ Reserved:ItThis field MUST be set to 0 on transmit and MUST be ignored on receipt. Record Count:TheThis is the number of records in this reply message. A record is comprised of that portion of the packet labeled 'Record' above and occurs the number of times equal to Recordcount.Count. Nonce:AThis is a 24-bit value set in a Data-Probepacketpacket, or a 64-bit value from the Map-Request is echoed in thisNonce'Nonce' field of theMap- Reply.Map-Reply. When a 24-bit value is supplied, it resides in thelow- orderlow-order 64 bits of thenonce'Nonce' field. Record TTL:TheThis is the time in minutes the recipient of the Map-Reply will store the mapping. If the TTL is 0, the entry SHOULD be removed from the cache immediately. If the value is 0xffffffff, the recipient can decide locally how long to store the mapping. Locator Count:TheThis is the number of Locator entries. AlocatorLocator entry comprises what is labeled above as 'Loc'. ThelocatorLocator count can be00, indicating that there are nolocatorsLocators for theEID-prefix.EID-Prefix. EID mask-len:MaskThis is the mask length forEID prefix.the EID-Prefix. ACT: This 3-bit field describesnegativeNegative Map-Reply actions. In any other message type, these bits are set to 0 and ignored on receipt. These bits are used only when the 'Locator Count' field is set to 0. The action bits are encoded only in Map-Reply messages. The actions defined are used by an ITR or PITR when a destination EID matches a negativemapping cacheMap-Cache entry. Unassigned values should cause amap-cacheMap-Cache entry to becreated and,created, and when packets match this negative cache entry, they will be dropped. The current assigned values are: (0) No-Action: The map-cache is keptalivealive, and no packet encapsulation occurs. (1) Natively-Forward: The packet is not encapsulated or dropped but natively forwarded. (2) Send-Map-Request: The packet invokes sending a Map-Request. (3) Drop: A packet that matches this map-cache entry is dropped. An ICMP Destination Unreachable message SHOULD be sent. A: The Authoritative bit, whensentsent, is always set to 1 by an ETR. When a Map-Server is proxy Map-Replying[LISP-MS][RFC6833] for a LISP site, the Authoritative bit is set to 0. This indicates to requesting ITRs that the Map-Reply was not originated by a LISP node managed at the site that owns theEID-prefix.EID-Prefix. Map-Version Number: When this 12-bit value isnon-zeronon-zero, the Map-Reply sender is informing the ITR what the version number is for theEID-recordEID record contained in the Map-Reply. The ETR can allocate this number internally but MUST coordinate this value with other ETRs for the site. When this value is 0, there is no versioning information conveyed. The Map-Version Number can be included in Map-Request and Map-Register messages. See Section 6.6.3 for more details.EID-prefix-AFI:EID-Prefix-AFI: Address family ofEID-prefixthe EID-Prefix according to [AFI].EID-prefix:EID-Prefix: This prefix is 4 octetsiffor an IPv4address-family,address family and 16 octetsiffor an IPv6address-family.address family. Priority:eachEach RLOC is assigned a unicastpriority.Priority. Lower values are more preferable. When multiple RLOCs have the samepriority,Priority, they MAY be used in a load-split fashion. A value of 255 means the RLOC MUST NOT be used for unicast forwarding. Weight:whenWhen priorities are the same for multiple RLOCs, theweightWeight indicates how to balance unicast traffic between them. Weight is encoded as a relative weight of total unicast packets that match the mapping entry. Forexampleexample, if there are 4locatorsLocators in alocator set,Locator-Set, where theweightsWeights assigned are 30, 20, 20, and 10, the firstlocatorLocator will get 37.5% of the traffic, the 2nd and 3rdlocatorsLocators will get 25% oftrafficthe traffic, and the 4thlocatorLocator will get 12.5% of the traffic. If allweightsWeights for alocator-setLocator-Set are equal, the receiver of the Map-Reply will decide how to load-split the traffic. See Section 6.5 for a suggested hash algorithm to distribute the load acrosslocatorsLocators with the samepriorityPriority and equalweightWeight values. M Priority:eachEach RLOC is assigned a multicastpriorityPriority used by an ETR in a receiver multicast site to select an ITR in a source multicast site for building multicast distribution trees. A value of 255 means the RLOC MUST NOT be used for joining a multicast distribution tree. For more details, see[MLISP].[RFC6831]. M Weight:whenWhen priorities are the same for multiple RLOCs, theweightWeight indicates how to balance building multicast distribution trees across multiple ITRs. TheweightWeight is encoded as a relative weight (similar to the unicast Weights) of the total number of trees built to the source site identified by theEID-prefix.EID-Prefix. If allweightsWeights for alocator-setLocator-Set are equal, the receiver of the Map-Reply will decide how to distribute multicast state across ITRs. For more details, see[MLISP].[RFC6831]. Unused Flags: These are set to 0 when sending and ignored on receipt. L:whenWhen this bit is set, thelocatorLocator is flagged as a locallocatorLocator to the ETR that is sending the Map-Reply. When a Map-Server is doing proxy Map-Replying[LISP-MS][RFC6833] for a LISP site, theL bitL-bit is set to 0 for alllocatorsLocators in thislocator-set.Locator-Set. p:whenWhen this bit is set, an ETR informs theRLOC-probingRLOC-Probing ITR that the locatoraddress,address for which this bit isset,set is the one beingRLOC- probedRLOC-probed and MAY be different from the source address of theMap- Reply.Map-Reply. An ITR that RLOC-probes a particularlocator,Locator MUST use thislocatorLocator for retrieving the data structure used to store the fact that thelocatorLocator is reachable. The"p" bitp-bit is set for a singlelocatorLocator in the samelocator set.Locator-Set. If an implementation sets more than one"p" bitp-bit erroneously, the receiver of the Map-Reply MUST select the firstlocator.Locator. The"p" bitp-bit MUST NOT be set forlocator-setLocator-Set records sent in Map-Request and Map-Register messages. R: This is set when the sender of a Map-Reply has a route to thelocatorLocator in thelocatorLocator data record. This receiver may find this useful to know if thelocatorLocator is up but not necessarily reachable from the receiver's point of view. See also Section 6.4 for another way the R-bit may be used. Locator: This is an IPv4 or IPv6 address (as encoded by the 'Loc-AFI' field) assigned to an ETR. Note that the destination RLOC address MAY be an anycast address. A source RLOC can be an anycast address as well. The source or destination RLOC MUST NOT be the broadcast address (255.255.255.255 or any subnet broadcast address known to therouter),router) and MUST NOT be a link-local multicast address. The source RLOC MUST NOT be a multicast address. The destination RLOC SHOULD be a multicast address if it is being mapped from a multicast destination EID. 6.1.5. EID-to-RLOC UDP Map-Reply Message A Map-Reply returns anEID-prefixEID-Prefix with a prefix length that is less than or equal to the EID being requested. The EID being requested is either from the destination field of an IP header of a Data-Probe or the EID record of a Map-Request. The RLOCs in the Map-Reply areglobally-routableglobally routable IP addresses of all ETRs for the LISP site. Each RLOC conveys status reachability but does not convey path reachability from arequestersrequester's perspective. Separate testing of path reachability isrequired,required. See Section 6.3 for details. Note that a Map-Reply may contain differentEID-prefixEID-Prefix granularity (prefix + length) than the Map-Requestwhichthat triggers it. This might occur if a Map-Request were for a prefix that had been returned by an earlier Map-Reply. In such a case, the requester updates its cache with the new prefix information and granularity. For example, a requester with two cachedEID-prefixesEID-Prefixes that are covered by aMap- ReplyMap-Reply containingone,one less-specificprefix,prefix replaces the entry with the less-specificEID-prefix.EID-Prefix. Note that the reverse, replacement of one less-specific prefix with multiple more-specific prefixes, can alsooccur butoccur, not by removing the less-specific prefix but rather by adding the more-specific prefixeswhichthat, during alookuplookup, will override the less-specific prefix. When an ETR is configured with overlappingEID-prefixes,EID-Prefixes, aMap- RequestMap-Request with an EID thatlongestbest matches anyEID-prefixEID-Prefix MUST be returned in a single Map-Reply message. For instance, if an ETR had database mapping entries forEID-prefixes:EID-Prefixes: 10.0.0.0/8 10.1.0.0/16 10.1.1.0/24 10.1.2.0/24 A Map-Request for EID 10.1.1.1 would cause a Map-Reply with a record count of 1 to be returned with a mapping recordEID-prefixEID-Prefix of 10.1.1.0/24. A Map-Request for EID10.1.5.5,10.1.5.5 would cause a Map-Reply with a record count of 3 to be returned with mapping records forEID-prefixesEID-Prefixes 10.1.0.0/16, 10.1.1.0/24, and 10.1.2.0/24. Note that not all overlappingEID-prefixesEID-Prefixes need to bereturned,returned but only themore specificsmore-specific entries (note that in the second example above 10.0.0.0/8 was not returned for requesting EID 10.1.5.5)entriesfor the matchingEID-prefixEID-Prefix of the requesting EID. When more than oneEID-prefixEID-Prefix is returned, all SHOULD use the sameTime-to-LiveTime to Live value so they can all time out at the same time. When amore specific EID-prefixmore-specific EID-Prefix is received later, itsTime-to-LiveTime to Live value in the Map-Reply record can be stored even when otherless specificsless-specific entries exist. When aless specific EID-prefixless-specific EID-Prefix is received later, its map-cache expiration time SHOULD be set to the minimum expiration time of anymore specific EID-prefixmore-specific EID-Prefix in the map-cache. This is done so the integrity of theEID-prefixEID-Prefix set is wholly maintained and so nomore-specificmore- specific entries are removed from the map-cache while keepingless-specificless- specific entries. Map-Replies SHOULD be sent for anEID-prefixEID-Prefix no more often than once per second to the same requesting router. For scalability, it is expected that aggregation of EID addresses intoEID-prefixesEID-Prefixes will allow one Map-Reply to satisfy a mapping for the EID addresses in the prefixrangerange, thereby reducing the number of Map-Request messages. Map-Reply records can have an emptylocator-set.Locator-Set. Anegative Map- ReplyNegative Map-Reply is a Map-Reply with an emptylocator-set.Locator-Set. Negative Map-Replies convey special actions by the sender to the ITR or PITRwhichthat have solicited the Map-Reply. There are two primary applications for Negative Map-Replies. The first is for a Map-Resolver to instruct an ITR or PITR when a destination is for a LISP site versus a non-LISPsite. Andsite, and the other is to source quench Map-Requestswhichthat are sent for non-allocated EIDs. For each Map-Reply record, the list oflocatorsLocators in alocator-setLocator-Set MUST appear in the same order for each ETR that originates a Map-Reply message. Thelocator-setLocator-Set MUST be sorted in order of ascending IP address where an IPv4 locator address is considered numerically 'less than' an IPv6 locator address. When sending a Map-Reply message, the destination address is copied fromtheone of theITR-RLOC'ITR-RLOC' fields from the Map-Request. The ETR can choose a locator address from one of the address families it supports. For Data-Probes, the destination address of the Map-Reply is copied from the source address of the Data-Probe messagewhichthat is invoking the reply. The source address of the Map-Reply is one of the local IP addresses chosen to allowuRPFUnicast Reverse Path Forwarding (uRPF) checks to succeed in the upstream service provider. The destination port of a Map-Reply message is copied from the source port of the Map-Request orData- ProbeData-Probe, and the source port of the Map-Reply message is set to the well-known UDP port 4342. 6.1.5.1. Traffic Redirection with Coarse EID-Prefixes When an ETR is misconfigured or compromised, it could return coarseEID-prefixesEID-Prefixes in Map-Reply messages it sends. TheEID-prefixEID-Prefix could coverEID-prefixes whichEID-Prefixes that are allocated to othersitessites, redirecting their traffic to thelocatorsLocators of the compromised site. To solve this problem, there are two basic solutions that could be used. The first is to have Map-Serversproxy-map-replyproxy Map-Reply on behalf of ETRs so their registeredEID-prefixesEID-Prefixes are the ones returned inMap- Replies.Map-Replies. Since the interaction between an ETR and Map-Server is secured withshared-keys,shared keys, it is easier for an ETR to detect misbehavior. The second solution is to have ITRs and PITRs cacheEID-prefixesEID-Prefixes withmask-lengthsmask lengths that are greater than or equal to a configured prefix length. This limits the damage to a specific width of anyEID-prefix advertised,EID-Prefix advertised but needs to be coordinated with the allocation of site prefixes. These solutions can be used independently or at the same time. At the time of this writing, other approaches are being considered and researched. 6.1.6. Map-Register Message Format The usage details of the Map-Register message can be found in specification[LISP-MS].[RFC6833]. This section solely defines the message format. The message is sent in UDP with a destination UDP port of 4342 and a randomly selected UDP source port number. The Map-Register message format is: 0 1 2 3 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ |Type=3 |P| Reserved |M| Record Count | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | Nonce . . . | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | . . . Nonce | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | Key ID | Authentication Data Length | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ ~ Authentication Data ~ +-> +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | | Record TTL | | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ R | Locator Count | EID mask-len | ACT |A| Reserved | e +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ c | Rsvd | Map-Version Number |EID-prefix-AFIEID-Prefix-AFI | o +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ r |EID-prefixEID-Prefix | d +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | /| Priority | Weight | M Priority | M Weight | | L +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | o | Unused Flags |L|p|R| Loc-AFI | | c +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | \| Locator | +-> +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ Packet field descriptions: Type: 3 (Map-Register) P: This is theproxy-map-reply bit, whenproxy Map-Reply bit. When set to11, an ETR sends aMap- RegisterMap-Register message requestingforthe Map-Server to proxy a Map-Reply. The Map-Server will send non-authoritative Map-Replies on behalf of the ETR. Details on this usage can be found in[LISP-MS].[RFC6833]. Reserved:ItThis field MUST be set to 0 on transmit and MUST be ignored on receipt. M: This is the want-map-notifybit, whenbit. When set to11, an ETR is requestingfora Map-Notify message to be returned in response to sending a Map-Register message. The Map-Notify message sent by a Map-Server is used toanacknowledge receipt of a Map-Register message. Record Count:TheThis is the number of records in this Map-Register message. A record is comprised of that portion of the packet labeled 'Record' above and occurs the number of times equal to Recordcount.Count. Nonce: This 8-octetNonce'Nonce' field is set to 0 in Map-Register messages. Since the Map-Register message is authenticated, thenonce'Nonce' field is not currently used for any security function but may be in the future as part of an anti-replay solution. Key ID:AThis is a configured ID to find the configured Message Authentication Code (MAC) algorithm and key value used for the authentication function. See Section 14.4 for codepoint assignments. Authentication Data Length:TheThis is the length in octets of theAuthentication Data'Authentication Data' field that follows this field. The length of theAuthentication Data'Authentication Data' field is dependent on theMessage Authentication Code (MAC)MAC algorithm used. The length field allows a device that doesn't know the MAC algorithm to correctly parse the packet. Authentication Data:TheThis is the message digest used from the output of theMessage Authentication Code (MAC)MAC algorithm. The entireMap- RegisterMap-Register payload is authenticated with this field preset to 0. After the MAC is computed, it is placed in this field. Implementations of this specification MUST include support for HMAC-SHA-1-96[RFC2404][RFC2404], and support for HMAC-SHA-256-128[RFC6234][RFC4868] is RECOMMENDED. The definition of the rest of the Map-Register can be found inthe Map-Reply section.Section 6.1.4. 6.1.7. Map-Notify Message Format The usage details of the Map-Notify message can be found in specification[LISP-MS].[RFC6833]. This section solely defines the message format. The message is sent inside a UDP packet with source and destination UDP ports equal to 4342. The Map-Notify message format is: 0 1 2 3 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ |Type=4 | Reserved | Record Count | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | Nonce . . . | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | . . . Nonce | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | Key ID | Authentication Data Length | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ ~ Authentication Data ~ +-> +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | | Record TTL | | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ R | Locator Count | EID mask-len | ACT |A| Reserved | e +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ c | Rsvd | Map-Version Number |EID-prefix-AFIEID-Prefix-AFI | o +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ r |EID-prefixEID-Prefix | d +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | /| Priority | Weight | M Priority | M Weight | | L +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | o | Unused Flags |L|p|R| Loc-AFI | | c +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | \| Locator | +-> +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ Packet field descriptions: Type: 4 (Map-Notify) The Map-Notify message has the same contents as a Map-Register message. See the Map-Register section for field descriptions. 6.1.8. Encapsulated Control Message Format An Encapsulated Control Message (ECM) is used to encapsulate control packets sent between xTRs and the mapping database system described in[LISP-MS].[RFC6833]. 0 1 2 3 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ / | IPv4 or IPv6 Header | OH | (uses RLOC addresses) | \ | | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ / | Source Port = xxxx | Dest Port = 4342 | UDP +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ \ | UDP Length | UDP Checksum | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ LH |Type=8 |S| Reserved | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ / | IPv4 or IPv6 Header | IH | (uses RLOC or EID addresses) | \ | | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ / | Source Port = xxxx | Dest Port = yyyy | UDP +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ \ | UDP Length | UDP Checksum | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ LCM | LISP Control Message | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ Packet header descriptions: OH: The outer IPv4 or IPv6headerheader, which uses RLOC addresses in the source and destination header address fields. UDP: The outer UDP header with destination port 4342. The source port is randomly allocated. The checksum field MUST be non-zero. LH: Type 8 is defined to be a "LISP Encapsulated ControlMessage"Message", and what follows is either an IPv4 or IPv6 header as encoded by the first 4 bits after thereserved'Reserved' field. S: This is the Security bit. When set to11, the field following theReserved'Reserved' field will have the following format. The detailed format of the Authentication Data Content is for further study. 0 1 2 3 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | AD Type | Authentication Data Content . . . | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ IH: The inner IPv4 or IPv6headerheader, which can use either RLOC or EID addresses in the header address fields. When a Map-Request is encapsulated in this packetformatformat, the destination address in this header is an EID. UDP: The inner UDPheaderheader, where the port assignmentsdependsdepend on the control packet being encapsulated. When the control packet is a Map-Request or Map-Register, the source port isITR/PITRselected by the ITR/PITR and the destination port is 4342. When the control packet is a Map-Reply, the source port is 4342 and the destination port is assigned from the source port of the invoking Map-Request. Port number 4341 MUST NOT be assigned to either port. The checksum field MUST be non-zero. LCM: The format is one of the control message formats described in this section. At this time, only Map-Request messages are allowed to be encapsulated.And inIn the future, PIMJoin-PruneJoin/Prune messages[MLISP][RFC6831] might be allowed. Encapsulating other types of LISP control messagesareis for further study. When Map-Requests are sent forRLOC-probingRLOC-Probing purposes(i.e(i.e., the probe-bit is set), they MUST NOT be sent inside Encapsulated Control Messages. 6.2. Routing Locator Selection Both the client-side and server-side may need control over the selection of RLOCs for conversations between them. This control is achieved by manipulating thePriority'Priority' andWeight'Weight' fields in EID-to-RLOC Map-Reply messages. Alternatively, RLOC information MAY be gleaned from received tunneled packets or EID-to-RLOC Map-Request messages. The followingenumeratesare different scenarios for choosing RLOCs and the controls that are available: oServer-sideThe server-side returns one RLOC.Client-sideThe client-side can only use one RLOC.Server-sideThe server-side has complete control of the selection. oServer-sideThe server-side returns a list ofRLOCRLOCs where a subset of the list has the same bestpriority. ClientPriority. The client can only use the subset list according to the weighting assigned by the server-side. In this case, the server-side controls both the subset list andload- splittingload-splitting across its members. The client-side can use RLOCs outside of the subset list if it determines that the subset list is unreachable (unless RLOCs are set to a Priority of 255). Some sharing of control exists: the server-side determines the destination RLOC list and load distribution while the client-side has the option of using alternatives to this list if RLOCs in the list are unreachable. oServer-sideThe server-side setsweighta Weight of 0 for the RLOC subset list. In this case, the client-side can choose how the traffic load is spread across the subset list. Control is shared by theserver-sideserver- side determining the list and the client determining load distribution. Again, the client can use alternative RLOCs if the server-provided list of RLOCsareis unreachable. o Either side (more likelyonthe server-side ETR) decides not to send a Map-Request. For example, if the server-side ETR does not send Map-Requests, it gleans RLOCs from the client-side ITR, giving the client-side ITR responsibility for bidirectional RLOC reachability and preferability. Server-side ETR gleaning of the client-side ITR RLOC is done by caching theinner headerinner-header source EID and theouter headerouter-header source RLOC of received packets. The client-side ITR controls how traffic is returned and can alternate using anouterouter- header source RLOC, which then can be added to the list the server-side ETR uses to return traffic. Since no Priority or Weights are provided using this method, theserver- sideserver-side ETR MUST assume that each client-side ITR RLOC uses the same best Priority with a Weight of zero. In addition, sinceEID-prefixEID-Prefix encoding cannot be conveyed in data packets, the EID-to-RLOCcacheCache ontunnel routersTunnel Routers can grow to be very large. o A "gleaned"map-cacheMap-Cache entry, one learned from the source RLOC of a received encapsulated packet, is only stored and used for a few seconds, pending verification. Verification is performed by sending a Map-Request to the source EID (theinner headerinner-header IP source address) of the received encapsulated packet. A reply to this "verifying Map-Request" is used to fully populate themap- cacheMap-Cache entry for the "gleaned" EID and is stored and used for the time indicated from theTTL'TTL' field of a received Map-Reply. When a verifiedmap-cacheMap-Cache entry is stored, data gleaning no longer occurs for subsequent packetswhichthat have a source EID that matches theEID-prefixEID-Prefix of the verified entry. RLOCs that appear in EID-to-RLOC Map-Reply messages are assumed to be reachable when the R-bit for thelocatorLocator record is set to 1. When the R-bit is set to 0, an ITR or PITR MUST NOT encapsulate to the RLOC. Neither the information contained in a Map-Replyornor that stored in the mapping database system provides reachability information for RLOCs. Note that reachability is not part of the mapping system and is determined using one or more of the Routing LocatorReachability Algorithmsreachability algorithms described in the next section. 6.3. Routing Locator Reachability Several mechanisms for determining RLOC reachability are currently defined: 1. An ETR may examine theLocator Status BitsLocator-Status-Bits in the LISP header of an encapsulated data packet received from an ITR. If the ETR is also acting as an ITR and has traffic to return to the original ITR site, it can use this status information to help select an RLOC. 2. An ITR may receive an ICMP Network Unreachable orICMPHost Unreachable message for an RLOC it is using. This indicates that the RLOC is likely down.Note,Note that trusting ICMP messages may not bedesirabledesirable, but neither is ignoring them completely. Implementations are encouraged to follow current best practices in treating these conditions. 3. An ITRwhichthat participates in the global routing system can determine that an RLOC is down if no BGPRIBRouting Information Base (RIB) route exists that matches the RLOC IP address. 4. An ITR may receive an ICMP Port Unreachable message from a destination host. This occurs if an ITR attempts to use interworking[INTERWORK][RFC6832] and LISP-encapsulated data is sent to a non-LISP-capable site. 5. An ITR may receive a Map-Reply from an ETR in response to a previously sent Map-Request. The RLOC source of the Map-Reply is likelyupup, since the ETR was able to send the Map-Reply to the ITR. 6. When an ETR receives an encapsulated packet from an ITR, the source RLOC from the outer header of the packet is likely up. 7. An ITR/ETR pair can use the LocatorReachability Algorithmsreachability algorithms described in this section, namely Echo-Noncing or RLOC-Probing. When determining Locator up/down reachability by examining theLocator Status BitsLocator-Status-Bits from theLISP encapsulatedLISP-encapsulated data packet, an ETR will receiveup to dateup-to-date status from an encapsulating ITR about reachability for all ETRs at the site. CE-based ITRs at the source site can determine reachability relative to each other using the site IGP as follows: o Under normal circumstances, each ITR will advertise a default route into the site IGP. o If an ITR fails or if the upstream link to its PE fails, its default route will eithertime-outtime out or be withdrawn. Each ITR can thus observe the presence or lack of a default route originated by the others to determine theLocator Status BitsLocator-Status-Bits it sets for them. RLOCs listed in a Map-Reply are numbered with ordinals 0 to n-1. TheLocator Status BitsLocator-Status-Bits in aLISP encapsulatedLISP-encapsulated packet are numbered from 0 to n-1 starting with the least significant bit. For example, if an RLOC listed in the 3rd position of the Map-Reply goes down (ordinal value 2), then all ITRs at the site will clear the 3rd least significant bit (xxxx x0xx) of theLocator Status Bits'Locator-Status-Bits' field for the packets they encapsulate. When an ETR decapsulates a packet, it will check for any change in theLocator Status Bits'Locator-Status-Bits' field. When a bit goes from 1 to 0, theETRETR, if acting also as an ITR, will refrain from encapsulating packets to an RLOC that is indicated as down. It will only resume using that RLOC if the correspondingLocator Status BitLocator-Status-Bit returns to a value of 1.Locator Status BitsLocator-Status-Bits are associated with alocator-setLocator-Set perEID-prefix.EID-Prefix. Therefore, when alocatorLocator becomes unreachable, theLocator Status BitLocator-Status-Bit that corresponds to thatlocator'sLocator's position in the list returned by the last Map-Reply will be set to zero for that particularEID- prefix.EID-Prefix. When ITRs at the site are not deployed in CE routers, the IGP can still be used to determine the reachability ofLocatorsLocators, provided they are injected into the IGP. This is typically done when a /32 address is configured on a loopback interface. When ITRs receive ICMP Network Unreachable or Host Unreachable messages as a method to determine unreachability, they will refrain from using Locatorswhichthat are described in Locator lists of Map-Replies. However, using this approach is unreliable because many network operators turn off generation of ICMP Destination Unreachable messages. If an ITR does receive an ICMP Network Unreachable or Host Unreachable message, it MAY originate its own ICMP Destination Unreachable message destined for the host that originated the data packet the ITR encapsulated. Also, BGP-enabled ITRs can unilaterally examine the RIB to see if a locator address from alocator-setLocator-Set in a mapping entry matches a prefix. If it does not find one and BGP is running in theDefaultDefault- Free Zone (DFZ), it can decide to not use thelocatorLocator even though theLocator Status BitsLocator-Status-Bits indicate that thelocatorLocator is up. In this case, the path from the ITR to the ETR that is assigned thelocatorLocator is not available. More details are in [LOC-ID-ARCH]. Optionally, an ITR can send a Map-Request to aLocatorLocator, and if aMap- ReplyMap-Reply is returned, reachability of the Locator has been determined. Obviously, sending such probes increases the number of control messages originated bytunnel routersTunnel Routers for active flows, so Locators are assumed to be reachable when they are advertised. This assumption does create a dependency: Locator unreachability is detected by the receipt of ICMP Host Unreachable messages. Whenana Locator has been determined to be unreachable, it is not used for active traffic; this is the same as if it were listed in a Map-Reply withpriorityPriority 255. The ITR can test the reachability of the unreachable Locator by sending periodic Requests. Both Requests and Replies MUST be rate- limited. Locator reachability testing is never done with datapacketspackets, since that increases the risk of packet loss for end-to-end sessions. When an ETR decapsulates a packet, it knows that it is reachable from the encapsulating ITR because that is how the packet arrived. In most cases, the ETR can also reach the ITR but cannot assume this to betruetrue, due to the possibility of path asymmetry. In the presence of unidirectional traffic flow from an ITR to an ETR, the ITR SHOULD NOT use the lack of return traffic as an indication that the ETR is unreachable. Instead, it MUST use an alternatemechanismsmechanism to determine reachability. 6.3.1. Echo Nonce Algorithm When data flows bidirectionally betweenlocatorsLocators from different sites, a data-plane mechanism called "nonce echoing" can be used to determine reachability between an ITR and ETR. When an ITR wants to solicit a nonce echo, it sets theNN- andE bitsE-bits and places a 24-bit nonce [RFC4086] in the LISP header of the next encapsulated data packet. When this packet is received by the ETR, the encapsulated packet is forwarded as normal. When the ETR next sends a data packet to the ITR, it includes the nonce received earlier with theN bitN-bit set andE bitE-bit cleared. The ITR sees this "echoed nonce" and knows that the path to and from the ETR is up. The ITR will set the E-bit and N-bit for every packet it sends while in the echo-nonce-request state. The time the ITR waits to process the echoed nonce before it determines the path is unreachable is variable and is a choice left for the implementation. If the ITR is receiving packets from the ETR but does not see the nonce echoed while being in the echo-nonce-request state, then the path to the ETR is unreachable. This decision may be overridden by otherlocatorLocator reachability algorithms. Once the ITR determines that the path to the ETR isdowndown, it can switch to anotherlocatorLocator for thatEID-prefix.EID-Prefix. Note that "ITR" and "ETR" are relative terms here. Both devices MUST be implementing both ITR and ETR functionality for the echo nonce mechanism to operate. The ITR and ETR may both go into the echo-nonce-request state at the same time. The number of packets sent or the time during which echo nonce requests are sent is animplementation specificimplementation-specific setting. However, when an ITR is in the echo-nonce-request state, it can echo the ETR's nonce in the next set of packets that it encapsulates andthen subsequently,subsequently continue sending echo-nonce-request packets. This mechanism does not completely solve the forward path reachabilityproblemproblem, as traffic may be unidirectional. That is, the ETR receiving traffic at a site may not be the same device as an ITRwhichthat transmits traffic from thatsitesite, or thesite to sitesite-to-site traffic is unidirectional so there is no ITR returning traffic. The echo-nonce algorithm is bilateral. That is, if one side sets the E-bit and the other side is not enabled for echo-noncing, then the echoing of the nonce does not occur and the requesting side mayregarderroneously consider thelocator unreachable erroneously.Locator unreachable. An ITR SHOULD only set the E-bit inaan encapsulated data packet when it knows the ETR is enabled for echo-noncing. This is conveyed by the E-bit in theMap- ReplyMap-Reply message. Note that otherlocatorLocator reachability mechanisms are being researched and can be used to compliment or even override theEcho Nonce Algorithm.echo nonce algorithm. See the next section for an example of control-plane probing. 6.3.2.RLOC ProbingRLOC-Probing AlgorithmRLOC ProbingRLOC-Probing is a method that an ITR or PITR can use to determine the reachability status of one or morelocatorsLocators that it has cached in amap-cacheMap-Cache entry. The probe-bit of the Map-Request and Map-Reply messagesareis used forRLOC Probing. RLOC probingRLOC-Probing. RLOC-Probing is done in thecontrol-planecontrol plane on a timerbasisbasis, where an ITR or PITR will originate a Map-Request destined to a locator address from one of its own locator addresses. A Map-Request used as an RLOC-probe is NOT encapsulated and NOT sent to a Map-Server oronto theALT likemapping database system as one would when soliciting mapping data. The EID record encoded in the Map-Request is theEID-prefixEID-Prefix of themap-cacheMap-Cache entry cached by the ITR or PITR. The ITR may include a mapping data record for its own database mapping informationwhichthat contains the localEID-prefixesEID-Prefixes and RLOCs for its site. RLOC-probes are sent periodically using a jittered timer interval. When an ETR receives a Map-Request message with the probe-bit set, it returns a Map-Reply with the probe-bit set. The source address of the Map-Reply is set according to the procedure described in Section 6.1.5. The Map-Reply SHOULD contain mapping data for theEID-prefixEID-Prefix contained in the Map-Request. This provides the opportunity for the ITR orPITR, whichPITR that sent the RLOC-probe to get mapping updates if there were changes to the ETR's database mapping entries. There are advantages and disadvantages ofRLOC Probing.RLOC-Probing. The greatest benefit ofRLOC ProbingRLOC-Probing is that it can handle many failure scenarios allowing the ITR to determine when the path to a specificlocatorLocator is reachable or has become unreachable, thus providing a robust mechanism for switching to using anotherlocatorLocator from the cachedlocator. RLOC ProbingLocator. RLOC-Probing can also provide roughRTTRound-Trip Time (RTT) estimates between a pair oflocatorsLocators, which can be useful for network management purposes as well as for selecting low delay paths. The major disadvantage ofRLOC ProbingRLOC-Probing is in the number of control messages required and the amount of bandwidth used to obtain those benefits, especially if the requirement for failure detection timesareis very small. Continued research and testing will attempt to characterize the tradeoffs of failure detection times versus message overhead. 6.4. EID Reachability within a LISP Site A site may be multihomed using two or more ETRs. The hosts and infrastructure within a site will be addressed using one or moreEID prefixesEID-Prefixes that are mapped to the RLOCs of the relevant ETRs in the mapping system. One possible failure mode is for an ETR to lose reachability to one or more of theEID prefixesEID-Prefixes within its own site. When this occurs when the ETR sends Map-Replies, it can clear the R-bit associated with its ownlocator.Locator. And when the ETR is also an ITR, it can clear itslocator-status-bitLocator-Status-Bit in the encapsulation data header. It is recognized that there are no simple solutions to the site partitioning problem because it is hard to know which part of theEID-prefixEID-Prefix range ispartitioned. Andpartitioned and whichlocatorsLocators can reach any sub-ranges of theEID-prefixes.EID-Prefixes. This problem is under investigation with the expectation that experiments will tell us more.Note,Note that this is not a new problem introduced by the LISP architecture. The problem exists today when amulti-homedmultihomed site uses BGP to advertise its reachability upstream. 6.5. Routing Locator Hashing When an ETR provides an EID-to-RLOC mapping in a Map-Reply message to a requesting ITR, thelocator-setLocator-Set for theEID-prefixEID-Prefix may contain differentpriorityPriority values for each locator address. When more than one bestpriority locatorPriority Locator exists, the ITR can decide how toloadload- share traffic against the correspondinglocators.Locators. The following hash algorithm may be used by an ITR to select alocatorLocator for a packet destined to an EID for the EID-to-RLOC mapping: 1. Either a source and destination address hashcan be usedor the traditional 5-tuple hashwhichcan be used. The traditional 5-tuple hash includes the source and destinationaddresses,addresses; source and destination TCP, UDP, orSCTPStream Control Transmission Protocol (SCTP) portnumbersnumbers; and the IP protocol number field or IPv6next- protocolnext-protocol fields of a packet that a host originates from within a LISP site. When a packet is not a TCP, UDP, or SCTP packet, the source and destination addresses only from the header are used to compute the hash. 2. Take the hash value and divide it by the number oflocatorsLocators stored in thelocator-setLocator-Set for the EID-to-RLOC mapping. 3. The remainder will yield a value of 0 to "number oflocatorsLocators minus 1". Use the remainder to select thelocatorLocator in thelocator-set.Locator-Set. Note that when a packet is LISP encapsulated, the source port number in the outer UDP header needs to be set. Selecting a hashed value allows core routerswhichthat are attached to Link Aggregation Groups (LAGs) to load-split the encapsulated packets across member links of such LAGs. Otherwise, core routers would see a single flow, since packets have a source address of the ITR, for packetswhichthat are originated by different EIDs at the source site. A suggested setting for the source port number computed by an ITR is a 5-tuple hash function on the inner header, as described above. Many core router implementations use a 5-tuple hash to decide how to balance packet load across members of a LAG. The 5-tuple hash includes the source and destination addresses of the packet and the source and destination ports when the protocol number in the packet is TCP or UDP. For this reason, UDP encoding is used for LISP encapsulation. 6.6. Changing the Contents of EID-to-RLOC Mappings Since the LISP architecture uses a caching scheme to retrieve and store EID-to-RLOC mappings, the only way an ITR can get a more up-to- date mapping is to re-request the mapping. However, the ITRs do not know when the mappingschangechange, and the ETRs do not keep track of which ITRs requested its mappings. For scalability reasons, we want to maintain this approach but need to provide a way for ETRs to change their mappings and inform the sites that are currently communicating with the ETR site using such mappings. When adding a newlocatorLocator record in lexicographic order to the end of alocator-set,Locator-Set, it is easy to update mappings. We assume that new mappings will maintain the samelocatorLocator ordering as the old mapping but will just have newlocatorsLocators appended to the end of the list.SoSo, some ITRs can have a new mapping while other ITRs have only an old mapping that is used until they time out. When an ITR has only an old mapping but detects bits set in theloc-status-bitsLocator-Status-Bits that correspond tolocatorsLocators beyond the list it has cached, it simply ignores them. However, this can only happen for locator addresses that are lexicographically greater than the locator addresses in the existinglocator-set.Locator-Set. When alocatorLocator record is inserted in the middle of alocator-set,Locator-Set, to maintain lexicographic order, the SMR procedure in Section 6.6.2 is used to inform ITRs and PITRs of the newlocator-status-bitLocator-Status-Bit mappings. When alocatorLocator record is removed from alocator-set,Locator-Set, ITRs that have the mapping cached will not use the removedlocatorLocator because the xTRs will set theloc-status-bitLocator-Status-Bit to 0.SoSo, even if thelocatorLocator is in the list, it will not be used. For new mapping requests, the xTRs can set thelocatorLocator AFI to 0 (indicating an unspecified address), as well as setting the correspondingloc-status-bitLocator-Status-Bit to 0. This forces ITRs with old or new mappings to avoid using the removedlocator.Locator. If many changes occur to a mapping over a long period of time, one will find empty record slots in the middle of thelocator-setLocator-Set and new records appended to thelocator-set.Locator-Set. At some point, it would be useful to compact thelocator-setLocator-Set so theloc-status-bitLocator-Status-Bit settings can be efficiently packed. We propose here three approaches forlocator-set compaction,Locator-Set compaction: one operational mechanism and two protocol mechanisms. The operational approach uses a clock sweep method. The protocol approaches use the concept of Solicit-Map-Requests and Map-Versioning. 6.6.1. Clock Sweep The clock sweep approach uses planning in advance and the use of count-down TTLs to time out mappings that have already been cached. The default setting for an EID-to-RLOC mapping TTL is 24 hours.SoSo, there is a24 hour24-hour window to time out old mappings. The following clock sweep procedure is used: 1. 24 hours before a mapping change is to take effect, a network administrator configures the ETRs at a site to start the clock sweep window. 2. During the clock sweep window, ETRs continue to send Map-Reply messages with the current (unchanged) mapping records. The TTL for these mappings is set to 1 hour. 3. 24 hours later, all previous cache entries will have timed out, and any active cache entries will time out within 1 hour. During this1 hour window1-hour window, the ETRs continue to send Map-Reply messages with the current (unchanged) mapping records with the TTL set to 1 minute. 4. At the end of the1 hour1-hour window, the ETRs will send Map-Reply messages with the new (changed) mapping records.SoSo, any active caches can get the new mapping contents right away if not cached, or in 1 minute if they had the mapping cached. The new mappings are cached with atime to liveTTL equal to the TTL in the Map-Reply. 6.6.2. Solicit-Map-Request (SMR) Soliciting a Map-Request is a selective way for ETRs, at the site where mappings change, to control the rate they receive requests for Map-Reply messages. SMRs are also used to tell remote ITRs to update the mappings they have cached. Since the ETRs don't keep track of remote ITRs that have cached their mappings, they do not know which ITRs need to have their mappings updated. As a result, an ETR will solicit Map-Requests (called an SMR message) from those sites to which it has been sending encapsulated datatofor the last minute. In particular, an ETR will send an SMR to an ITR to which it has recently sent encapsulated data. An SMR message is simply a bit set in a Map-Request message. An ITR or PITR will send a Map-Request when they receive an SMR message. Both the SMR sender and the Map-Request responder MUSTrate-limitedrate-limit these messages. Rate-limiting can be implemented as a global rate- limiter or one rate-limiter per SMR destination. The following procedure shows howaan SMR exchange occurs when a site is doinglocator-setLocator-Set compaction for an EID-to-RLOC mapping: 1. When the database mappings in an ETR change, the ETRs at the site begin to send Map-Requests with the SMR bit set for eachlocatorLocator in eachmap-cacheMap-Cache entry the ETR caches. 2. A remote ITRwhichthat receives the SMR message will schedule sending a Map-Request message to the source locator address of the SMR message or to the mapping database system. A newly allocated random nonce isselectedselected, and theEID-prefixEID-Prefix used is the one copied from the SMR message. If the sourcelocatorLocator is the onlylocatorLocator in the cachedlocator-set,Locator-Set, the remote ITR SHOULD send a Map-Request to the database mapping system just in case the singlelocatorLocator has changed and may no longer be reachable to accept the Map-Request. 3. The remote ITR MUST rate-limit the Map-Request until it gets a Map-Reply while continuing to use the cached mapping. WhenMap Versioning is used,Map-Versioning as described in Section6.6.3,6.6.3 is used, an SMR sender can detect if an ITR is using the mostup to dateup-to-date database mapping. 4. The ETRs at the site with the changed mapping will reply to the Map-Request with a Map-Reply message that has a nonce from the SMR-invoked Map-Request. The Map-Reply messages SHOULD beraterate- limited. This is important to avoid Map-Reply implosion. 5. TheETRs,ETRs at the site with the changedmapping,mapping record the fact that the site that sent the Map-Request has received the new mapping data in themapping cacheMap-Cache entry for the remote site so theloc-status-bitsLocator-Status-Bits are reflective of the new mapping for packets going to the remote site. The ETR then stops sending SMR messages. Experimentation is in progress to determine the appropriate rate- limit parameters. For securityreasonsreasons, an ITR MUST NOT process unsolicited Map-Replies. To avoidmap-cacheMap-Cache entry corruption by athird-party,third party, a sender of an SMR-based Map-Request MUST be verified. If an ITR receives anSMR- basedSMR-based Map-Request and the source is not in thelocator-setLocator-Set for the storedmap-cacheMap-Cache entry, then the responding Map-Request MUST be sent with an EID destination to the mapping database system. Since the mapping database system is a more secure way to reach an authoritative ETR, it will deliver the Map-Request to the authoritative source of the mapping data. When an ITR receives an SMR-based Map-Request for which it does not have a cached mapping for the EID in the SMR message, it MAY not sendaan SMR-invoked Map-Request. This scenario can occur when an ETR sends SMR messages to alllocatorsLocators in thelocator-setLocator-Set it has stored in its map-cache but the remote ITRs that receive the SMR may not be sending packets to the site. There is no point in updating the ITRs until they need to send, in whichcase,case they will send Map-Requests to obtain amap-cacheMap-Cache entry. 6.6.3. DatabaseMap VersioningMap-Versioning When there is unidirectional packet flow between an ITR and ETR, and the EID-to-RLOC mappings change on the ETR, it needs to inform the ITR so encapsulationcan stopto a removedlocatorLocator can stop andstartcan instead be started to a newlocatorLocator in thelocator-set.Locator-Set. An ETR, when it sends Map-Reply messages, conveys its own Map-Versionnumber.Number. This is known as the Destination Map-Version Number. ITRs include the Destination Map-Version Number in packets they encapsulate to the site. When an ETR decapsulates a packet and detects that the Destination Map-Version Number is less than the current version for its mapping, the SMR procedure described in Section 6.6.2 occurs. An ITR, when it encapsulates packets to ETRs, can convey its ownMap- Version number.Map-Version Number. This is known as the Source Map-Version Number. When an ETR decapsulates a packet and detects that the Source Map-Version Number is greater than the last Map-Version Number sent in aMap- ReplyMap-Reply from the ITR's site, the ETR will send a Map-Request to one of the ETRs for the source site. A Map-Version Number is used as a sequence number perEID-prefix. SoEID-Prefix, so values that aregreater,greater are considered to be more recent. A value of 0 for the Source Map-Version Number or the Destination Map-Version Number conveys no versioninginformationinformation, and an ITR does no comparison with previously received Map-Version Numbers. A Map-Version Number can be included in Map-Register messages as well. This is a good way for the Map-Servercanto assure that all ETRs for a site registering to it will be synchronized according to Map-Versionnumber synchronized.Number. See[VERSIONING][RFC6834] for a more detailed analysis and description of DatabaseMap Versioning.Map-Versioning. 7. Router Performance Considerations LISP is designed to be veryhardware-based"hardware-based forwardingfriendly.friendly". A few implementation techniques can be used to incrementally implement LISP: o When atunnel encapsulatedtunnel-encapsulated packet is received by an ETR, the outer destination address may not be the address of the router. This makes it challenging for the control plane to get packets from the hardware. This may be mitigated by creating specialFIBForwarding Information Base (FIB) entries for theEID-prefixesEID-Prefixes of EIDs served by the ETR (those for which the router provides an RLOC translation). These FIB entries are marked with a flag indicating thatcontrol planecontrol-plane processing should be performed. The forwarding logic of testing for particular IP protocol numbervaluevalues is not necessary. There are a few proven cases where no changes to existing deployed hardware were needed to support the LISPdata-plane.data- plane. o On an ITR, prepending a new IP header consists of adding more octets to a MAC rewrite string and prepending the string as part of the outgoing encapsulation procedure. Routers that supportGREGeneric Routing Encapsulation (GRE) tunneling [RFC2784] or 6to4 tunneling [RFC3056] may already support this action. o A packet's source address or interface the packet was received on can be used to selectaVRF (Virtual Routing/Forwarding). The VRF's routing table can be used to find EID-to-RLOC mappings. For performance issues related to map-cache management, seesectionSection 12. 8. Deployment Scenarios This section will explore how and where ITRs and ETRs can be deployed and will discuss the pros and cons of each deployment scenario. For a more detailed deployment recommendation, refer to [LISP-DEPLOY]. There are two basic deploymenttrade-offstradeoffs to consider: centralized versus distributedcachescaches; and flat,recursive,Recursive, orre-encapsulating tunneling.Re-encapsulating Tunneling. When deciding on centralized versus distributed caching, the following issues should be considered: o Are thetunnel routersTunnel Routers spread out so that the caches are spread across all the memories of each router? A centralized cache is when an ITR keeps a cache for all the EIDs it is encapsulating to. The packet takes a direct path to the destinationlocator.Locator. A distributed cache is when an ITR needs help from otherre- encapsulatingre-encapsulating routers because it does not store all the cache entries for the EIDsisit is encapsulating to.SoSo, the packet takes a path through re-encapsulating routers that have a different set of cache entries. o Should management "touch points" be minimized by only choosing a fewtunnel routers,Tunnel Routers, just enough for redundancy? o In general, using more ITRs doesn't increase management load, since caches are built and stored dynamically. On the other hand, using more ETRs does require moremanagementmanagement, sinceEID-prefix-to-RLOCEID-Prefix-to- RLOC mappings need to be explicitly configured. When deciding on flat,recursive,Recursive, orre-encapsulation tunneling,Re-encapsulating Tunneling, the following issues should be considered: o Flat tunneling implements a single tunnel between the source site and destination site. This generally offers better paths between sources and destinations with a single tunnel path. o RecursivetunnelingTunneling is when tunneled traffic is again further encapsulated in another tunnel, either to implement VPNs or to perform Traffic Engineering. When doing VPN-based tunneling, the site has somecontrolcontrol, since the site is prepending a new tunnel header. In the case of TE-based tunneling, the site may have control if it is prepending a new tunnel header, but if the site's ISP is doing the TE, then the site has no control. RecursivetunnelingTunneling generally will result in suboptimal paths butatwith the benefit of steering traffic toresource availableparts of thenetwork.network that have more resources available. o The technique of re-encapsulation ensures that packets only require one tunnel header.SoSo, if a packet needs to bererouted,re-routed, it is first decapsulated by the ETR and then re-encapsulated with a new tunnel header using a new RLOC. The next sub-sections willsurveyexamine wheretunnel routersTunnel Routers can reside in the network. 8.1.First-hop/Last-hopFirst-Hop/Last-Hop Tunnel Routers By locatingtunnel routersTunnel Routers close to hosts, theEID-prefixEID-Prefix set is at the granularity of an IP subnet.SoSo, at the expense of moreEID- prefix-to-RLOCEID-Prefix-to-RLOC sets for the site, the caches in eachtunnel routerTunnel Router can remain relatively small. But caches always depend on the number of non-aggregated EID destination flows active through thesetunnel routers.Tunnel Routers. With moretunnel routersTunnel Routers doing encapsulation, the increase in control traffic grows as well: since theEID-granularityEID granularity is greater, more Map-Requests and Map-Replies are traveling between more routers. The advantage of placing the caches and databases at these stub routers is that the products deployed in this part of the network have better price-memory ratiosthenthan their core router counterparts. Memory is typically less expensive in thesedevicesdevices, and fewer routes are stored (only IGP routes). These devices tend to have excess capacity, both for forwarding and routingstate.states. LISP functionality can also be deployed in edge switches. These devices generally have layer-2 ports facing hosts and layer-3 ports facing the Internet. Spare capacity is also often available in thesedevices as well.devices. 8.2. Border/Edge Tunnel Routers Usingcustomer-edgeCustomer Edge (CE) routers for tunnel endpoints allows the EID space associated with a site to be reachable via a small set of RLOCs assigned to the CE routers for that site. This is the default behavior envisioned in the rest of this specification. This offers the opposite benefit of the first-hop/last-hoptunnel routerTunnel Router scenario: the number of mapping entries and network management touch pointsareis reduced, allowing better scaling. One disadvantage is thatless of the network'sfewer network resources are used to reach hostendpointsendpoints, thereby centralizing the point-of-failure domain and creating network choke points at the CE router. Note that more than one CE router at a site can be configured with the same IP address. In thiscasecase, an RLOC is an anycast address. This allows resilience between the CE routers. That is, if a CE router fails, traffic is automatically routed to the other routers using the same anycast address. However, this comes with the disadvantage where the site cannot control the entrance point when the anycast route is advertised out from all border routers. Another disadvantage of using anycastlocatorsLocators is the limited advertisement scope of /32 (or /128 for IPv6) routes. 8.3. ISPProvider-EdgeProvider Edge (PE) Tunnel RoutersUseThe use of ISP PE routers as tunnel endpoint routers is not the typical deployment scenario envisioned inthethis specification. This section attempts to capture some of the reasoning behind this preferenceoffor implementing LISP on CE routers.UseThe use of ISP PE routers as tunnel endpoint routers gives an ISP, rather than a site, control over the location of the egress tunnel endpoints. That is, the ISP can decideifwhether the tunnel endpoints are in the destination site (in either CE routers or last-hop routers within a site) or at other PE edges. The advantage of this case is that two tunnel headers can be avoided. By having the PE be the first router on the path to encapsulate, it can choose a TE path first, and the ETR can decapsulate and re-encapsulate for a tunnel to the destination end site. An obvious disadvantage is that the end site has no control over where its packets flow or over the RLOCs used. Other disadvantages includethedifficulty in synchronizing path liveness updates between CE and PE routers. As mentioned in earliersectionssections, a combination of these scenarios is possible at the expense of extra packet headeroverhead,overhead; if both site and provider want control, thenrecursiveRecursive orre-encapsulating tunnelsRe-encapsulating Tunnels are used. 8.4. LISP Functionality with Conventional NATs LISP routers can be deployed behind Network Address Translator (NAT) devices to provide the same set of packet services hosts have today when they are addressed out of private address space. It is important to note that a locator address in any LISP control message MUST be a globally routable address and therefore SHOULD NOT contain [RFC1918] addresses. If a LISP router is configured with private addresses, they MUST be used only in the outer IP header so the NAT device can translate properly. Otherwise, EID addresses MUST be translated before encapsulation is performed. Both NAT translation and LISP encapsulation functions could be co-located in the same device. More details on LISP address translation can be found in[INTERWORK].[RFC6832]. 8.5. Packets Egressing a LISP Site When a LISP site is using two ITRs for redundancy, the failure of one ITR will likely shift outbound traffic to the second. This second ITR's cache may notnotbe populated with the same EID-to-RLOC mapping entries as the first. If this second ITR does not have these mappings, traffic will be dropped while the mappings are retrieved from the mapping system. The retrieval of these messages may increase the load of requests being sent into the mapping system. Deployment and experimentation will determine whether this issue requires more attention. 9. Traceroute Considerations When a source host in a LISP site initiates a traceroute to a destination host in another LISP site, it is highly desirable for it to see the entire path. Since packets are encapsulated from the ITR to the ETR, the hop across the tunnel could be viewed as a single hop. However, LISP traceroute will provide the entire path so the user can see 3 distinct segments of the path from a source LISP host to a destination LISP host: Segment 1 (in source LISP site based on EIDs):source-hostsource host --->first-hopfirst hop ...next-hopnext hop ---> ITR Segment 2 (in the core network based on RLOCs): ITR --->next-hopnext hop ...next-hopnext hop ---> ETR Segment 3 (in the destination LISP site based on EIDs): ETR --->next-hopnext hop ...last-hoplast hop --->destination-hostdestination host For segment 1 of the path, ICMP Time Exceeded messages are returned in the normal manner as they are today. The ITR performs a TTL decrement andtesttests for 0 before encapsulating.SoTherefore, theITRITR's hop is seen by the traceroute sourcehasas having an EID address (the address of the site-facing interface). For segment 2 of the path, ICMP Time Exceeded messages are returned to the ITR because the TTL decrement to 0 is done on the outer header, so thedestinationdestinations of the ICMP messages aretothe ITR RLOCaddress,address and the source RLOC address of the encapsulated traceroute packet. The ITR looks inside of the ICMP payload to inspect the traceroute source so it can return the ICMP message to the address of the traceroute clientas well as retainingand also retain the core router IP address in the ICMP message. This is so the traceroute client can display the core router address (the RLOC address) in the traceroute output. The ETR returns its RLOC address and responds to the TTL decrement to0 like0, as the previous core routers did. For segment 3, the next-hop router downstream from the ETR will be decrementing the TTL for the packet that was encapsulated, sent into the core, decapsulated by the ETR, and forwarded because it isn't the final destination. If the TTL is decremented to 0, any router on the path to the destination of the traceroute, including the next-hop router or destination, will send an ICMP Time Exceeded message to the source EID of the traceroute client. The ICMP message will be encapsulated by the local ITR and sent back to the ETR in the originated traceroute source site, where the packet will be delivered to the host. 9.1. IPv6 Traceroute IPv6 traceroute follows the procedure describedaboveabove, since the entire traceroute data packet is included in the ICMP Time Exceeded message payload. Therefore, only the ITR needs to pay special attentionforto forwarding ICMP messages back to the traceroute source. 9.2. IPv4 Traceroute For IPv4 traceroute, we cannot follow the aboveprocedureprocedure, since IPv4 ICMP Time Exceeded messages only include the invoking IP header and 8 octets that follow the IP header. Therefore, when a core router sends an IPv4 Time Exceeded message to an ITR, all the ITR has in the ICMP payload is the encapsulated header itprependedprepended, followed by a UDP header. The original invoking IP header, and therefore the identity of the traceroutesourcesource, is lost. The solution we propose to solve this problem is to cache traceroute IPv4 headers in the ITR and to match them up with corresponding IPv4 Time Exceeded messages received from core routers and the ETR. The ITR will use a circular buffer for caching the IPv4 and UDP headers of traceroute packets. It will select a 16-bit number as a key to find them later when the IPv4 Time Exceeded messages are received. When an ITR encapsulates an IPv4 traceroute packet, it will use the 16-bit number as the UDP source port in the encapsulating header. When the ICMP Time Exceeded message is returned to the ITR, the UDP header of the encapsulating header is present in the ICMPpayloadpayload, thereby allowing the ITR to find the cached headers for the traceroute source. The ITR puts the cached headers in the payload and sends the ICMP Time Exceeded message to the traceroute source retaining the source address of the original ICMP Time Exceeded message (a core router or the ETR of the site of the traceroute destination). The signature of a traceroute packet comes in two forms. The first form is encoded as a UDP message where the destination port is inspected for a range of values. The second form is encoded as an ICMP message where the IP identification field is inspected for a well-known value. 9.3. TracerouteusingUsing Mixed Locators When either an IPv4 traceroute or IPv6 traceroute is originated and the ITR encapsulates it in the other address family header,youone cannot get all 3 segments of the traceroute. Segment 2 of the traceroutecan notcannot be conveyed to the traceroutesourcesource, since it is expecting addresses from intermediate hops in the same address format for the type of traceroute it originated. Therefore, in this case, segment 2 will make the tunnel look like one hop. All the ITR has to do to make this work is to not copy the inner TTL to the outer, encapsulating header's TTL when a traceroute packet is encapsulated using an RLOC from a different address family. This will cause no TTL decrement to 0 to occur in core routers between the ITR and ETR. 10. Mobility Considerations There are several kinds ofmobilitymobility, of which only some might be of concern to LISP.EssentiallyEssentially, they are as follows. 10.1. Site Mobility A site wishes to change its attachment points to the Internet, and its LISP Tunnel Routers will have new RLOCs when it changes upstream providers. Changes inEID-RLOCEID-to-RLOC mappings for sites are expected to be handled by configuration, outside ofthe LISP protocol.LISP. 10.2. Slow Endpoint Mobility An individual endpoint wishes tomove,move but is not concerned about maintaining session continuity. Renumbering is involved. LISP can help with the issues surrounding renumbering [RFC4192] [LISA96] by decoupling the address space used by a site from the address spaces used by itsISPs. [RFC4984]ISPs [RFC4984]. 10.3. Fast Endpoint Mobility Fast endpoint mobility occurs when an endpoint moves relatively rapidly, changing itsIP layerIP-layer network attachment point. Maintenance of session continuity is a goal. This is where the Mobile IPv4 [RFC5944] and Mobile IPv6 [RFC6275] [RFC4866] mechanisms areused,used and primarily where interactions with LISP need to be explored. The problem is that as an endpoint moves, it may require changes to the mapping between its EID and a set of RLOCs for its new network location. When this is added to the overhead ofmobileMobile IP binding updates, some packets might be delayed or dropped. In IPv4 mobility, when an endpoint is away from home, packets to it are encapsulated and forwarded via a home agentwhichthat resides in the home area the endpoint's address belongs to. The home agent will encapsulate and forward packets either directly to the endpoint or to a foreign agentwhichthat resides where the endpoint has moved to. Packets from the endpoint may be sent directly to the correspondent node, may be sent via the foreign agent, or may be reverse-tunneled back to the home agent for delivery to the mobile node. As the mobile node's EID or available RLOC changes, LISP EID-to-RLOC mappings are required for communication between the mobile node and the home agent, whether via the foreign agent or not. As a mobile endpoint changes networks, up to three LISP mapping changes may be required: o The mobile node moves from an old location to a new visited network location and notifies its home agent that it has done so. The Mobile IPv4 control packets the mobile node sends pass through one of the new visited network's ITRs, which needs anEID-RLOCEID-to-RLOC mapping for the home agent. o The home agent might not have theEID-RLOCEID-to-RLOC mappings for the mobile node's "care-of" address or its foreign agent in the new visited network, in which case it will need to acquire them. o When packets are sent directly to the correspondent node, it may be that no traffic has been sent from the new visited network to the correspondent node's network, and the new visited network's ITR will need to obtain anEID-RLOCEID-to-RLOC mapping for the correspondent node's site. In addition, if the IPv4 endpoint is sending packets from the new visited network using its original EID, then LISP will need to perform a route-returnability check on the newEID-RLOCEID-to-RLOC mapping for that EID. In IPv6 mobility, packets can flow directly between the mobile node and the correspondent node in either direction. The mobile node uses its "care-of" address (EID). In this case, the route-returnability check would not be needed but one more LISP mapping lookup may be required instead: o As above, three mapping changes may be needed for the mobile node to communicate with its home agent and to send packets to the correspondent node. o In addition, another mapping will be needed in the correspondent node's ITR, in order for the correspondent node to send packets to the mobile node's "care-of" address (EID) at the new network location. When both endpoints aremobilemobile, the number of potential mapping lookups increases accordingly. As a mobile nodemovesmoves, there are not only mobility state changes in the mobile node, correspondent node, and home agent, but also state changes in the ITRs and ETRs for at least someEID-prefixes.EID-Prefixes. The goal is to support rapid adaptation, with little delay or packet loss for the entire system.AlsoAlso, IP mobility can be modified to require fewer mapping changes. In order to increase overall system performance, there may be a need to reduce the optimization of one area in order to place fewer demands on another. In LISP, one possibility is to "glean" information. When a packet arrives, the ETR could examine theEID-RLOCEID-to-RLOC mapping and use that mapping for all outgoing traffic to that EID. It can do this after performing a route-returnability check, to ensure that the new network location does haveaan internal route to that endpoint. However, this does not cover the case where an ITR (the node assigned the RLOC) at the mobile-node location has been compromised. Mobile IP packet exchange is designed for an environment in which all routing information is disseminated before packets can be forwarded. In order to allow the Internet to grow to support expected future use, we are moving to an environment where some information may have to be obtained after packets are in flight. Modifications to IP mobility should be considered in order to optimize the behavior of the overall system. Anythingwhichthat decreases the number of newEID- RLOCEID-to-RLOC mappings needed when a node moves, or maintains the validity of anEID-RLOCEID-to-RLOC mapping for a longer time, is useful. 10.4. Fast Network Mobility In addition to endpoints, a network can be mobile, possibly changing xTRs. A "network" can be as small as a single router and as large as a whole site. This is different from site mobility in that it is fast and possibly short-lived, but different from endpoint mobility in that a whole prefix is changing RLOCs. However, the mechanisms are thesamesame, and there is no new overhead in LISP. A map request for any endpoint will return a binding for the entire mobile prefix. If mobile networks become a more common occurrence, it may be useful to revisit the design of the mapping service and allow for dynamic updates of the database. The issue of interactions between mobility and LISP needs to be explored further. Specific improvements to the entire system will depend on the details of mapping mechanisms. Mapping mechanisms should be evaluated on how well they support session continuity for mobile nodes. 10.5. LISP Mobile Node Mobility A mobile device can use the LISP infrastructure to achieve mobility by implementing the LISP encapsulation and decapsulation functions and acting as a simple ITR/ETR. By doing this, such a "LISP mobile node" can usetopologically-independenttopologically independent EID IP addresses that are not advertised into and do not impose a cost on the global routing system. These EIDs are maintained at the edges of the mapping system (in LISP Map-Servers and Map-Resolvers) and are provided on demand to only the correspondents of the LISP mobile node. Refer tothe LISP Mobility Architecture specification[LISP-MN] for more details. 11. Multicast Considerations A multicast group address, as defined in the original Internetarchitecturearchitecture, is an identifier of a grouping of topologically independent receiver host locations. The address encoding itself does not determine the location of the receiver(s). The multicast routing protocol, and the network-based state the protocol creates,determinesdetermine where the receivers are located. In the context of LISP, a multicast group address is both an EID and a Routing Locator. Therefore, no specific semantic or action needs to be taken for a destination address, as it would appear in an IP header. Therefore, a group address that appears in an inner IP header built by a source host will be used as the destination EID. The outer IP header (the destination Routing Locator address), prepended by a LISP router, will use the same group address as the destination Routing Locator. Having said that, only the source EID and source Routing Locatorneedsneed to be dealt with. Therefore, an ITR merely needs to put its own IP address in the sourceRouting Locator'Routing Locator' field when prepending the outer IP header. This source Routing Locator address, like any other Routing Locatoraddressaddress, MUST be globally routable. Therefore, an EID-to-RLOC mapping does not need to be performed by an ITR when a received data packet is a multicast data packet or when processing a source-specific Join (either by IGMPv3 or PIM). But the source Routing Locator is decided by the multicast routing protocol in a receiver site. That is, anEID to Routing LocatorEID-to-RLOC translation is done atcontrol-time.control time. Another approach is to have the ITR not encapsulate a multicast packet and allow thehost builtpacket built by the host to flow into the core even if the source address is allocated out of the EID namespace. If the RPF-Vector TLV [RFC5496] is used by PIM in the core, then core routers can RPF to the ITR (theLocator addresslocator address, which is injected into core routing) rather than the host source address (the EIDaddressaddress, which is not injected into core routing). To avoid any EID-based multicast state in the network core, the first approach is chosen for LISP-Multicast. Details for LISP-Multicast andInterworkinginterworking with non-LISP sitesisare described inspecification [MLISP].[RFC6831] and [RFC6832]. 12. Security Considerations It is believed that most of the security mechanisms will be part of the mapping database service when usingcontrol planecontrol-plane procedures for obtaining EID-to-RLOC mappings. Fordata plane triggereddata-plane-triggered mappings, as described in this specification, protection is provided against ETR spoofing by usingReturn-Routabilityroute-returnability (see Section 3) mechanisms evidenced by the use of a 24-bitNonce'Nonce' field in the LISP encapsulation header and a 64-bitNonce'Nonce' field in the LISP control message. The nonce, coupled with the ITR accepting only solicitedMap-RepliesMap-Replies, provides a basic level of security, in many ways similar to the security experienced in the current Internet routing system. It is hard for off-path attackers to launch attacks against these LISP mechanisms, as they do not have the nonce values. Sending a large number of packets to accidentally find the right nonce value ispossible,possible but would already by itself be a denial-of-service (DoS) attack. On-path attackers can perform far more serious attacks, but on-path attackers can launch serious attacks in the current Internet as well, including eavesdropping,blockingblocking, or redirecting traffic. See more discussion on this topic in Section 6.1.5.1. LISP does not rely on a PKI or a moreheavy weightheavyweight authentication system. These systems challengethe scalabilityone ofLISP which was athe primary designgoal.goals of LISP -- scalability. DoS attack prevention will depend on implementations rate-limiting Map-Requests and Map-Replies to the control plane as well asrate- limitingrate-limiting the number of data-triggered Map-Replies. An incorrectly implemented or malicious ITR might choose to ignore thepriorityPriority andweightsWeights provided by the ETR in its Map-Reply. Thistraffic steeringtraffic-steering would be limited to the traffic that is sent by this ITR'ssite,site and no more severe than if the site initiated a bandwidth DoS attack on (one of) the ETR's ingress links. The ITR's site would typically gain no benefit from not respecting theweights,Weights and would likelytoreceive better service by abiding by them. To deal with map-cache exhaustion attempts in an ITR/PITR, the implementation should consider putting a maximum cap on the number of entries stored with a reserve list for special or frequently accessed sites. This should be a configuration policy control set by the network administrator who manages ITRs and PITRs. When overlappingEID-prefixesEID-Prefixes occur across multiplemap-cacheMap-Cache entries, the integrity of the set must be wholly maintained.SoSo, if a more-specific entry cannot be added due to reaching the maximum cap, then none of theless specificsless-specific entries should be stored in the map-cache. Given that the ITR/PITR maintains a cache of EID-to-RLOC mappings, cache sizing and maintenanceis an issueare issues to be kept in mind during implementation. It is a good idea to have instrumentation in place to detect thrashing of the cache. Implementation experimentation will be used to determine which cache management strategies work best. In general, it is difficult to defend againstcache trashingcache-thrashing attacks. It should be noted that an undersized cache in an ITR/PITR not only causes adverseaffecteffects on the site or regionthey support,it supports but may also cause increased Map-Requestloadloads on the mapping system. "Piggybacked" mapping data as discussed in Section 6.1.3 specifies how to handle such mappings and includes the possibility for an ETR to temporarily accept such a mapping before verification when running in "trusted" environments. In such cases, there is a potential threat that a fake mapping could be inserted (even if only for a short period) into a map-cache. As noted in Section 6.1.3, an ETR MUST be specifically configured to run in such a mode and might usefully only consider some specific ITRs as also running in that same trusted environment. There is a security risk implicit in the fact that ETRs generate theEID prefixEID-Prefix to which they are responding. An ETR can claim a shorter prefix than it is actually responsible for. Various mechanisms to ameliorate or resolve this issue will be examined in thefuture,future [LISP-SEC]. Spoofing ofinner headerinner-header addresses ofLISP encapsulatedLISP-encapsulated packets ispossible likepossible, as with any tunneling mechanism. ITRs MUST verify the source address of a packet to be an EID that belongs to the site'sEID-prefixEID-Prefix range prior to encapsulation. An ETR must only decapsulate and forward datagrams with aninner headerinner-header destination that matches one of itsEID-prefixEID-Prefix ranges. If, upon receipt and decapsulation, the destination EID of a datagram does not match one of the ETR's configuredEID-prefixes,EID-Prefixes, the ETR MUST drop the datagram. If aLISP encapsulatedLISP-encapsulated packet arrives at an ETR, it SHOULD compare theinner headerinner-header source EID address and theouter headerouter-header source RLOC address with the mapping that exists in the mapping database.ThenThen, when spoofing attacks occur, theouter headerouter-header source RLOC address can be used to trace back the attack to the source site, using existing operational tools. This experimental specification does not address automated key management (AKM). BCP 107 [RFC4107] provides guidance in this area. In addition, at the time of this writing, substantial work is being undertaken to improve security of the routing system[KARP], [RPKI], [BGP-SEC],[RFC6518] [RFC6480] [BGP-SEC] [LISP-SEC]. Future work on LISP should addressBCP-107the issues discussed in BCP 107 as well as other open security considerations, which may require changes to this specification. 13. Network Management Considerations Considerations forNetwork Managementnetwork management tools exist so the LISP protocol suite can be operationally managed.TheThese mechanisms can be found in [LISP-MIB] and[LISP-LIG].[RFC6835]. 14. IANA Considerations This section provides guidance to the Internet Assigned Numbers Authority (IANA) regarding registration of values related to the LISP specification, in accordance with BCP 26and RFC 5226[RFC5226]. There are fourname spacesnamespaces (listed in the sub-sections below) in LISP thatrequire registration:have been registered. o LISP IANA registry allocations should not be made for purposes unrelated to LISP routing or transport protocols. o The following policies are used here with the meanings defined in BCP 26: "Specification Required", "IETF Review", "Experimental Use", and "First Come First Served". 14.1. LISP ACT and Flag Fields New ACT values (Section 6.1.4) can be allocated through IETF review or IESG approval. Four values have already been allocated by this specification (Section 6.1.4). In addition,theLISPprotocolhas a number of flag fields and reserved fields, such as the LISP header flags field (Section 5.3). New bits for flagscan be taken into use fromin these fieldsthroughcan be implemented after IETF review or IESG approval, but these need not be managed by IANA. 14.2. LISP Address Type Codes LISP Address [LCAF] type codes have a range from 0 to 255. New type codes MUST be allocatedconsecutivelyconsecutively, starting at 0. Type Codes0 - 1270-127 are to be assigned by IETF review or IESG approval. Type Codes128 - 255128-255 are availableon aaccording to the [RFC5226] First Come First Served policy. This registry, initially empty, is constructed forfuture-usefuture use in experimental workof LCAFrelated to LISP Canonical Address Format (LCAF) values. See [LCAF] for detailsforof other possible unapproved address encodings. The unapproved LCAF encodings are an area for further study and experimentation. 14.3. LISP UDP Port Numbers The IANA registry has allocated UDP port numbers 4341 and 4342 for lisp-data and lisp-control operation, respectively. IANAis requested to updatehas updated the description forudpUDP ports 4341 and 4342 as follows: lisp-data 4341 udp LISP Data Packets lisp-control 4342 udp LISP Control Packets 14.4. LISP Key ID Numbers The following Key ID values are defined by this specification as used in any packet type that references aKey ID'Key ID' field: Name Number Defined in ----------------------------------------------- None 0 n/a HMAC-SHA-1-96 1 [RFC2404] HMAC-SHA-256-128 2[RFC6234][RFC4868] Number values are in the range of 0 to65355.65535. The allocation of values is on a first come firstserveserved basis. 15. Known Open Issues and Areas of Future Work As an experimental specification, this work is, by definition, incomplete. Specific areas where additional experience and work are neededinclude:include the following: o At present, only[ALT][RFC6836] is defined for implementing a database of EID-to-RLOC mapping information. Additional research on other mapping database systems is strongly encouraged. o Failure and recovery of LISP site partitioning (see Section6.4),6.4) in the presence of redundant configuration (see Section 8.5) needs further research and experimentation. o The characteristics of map-cache management under exceptional conditions, such as denial-of-serviceattacksattacks, are not fully understood. Further experience is needed to determine whether current caching methods are practical or in need of further development. In particular, the performance,scalingscaling, and security characteristics of the map-cache will be discovered as part of this experiment. Performance metrics to be observed are packet reordering associated with the LISPdata probeData-Probe and loss of the first packet in a flow associated with map-caching. The impact of these upon TCP will be observed. See Section 12 for additional thoughts and considerations. o Preliminary work has been done to ensure that sites employing LISP can interconnect with the rest of the Internet. This work is documented in[INTERWORK],[RFC6832], but further experimentation and experienceisare needed. o At present, no mechanism for automated key management for message authentication is defined. Addressing automated key management is necessary before this specificationcouldcan be developed into astandards trackStandards Track RFC. See Section 12 for further details regarding security considerations. o In order to maintain security and stability, InternetProtocolsprotocols typically isolate the control and data planes. Therefore, user activity cannot causecontrol planecontrol-plane state to be created or destroyed. LISP does not maintain this separation. The degree to which the loss of separation impacts security and stability is a topic for experimental observation. o LISP allows for the use of different mapping databasesystems to be used.systems. While only one[ALT][RFC6836] is currentlywell-defined,well defined, each mapping database will likely have some impact on the security of theEID- to-RLOCEID-to-RLOC mappings. How each mapping database system's security properties impactonLISP overall is for further study. o An examination of the implications of LISP on Internet traffic, applications, routers, and security is needed. This will helptoimplementors understand the consequences for network stability, routing protocol function, routing scalability, migration and backward compatibility, and implementation scalability (as influenced by additional protocolcomponents,components; additionalstate,state; and additional processing for encapsulation, decapsulation, and liveness). o Experiments need to verify that LISP produces no significant change in the behavior of protocols run between end-systems over a LISP infrastructure versus being run directly between those same end-systems. o Experiments need to verify that the issues raised in the Critique section of [RFC6115] are either insignificant or have been addressed by updates tothe LISP protocol.LISP. Other LISP documents may also include open issues and areas for future work. 16. References 16.1. Normative References[ALT] Farinacci, D., Fuller, V., Meyer, D., and D. Lewis, "LISP Alternative Topology (LISP-ALT)", draft-ietf-lisp-alt-10.txt (work in progress). [LISP-MS] Farinacci, D. and V. Fuller, "LISP Map Server", draft-ietf-lisp-ms-16.txt (work in progress).[RFC0768] Postel, J., "User Datagram Protocol", STD 6, RFC 768, August 1980. [RFC0791] Postel, J., "Internet Protocol", STD 5, RFC 791, September 1981. [RFC1918] Rekhter, Y., Moskowitz, R., Karrenberg, D., Groot, G., and E. Lear, "Address Allocation for Private Internets", BCP 5, RFC 1918, February 1996. [RFC2119] Bradner, S., "Key words for use in RFCs to Indicate Requirement Levels", BCP 14, RFC 2119, March 1997. [RFC2404] Madson, C. and R. Glenn, "The Use of HMAC-SHA-1-96 within ESP and AH", RFC 2404, November 1998. [RFC2460] Deering, S. and R. Hinden, "Internet Protocol, Version 6 (IPv6) Specification", RFC 2460, December 1998. [RFC3168] Ramakrishnan, K., Floyd, S., and D. Black, "The Addition of Explicit Congestion Notification (ECN) to IP", RFC 3168, September 2001. [RFC3232] Reynolds, J., "Assigned Numbers: RFC 1700 is Replaced by an On-line Database", RFC 3232, January 2002. [RFC4086] Eastlake, D., Schiller, J., and S. Crocker, "Randomness Requirements for Security", BCP 106, RFC 4086, June 2005. [RFC4632] Fuller, V. and T. Li, "Classless Inter-domain Routing (CIDR): The Internet Address Assignment and Aggregation Plan", BCP 122, RFC 4632, August 2006. [RFC4868] Kelly, S. and S. Frankel, "Using HMAC-SHA-256, HMAC-SHA-384, and HMAC-SHA-512 with IPsec", RFC 4868, May 2007. [RFC5226] Narten, T. and H. Alvestrand, "Guidelines for Writing an IANA Considerations Section in RFCs", BCP 26, RFC 5226, May 2008. [RFC5496] Wijnands, IJ., Boers, A., and E. Rosen, "The Reverse Path Forwarding (RPF) Vector TLV", RFC 5496, March 2009. [RFC5944] Perkins, C., "IP Mobility Support for IPv4, Revised", RFC 5944, November 2010. [RFC6115] Li, T., "Recommendation for a Routing Architecture", RFC 6115, February 2011.[RFC6234] Eastlake, D. and T. Hansen, "US Secure Hash Algorithms (SHA and SHA-based HMAC and HKDF)", RFC 6234, May 2011.[RFC6275] Perkins, C., Johnson, D., and J. Arkko, "Mobility Support in IPv6", RFC 6275, July 2011.[VERSIONING][RFC6833] Farinacci, D. and V. Fuller, "Locator/ID Separation Protocol (LISP) Map-Server Interface", RFC 6833, January 2013. [RFC6834] Iannone, L., Saucez, D., and O. Bonaventure,"LISP Mapping Versioning", draft-ietf-lisp-map-versioning-09.txt (work in progress)."Locator/ID Separation Protocol (LISP) Map-Versioning", RFC 6834, January 2013. [RFC6836] Farinacci, D., Fuller, V., Meyer, D., and D. Lewis, "Locator/ID Separation Protocol Alternative Logical Topology (LISP+ALT)", RFC 6836, January 2013. 16.2. Informative References [AFI] IANA, "Address FamilyIndicators (AFIs)", ADDRESS FAMILY NUMBERS http://www.iana.org/assignments/address-family-numbers. [AFI-REGISTRY] IANA, "Address Family Indicators (AFIs)", ADDRESS FAMILY NUMBER registry http://www.iana.org/assignments/ address-family-numbers/ address-family-numbers.xml#address-family-numbers-1.Numbers", <http://www.iana.org/assignments/address-family-numbers>. [BGP-SEC] Lepinski,M.,M. and S. Turner, "An Overview of BGPSEC",draft-lepinski-bgpsec-overview-00.txt (workWork inprogress), March 2011.Progress, May 2012. [CHIAPPA] Chiappa, J., "Endpoints and Endpoint names: A Proposed Enhancement to the Internet Architecture",Internet- Draft http://www.chiappa.net/~jnc/tech/endpoints.txt.1999, <http://mercury.lcs.mit.edu/~jnc/tech/endpoints.txt>. [CONS] Brim, S., Chiappa, N., Farinacci, D., Fuller, V., Lewis, D., and D. Meyer, "LISP-CONS: A Content distribution Overlay Network Service for LISP",draft-meyer-lisp-cons-04.txt (workWork inprogress).Progress, April 2008. [EMACS] Brim, S., Farinacci, D., Meyer, D., and J. Curran, "EID Mappings Multicast Across Cooperating Systems for LISP",draft-curran-lisp-emacs-00.txt (work in progress). [INTERWORK] Lewis, D., Meyer, D., Farinacci, D., and V. Fuller, "Interworking LISP with IPv4 and IPv6", draft-ietf-lisp-interworking-06.txt (work in progress). [KARP] Lebovitz, G. and M. Bhatia, "Keying and Authentication for Routing Protocols (KARP)Design Guidelines", draft-ietf-karp-design-guide-06.txt (workWork inprogress), October 2011.Progress, November 2007. [LCAF] Farinacci, D., Meyer, D., and J. Snijders, "LISP Canonical AddressFormat", draft-ietf-lisp-lcaf-00.txt (workFormat (LCAF)", Work inprogress).Progress, January 2013. [LISA96] Lear, E., Tharp, D., Katinsky, J.,Coffin, J.,andD. Tharp,J. Coffin, "Renumbering: Threat or Menace?", Usenix.Tenth System Administration Conference (LISA 96), October 1996. [LISP-DEPLOY] Jakab, L., Cabellos-Aparicio, A., Coras, F., Domingo-Pascual, J., and D. Lewis, "LISP Network Element Deployment Considerations",draft-ietf-lisp-deployment-05.txt (work in progress). [LISP-LIG] Farinacci, D. and D. Meyer, "LISP Internet Groper (LIG)", draft-ietf-lisp-lig-06.txt (work in progress). [LISP-MAIN] Farinacci, D., Fuller, V., Meyer, D., and D. Lewis, "Locator/ID Separation Protocol (LISP)", draft-farinacci-lisp-12.txt (workWork inprogress).Progress, October 2012. [LISP-MIB] Schudel, G., Jain, A., and V. Moreno, "LISP MIB",draft-ietf-lisp-mib-07.txt (workWork inprogress).Progress, January 2013. [LISP-MN] Farinacci, D.,Fuller, V.,Lewis, D.,and D.Meyer, D., and C. White, "LISPMobility Architecture", draft-meyer-lisp-mn-08.txt (workMobile Node", Work inprogress).Progress, October 2012. [LISP-SEC] Maino, F.,Ermagon,Ermagan, V., Cabellos, A.,Sausez,Saucez, D., and O. Bonaventure, "LISP-Security (LISP-SEC)",draft-ietf-lisp-sec-04.txt (workWork inprogress).Progress, October 2012. [LOC-ID-ARCH] Meyer, D. and D. Lewis, "Architectural Implications of Locator/ID Separation",draft-meyer-loc-id-implications-02.txt (work in progress). [MLISP] Farinacci, D., Meyer, D., Zwiebel, J., and S. Venaas, "LISP for Multicast Environments", draft-ietf-lisp-multicast-14.txt (work in progress). [NERD] Lear, E., "NERD: A Not-so-novel EID to RLOC Database", draft-lear-lisp-nerd-08.txt (workWork inprogress).Progress, January 2009. [OPENLISP] Iannone,L.L., Saucez, D., and O. Bonaventure, "OpenLISP Implementation Report",draft-iannone-openlisp-implementation-01.txt (workWork inprogress).Progress, July 2008. [RADIR] Narten, T.,"Routing and Addressing Problem Statement", draft-narten-radir-problem-statement-05.txt (work"On the Scalability of Internet Routing", Work inprogress).Progress, February 2010. [RFC1034] Mockapetris, P., "Domain names - concepts and facilities", STD 13, RFC 1034, November 1987. [RFC2784] Farinacci, D., Li, T., Hanks, S., Meyer, D., and P. Traina, "Generic Routing Encapsulation (GRE)", RFC 2784, March 2000. [RFC3056] Carpenter, B. and K. Moore, "Connection of IPv6 Domains via IPv4 Clouds", RFC 3056, February 2001. [RFC3261] Rosenberg, J., Schulzrinne, H., Camarillo, G., Johnston, A., Peterson, J., Sparks, R., Handley, M., and E. Schooler, "SIP: Session Initiation Protocol", RFC 3261, June 2002. [RFC4107] Bellovin, S. and R. Housley, "Guidelines for Cryptographic Key Management", BCP 107, RFC 4107, June 2005. [RFC4192] Baker, F., Lear, E., and R. Droms, "Procedures for Renumbering an IPv6 Network without a Flag Day", RFC 4192, September 2005. [RFC4866] Arkko, J., Vogt, C., and W. Haddad, "Enhanced Route Optimization for Mobile IPv6", RFC 4866, May 2007. [RFC4984] Meyer, D., Zhang, L., and K. Fall, "Report from the IAB Workshop on Routing and Addressing", RFC 4984, September 2007.[RPKI][RFC6480] Lepinski,M.,M. and S. Kent, "An Infrastructure to Support Secure Internet Routing",draft-ietf-sidr-arch-13.txt (work in progress),RFC 6480, February2011. [UDP-TUNNELS] Eubanks, M. and P. Chimento, "UDP Checksums for Tunneled Packets", draft-ietf-6man-udpchecksums-05.txt (work in progress), October2012.[UDP-ZERO] Fairhurst,[RFC6518] Lebovitz, G. and M.Westerland, "IPv6 UDP Checksum Considerations", draft-ietf-6man-udpzero-07.txt (workBhatia, "Keying and Authentication for Routing Protocols (KARP) Design Guidelines", RFC 6518, February 2012. [RFC6831] Farinacci, D., Meyer, D., Zwiebel, J., and S. Venaas, "The Locator/ID Separation Protocol (LISP) for Multicast Environments", RFC 6831, January 2013. [RFC6832] Lewis, D., Meyer, D., Farinacci, D., and V. Fuller, "Interworking between Locator/ID Separation Protocol (LISP) and Non-LISP Sites", RFC 6832, January 2013. [RFC6835] Farinacci, D. and D. Meyer, "The Locator/ID Separation Protocol Internet Groper (LIG)", RFC 6835, January 2013. [RFC6837] Lear, E., "NERD: A Not-so-novel Endpoint ID (EID) to Routing Locator (RLOC) Database", RFC 6837, January 2013. [UDP-TUNNELS] Eubanks, M., Chimento, P., and M. Westerlund, "IPv6 and UDP Checksums for Tunneled Packets", Work inprogress), OctoberProgress, January 2013. [UDP-ZERO] Fairhurst, G. and M. Westerlund, "Applicability Statement for the use of IPv6 UDP Datagrams with Zero Checksums", Work in Progress, December 2012. Appendix A. Acknowledgments An initial thank you goes to Dave Oran for planting the seeds for the initial ideas for LISP. His consultation continues to provide value to the LISP authors. A special and appreciative thank you goes to Noel Chiappa for providing architectural impetus over the past decades on separation of location and identity, as well as detailedreviewreviews of the LISP architecture and documents, coupled with enthusiasm for making LISP a practical and incremental transition for the Internet. The authors would like to gratefully acknowledge many people who have contributeddiscussiondiscussions and ideas to the making of this proposal. They include Scott Brim, Andrew Partan, John Zwiebel, Jason Schiller, Lixia Zhang, Dorian Kim, Peter Schoenmaker, Vijay Gill, Geoff Huston, David Conrad, Mark Handley, Ron Bonica, Ted Seely, Mark Townsley, Chris Morrow, Brian Weis, Dave McGrew, Peter Lothberg, Dave Thaler, Eliot Lear, Shane Amante, Ved Kafle, Olivier Bonaventure, Luigi Iannone, Robin Whittle, Brian Carpenter, Joel Halpern, Terry Manderson, Roger Jorgensen, Ran Atkinson, Stig Venaas, Iljitsch van Beijnum, Roland Bless, Dana Blair, Bill Lynch, Marc Woolward, Damien Saucez, Damian Lezama, Attilla De Groot, Parantap Lahiri, David Black, Roque Gagliano, Isidor Kouvelas, Jesper Skriver, Fred Templin, Margaret Wasserman, Sam Hartman, Michael Hofling, Pedro Marques, Jari Arkko, Gregg Schudel, Srinivas Subramanian, Amit Jain, Xu Xiaohu, Dhirendra Trivedi, Yakov Rekhter, John Scudder, John Drake, Dimitri Papadimitriou, Ross Callon, Selina Heimlich, Job Snijders, Vina Ermagan, Albert Cabellos, Fabio Maino, Victor Moreno, Chris White, Clarence Filsfils, and Alia Atlas. This work originated in the Routing Research Group (RRG) of the IRTF.TheAn individual submission[LISP-MAIN]was converted intothisthe IETF LISP working groupdraft.document that became this RFC. The LISP working group would like to give a special thanks to Jari Arkko, the Internet Area AD at the time that the set of LISP documents were being prepared for IESG last call, and for his meticulousreviewreviews anddetail commentarydetailed commentaries on the 7 working group last calldraftsdocuments progressing toward experimental RFCs.Appendix B. Document Change Log B.1. Changes to draft-ietf-lisp-24.txt o Posted November 2012 for final pre-RFC version. o Move draft-ietf-6man-udpchecksums reference back to Informative References section. B.2. Changes to draft-ietf-lisp-23.txt o Posted May 2012 for final pre-RFC version. o Move only the reference draft-ietf-6man-udpzero to the Informative References section. Leave the draft-ietf-6man-udpchecksums reference in the Normative References section. After talking to many people involved with this issue at Paris IETF, all thought this would be an acceptable change. o Added text to IANA Considerations section 14.4 to reflect IANA comments about allocating Key-ID numbers. B.3. Changes to draft-ietf-lisp-22.txt o Posted February 2012 to reflect final DISCUSS comments from Adrian Farrel. B.4. Changes to draft-ietf-lisp-21.txt o Posted February 2012 to reflect DISCUSS comments from Adrian Farrel, Stewart Bryant, and Wesley Eddy. B.5. Changes to draft-ietf-lisp-20.txt o Posted January 2012 for resolution to Adrian Farrel's security comments as well as additions to the end of section 2, Elwyn Davies Gen-Art comments, and Ralph Droms' IANA and EID definition comments. B.6. Changes to draft-ietf-lisp-19.txt o Posted January 2012 for Stephen Farrell's comment resolution. B.7. Changes to draft-ietf-lisp-18.txt o Posted December 2011 after reflecting comments from IANA. o Create reference to sections 5.4.1 and 5.4.2 about DF bit setting from section 5.3. o Inserted two references for Route-Returnability and on-path attacks in Security Considerations section. B.8. Changes to draft-ietf-lisp-17.txt o Posted December 2011 after IETF last call comments. o Make Map-Notify port assignment be 4342 in both source and destination ports. This change was agreed on and put in [LISP-MS] but was not updated in this spec. B.9. Changes to draft-ietf-lisp-16.txt o Posted October 2011 after AD review by Jari. B.10. Changes to draft-ietf-lisp-15.txt o Posted July 2011. Fixing IDnits errors. o Change description on how to select a source address for RLOC- probe Map-Replies to refer to the "EID-to-RLOC Map-Reply Message" section. B.11. Changes to draft-ietf-lisp-14.txt o Post working group last call and pre-IESG last call review. o Indicate that an ICMP Unreachable message should be sent when a packet matches a drop-based negative map-cache entry. o Indicate how a map-cache set of overlapping EID-prefixes must maintain integrity when the map-cache maximum cap is reached. o Add Joel's description for the definition of an EID, that the bit string value can be an RLOC for another device in abstract but the architecture allows it to be an EID of one device and the same value as an RLOC for another device. o In the "Tunnel Encapsulation Details" section, indicate that 4 combinations of encapsulation are supported. o Add what ETR should do for a Data-Probe when received for a destination EID outside of its EID-prefix range. This was added in the Data Probe definition section. o Added text indicating that more-specific EID-prefixes must not be removed when less-specific entries stay in the map-cache. This is to preserve the integrity of the EID-prefix set. o Add clarifying text in the Security Considerations section about how an ETR must not decapsulate and forward a packet that is not for its configured EID-prefix range. B.12. Changes to draft-ietf-lisp-13.txt o Posted June 2011 to complete working group last call. o Tracker item 87. Put Yakov suggested wording in the EID-prefix definition section to reference [INTERWORK] and [LISP-DEPLOY] about discussion on transition and access mechanisms. o Change "ITRs" to "ETRs" in the Locator Status Bit definition section and data packet description section per Damien's comment. o Remove the normative reference to [LISP-SEC] when describing the S-bit in the ECM and Map-Reply headers. o Tracker item 54. Added text from John Scudder in the "Packets Egressing a LISP Site" section. o Add sentence to the "Reencapsulating Tunnel" definition about how reencapsulation loops can occur when not coordinating among multiple mapping database systems. o Remove "In theory" from a sentence in the Security Considerations section. o Remove Security Area Statement title and reword section with Eliot's provided text. The text was agreed upon by LISP-WG chairs and Security ADs. o Remove word "potential" from the over-claiming paragraph of the Security Considerations section per Stephen's request. o Wordsmithing and other editorial comments from Alia. B.13. Changes to draft-ietf-lisp-12.txt o Posted April 2011. o Tracker item 87. Provided rewording how an EID-prefix can be reused in the definition section of "EID-prefix". o Tracker item 95. Change "eliminate" to "defer" in section 4.1. o Tracker item 110. Added that the Mapping Protocol Data field in the Map-Reply message is only used when needed by the particular Mapping Database System. o Tracker item 111. Indicate that if an LSB that is associated with an anycast address, that there is at least one RLOC that is up. o Tracker item 108. Make clear the R-bit does not define RLOC path reachability. o Tracker item 107. Indicate that weights are relative to each other versus requiring an addition of up to 100%. o Tracker item 46. Add a sentence how LISP products should be sized for the appropriate demand so cache thrashing is avoided. o Change some references of RFC 5226 to [AFI] per Luigi. o Per Luigi, make reference to "EID-AFI" consistent to "EID-prefix- AFI". o Tracker item 66. Indicate that appending locators to a locator- set is done when the added locators are lexicographically greater than the previous ones in the set. o Tracker item 87. Once again reword the definition of the EID- prefix to reflect recent comments. o Tracker item 70. Added text to security section on what the implications could be if an ITR does not obey priority and weights from a Map-Reply message. o Tracker item 54. Added text to the new section titled "Packets Egressing a LISP Site" to describe the implications when two or more ITRs exist at a site where only one ITR is used for egress traffic and when there is a shift of traffic to the others, how the map-cache will need to be populated in those new egress ITRs. o Tracker item 33. Make more clear in the Routing Locator Selection section what an ITR should do when it sees an R-bit of 0 in a locator-record of a Map-Reply. o Tracker item 33. Add paragraph to the EID Reachability section indicating that site partitioning is under investigation. o Tracker item 58. Added last paragraph of Security Considerations section about how to protect inner header EID address spoofing attacks. o Add suggested Sam text to indicate that all security concerns need not be addressed for moving document to Experimental RFC status. Put this in a subsection of the Security Considerations section. B.14. Changes to draft-ietf-lisp-11.txt o Posted March 30, 2011. o Change IANA URL. The URL we had pointed to a general protocol numbers page. o Added the "s" bit to the Map-Request to allow SMR-invoked Map- Requests to be sent to a MN ETR via the map-server. o Generalize text for the definition of Reencapsuatling tunnels. o Add paragraph suggested by Joel to explain how implementation experimentation will be used to determine the proper cache management techniques. o Add Yakov provided text for the definition of "EID-to-RLOC "Database". o Add reference in Section 8, Deployment Scenarios, to the draft-jakab-lisp-deploy-02.txt draft. o Clarify sentence about no hardware changes needed to support LISP encapsulation. o Add paragraph about what is the procedure when a locator is inserted in the middle of a locator-set. o Add a definition for Locator Status Bits so we can emphasize they are used as a hint for router up/down status and not path reachability. o Change "BGP RIB" to "RIB" per Clarence's comment. o Fixed complaints by IDnits. o Add subsection to Security Considerations section indicating how EID-prefix overclaiming in Map-Replies is for further study and add a reference to LISP-SEC. B.15. Changes to draft-ietf-lisp-10.txt o Posted March 2011. o Add p-bit to Map-Request so there is documentary reasons to know when a PITR has sent a Map-Request to an ETR. o Add Map-Notify message which is used to acknowledge a Map-Register message sent to a Map-Server. o Add M-bit to the Map-Register message so an ETR that wants an acknowledgment for the Map-Register can request one. o Add S-bit to the ECM and Map-Reply messages to describe security data that can be present in each message. Then refer to [LISP-SEC] for expansive details. o Add Network Management Considerations section and point to the MIB and LIG drafts. o Remove the word "simple" per Yakov's comments. B.16. Changes to draft-ietf-lisp-09.txt o Posted October 2010. o Add to IANA Consideration section about the use of LCAF Type values that accepted and maintained by the IANA registry and not the LCAF specification. o Indicate that implementations should be able to receive LISP control messages when either UDP port is 4342, so they can be robust in the face of intervening NAT boxes. o Add paragraph to SMR section to indicate that an ITR does not need to respond to an SMR-based Map-Request when it has no map-cache entry for the SMR source's EID-prefix. B.17. Changes to draft-ietf-lisp-08.txt o Posted August 2010. o In section 6.1.6, remove statement about setting TTL to 0 in Map- Register messages. o Clarify language in section 6.1.5 about Map-Replying to Data- Probes or Map-Requests. o Indicate that outer TTL should only be copied to inner TTL when it is less than inner TTL. o Indicate a source-EID for RLOC-probes are encoded with an AFI value of 0. o Indicate that SMRs can have a global or per SMR destination rate- limiter. o Add clarifications to the SMR procedures. o Add definitions for "client-side" and 'server-side" terms used in this specification. o Clear up language in section 6.4, last paragraph. o Change ACT of value 0 to "no-action". This is so we can RLOC- probe a PETR and have it return a Map-Reply with a locator-set of size 0. The way it is spec'ed the map-cache entry has action "dropped". Drop-action is set to 3. o Add statement about normalizing locator weights. o Clarify R-bit definition in the Map-Reply locator record. o Add section on EID Reachability within a LISP site. o Clarify another disadvantage of using anycast locators. o Reworded Abstract. o Change section 2.0 Introduction to remove obsolete information such as the LISP variant definitions. o Change section 5 title from "Tunneling Details" to "LISP Encapsulation Details". o Changes to section 5 to include results of network deployment experience with MTU. Recommend that implementations use either the stateful or stateless handling. o Make clarification wordsmithing to Section 7 and 8. o Identify that if there is one locator in the locator-set of a map- cache entry, that an SMR from that locator should be responded to by sending the the SMR-invoked Map-Request to the database mapping system rather than to the RLOC itself (which may be unreachable). o When describing Unicast and Multicast Weights indicate the the values are relative weights rather than percentages. So it doesn't imply the sum of all locator weights in the locator-set need to be 100. o Do some wordsmithing on copying TTL and TOS fields. o Numerous wordsmithing changes from Dave Meyer. He fine toothed combed the spec. o Removed Section 14 "Prototype Plans and Status". We felt this type of section is no longer appropriate for a protocol specification. o Add clarification text for the IRC description per Damien's commentary. o Remove text on copying nonce from SMR to SMR-invoked Map- Request per Vina's comment about a possible DoS vector. o Clarify (S/2 + H) in the stateless MTU section. o Add text to reflect Damien's comment about the description of the "ITR-RLOC Address" field in the Map-Request. that the list of RLOC addresses are local addresses of the Map-Requester. B.18. Changes to draft-ietf-lisp-07.txt o Posted April 2010. o Added I-bit to data header so LSB field can also be used as an Instance ID field. When this occurs, the LSB field is reduced to 8-bits (from 32-bits). o Added V-bit to the data header so the 24-bit nonce field can also be used for source and destination version numbers. o Added Map-Version 12-bit value to the EID-record to be used in all of Map-Request, Map-Reply, and Map-Register messages. o Added multiple ITR-RLOC fields to the Map-Request packet so an ETR can decide what address to select for the destination of a Map- Reply. o Added L-bit (Local RLOC bit) and p-bit (Probe-Reply RLOC bit) to the Locator-Set record of an EID-record for a Map-Reply message. The L-bit indicates which RLOCs in the locator-set are local to the sender of the message. The P-bit indicates which RLOC is the source of a RLOC-probe Reply (Map-Reply) message. o Add reference to the LISP Canonical Address Format [LCAF] draft. o Made editorial and clarification changes based on comments from Dhirendra Trivedi. o Added wordsmithing comments from Joel Halpern on DF=1 setting. o Add John Zwiebel clarification to Echo Nonce Algorithm section 6.3.1. o Add John Zwiebel comment about expanding on proxy-map-reply bit for Map-Register messages. o Add NAT section per Ron Bonica comments. o Fix IDnits issues per Ron Bonica. o Added section on Virtualization and Segmentation to explain the use if the Instance ID field in the data header. o There are too many P-bits, keep their scope to the packet format description and refer to them by name every where else in the spec. o Scanned all occurrences of "should", "should not", "must" and "must not" and uppercased them. o John Zwiebel offered text for section 4.1 to modernize the example. Thanks Z! o Make it more clear in the definition of "EID-to-RLOC Database" that all ETRs need to have the same database mapping. This reflects a comment from John Scudder. o Add a definition "Route-returnability" to the Definition of Terms section. o In section 9.2, add text to describe what the signature of traceroute packets can look like. o Removed references to Data Probe for introductory example. Data- probes are still part of the LISP design but not encouraged. o Added the definition for "LISP site" to the Definition of Terms" section. B.19. Changes to draft-ietf-lisp-06.txt Editorial based changes: o Posted December 2009. o Fix typo for flags in LISP data header. Changed from "4" to "5". o Add text to indicate that Map-Register messages must contain a computed UDP checksum. o Add definitions for PITR and PETR. o Indicate an AFI value of 0 is an unspecified address. o Indicate that the TTL field of a Map-Register is not used and set to 0 by the sender. This change makes this spec consistent with [LISP-MS]. o Change "... yield a packet size of L octets" to "... yield a packet size greater than L octets". o Clarify section 6.1.5 on what addresses and ports are used in Map- Reply messages. o Clarify that LSBs that go beyond the number of locators do not to be SMRed when the locator addresses are greater lexicographically than the locator in the existing locator-set. o Add Gregg, Srini, and Amit to acknowledgment section. o Clarify in the definition of a LISP header what is following the UDP header. o Clarify "verifying Map-Request" text in section 6.1.3. o Add Xu Xiaohu to the acknowledgment section for introducing the problem of overlapping EID-prefixes among multiple sites in an RRG email message. Design based changes: o Use stronger language to have the outer IPv4 header set DF=1 so we can avoid fragment reassembly in an ETR or PETR. This will also make IPv4 and IPv6 encapsulation have consistent behavior. o Map-Requests should not be sent in ECM with the Probe bit is set. These type of Map-Requests are used as RLOC-probes and are sent directly to locator addresses in the underlying network. o Add text in section 6.1.5 about returning all EID-prefixes in a Map-Reply sent by an ETR when there are overlapping EID-prefixes configure. o Add text in a new subsection of section 6.1.5 about dealing with Map-Replies with coarse EID-prefixes. B.20. Changes to draft-ietf-lisp-05.txt o Posted September 2009. o Added this Document Change Log appendix. o Added section indicating that encapsulated Map-Requests must use destination UDP port 4342. o Don't use AH in Map-Registers. Put key-id, auth-length, and auth- data in Map-Register payload. o Added Jari to acknowledgment section. o State the source-EID is set to 0 when using Map-Requests to refresh or RLOC-probe. o Make more clear what source-RLOC should be for a Map-Request. o The LISP-CONS authors thought that the Type definitions for CONS should be removed from this specification. o Removed nonce from Map-Register message, it wasn't used so no need for it. o Clarify what to do for unspecified Action bits for negative Map- Replies. Since No Action is a drop, make value 0 Drop. B.21. Changes to draft-ietf-lisp-04.txt o Posted September 2009. o How do deal with record count greater than 1 for a Map-Request. Damien and Joel comment. Joel suggests: 1) Specify that senders compliant with the current document will always set the count to 1, and note that the count is included for future extensibility. 2) Specify what a receiver compliant with the draft should do if it receives a request with a count greater than 1. Presumably, it should send some error back? o Add Fred Templin in acknowledgment section. o Add Margaret and Sam to the acknowledgment section for their great comments. o Say more about LAGs in the UDP section per Sam Hartman's comment. o Sam wants to use MAY instead of SHOULD for ignoring checksums on ETR. From the mailing list: "You'd need to word it as an ITR MAY send a zero checksum, an ETR MUST accept a 0 checksum and MAY ignore the checksum completely. And of course we'd need to confirm that can actually be implemented. In particular, hardware that verifies UDP checksums on receive needs to be checked to make sure it permits 0 checksums." o Margaret wants a reference to http://www.ietf.org/id/draft-eubanks-chimento-6man-00.txt. o Fix description in Map-Request section. Where we describe Map- Reply Record, change "R-bit" to "M-bit". o Add the mobility bit to Map-Replies. So PITRs don't probe so often for MNs but often enough to get mapping updates. o Indicate SHA1 can be used as well for Map-Registers. o More Fred comments on MTU handling. o Isidor comment about spec'ing better periodic Map-Registers. Will be fixed in draft-ietf-lisp-ms-02.txt. o Margaret's comment on gleaning: "The current specification does not make it clear how long gleaned map entries should be retained in the cache, nor does it make it clear how/ when they will be validated. The LISP spec should, at the very least, include a (short) default lifetime for gleaned entries, require that they be validated within a short period of time, and state that a new gleaned entry should never overwrite an entry that was obtained from the mapping system. The security implications of storing "gleaned" entries should also be explored in detail." o Add section on RLOC-probing per working group feedback. o Change "loc-reach-bits" to "loc-status-bits" per comment from Noel. o Remove SMR-bit from data-plane. Dino prefers to have it in the control plane only. o Change LISP header to allow a "Research Bit" so the Nonce and LSB fields can be turned off and used for another future purpose. For Luigi et al versioning convergence. o Add a N-bit to the data header suggested by Noel. Then the nonce field could be used when N is not 1. o Clarify that when E-bit is 0, the nonce field can be an echoed nonce or a random nonce. Comment from Jesper. o Indicate when doing data-gleaning that a verifying Map-Request is sent to the source-EID of the gleaned data packet so we can avoid map-cache corruption by a 3rd party. Comment from Pedro. o Indicate that a verifying Map-Request, for accepting mapping data, should be sent over the ALT (or to the EID). o Reference IPsec RFC 4302. Comment from Sam and Brian Weis. o Put E-bit in Map-Reply to tell ITRs that the ETR supports echo- noncing. Comment by Pedro and Dino. o Jesper made a comment to loosen the language about requiring the copy of inner TTL to outer TTL since the text to get mixed-AF traceroute to work would violate the "MUST" clause. Changed from MUST to SHOULD in section 5.3. B.22. Changes to draft-ietf-lisp-03.txt o Posted July 2009. o Removed loc-reach-bits longword from control packets per Damien comment. o Clarifications in MTU text from Roque. o Added text to indicate that the locator-set be sorted by locator address from Isidor. o Clarification text from John Zwiebel in Echo-Nonce section. B.23. Changes to draft-ietf-lisp-02.txt o Posted July 2009. o Encapsulation packet format change to add E-bit and make loc- reach-bits 32-bits in length. o Added Echo-Nonce Algorithm section. o Clarification how ECN bits are copied. o Moved S-bit in Map-Request. o Added P-bit in Map-Request and Map-Reply messages to anticipate RLOC-Probe Algorithm. o Added to Mobility section to reference [LISP-MN]. B.24. Changes to draft-ietf-lisp-01.txt o Posted 2 days after draft-ietf-lisp-00.txt in May 2009. o Defined LEID to be a "LISP EID". o Indicate encapsulation use IPv4 DF=0. o Added negative Map-Reply messages with drop, native-forward, and send-map-request actions. o Added Proxy-Map-Reply bit to Map-Register. B.25. Changes to draft-ietf-lisp-00.txt o Posted May 2009. o Rename of draft-farinacci-lisp-12.txt. o Acknowledgment to RRG.Authors' Addresses Dino FarinacciciscoCisco Systems Tasman Drive San Jose, CA 95134 USAEmail: dino@cisco.comEMail: farinacci@gmail.com Vince Fullercisco Systems Tasman Drive San Jose, CA 95134 USA Email: vaf@cisco.comEMail: vaf@vaf.net Dave MeyerciscoCisco Systems 170 Tasman Drive San Jose, CA USAEmail: dmm@cisco.comEMail: dmm@1-4-5.net Darrel LewisciscoCisco Systems 170 Tasman Drive San Jose, CA USAEmail:EMail: darlewis@cisco.com