IDR Working Group W. Wang Internet-Draft A. Wang Intended status: Experimental China Telecom Expires: 23 January 2025 H. Wang Huawei Technologies G. Mishra Verizon Inc. J. Dong Huawei Technologies 22 July 2024 VPN Prefix Outbound Route Filter (VPN Prefix ORF) for BGP-4 draft-ietf-idr-vpn-prefix-orf-07 Abstract This draft defines a new Outbound Route Filter (ORF) type, called the VPN Prefix ORF. The described VPN Prefix ORF mechanism is applicable when the VPN routes from different VRFs are exchanged via one shared BGP session. Status of This Memo This Internet-Draft is submitted in full conformance with the provisions of BCP 78 and BCP 79. Internet-Drafts are working documents of the Internet Engineering Task Force (IETF). Note that other groups may also distribute working documents as Internet-Drafts. The list of current Internet- Drafts is at https://datatracker.ietf.org/drafts/current/. Internet-Drafts are draft documents valid for a maximum of six months and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use Internet-Drafts as reference material or to cite them other than as "work in progress." This Internet-Draft will expire on 23 January 2025. Copyright Notice Copyright (c) 2024 IETF Trust and the persons identified as the document authors. All rights reserved. This document is subject to BCP 78 and the IETF Trust's Legal Provisions Relating to IETF Documents (https://trustee.ietf.org/ license-info) in effect on the date of publication of this document. Please review these documents carefully, as they describe your rights Wang, et al. Expires 23 January 2025 [Page 1] Internet-Draft RD-ORF July 2024 and restrictions with respect to this document. Code Components extracted from this document must include Revised BSD License text as described in Section 4.e of the Trust Legal Provisions and are provided without warranty as described in the Revised BSD License. Table of Contents 1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . 2 2. Conventions used in this document . . . . . . . . . . . . . . 4 3. Terminology . . . . . . . . . . . . . . . . . . . . . . . . . 4 4. Operation process of VPN Prefix ORF mechanism on sender . . . 5 4.1. Intra-domain Scenarios and Solutions . . . . . . . . . . 7 4.1.1. Scenario-1 and Solution (Unique RD, One RT) . . . . . 8 4.1.2. Scenario-2 and Solution (Unique RD, Multiple RTs) . . 9 4.1.3. Scenario-3 and Solution (Universal RD) . . . . . . . 11 5. Source PE Extended Community . . . . . . . . . . . . . . . . 13 6. VPN Prefix ORF Encoding . . . . . . . . . . . . . . . . . . . 14 6.1. Source PE TLV . . . . . . . . . . . . . . . . . . . . . . 16 6.2. Route Target TLV . . . . . . . . . . . . . . . . . . . . 16 7. Operation process of VPN Prefix ORF mechanism on receiver . . 17 8. Withdraw of VPN Prefix ORF entries . . . . . . . . . . . . . 18 9. Applicability . . . . . . . . . . . . . . . . . . . . . . . . 18 10. Implementation Considerations . . . . . . . . . . . . . . . . 20 10.1. Implementation Considerations . . . . . . . . . . . . . 20 10.2. Implementation status . . . . . . . . . . . . . . . . . 20 10.3. Experimental topology . . . . . . . . . . . . . . . . . 21 10.4. Results of Experiments . . . . . . . . . . . . . . . . . 21 11. Security Considerations . . . . . . . . . . . . . . . . . . . 21 12. IANA Considerations . . . . . . . . . . . . . . . . . . . . . 21 13. Contributor . . . . . . . . . . . . . . . . . . . . . . . . . 22 14. Acknowledgement . . . . . . . . . . . . . . . . . . . . . . . 22 15. Normative References . . . . . . . . . . . . . . . . . . . . 22 Appendix A. Experimental topology . . . . . . . . . . . . . . . 24 Authors' Addresses . . . . . . . . . . . . . . . . . . . . . . . 25 1. Introduction [I-D.wang-idr-vpn-routes-control-analysis] analyzed the scenarios and requirements for VPN routes control in the shared BGP session. This draft analyzes the existing solutions and their limitations for these scenarios, proposes the new VPN Prefix ORF solution to meet the requirements that described in section 8 of [I-D.wang-idr-vpn-routes-control-analysis]. Now, there are several solutions can be used to alleviate these problem: * Route Target Constraint (RTC) as defined in [RFC4684] Wang, et al. Expires 23 January 2025 [Page 2] Internet-Draft RD-ORF July 2024 * Address Prefix ORF as defined in [RFC5292] * CP-ORF mechanism as defined in [RFC7543] * PE-CE edge peer Maximum Prefix * Configure the Maximum Prefix for each VRF on edge nodes However, there are limitations to existing solutions: 1) Route Target Constraint RTC can only filter the VPN routes from the uninterested VRFs, if the “offending routes” come from the interested VRF, RFC mechanism can't filter them. 2) Address Prefix ORF Using Address Prefix ORF to filter VPN routes need to pre- configuration, but it is impossible to know which prefix may cause overflow in advance. 3) CP-ORF Mechanism [RFC7543] defines the Covering Prefixes ORF (CP-ORF). A BGP speaker sends a CP-ORF to a peer in order to pull routes that cover a specified host address. A prefix covers a host address if it can be used to forward traffic towards that host address. CP-ORF is applicable in Virtual Hub-and-Spoke[RFC7024] VPN and also the BGP/MPLS Ethernet VPN (EVPN)[RFC7432] networks, but its main aim is also to get the wanted VPN prefixes and can't be used to filter the overwhelmed VPN prefixes dynamically. 4) PE-CE edge peer Maximum Prefix The BGP Maximum-Prefix feature is used to control how many prefixes can be received from a neighbor. By default, this feature allows a router to bring down a peer when the number of received prefixes from that peer exceeds the configured Maximum-Prefix limit. This feature is commonly used for external BGP peers. If it is applied to internal BGP peers, for example the VPN scenarios, all the VPN routes from different VRFs will share the common fate, which is not desirable for the finer control of the VPN Prefixes advertisement. 5) Configure the Maximum Prefix for each VRF on edge nodes Wang, et al. Expires 23 January 2025 [Page 3] Internet-Draft RD-ORF July 2024 When a VRF overflows, it stops the import of routes and log the extra VPN routes into its RIB. However, PEs still need to parse the BGP updates. These processes will cost CPU cycles and further burden the overflowed PE. This draft defines a new ORF-type, called the VPN Prefix ORF. This mechanism is event-driven and does not need to be pre-configured. When a VRF of a router overflows, the router will find out the VPN prefix (RD, RT, source PE, etc.) of offending VPN routes in this VRF, and send a VPN Prefix ORF to its BGP peer that carries the relevant information. If a BGP speaker receives a VPN Prefix ORF entry from its BGP peer, it will filter the VPN routes it tends to send according to the entry. The purpose of this mechanism is to control the outrage within the minimum range and avoid churn effects when a VRF on a device in the network overflows. VPN Prefix ORF is applicable when the VPN routes from different VRFs are exchanged via one shared BGP session. 2. Conventions used in this document The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be interpreted as described in [RFC2119] . 3. Terminology The following terms are defined in this draft: * RD: Route Distinguisher, defined in [RFC4364] * ORF: Outbound Route Filter, defined in [RFC5291] * AFI: Address Family Identifier, defined in [RFC4760] * SAFI: Subsequent Address Family Identifier, defined in [RFC4760] * EVPN: BGP/MPLS Ethernet VPN, defined in [RFC7432] * RR: Router Reflector, provides a simple solution to the problem of IBGP full mesh connection in large-scale IBGP implementation. * VRF: Virtual Routing Forwarding, a virtual routing table based on VPN instance. Wang, et al. Expires 23 January 2025 [Page 4] Internet-Draft RD-ORF July 2024 4. Operation process of VPN Prefix ORF mechanism on sender The operation of VPN Prefix ORF mechanism on each device is independent, each of them makes a local judgement to determine whether it needs to send VPN Prefix ORF to its upstream peer. Operators can set the algorithms in the devices according to their own circumstances. The general procedures of the VPN Prefix ORF mechanism on the sender are as follows: This section describes the procedures for the BGP peer(the first BGP peer) receiving the VPN routes information from another BGP peer(the second BGP peer). The VPN information includes the updated VPN routes and their corresponding VPN instance identification information. Based on the VPN instance identification information, the first BGP peer determines the newly added VPN routes, and checks whether the number of the newly added VPN route has caused the number of total VPN routes exceeding the maximum route limit of the associated VPN instance. If the route number of the VPN instance that identified by the VPN instance identification information is reached or excess its limit, it will send the instruction information to the second BGP peer, indicating that the second BGP peer stop sending the corresponding VPN routes that identified by the VPN instance identification information. The first BGP peer and the second BGP peer are iBGP peers that are within one AS. The VPN instance identification information is RD and the instruction information is sent via the route-refresh message. The instruction information that sends to the second BGP peer includes the followings information: The ORF entries that are included in the route-refresh message. Set the Action field in the ORF entries to the value that instructs to add the corresponding filter condition into outbound route filter of the second the BGP peer. Set the Match field in the ORF entries to the value that instructs to deny the VPN routes updates that matches the corresponding ORF entries. The RD value that identifies the above mentioned VPN instance is added at the type-specific part of the ORF entries. Wang, et al. Expires 23 January 2025 [Page 5] Internet-Draft RD-ORF July 2024 When multiple VRFs on a PE are receiving VPN routes for a specific RD, if one VRF exceeds its limit upon receiving routes with that RD and the PE sends a VP-ORF message, it will prevent other VRFs that have not exceeded their limits from receiving VPN routes containing that RD, leading to communication disruptions between these VRFs and the rejected VPN routes. In order to more finely control VPN routing, when the above situation occurs, VRF should generate a filter when its route limit is exceeded, to selectively filter route entries based on RD before they enter the VRF from the RIB. The PE which receives the offending VPN routes parses the received BGP update message to obtain routing information, and obtains VPN routing entries associated with the BGP path from the routing information. Then, PE should determine the target VRF based on the RT import information of the routing target entry, and configure filters for the target VRF. PE should first detect whether there is currently a filter associated with the target VRF. If yes, PE should use the filter to filter the VPN routing entries, so that VPN routing entries carrying the specified routing identifier RD are not introduced into the target VRF. If there is currently no filter associated with the target VRF, PE will directly introduce the VPN routing entry into the target VRF. The format of VPN Prefix ORF entry is shown in Section 6 When configuring a filter for the target VRF, the PE should detect whether there is currently a routing overflow issue with the target VRF. If the target VRF currently has a routing overflow issue and the reason for the routing overrun is caused by a VPN routing entry carrying the specified RD, a filter is generated for the target VRF, wherein the filter is used to filter VPN routing entries carrying the specified RD. If the target VRF currently does not have routing overflow issues, PE will delete the filter for the target VRF. The detail procedures for further subdivisions are described below: a) No quota value is set on PE On PE, each VRF has a prefix limit. When the PE receives VPN routes from its BGP peer, due to the received VPN routes may belong to different VPN and carry the corresponding RDs, the PE should extract the VPN route information from BGP UPDATE message which contains VPN routes related to BGP optimal routing. PE can determine the target VRFs of the received VPN route based on the RT of the VPN route and the RT-import of VRFs. Then, the PE should sequentially determine whether each target VRF will exceed the limit after importing the received VPN routes. If a target VRF exceeds the limit which is caused by the VPN routes carrying a certain RD and the other target VRFs have not overflow, PE should not trigger the VPN Prefix ORF Wang, et al. Expires 23 January 2025 [Page 6] Internet-Draft RD-ORF July 2024 mechanism, and only performs VPN route filtering for the target VRF and stop importing VPN routes carrying the specific RD. If a target VRF exceeds the limit and there is no other VRFs need these VPN routes, the PE should trigger the VPN Prefix ORF mechanism and send a BGP ROUTE-REFRESH message contains the corresponding VPN Prefix ORF entry to its peer, which will generate a VPN routes filtering strategy for the VRF. And if the "Offending VPN routes process method" bit is 1, the receiver of VPN Prefix ORF entry should withdraw the extra VPN routes according to the value of VRF Prefix Limit, RD, RT and information in optional TLVs in the entry, and stop sending the corresponding VPN routes to the sender. If the target VRF no longer exceeds the limit, the relevant VPN routing filtering strategy needs to be deleted. When importing VPN routes to a VRF, it is necessary to determine whether there is a VPN routes filtering strategy on the PE for that VRF. BGP peers supporting this functionality has VRFs that has maximum route limit reached should comply with the procedures defined in this document b) Quota value is set on PE On PE, each VRF has a prefix limit, and routes associated with each tuple has a pre-configured quota. Considering that the received VPN routes may belong to different VPNs and therefore carry different RDs, the PE should determine whether VRF will exceed the limit after adding the received VPN routes. * when routes associated with tuple pass the quota but the prefix limit of VRF is not exceeded, PE should send warnings to the operator, and the VPN Prefix ORF mechanism should not be triggered. * when routes associated with tuple pass the quota and the prefix limit is exceeded and there is no other VRFs on offended PE need VPN routes with this tuple, PE should trigger VPN Prefix ORF mechanism and send a BGP ROUTE- REFRESH message contains the corresponding VPN Prefix ORF entry to its peer. When the VPN Prefix ORF mechanism is triggered, the device must send an alarm information to network operators. 4.1. Intra-domain Scenarios and Solutions For intra-AS VPN deployment, there are three scenarios: Wang, et al. Expires 23 January 2025 [Page 7] Internet-Draft RD-ORF July 2024 * RD is allocated per VPN per PE, each VRF only import one RT (see Section 4.1.1). * RD is allocated per VPN per PE. Multiple RTs are associated with such VPN routes, and are imported into different VRFs in other devices(see Section 4.1.2). * RD is allocated per VPN, each VRF imports one/multiple RTs(see Section 4.1.3). The following sections will describe solutions to the above scenarios in detail. 4.1.1. Scenario-1 and Solution (Unique RD, One RT) In this scenario, PE1-PE4 and RR are iBGP peers. RD is allocated per VPN per PE. The offending VPN routes only carry one RT. We assume the network topology is shown in Figure 1. +------------------------------------------------------------------------+ | | | | | +-------+ +-------+ | | | PE1 +----------------+ +-----------------+ PE4 | | | +-------+ | | +-------+ | | VPN1(RD11,RT1) | | VPN2(RD12,RT2) | | VPN2(RD12,RT2) | | | | +-+----+-+ | | | RR | | | +-+----+-+ | | | | | | | | | | +-------+ | | +-------+ | | | PE2 +----------------+ +-----------------+ PE3 | | | +-------+ +-------+ | | VPN1(RD21,RT1) VPN1(RD31,RT1) | | VPN2(RD22,RT2,RT1) VPN2(RD32,RT2) | | | | AS 100 | | | +------------------------------------------------------------------------+ Figure 1 Network Topology of Scenario-1 When PE3 sends excessive VPN routes with RT1, while both PE1 and PE2 import VPN routes with RT1, the process of offending VPN routes will influence performance of VRFs on PEs. PEs and RR should have some mechanisms to identify and control the advertisement of offending VPN routes. Wang, et al. Expires 23 January 2025 [Page 8] Internet-Draft RD-ORF July 2024 a) PE1 If quota value is not set on PE1, and each VRF has a prefix limit on PE1. When the prefix limit of VPN1 VRF is exceeded, due to there is no other VRFs on PE1 need the VPN routes with RT1, PE1 will send VPN Prefix ORF message to RR and send warning to operator. The message will carry the prefix limit of VPN1 VRF, the RD value is set to 0 and the RT value is set to RT1. RR will withdraw the offending VPN routes and control the number of VPN routes sending to PE1. If quota value is set on PE1, each VRF has a prefix limit, and each tuple imported into VRF has a quota. When routes associated with tuple pass the quota but the prefix limit of VPN1 VRF is not exceeded, PE1 sends a warning message to the operator, and the VPN Prefix ORF mechanism should not be triggered. After the prefix limit of VPN1 VRF is exceeded, since there is no other VRFs on PE1 need the VPN routes with RT1, PE1 will generate a BGP ROUTE-REFRESH message contains a VPN Prefix ORF entry, and send the entry to RR. RR will withdraw and stop to advertise such offending VPN routes (RD31, the prefix limit of VPN1 VRF, source PE is PE3, RT is RT1) to PE1. b) PE2 If quota value is not set on PE2, when the prefix limit of VPN1 VRF is exceeded, PE2 cannot trigger VPN Prefix ORF mechanism directly, because VPN2 VRF needs the VPN routes with RT1. PE2 triggers the mechanism only when the prefix limit for both, the VPN1 and VPN2 VRF has exceeded. The VPN Prefix ORF message will carry the VRF Prefix Limit = min(prefix limit of VPN1 VRF, prefix limit of VPN2 VRF), the RD value is set to 0 and the RT value is set to RT1. RR will withdraw the offending VPN routes and control the number of VPN routes sending to PE1. If quota value is set on PE2, both VPN1 VRF and VPN2 VRF import VPN routes with RT1. If PE2 triggers VPN Prefix ORF mechanism when VPN1 VRF overflows, VPN2 VRF cannot receive VPN routes with RT1 as well. PE2 should not trigger the VPN Prefix ORF mechanism to RR (RD31, min(prefix limit of VPN1 VRF, prefix limit of VPN2 VRF), RT1, RT2, source PE is PE3) until all the VRFs that import RT1 on it overflow. 4.1.2. Scenario-2 and Solution (Unique RD, Multiple RTs) In this scenario, PE1-PE4 and RR are iBGP peers. RD is allocated per VPN per PE. Multiple RTs are associated with the offending VPN routes, and are imported into different VRFs in other devices. We assume the network topology is shown in Figure 2. Wang, et al. Expires 23 January 2025 [Page 9] Internet-Draft RD-ORF July 2024 +------------------------------------------------------------------------+ | | | | | +-------+ +-------+ | | | PE1 +----------------+ +-----------------+ PE4 | | | +-------+ | | +-------+ | | VPN1(RD11,RT1) | | VPN2(RD42,RT2) | | VPN2(RD12,RT2) | | | | +-+----+-+ | | | RR | | | +-+----+-+ | | | | | | | | | | +-------+ | | +-------+ | | | PE2 +----------------+ +-----------------+ PE3 | | | +-------+ +-------+ | | VPN1(RD21,RT1) VPN1(RD31,RT1,RT2) | | VPN2(RD32,RT2) | | | | AS 100 | | | +------------------------------------------------------------------------+ Figure 2 Network Topology of Scenario-2 When PE3 sends excessive VPN routes with RT1 and RT2, while both PE1 and PE2 import VPN routes with RT1, and PE1 also imports VPN routes with RT2. a) PE1 If quota value is not set on PE1, when the prefix limit of VPN1 VRF is exceeded, PE1 cannot trigger VPN Prefix ORF mechanism directly, because VPN2 VRF needs the VPN routes with RT1. PE1 triggers the mechanism only when the prefix limit for both, the VPN1 VRF and VPN2 VRF has exceeded. PE1 will send VPN Prefix ORF message to RR and send warning to operator. The message will carry the VRF Prefix Limit = min(prefix limit of VPN1 VRF, prefix limit of VPN2 VRF), the RD value is set to RD31, the RT value is set to RT1, RT2 and the source PE is PE3. RR will withdraw and stop advertising such offending VPN routes to PE1. If quota value is set on PE1, when routes associated with tuple pass the quota but the prefix limit of VPN1 VRF is not exceeded, PE1 sends a warning to the operator. In case where the VPN1 VRF has exceeded its prefix limit and VPN2 VRF has not yet exceeded its prefix limit, PE1 should not send out the trigger message. Otherwise, the VPN2 VRF can't receive the VPN routes too (RR will filter all the VPN prefixes that contain RT1). PE1 should Wang, et al. Expires 23 January 2025 [Page 10] Internet-Draft RD-ORF July 2024 just discard the offending VPN routes locally. PE1 should only generate a BGP ROUTE-REFRESH message contains a VPN Prefix ORF entry(RD31, min(prefix limit of VPN1 VRF, prefix limit of VPN2 VRF), RT1, RT2, comes from PE3) in case where the prefix limit has exceeded for both the VRFs. b) PE2 If quota value is not set on PE2, when the prefix limit of VPN1 VRF is exceeded, due to there is no other VRFs on PE2 need the VPN routes with RT1, PE2 will send VPN Prefix ORF message to RR and send warning to operator. The message will carry the prefix limit of VPN1 VRF, the RD value is set to RD31 and the RT value is set to RT1. RR will withdraw and stop advertising such offending VPN routes to PE2. If quota value is set on PE2, due to there is only one VRF imports VPN routes with RT1. If it overflows, it will trigger VPN Prefix ORF (RD31, the prefix limit of VPN1 VRF, RT1, comes from PE3) mechanisms. RR will withdraw and stop advertising such offending VPN routes to PE2. 4.1.3. Scenario-3 and Solution (Universal RD) In this scenario, PE1-PE4 and RR are iBGP peers. RD is allocated per VPN. One/Multiple RTs are associated with the offending VPN routes, and be imported into different VRFs in other devices. We assume the network topology is shown in Figure 3. Wang, et al. Expires 23 January 2025 [Page 11] Internet-Draft RD-ORF July 2024 +------------------------------------------------------------------------+ | | | | | +-------+ +-------+ | | | PE1 +----------------+ +-----------------+ PE4 | | | +-------+ | | +-------+ | | VPN1(RD1,RT1) | | VPN2(RD12,RT2) | | VPN2(RD12,RT2) | | | | +-+----+-+ | | | RR | | | +-+----+-+ | | | | | | | | | | +-------+ | | +-------+ | | | PE2 +----------------+ +-----------------+ PE3 | | | +-------+ +-------+ | | VPN1(RD1,RT1) VPN1(RD1,RT1,RT2) | | VPN2(RD32,RT2) | | | | AS 100 | | | +------------------------------------------------------------------------+ Figure 3 Network Topology of Scenario-3 When PE3 sends excessive VPN routes with RD1 and attached RT1 and RT2, while both PE1 and PE2 import VPN routes with RT1, the process of offending VPN routes will influence performance of VRFs on PEs. a) PE1 If quota value is not set on PE1, when the prefix limit of VPN1 VRF is exceeded, PE1 cannot trigger VPN Prefix ORF mechanism directly, because VPN2 VRF needs the VPN routes with RT2. PE1 triggers the mechanism only when the prefix limit for both, the VPN1 and VPN2 VRF has exceeded, and send warning to operator. The message will carry the VRF Prefix Limit = min(prefix limit of VPN1 VRF, prefix limit of VPN2 VRF), the RD value is set to RD1 and the RT value is set to RT1, RT2. RR will withdraw and stop advertising such offending VPN routes to PE1. If quota value is set on PE1, when routes associated with tuple pass the quota but the prefix limit of VPN1 VRF is not exceeded, PE1 sends a warning to the operator. When the prefix limit of VPN1 VRF is exceeded, if the VPN2 VRF does not reach its limit at the same time, PE1 should still not send out the trigger message, because if it does so, the VPN2 VRF can't receive the VPN routes too (RR will filter all the VPN prefixes that contain RT1). PE1 should just discard the offending VPN routes locally. PE1 should only Wang, et al. Expires 23 January 2025 [Page 12] Internet-Draft RD-ORF July 2024 generate a BGP ROUTE-REFRESH message contains a VPN Prefix ORF entry(RD1, min(prefix limit of VPN1 VRF, prefix limit of VPN2 VRF), RT1, RT2, comes from PE3) when both the VRFs that import such prefixes are overflow. b) PE2 If quota value is not set on PE2, when the prefix limit of VPN1 VRF is exceeded, due to there is no other VRFs on PE2 need the VPN routes with RT1, PE2 will send VPN Prefix ORF message to RR and send warning to operator. The message will carry the prefix limit of VPN1 VRF, the RD value is set to RD1 and the RT value is set to RT1. RR will withdraw and stop advertising such offending VPN routes to PE2. If quota value is set on PE2, due to there is only one VRF imports VPN routes with RT1. If it overflows, it will trigger VPN Prefix ORF (RD1, the prefix limit of VPN1 VRF, RT1, comes from PE3) mechanisms. RR will withdraw and stop advertising such offending VPN routes to PE2. When PE2 overflows and PE1 does not overflow. PE2 triggers the VPN Prefix ORF message (RD1, RT1, comes from PE3). Using Source PE and RD, RR will only withdraw and stop advertising VPN routes with to PE2. The communication between PE2 and PE1 for VPN1 will not be influenced. 5. Source PE Extended Community We usually use NEXT_HOP to identify the source, but it may not useful in the following scenarios: * a PE may have multiple addresses so that its BGP peer will receive several different NEXT_HOP from the same source. * In Option B inter-domain scenario, the ASBR will change the NEXT_HOP. ORIGINATOR_ID is a non-transitive attribute generated by RR to identify the source, but ORIGINATOR_ID cannot be advertised outside the local AS. To cover the above scenarios, we define a new Extended Community: Source PE Extended Community(SPE EC) to transmit the identifier of source. Its value can be set by source PE/RR/ASBR. Once set and attached with the BGP UPDATE message, its value should not be changed along the advertisement path. The format of SPE EC is shown as Figure 4. Wang, et al. Expires 23 January 2025 [Page 13] Internet-Draft RD-ORF July 2024 0 1 2 3 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | 0x0d | Autonomous System Number : +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ : AS Number (cont.) | ORIGINATOR_ID : +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ : ORIGINATOR_ID (cont.) | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ Figure 4 The format of SPE EC For the RR/ASBR, it should perform as following: * Check the existence of the SPE EC. If it exists, does not change it. * If SPE EC does not exist, check the existence of ORIGINATOR_ID. If it exists, put it into SPE EC. * If ORIGINATOR_ID does not exist, put the router-id of source PE into SPE EC. 6. VPN Prefix ORF Encoding In this section, we defined a new ORF type called VPN Prefix Outbound Route Filter (VPN Prefix ORF). The ORF entries are carried in the BGP ROUTE-REFRESH message as defined in [RFC5291]. A BGP ROUTE- REFRESH message can carry one or more ORF entries. The ROUTE-REFRESH message which carries ORF entries contains the following fields: * AFI (2 octets) * SAFI (1 octet) * When-to-refresh (1 octet): the value is IMMEDIATE * ORF Type (1 octet) * Length of ORF entries (2 octets) A VPN Prefix ORF entry contains a common part and type-specific part. The common part is encoded as follows: * Action (2 bits): the value is ADD, REMOVE or REMOVE-ALL * Match (1 bit): the value is PERMIT or DENY Wang, et al. Expires 23 January 2025 [Page 14] Internet-Draft RD-ORF July 2024 * Offending VPN routes process method (1 bit): if the value is set to 0, it means all offending VPN routes on the sender of VPN Prefix ORF message should be withdrawn; if the value is set to 1, it means the sender of VPN Prefix ORF message refuse to receive new offending VPN routes. The default value is 0. * Reserved (4 bits) VPN Prefix ORF also contains type-specific part. The encoding of the type-specific part is shown in Figure 5. +-----------------------------------------+ | | | Sequence (4 octets) | | | +-----------------------------------------+ | | | Length (2 octets) | | | +-----------------------------------------+ | | | VRF Prefix Limit (4 octets) | | | +-----------------------------------------+ | | | Route Distinguisher (8 octets) | | | +-----------------------------------------+ | | | Optional TLVs (variable) | | | +-----------------------------------------+ Figure 5: VPN Prefix ORF type-specific encoding * Sequence: identifying the order in which RD-ORF is generated. * Length: identifying the length of this VPN Prefix ORF entry. * VRF Prefix Limit: carrying the prefix limt of the overflowed VRF. * Route Distinguisher: distinguish the different user routes. The VPN Prefix ORF filters the VPN routes it tends to send based on Route Distinguisher. If RD is equal to 0, it means all VPN prefixes. * Optional TLVs: carry the potential additional information to give the extensibility of the VPN Prefix ORF mechanism. Wang, et al. Expires 23 January 2025 [Page 15] Internet-Draft RD-ORF July 2024 Note that if the Action component of an ORF entry specifies REMOVE- ALL, the ORF entry does not include the type-specific part. When the BGP ROUTE-REFRESH message carries VPN Prefix ORF entries, it must be set as follows: * The ORF-Type MUST be set to VPN Prefix ORF. * The AFI MUST be set to IPv4, IPv6, or Layer 2 VPN (L2VPN). * If the AFI is set to IPv4 or IPv6, the SAFI MUST be set to MPLS- labeled VPN address. * If the AFI is set to L2VPN, the SAFI MUST be set to BGP EVPN. * The Match field should be set to PERMIT when VRF Prefix Limit = 0xFFFF and RD=0; otherwise, the Match field should be set to DENY. 6.1. Source PE TLV Source PE TLV is defined to identify the source of the VPN routes. For the sender of VPN Prefix ORF, it will check the existence of SPE EC. If it exists, the sender will put it into Source PE TLV. Otherwise, the value of Source PE TLV should be set to local AS number and NEXT_HOP. The source PE TLV contains the following types: * Type = 1(suggested), Length = 8 octets, value = local AS number + NEXT_HOP. * Type = 2(suggested), Length = 20 octets, value = local AS number + NEXT_HOP. * Type = 3(suggested), Length = 8 octets, value = the value of AS number and ORIGINATOR_ID in Source PE Extended Community. 6.2. Route Target TLV Route Target TLV is defined to identify the RT of the offending VPN routes. RT and RD can be used together to filter VPN routes when the source VRF contains multiple RTs, and the VPN routes with different RTs may be assigned to different VRFs on the receiver. The encoding of Route Target TLV is as follow: Wang, et al. Expires 23 January 2025 [Page 16] Internet-Draft RD-ORF July 2024 Type = 4(suggested), Length = 8*n (n is the number of RTs that the offending VPN routes attached) octets, value = the RT of the offending VPN routes. If multiple RTs are included, there must be an exact match. 7. Operation process of VPN Prefix ORF mechanism on receiver The receiver of VPN Prefix ORF entries may be a RR or PE. As it receives the VPN Prefix ORF entries from the sender, it will check to find if it already existed in its ORF-Policy table. If not, the receiver will add the VPN Prefix ORF entries into its ORF-Policy table; otherwise, the receiver will discard it. The RR or PE should determine the sending device and next hop address of the stored VPN Prefix ORF entry corresponding to the source PE in the filtering condition by looking up in the FIB, including: 1) determining the port field corresponding to the stored VPN Prefix ORF entry in the export routing filter strategy table; 2) Determine the sending device of the stored VPN Prefix ORF entry based on the value of the port field corresponding to the stored VPN Prefix ORF entry. The filtering conditions for the stored VPN Prefix ORF entries contain the RD and RT of the source PE. If the sending device of the VPN Prefix ORF entry stored under the same filtering condition includes all IBGP neighbors of the RR or PE other than the device with the next hop address, the VPN Prefix ORF entry is generated, where the next hop address is the next hop address sent to the network device in the direction of the source PE in the same filtering condition. The filtering condition in the generated VPN Prefix ORF entry contain the address, RD and RT of the source PE; Then, the RR or PE should send the generated VPN Prefix ORF entry to the device at the next hop address via BGP ROUTE-REFRESH message, so that the device at the next hop address filters the VPN route sent by the source PE. Before the route update is announced, the receiver should check its ORF-Policy table with tuple of the VPN route. The Route Distinguisher information can be extracted directly from the BGP UPDATE message. The source PE information should be compared against the Source PE Extended Community if it is contained in BGP UPDATE message, or else the NEXT_HOP. If there is not a related VPN Prefix ORF entry in ORF-Policy table, the receiver will send this VPN route; otherwise, the receiver will perform the following operations: Wang, et al. Expires 23 January 2025 [Page 17] Internet-Draft RD-ORF July 2024 * If the "Offending VPN routes process method" bit is 0, the receiver should withdraw all the VPN routes identified by RD, RT and information in optional TLVs in the entry, and stop sending the corresponding VPN routes to the sender. * If the "Offending VPN routes process method" bit is 1, the receiver withdraw the extra VPN routes according to the value of VRF Prefix Limit, RD, RT and information in optional TLVs in the entry, and stop sending the corresponding VPN routes to the sender. 8. Withdraw of VPN Prefix ORF entries When the VPN Prefix ORF mechanism is triggered, the alarm information will be generated and sent to the network operators. Operators should manually configure the network to resume normal operation. Due to devices can record the VPN Prefix ORF entries sent by each VRF, operators can find the entries needs to be withdrawn, and trigger the withdraw process as described in [RFC5291] manually. After returning to normal, the device sends withdraw ORF entries to its peer who have previously received ORF entries. 9. Applicability We take the scenario in Section 4.1.1 as an example to illustrate how to determine each field when the sender generates a VPN Prefix ORF entry. We assume it is an IPv4 network. After PE1-PE4 and RR advertising the Outbound Route Filtering Capability, each of PE1-PE4 should send a VPN Prefix ORF entry that means "PERMIT-ALL" as follows: * AFI is equal to IPv4 * SAFI is equal to MPLS-labeled VPN address * When-to-refresh is equal to IMMEDIATE * ORF Type is equal to VPN Prefix ORF * Length of ORF entries is equal to 26 * Action is equal to ADD * Match is equal to PERMIT * Offending VPN routes process method is equal to 0 * Sequence is equal to 0xFFFFFFFF Wang, et al. Expires 23 January 2025 [Page 18] Internet-Draft RD-ORF July 2024 * Length is equal to 12 * VRF Prefix Limit is equal to 0xFFFF * Route Distinguisher is equal to 0 When the VPN Prefix ORF mechanism is triggered on PE1, PE1 will generate a VPN Prefix ORF contains the following information: * AFI is equal to IPv4 * SAFI is equal to MPLS-labeled VPN address * When-to-refresh is equal to IMMEDIATE * ORF Type is equal to VPN Prefix ORF * Length of ORF entries is equal to 44 * Action is equal to ADD * Match is equal to DENY * Offending VPN routes process method is equal to 0 * Sequence is equal to 1 * Length is equal to 30 * VRF Prefix Limit is equal to the prefix limit of VPN1 VRF * Route Distinguisher is equal to RD31 * Optional TLV: - Type is equal to 1 (Source PE TLV) - Length is equal to 8 - value is equal to PE3's IPv4 address - Type is equal to 4 (Route Target TLV) - Length is equal to 8 - value is equal to RT1 Wang, et al. Expires 23 January 2025 [Page 19] Internet-Draft RD-ORF July 2024 10. Implementation Considerations This draft is experimental in order to determine if the proposed mechanism could block the offending routes as expected or not, and whether it would cause other potential network failures. The first section below describes implementation considerations for the mechanism. The second section below provides a short summary of the experimental topology and the results. 10.1. Implementation Considerations Before originating a VPN Prefix ORF message, the device should compare the list of RDs carried by VPN routes signaled for filtering and the RDs imported by not affected VRF(s). Once they have intersection, the VPN Prefix ORF message MUST NOT be originated. In deployment, the quota value can be set with different granularity, such as , , etc. If the quota value is set to (VRF prefix limit/the number of PEs), whenever a new PE access to the network, the quota value should be changed. To avoid the frequently change of the quota value, the value can be set according to the following formula: Quota=MIN[(Margins coefficient)**, VRF Prefixes Limit] It should be noted that the above formula is only an example, the operators can use different formulas based on actual needs in management plane. 10.2. Implementation status Currently, H3C has implemented VPN Prefix ORF mechanism related functions as follows: * By configuring VRF Prefix limit and quota, achieve the use of RD and Source PE to control VPN routing. * Generating, transmitting and processing Type 1 and Type 2 Source PE TLV. * Using the Offending VPN routes process method to revoke all routes. Wang, et al. Expires 23 January 2025 [Page 20] Internet-Draft RD-ORF July 2024 Besides, we also implemented the following functions based on the open-source BGP implementation (FRR): * VPN Prefix ORF mechanism triggered based on VRF limit in intra- domain and inter-domain scenarios. * RD based VPN routing filtering in intra-domain and inter-domain scenarios. 10.3. Experimental topology The experiments will test whether the VPN Prefix ORF blocks the offending routes in the following scenarios: * Intra-domain as a standalone mechanism, * Inter-domain as a standalone mechanisms, * Adding the VPN Prefix ORF to existing mechanisms for intra-domain VPNs, * Adding the VPN Prefix ORF to existing mechanisms for intra-domain VPNs. 10.4. Results of Experiments [TBD] 11. Security Considerations This draft does build upon [RFC5291]. A BGP speaker will maintain the VPN Prefix ORF entries in an ORF-Policy table, this behavior consumes its memory and compute resources. To avoid the excessive consumption of resources, [RFC5291] specifies that a BGP speaker can only accept ORF entries transmitted by its interested peers. 12. IANA Considerations This document defines a new Outbound Route Filter type - VPN Prefix Outbound Route Filter (VPN Prefix ORF). The code point is from the "BGP Outbound Route Filtering (ORF) Types". It is recommended to set the code point of VPN Prefix ORF to 66. This document also define a VPN Prefix ORF TLV type under "Border Gateway Protocol (BGP) Parameters", four TLV types are defined: Wang, et al. Expires 23 January 2025 [Page 21] Internet-Draft RD-ORF July 2024 under "Border Gateway Protocol (BGP) Parameters" Registry: "VPN Prefix ORF TLV" Registration Procedure(s): First Come, First Served Value range:0-255, value 0 is reserved. +===========================+=============+===========================+ | Registry | Type | Meaning | +===========================+=============+===========================+ |IPv4 Source PE TLV | 1(suggested)|IPv4 address for source PE.| +---------------------------+-------------+---------------------------+ |IPv6 Source PE TLV | 2(suggested)|IPv6 address for source PE.| +---------------------------+-------------+---------------------------+ |Source PE Extended | 3(suggested)|Source PE Extended | |Community TLV | |Community for source PE | +---------------------------+-------------+---------------------------+ |Route Target TLV | 4(suggested)|Route Target of the | | | |offending VPN routes | +---------------------------+-------------+---------------------------+ This document also requests a new Transitive Extended Community Type. The new Transitive Extended Community Type name shall be "Source PE Extended Community". Under "BGP Transitive Extended Community Types:" Registry: "Source PE Extended Community" type 0x0d(suggested) Source PE Extended Community 13. Contributor Shunwan Zhuang Huawei Technologies Huawei Building, No.156 Beiqing Rd. Beijing Beijing, 100095 China 14. Acknowledgement Thanks Robert Raszuk, Jim Uttaro, Jakob Heitz, Jeff Tantsura, Rajiv Asati, John E Drake, Gert Doering, Shuanglong Chen, Enke Chen, Srihari Sangli and Igor Malyushkin for their valuable comments on this draft. 15. Normative References Wang, et al. Expires 23 January 2025 [Page 22] Internet-Draft RD-ORF July 2024 [I-D.ietf-bess-evpn-inter-subnet-forwarding] Sajassi, A., Salam, S., Thoria, S., Drake, J., and J. Rabadan, "Integrated Routing and Bridging in Ethernet VPN (EVPN)", Work in Progress, Internet-Draft, draft-ietf- bess-evpn-inter-subnet-forwarding-15, 26 July 2021, . [I-D.wang-idr-vpn-routes-control-analysis] Wang, A., Wang, W., Mishra, G. S., Wang, H., Zhuang, S., and J. Dong, "Analysis of VPN Routes Control in Shared BGP Session", Work in Progress, Internet-Draft, draft-wang- idr-vpn-routes-control-analysis-04, 6 September 2021, . [RFC2119] Bradner, S., "Key words for use in RFCs to Indicate Requirement Levels", BCP 14, RFC 2119, DOI 10.17487/RFC2119, March 1997, . [RFC4360] Sangli, S., Tappan, D., and Y. Rekhter, "BGP Extended Communities Attribute", RFC 4360, DOI 10.17487/RFC4360, February 2006, . [RFC4364] Rosen, E. and Y. Rekhter, "BGP/MPLS IP Virtual Private Networks (VPNs)", RFC 4364, DOI 10.17487/RFC4364, February 2006, . [RFC4684] Marques, P., Bonica, R., Fang, L., Martini, L., Raszuk, R., Patel, K., and J. Guichard, "Constrained Route Distribution for Border Gateway Protocol/MultiProtocol Label Switching (BGP/MPLS) Internet Protocol (IP) Virtual Private Networks (VPNs)", RFC 4684, DOI 10.17487/RFC4684, November 2006, . [RFC4760] Bates, T., Chandra, R., Katz, D., and Y. Rekhter, "Multiprotocol Extensions for BGP-4", RFC 4760, DOI 10.17487/RFC4760, January 2007, . [RFC5291] Chen, E. and Y. Rekhter, "Outbound Route Filtering Capability for BGP-4", RFC 5291, DOI 10.17487/RFC5291, August 2008, . [RFC5292] Chen, E. and S. Sangli, "Address-Prefix-Based Outbound Route Filter for BGP-4", RFC 5292, DOI 10.17487/RFC5292, August 2008, . Wang, et al. Expires 23 January 2025 [Page 23] Internet-Draft RD-ORF July 2024 [RFC7024] Jeng, H., Uttaro, J., Jalil, L., Decraene, B., Rekhter, Y., and R. Aggarwal, "Virtual Hub-and-Spoke in BGP/MPLS VPNs", RFC 7024, DOI 10.17487/RFC7024, October 2013, . [RFC7432] Sajassi, A., Ed., Aggarwal, R., Bitar, N., Isaac, A., Uttaro, J., Drake, J., and W. Henderickx, "BGP MPLS-Based Ethernet VPN", RFC 7432, DOI 10.17487/RFC7432, February 2015, . [RFC7543] Jeng, H., Jalil, L., Bonica, R., Patel, K., and L. Yong, "Covering Prefixes Outbound Route Filter for BGP-4", RFC 7543, DOI 10.17487/RFC7543, May 2015, . Appendix A. Experimental topology The experimental topology is shown in Figure 6. +--------------------------+ +--------------------------+ | | | | | | | | | +---------+ | | +---------+ | | | PE1 | | | | PE3 | | | +---------+ | | +---------+ | | \ | | / | | \+---------+ EBGP +---------+/ | | | | | | | | | ASBR1 |-----------| ASBR2 | | | | | | | | | +---------+ +---------+ | | / | | \ | | +---------+/ | | \+---------+ | | | PE2 | | | | PE4 | | | +---------+ | | +---------+ | | | | | | AS1 | | AS2 | +--------------------------+ +--------------------------+ Figure 6 The experimental topology This topology can be used to verify as follows: * whether the VPN Prefix ORF mechanism could block the offending routes in intra-domain scenario. * whether the VPN Prefix ORF mechanism could block the offending routes in inter-domain scenario. Wang, et al. Expires 23 January 2025 [Page 24] Internet-Draft RD-ORF July 2024 * whether the VPN Prefix ORF mechanism conflicts with the existing mechanism and cause failure. * whether the quota value leads to flapping. * TBD Authors' Addresses Wei Wang China Telecom Beiqijia Town, Changping District Beijing Beijing, 102209 China Email: weiwang94@foxmail.com Aijun Wang China Telecom Beiqijia Town, Changping District Beijing Beijing, 102209 China Email: wangaj3@chinatelecom.cn Haibo Wang Huawei Technologies Huawei Building, No.156 Beiqing Rd. Beijing Beijing, 100095 China Email: rainsword.wang@huawei.com Gyan S. Mishra Verizon Inc. 13101 Columbia Pike Silver Spring, MD 20904 United States of America Phone: 301 502-1347 Email: gyan.s.mishra@verizon.com Wang, et al. Expires 23 January 2025 [Page 25] Internet-Draft RD-ORF July 2024 Jie Dong Huawei Technologies Huawei Building, No.156 Beiqing Rd. Beijing Beijing, 100095 China Email: jie.dong@huawei.com Wang, et al. Expires 23 January 2025 [Page 26]