SRv6 uSID - Scale (Pt. 2)
Last updated
Last updated
Load top1.vpnv4v6.srv6.usid.multi.area.part2.init.cfg in the XRv9K topology in CML.
The leaking of loopbacks into L1 has been removed. Find another way to make the BGP nexthops accessible. Do not use any null0 routes or summarization from the L1/L2 routers.
Using a PCE to calculate inter-area SRv6 SID paths can be an alternative to redistributing loopbacks into the IGP to achieve BGP nexthop validity. The idea is that the headend can resolve the nexthop for the VPN route via the PCE instead of via the RIB. So the L1/L2 router no longer needs to leak the loopback prefixes into L1. This allows for even better routing scalability, and also allows for end-to-end TE paths based on latency, link-affinity, etc.
First, we setup the PCEP sessions. XR9 will act as PCE in this solution.
Because we have multiple ISIS L1 areas, we must use BGP-LS to give XR9 full topology info. In this solution, XR1 and XR3 run BGP-LS with XR9, but you can also configure XR4 as well for redundancy.
Additionally, I found that when the PCE is receiving topologies from multiple L1 areas via BGP-LS, we also need to set every node’s RID for TE purposes. It seems when only a single flat L2 topology is used, this is not required. But my ODN policies would not come up until I included this step:
We’ll now export all L3VPN routes with a color. This means that every route will use the PCE for resolution. We’ll use color 10 to mean the default “best effort” IGP path:
Now we need to export VPN routes with this color. For variety, I use a route-policy directly under the VRF, instead of applying the RPL at the BGP/VRF/AFI level.
The last step is that we need to tell BGP to use the ODN policy as a substitute for nexthop validity. If the ODN policy is up, then that should take place of IPv6-reachability to the nexthop. Additionally, we tell BGP to use the ODN policy’s metric as the IGP metric during that BGP bestpath step.
The VPN routes now show a BSID is used for nexthop reachability. Additionally, the metric from the SR-TE policy is passed to BGP:
Traffic is working end-to-end:
In summary, using ODN for BGP nexthop validation allows better scalability by removing the need to leak loopbacks into the L1 areas. Also, it allows for end-to-end SR-TE policies.
We are still leaking the locator summary prefixes into L1. Can we remove these too?
The answer is that yes, we can! This is because the PCE is pushing a SID list that includes the local L1/L2 router. So the first hop of the SID list will be resolvable via the L1. This allows even better scalability.
On XR1, XR3, and XR4, we’ll remove route leaking:
Traffic is still working end-to-end:
This is because both XR5 and XR7 have an SR-TE policy that uses their L1/L2 router as the first hop: