Beruflich Dokumente
Kultur Dokumente
Agenda
Thinking About Fast Convergence
Reactive Convergence
Proactive Convergence
Closing Remarks
BRKRST-3363
Cisco Public
Agenda
Thinking About Fast Convergence
Fast Convergence Mindset
Measuring Fast Convergence
Reactive Convergence
Proactive Convergence
Closing Remarks
BRKRST-3363
Cisco Public
Do I Need It?
Complexity vs. Return
Business Drivers
Risks
Processes
Monitoring
Applications
Everything Matters!
BRKRST-3363
Cisco Public
BRKRST-3363
Cisco Public
Agenda
Thinking About Fast Convergence
o Fast Convergence Mindset
Measuring Fast Convergence
Reactive Convergence
Proactive Convergence
Other Convergence Tools
Closing Remarks
BRKRST-3363
Cisco Public
Measuring Convergence
Convergence =
Failure Detection + Event Propagation + Routing Process + FIB Update
Neighbor Down
Tell Neighbors
B
A
D
C
BRKRST-3363
Cisco Public
B
A
D
C
BRKRST-3363
Cisco Public
10
!
A
D
C
BRKRST-3363
Cisco Public
11
Event Propagation
Spread the word
B
My Link to D is down!
D
C
BRKRST-3363
Cisco Public
12
My Link to B is down!
Routing Process
What happened?
Event Propagation
Spread the word
B
A
???
???
C
BRKRST-3363
Cisco Public
13
Routing Process
What happened?
Event Propagation
Spread the word
Reach D via C
A
No change
C
BRKRST-3363
Cisco Public
14
Routing Process
What happened?
Event Propagation
FIB Update
Make it so
B
A
D
C
BRKRST-3363
Cisco Public
15
Routing Process
What happened?
Event Propagation
FIB Update
Make it so
B
A
D
C
BRKRST-3363
Cisco Public
16
Routing Process
10+ ms
Now where do we go?
What happened?
Event Propagation 0 to 10 ms
Make it so
B
A
D
C
BRKRST-3363
Cisco Public
17
Agenda
o Thinking About Fast Convergence
Reactive Convergence
Failure Detection
Detecting Link Failures
Fast Hellos and BFD
Event Propagation
Routing Update
Forwarding Table Update
BGP Convergence
Proactive Convergence
Closing Remarks
BRKRST-3363
Cisco Public
18
Routing Process
What happened?
Event Propagation
FIB Update
Make it so
BRKRST-3363
Cisco Public
19
Routing Process
What happened?
Event Propagation
FIB Update
Make it so
BRKRST-3363
Cisco Public
20
Failure Detection
Detecting Link Failure
Link Failure -> Interface Down, Easy?
Hardware
PHY
Software
Firmware
Port
ASIC
CPU
Hardware Dependent
Polling vs Interrupt
6748-GE-TX: 20ms/port * 48 ports = 960ms (polled)
Nexus 7k, ASR9k, 6708-10GE/ES/ES+: <10ms (interrupt)
BRKRST-3363
Cisco Public
21
IOS
Failure Detection
Detecting Link Failure
Link Failure -> Interface Down, Easy?
Hardware
PHY
Software
Firmware
Port
ASIC
CPU
Debounce Timer
Throttles down notification
Switches only
BRKRST-3363
Cisco Public
22
IOS
Failure Detection
Detecting Link Failure
Link Failure -> Interface Down, Easy?
Hardware
PHY
Software
Firmware
Port
ASIC
CPU
Debounce Timer
Carrier Delay
Throttles up + down
Switches only
Routers only
BRKRST-3363
Cisco Public
23
IOS
Carrier Delay
Generally Configurable
Platform/Linecard/Media Dependent
Software Dependent
IOS/IOS-XE
7600
2 Seconds
NX-OS
100ms (SVI Only)
NX-OS
XR/ASR9k
100ms
0 ms
ASR9k
0ms
Cisco Public
24
Agenda
o Thinking About Fast Convergence
Reactive Convergence
Failure Detection
o Detecting Link Failures
Fast Hellos and BFD
Event Propagation
Routing Update
BGP Convergence
Forwarding Table Update
Proactive Convergence
Closing Remarks
BRKRST-3363
Cisco Public
26
BFD
Even Faster
~1 second detection
Process Driven
1 Hello/Protocol
50+ Bytes
BRKRST-3363
~24 bytes
Cisco Public
27
BFD
Even Faster
~1 second detection
Process Driven
1 Hello/Protocol
50+ Bytes
BRKRST-3363
~24 bytes
Cisco Public
28
Routing Process
What happened?
BFD (150 ms)
FIB Update
Event Propagation
Make it so
BRKRST-3363
Cisco Public
30
Agenda
o Thinking About Fast Convergence
Reactive Convergence
o Failure Detection
Event Propagation
Routing Update
BGP Convergence
Forwarding Table Update
Proactive Convergence
Closing Remarks
BRKRST-3363
Cisco Public
31
Routing Process
What happened?
BFD (150 ms)
FIB Update
Event Propagation
Make it so
BRKRST-3363
Cisco Public
32
EIGRP
Query
The Bad
Query Domain Size
B
EIGRP
Query
EIGRP
Query
D
EIGRP
Query
BRKRST-3363
Cisco Public
33
EIGRP
Query
The Bad
Query Domain Size
B
The Ugly
EIGRP
Query
Stuck In Active
EIGRP
Query
D
EIGRP
Query
BRKRST-3363
Cisco Public
34
EIGRP
Query
Summary
Stub
Filters
Summary
Boundary
BRKRST-3363
Cisco Public
35
EIGRP
Query
Summary
Stub
Filters
EIGRP
Reply
Summary
Boundary
BRKRST-3363
Cisco Public
36
<10s ms
Feasible Successors
BRKRST-3363
Cisco Public
38
<10s ms
Feasible Successors
Dont even ask!
No Query/Reply
B
~0 ms
BRKRST-3363
Cisco Public
39
Routing Process
What happened?
BFD (150 ms)
FIB Update
Event Propagation
Make it so
BRKRST-3363
Cisco Public
44
Cisco Public
45
Start: 0ms
Start: 5ms
Hold: 20ms
Hold: 40ms
Max: 5000ms
Max: 10000ms
BRKRST-3363
Cisco Public
46
Routing Process
What happened?
BFD (150 ms)
FIB Update
Event Propagation
Make it so
BRKRST-3363
Cisco Public
48
RP/0/RSP0/CPU0:XR# configure
RP/0/RSP0/CPU0:XR(config)# router isis CLUS
RP/0/RSP0/CPU0:XR(config-isis)# lsp-gen-interval initial-wait <time>
RP/0/RSP0/CPU0:XR(config-isis)# interface g0/3/0/0
RP/0/RSP0/CPU0:XR(config-isis-if)# lsp fast-flood threshold <num of LSPs>
BRKRST-3363
Cisco Public
49
Routing Process
What happened?
BFD (150 ms)
FIB Update
Event Propagation
Make it so
BRKRST-3363
Cisco Public
51
Agenda
o Thinking About Fast Convergence
Reactive Convergence
o Failure Detection
o Event Propagation
Routing Update
BGP Convergence
Forwarding Table Update
Proactive Convergence
Closing Remarks
BRKRST-3363
Cisco Public
52
Routing Process
What happened?
BFD (150 ms)
Event Propagation
FIB Update
BRKRST-3363
Cisco Public
Make it so
53
BRKRST-3363
Cisco Public
54
Routing Process
What happened?
BFD (150 ms)
Event Propagation
FIB Update
BRKRST-3363
Cisco Public
Make it so
55
Cisco Public
56
Cisco Public
57
BRKRST-3363
Cisco Public
58
Routing Process
What happened?
BFD (150 ms)
Event Propagation
FIB Update
BRKRST-3363
Cisco Public
Make it so
59
Agenda
o Thinking About Fast Convergence
Reactive Convergence
o Failure Detection
o Event Propagation
o Routing Update
BGP Convergence
Forwarding Table Update
Proactive Convergence
Closing Remarks
BRKRST-3363
Cisco Public
60
OSPF
OSPF
Different Goals
Lots of Data.
.means lots of CPU
.means lots of memory
.means lots of packets
iBGP
OSPF
OSPF
D
Cisco Public
61
BFD
neighbor <> fall-over bfd
Interface Tracking
Notifies BGP if interface/route down
Enabled by default
BRKRST-3363
Cisco Public
62
L3 Source
L3 Destination
TTL
Bigger packets
Source Port
Destination Port
Flags
Window
Window Size
Local TCP buffer
ACKs reduce window as it fills
Update Groups
BGP Routes
BRKRST-3363
Cisco Public
64
MTU
MSS
Cisco Public
66
192.168.1.1
10.1.2.0/24
192.168.1.1
10.1.3.0/24
192.168.1.1
Before PIC
Update per route
Convergence dependent on BGP RIB size
BRKRST-3363
Cisco Public
67
BRKRST-3363
10.1.1.0/24
192.168.1.1
10.1.2.0/24
192.168.1.1
10.1.3.0/24
192.168.1.1
Cisco Public
68
192.168.1.1
Next Hop 1
10.1.3.0/24
192.168.1.1
192.168.1.1
BRKRST-3363
Cisco Public
69
Agenda
o Thinking About Fast Convergence
Reactive Convergence
o Failure Detection
o Event Propagation
o Routing Update
o BGP Convergence
Forwarding Table Update
Proactive Convergence
Closing Remarks
BRKRST-3363
Cisco Public
70
Routing
Table
EIGRP
FIB
(Software CEF)
Hardware CEF
(TCAM)
Software
BRKRST-3363
Cisco Public
Hardware
71
Routing
Table
EIGRP
FIB
(Software CEF)
Hardware CEF
(TCAM)
Software
BRKRST-3363
Cisco Public
Hardware
72
Routing
Table
EIGRP
FIB
(Software CEF)
Hardware CEF
(TCAM)
Software
BRKRST-3363
Cisco Public
Hardware
73
Process quantum
2 Quad
Core
2.13 GHz
Single 2 Single
Core
Core
Single
1.5 GHz 1.5 GHz
Core
Single 1.5 GHz
Core
[VALUE]
MHz
Dual
Quad
Core
Core
2.66 GHz
2.27 GHz
XE only
Prefix Prioritization
Sup720
BRKRST-3363
Cisco Public
74
RSP720
RP1
RSP2
Sup2e
RSP440
RP2
Routing
Table
EIGRP
FIB
(Software CEF)
Hardware CEF
(TCAM)
Software
BRKRST-3363
Cisco Public
Hardware
77
Routing
Table
EIGRP
FIB
(Software CEF)
Hardware CEF
(TCAM)
Software
BRKRST-3363
Cisco Public
Hardware
78
Hardware Matters!
7600:
7600(config)#hw-module slot <mod> process-max-time 50
7600(config)#hw-module slot <sup> sp process-max-time 50
Cisco Public
79
Routing
Table
EIGRP
FIB
(Software CEF)
Hardware CEF
(TCAM)
Software
BRKRST-3363
Cisco Public
Hardware
80
Routing Process
What happened?
BFD (150 ms)
Event Propagation
FIB Update
153ms 5+ minutes
BRKRST-3363
Cisco Public
81
Agenda
o Thinking About Fast Convergence
o Reactive Convergence
Proactive Convergence
Loop Free Alternate (IP FRR)
BGP PIC Edge
LDP Session Protection
Closing Remarks
BRKRST-3363
Cisco Public
82
10.1.1.0/24
BRKRST-3363
Cisco Public
83
Cisco Public
84
EIGRP LFA
RouterB#show ip route 172.16.2.0
Known via "eigrp 10", distance 90, metric 1100800, type
internal
* 172.16.1.2, from 172.16.1.2, 00:00:17 ago, via Ethernet0/1
Route metric is 281600, traffic share count is 1
Repair Path: 192.168.1.1, via Ethernet0/0
RouterB#show ip cef 172.16.2.0
172.16.2.0/24
nexthop 172.16.1.2 Ethernet0/1
repair: attached-nexthop 192.168.1.1 Ethernet0/0
BRKRST-3363
Cisco Public
85
Remote LFA
D
BRKRST-3363
E
Cisco Public
87
Remote LFA
A
Traffic from D
Traffic from A
D
BRKRST-3363
E
Cisco Public
88
Remote LFA
A
Traffic from D
Microloop!
Traffic from A
D
BRKRST-3363
E
Cisco Public
89
Remote LFA
Local node runs Secondary SPF from the point of view of the remote node
Automatic MPLS TE Fast Reroute
Use TE Tunnel to get between local and remote nodes
Requires LDP in the ring
BRKRST-3363
Cisco Public
90
Remote LFA
D
BRKRST-3363
E
Cisco Public
91
Remote LFA
D
BRKRST-3363
E
Cisco Public
92
ASR 9000
ASR901
BRKRST-3363
Cisco Public
94
ASR 9000
Microloop!
E
ASR901
BRKRST-3363
Cisco Public
95
Keep Tunnel
Delay install of routes to RIB
5 Second Delay
Routing Table
BRKRST-3363
Cisco Public
96
ASR 9000
ASR901
BRKRST-3363
Cisco Public
97
ASR 9000
ASR901
BRKRST-3363
Cisco Public
98
ASR 9000
ASR901
BRKRST-3363
Cisco Public
99
Agenda
o Thinking About Fast Convergence
o Reactive Convergence
Proactive Convergence
o Loop Free Alternate
BGP PIC Edge
LDP Session Protection
Closing Remarks
BRKRST-3363
Cisco Public
100
PE1
PE3
CE
2
CE
1
PE4
PE2
BRKRST-3363
Cisco Public
101
PE1
PE3
CE
2
CE
1
bgp advertise-best-external
PE4
PE2
BRKRST-3363
Cisco Public
102
PE1
PE3
CE
2
CE
1
PE4
PE2
BRKRST-3363
Cisco Public
103
PE1
PE3
bgp additional-paths install
CE
2
CE
1
bgp advertise-best-external
PE4
PE2
BRKRST-3363
Cisco Public
104
Agenda
o Thinking About Fast Convergence
o Reactive Convergence
Proactive Convergence
o Loop Free Alternate
o BGP PIC Edge
LDP Session Protection
Closing Remarks
BRKRST-3363
Cisco Public
106
BRKRST-3363
Cisco Public
107
B
Session Protection creates targeted LDP Session
BRKRST-3363
Cisco Public
108
B
Session Protection creates targeted LDP Session
BRKRST-3363
Cisco Public
109
Agenda
o Thinking About Fast Convergence
o Reactive Convergence
o Proactive Convergence
Closing Remarks
BRKRST-3363
Cisco Public
122
Other Considerations
Punt Path
Cisco Public
123
Final Thoughts
Timers are just the beginning
Everything matters
CPU, Hardware, Software, Latency, Operating System
Culture of Engineering
Tolerance for false positive
Willing and able to work on hard problems
BRKRST-3363
Cisco Public
124
Recommended Sessions
BRKARC-2350 IOS Routing Internals
BRKDCT-2333 Data Center Network Failure Detection
BRKRST-3371 Advances in BGP
BRKRST-3007 Advanced Topics and Directions in Routing Protocols
BRKARC-3472 - NX-OS Routing Architecture and Best Practices
BRKRST-2336 (EIGRP), 2337 (OSPF), 2338 (ISIS) Deployment in Modern Networks
BRKRST-2041 - WAN Architectures and Design Principles
BRKRST-2042 Highly Available Wide Area Network Design
BRKCRS-2031 Enterprise Campus Design: Multilayer Architectures and Design Principles
BRKNMS-2518 Secrets to Achieving High Availability
BRKRST-3363
Cisco Public
125