Mpls

MPLS-based Request Routing
MPLS-based Request Routing Arup Acharya, Anees Shaikh, Renu Tewari, Dinesh Verma IBM TJ Watson Research Center
High volume Internet data centers

Web server cluster + front-end dispatcher
direct requests based on server load, requested content, client identity, etc.
dispatcher
clients
Next Generation Internet
SoSe03
240
SoSe03
241
Current dispatcher technology

Layer-4 dispatchers

MPLS-based architecture
MPLS provides a circuit-switching service over a hop-by-hop IP network Architecture components
z z
route requests based on TCP/IP headers high-performance h/w implementations functionality limited to load balancing or simple affinity use application information (e.g., HTTP headers) sophisticated functionality content-based routing, affinity, load-balancing scalability and performance limited by TCP connection termination
z
Layer-7 dispatchers

MPLS network MPLS-enabled client-side proxy z MPLS switch acting as dispatcher
clients forward proxy
servers MPLS-enabled network LSR

LS P
Application-level gateway z TCP splicing z TCP connection handoff
MPLS switch
Desired solution

sophisticated functions and flexibility high-performance

SoSe03 242 Next Generation Internet
control connection
SoSe03
243
MPLS label stacking

label stacking:

Label distribution to proxies

Persistent control connection between dispatcher and proxy

Labels typically used for expressing routing policies Use label stacking to push application-layer label Outer routing label used for switching in the network MPLS-enabled server network further improves performance
content-based routing: URLlabel mapping load balancing: labels, weights, and policy client affinity: labels and timeout, start/stop URLs service differentiation: per-service class label set (e.g., gold, silver, bronze)
MPLS-enabled network IP pkt LA IP pkt LA LR IP pkt LA LR IP pkt LA IP pkt
Dispatcher populates layer-2 label table
forward proxy
ingress LSR
egress LSR
MPLS switch
SoSe03
244
SoSe03
245
Deployment issues
Need wide MPLS deployment in core and edge
Summary
Key advantages

supported by reports from large ISPs and IP service equipment vendors (e.g., for VPNs) data center and proxies in same administrative control (ISP with hosting service) ASPs with large enterprise customers and SLAs Intranet and extranet servers limit proxy participation to high-volume client sites proxies may initiate with selected, popular sites
Why install an MPLS-enabled proxy?

leverage growth of MPLS deployment in core and edge networks removes primary bottleneck of TCP termination realization in standard off-the-shelf switch hardware implements sophisticated request routing functions assign some request-routing functionality to proxies MPLS-aware proxies at the network edges implementation of control protocol for label distribution
Requirements

Scaling to many proxies and web sites

SoSe03
246
SoSe03
247
Literature and acknowledgements

MPLS-based Request Routing (R&D Synopsis) A. Acharya, A. Shaikh, R. Tewari, and D. Verma, Proc. Int'l Workshop on Web Caching and Content Distribution (WCW '01), June 2001. Extended version published as IBM Research Report RC 22275 http://www.research.ibm.com/people/a/aashaikh/papers/rc22275.pdf In MPLS World News http://www.mplsworld.com/archi_drafts/focus/analy-ibm.htm Thanks to Anees Shaikh and Arup Acharya for providing their presentation!
Content Distribution in the WWW

Motivation & Classification Web Caching Content Distribution Networks

Techniques Performance
SoSe03
248
SoSe03
249
Content Distribution in the WWW: Motivation

WWW users use HTTP to retrieve web objects from a server Response time can be slow (World wide wait):

Content Distribution in the WWW

Content distribution refers to mechanisms for: 1. Replicating content on multiple servers in the Internet 2. Providing end systems means to determine the servers with fastest response Large industry:

Low-speed path causing low transmission delay One or more congested links cause queuing delay and packet drops Web server is overloaded
Strategy:

Replicate server content Direct client to best server
Cisco, Lucent, Inktomi, CacheFlow etc.: provide hard-and software Akamai, AT&T etc.: provide content distribution services to providers such as CNN and Yahoo Web caching Content distribution networks P2p file sharing (extra lectures)
Classification:

SoSe03
250
SoSe03
251
Web caching
A web cache (proxy server) is a network entity that satisfies HTTP requests on the behalf of an origin server Cache is both a client (to the origin server) and a server (to the clients)
Web caching: Motivation

Reduce latency by avoiding slow links between client and origin server:

low bandwidth links congested links between institutional network and regional ISP. Reduce traffic on transoceanic links. An Internet dense with caches allows a content provider to offer highperformance distribution at low cost.
z z
Reduce traffic on links

Pr eq u HT est T Pr client esp o ns e st e u req n se TP spo HT e r TP HT
HT T
Proxy server
TP HT
es t e qu
Spread load of overloaded origin server to caches.
Origin server
Inexpensive server Low-bandwidth Internet connection
client
Origin server
SoSe03 252 Next Generation Internet SoSe03 253
Design Techniques: Hierarchical Caching

Origin servers National ISP
Cooperative Caching
Multiple sibling caches within a single ISP. One or more of the siblings could contain the requested object. Cooperation:

Regional ISP
Regional ISP
ICP (Internet Cache Protocol): siblings send messages to each other to find a copy of object (Intercache communication) CARP (Cache Array Routing Protocol): URL space is partitioned (Hashbased Request Routing) Can have cooperating sibling caches in each ISP in each tier of a hierarchy.
Each ISP can have a cache. ISPs higher in hierarchy have

Local ISP
Local ISP
larger user populations higher hit rates

Web cache
SoSe03
254
SoSe03
255
Caching: Other caching terms

Reverse proxy caching:
September 11 with Web caching Web Server www.cnn.com
Caches close to the origin server. Independent of client-side proxy caching Aims at caching dynamic content, e.g. personalized content Retrieve data from remote servers in anticipation of client requests A summary of the contents of an Internet Object Caching Server Harvest: introduced the idea of hierarchical caching (mainly for FTP) Squid: extended Harvest for HTTP, introduced ICP
New Content WTC News!
Active Caching:
Content Prefetching:
Cache Digest:
request
1000,000 other hosts
Well-known cache systems:

1000,000 other hosts old content request
ISP
- Congestion / Bottleneck - Caching Proxy
Figure from J. R. Iyengar
SoSe03
256
User merlot.cis.udel.edu
Content Distribution Networks

Content distribution networks (CDNs) are a mechanism to deliver content to end users on behalf of origin web sites. CDNs consist of a collection of surrogates (non-origin servers) that attempt to offload work from origin servers by delivering content on their behalf. For each request, the CDN locates a surrogate close to the client that serves the request. Different notions of close are:

CDNs vs. Web caches

A CDN can be regarded as a set of widely-dispersed caches but there are two major differences to web caches:
Surrogates are coordinated by a mechanism that routes client requests to good surrogate Surrogates are potentially populated by other means than requests by clients
CDN distribution node Origin server in North America Surrogate in South America Surrogate in Europe Surrogate in Asia
Network proximity Bandwidth availability Availability of content Low latency, e.g. choosing a lightly loaded server (or not heavily loaded server)
SoSe03
258
SoSe03
259
September 11 with CDN Web Server www.cnn.com WA CA IL MA 1000,000 other users FL NY DE

request new content - Distribution Infrastructure - Surrogate
Figure from J. R. Iyengar
CDN Techniques
New Content WTC News!
There are two major techniques for redirecting client requests for objects served by the CDN to a particular CDN server (sometimes called Request Routing):
1.
DNS redirection
a. b.
MI
1000,000 other users
Full-site content delivery Partial-site content delivery
2.
URL rewriting
Other techniques are:

Anycast (does not consider surrogate load) Transport-layer request routing: can be used in combination with DNS redirection.
User merlot.cis.udel.edu
Next Generation Internet SoSe03
260
SoSe03
261
DNS Redirection
Normal DNS operation:
Client: checks local cache g http 199.172.136.40
DNS Redirection
Modified DNS:
Client: checks local cache g http Surrogate X
c http://www.ieee.org/
c http://www.ieee.org/
Local DNS: d IP-Adresse fr www.ieee.org ?
f www.ieee.org A 199.172.136.40
dns.ieee.org
Local DNS: d IP-Adresse fr www.ieee.org ?
f www.ieee.org Surrogate X
Modified: chooses one surrogate
e dns.ieee.org NS 199.172.136.6 RootNameserver
e dns.ieee.org NS 199.172.136.6 RootNameserver: Knows dns.ieee.org Other NS that cannot resolve name
Other NS that cannot resolve name

Next Generation Internet SoSe03 262
SoSe03
263
DNS Redirection
Advantages:

Full- and Partial-site Content Delivery

Full-site:

Simple no changes to existing protocols, clients or servers General works for all IP-based applications, independent of transport protocol used
All requests to the CDN are redirected via DNS Surrogates either serve content from their cache or forward requests to the origin server Used by Adero, Netcaching, ..
Partial-site:

Origin sites modifies the embedded URLs for objects (images) so that these URLs are resolved by the CDNs DNS server Actual syntax varies with the CDN. Speedera changes www.foo.com/bar.gif to foo.speedera.net/www.foo.com/bar.gif
SoSe03
264
SoSe03
265
DNS redirect and TTL

DNS resource records contain Time to Live field that specifies how long a client may cache a resource record. RFC 1912 recommends TTL values of 1-5 days
Problems with small TTL values

Small TTLs lead to problems:

Clients must perform DNS lookups more frequently

z
This can increase client latency It has been observed that in many cases the time between the HTTP GET request and the arrival of the first data packet accounts for 30-40% of the response time, the main reason is the bad performance of DNS
Increased load on DNS

z
Nameservers typically use a TTL of 1 day.
The DNS of CDNs have very small TTL values of 10-200 seconds
Aim: better load balancing
SoSe03
266
SoSe03
267
Effectiveness of DNS-based Server Selection

Study by A. Shaikh, R. Tewari and M. Agrawal:

URL Rewriting
URL Rewriting:

Without careful TTL tuning, client latency can increase

z
In particular, when web pages contain more embedded objects
Typical client-nameserver distance is 8 or more hops. Furthermore clients and nameservers often have disjoint paths to surrogates.
z
Latency to nameserver is poor indicator of latency to client
Origin server rewrites URL links as part of dynamically generating pages to redirect clients to different servers. At resource access time, the page is dynamically rewritten with the IP address of one of the surrogates, avoiding the need for a DNS lookup. Problem: First request must be served from origin server
Hybrid approach:
Use URL rewriting to identify a particular server that might resolve to the IP address of another surrogate
SoSe03
268
SoSe03
269
Performance of CDNs (1)

Study by Johnson et al. that evaluated performance of two CDNs (Akamai and Digital Island)
1.
Performance of CDNs (2)

Study by Krishnamurty et al.:

2.
3.
CDNs are able to succesfully provide services by avoiding significantly "bad" services as opposed to being able to pick the best ones. CDN's occasionally make bad choices in picking servers for clients that have measured latencies worse than going to the original client thereby degrading service for client rather than improving them. The use of CDN's actually does improve performance on average when considering both performance using the origin server as well as comparing the choice of server to other possible choices.
CDNs offer much better performance than origin servers Significant differences in download times between different CDNs Compared the download time for a newly obtained surrogate to a fixed and the previous surrogate (i.e. effect of low TTL values):
z
In almost all cases, the response time was better using the previous or fixed server z Indicates that even worst-case client response time is generally not improved with a DNS lookup to find a new server z Confirms the findings by Shaikh and Tewari that careful tuning of TTL values is important (and difficult)
SoSe03
270
SoSe03
271
Literature
B. Krishnamurthy, C. Wills, and Y. Zhang . On the Use and Performance of Content Distribution Networks Proceedings of SIGCOMM IMW 2001, California, November 2001. A. Shaikh, R. Tewari, M. Agrawal. On the Effectiveness of DNSbased Server Selection, Proc. IEEE INFOCOM 2001, April 2001. K. Johnson, J. Carr, M. Day, and M. F. Kaashoek. The measured performance of content distribution networks. 5th International Web Caching and Content Delivery Workshop, Lisbon, Portugal, May 2000. G. Barish and K. Obraczka. World Wide Web Caching: Trends and Techniques. IEEE Communications Magazine Internet Technology Series, May 2000.
SoSe03
272

Mpls

Hochgeladen von

Dokumentinformationen

Originalbeschreibung:

Originaltitel

Copyright

Verfügbare Formate

Dieses Dokument teilen

Dokument teilen oder einbetten

Freigabeoptionen

Stufen Sie dieses Dokument als nützlich ein?

Sind diese Inhalte unangemessen?

Copyright:

Verfügbare Formate

Mpls

Hochgeladen von

Copyright:

Verfügbare Formate

MPLS-based Request Routing

High volume Internet data centers

Next Generation Internet

Next Generation Internet

Current dispatcher technology

MPLS network MPLS-enabled client-side proxy z MPLS switch acting as dispatcher

clients forward proxy

servers MPLS-enabled network LSR

Application-level gateway z TCP splicing z TCP connection handoff

sophisticated functions and flexibility high-performance

Next Generation Internet

MPLS label stacking

Label distribution to proxies

MPLS-enabled network IP pkt LA IP pkt LA LR IP pkt LA LR IP pkt LA IP pkt

Dispatcher populates layer-2 label table

Next Generation Internet

Next Generation Internet

Why install an MPLS-enabled proxy?

Scaling to many proxies and web sites

Next Generation Internet

Next Generation Internet

Literature and acknowledgements

Content Distribution in the WWW

Next Generation Internet

Next Generation Internet

Content Distribution in the WWW: Motivation

Content Distribution in the WWW

Replicate server content Direct client to best server

Next Generation Internet

Next Generation Internet

Web caching: Motivation

Reduce traffic on links

Pr eq u HT est T Pr client esp o ns e st e u req n se TP spo HT e r TP HT

Spread load of overloaded origin server to caches.

Inexpensive server Low-bandwidth Internet connection

Design Techniques: Hierarchical Caching

Each ISP can have a cache. ISPs higher in hierarchy have

larger user populations higher hit rates

Next Generation Internet

Next Generation Internet

Caching: Other caching terms

September 11 with Web caching Web Server www.cnn.com

New Content WTC News!

1000,000 other hosts

Well-known cache systems:

1000,000 other hosts old content request

Next Generation Internet

Content Distribution Networks

CDNs vs. Web caches

Next Generation Internet

Next Generation Internet

September 11 with CDN Web Server www.cnn.com WA CA IL MA 1000,000 other users FL NY DE

1000,000 other users

Full-site content delivery Partial-site content delivery

Other techniques are:

Next Generation Internet

Local DNS: d IP-Adresse fr www.ieee.org ?

Local DNS: d IP-Adresse fr www.ieee.org ?

Modified: chooses one surrogate

e dns.ieee.org NS 199.172.136.6 RootNameserver

Other NS that cannot resolve name

Next Generation Internet