Beruflich Dokumente
Kultur Dokumente
Dr. G P SAJEEV
Asst.Professor
1 / 43
Contents
Outline
1 2
3 4 5
2 / 43
Interaction with underlying network protocol: TCP Scalability and performance enhancement
Server farms Web Caching Content Distribution Network (CDN)
3 / 43
HTML
Actual content of web resource, represented in ASCII
HTTP
Protocol for client/server communication
4 / 43
Web Page
Base HTML le Referenced objects (e.g., images), Each object has its own URL
+
Straight-forward and easy to learn Simplest HTML document is a plain text le Automatically generated by authoring programs
Web Content Distribution 5 / 43
Web Access
+ +
Client program
E.g., Web browser Running on end host Requests service
+
Server program
Provides service E.g., Web server
6 / 43
Servers
Receive requests and send responses Store or generate the responses
7 / 43
Web Browser
Functions
Generating HTTP requests
User types URL, clicks a hyperlink, or selects bookmark User clicks reload, or submit on a Web page Automatic downloading of embedded images
Layout of response
Parsing HTML and rendering the Web page Invoking helper applications (e.g., Acrobat, PowerPoint)
Maintaining a cache
Storing recently-viewed objects Checking that cached objects are fresh
8 / 43
+
Caching & Replication
9 / 43
Caching
Relevance of Cache Web Cache
To keep the Web services attractive, the client-side latencies and congestion in the network are to be reduced to a tolerable limit. Caching the documents at strategic points across the network is a solution, which is termed as Web caching.
1
Reducing the cost of connecting to the Internet. Reducing the latency of todays WWW. Bandwidth will always have some cost. Non-uniform bandwidth and latencies. Bandwidth demands continue to increase.
10 / 43
Web Cache
Performance Parameters
Hit Ratio (HR) Byte Hit Ratio (BHR) Mean Response Time (MRT)
11 / 43
Caching Tasks
Replacement To decide whether an object is to be replaced/evicted from the cache store, when the cache storage is full. This is the process of nding out the best candidate for eviction. Admission To decide whether an object is to be cached or admitted to the cache store, when it arrives from the origin server. Consistency Handling To decide an action (invalidate/update/pre-fetch), when an object in the cache store becomes stale.
12 / 43
Cache: Example
Assumptions
average object size = 100,000 bits avg. request rate from institutions browser to origin serves = 15/sec delay from institutional router to any origin server and back to router = 2 sec
Network
Consequences
utilization on LAN = 15% utilization on access link = 100% total delay = Internet delay + access delay + LAN delay = 2 sec + minutes + milliseconds
Web Content Distribution 13 / 43
Cache: Example
Possible solution
increase bandwidth of access link to, say, 10 Mbps
Consequences
utilization on LAN = 15 utilization on access link = 15% Total delay = Internet delay + access delay + LAN delay = 2 sec + msecs + msecs often a costly upgrade
Web Content Distribution 14 / 43
Network
Cache: Example
Install Cache
suppose hit rate is 40 %
Consequences
40 % requests will be satised almost immediately 60% requests satised by origin server utilization of access link reduced to 60%, resulting in negligible delays (say 10 msec) total delay = Internet delay + access delay + LAN delay = 0.6*2 sec + .6*.01 secs +
Web Content Distribution 15 / 43
Web Proxy
16 / 43
Proxy Caching
The process
Client 1 requests http://www.foo.com/fun.jpg
Client sends GET fun.jpg to the proxy Proxy sends GET fun.jpg to the server Server sends response to the proxy Proxy stores the response, and forwards to client
Benets
Faster response time to the clients Lower load on the Web server Reduced bandwidth consumption inside the network
Web Content Distribution
17 / 43
Other Functions
Anonymization
Server sees requests coming from the proxy address rather than the individual user IP addresses
Transcoding
Converting data from one form to another. E.g., reducing the size of images for cell-phone browsers
Prefetching
Requesting content before the user asks for it
Filtering
Blocking access to sites, based on URL or content
18 / 43
The server and its link can be overloaded Peering points between ISPs can be congested Alternative solution: Content Distribution Networks
Geographically diverse servers serving content from many sources
CDN
19 / 43
Redirector Mechanisms
Augment DNS to return dierent server addresses Server-based redirection: based on HTTP redirect feature
20 / 43
CDN Architecture
+
The content providers are the CDN customers.
CDN company installs hundreds of CDN servers throughout Internet in lower-tier ISPs, close to users CDN replicates its customers content in CDN servers. When provider updates content, CDN updates servers
21 / 43
CDN Principle
Origin server
www.foo.com distributes HTML Replaces:
23 / 43
P2P Network
P2P
no always-on server arbitrary end systems directly communicate peers are intermittently connected and change IP addresses Three topics:
File distribution Searching for information Case Study: Skype
P2P
24 / 43
P2P
25 / 43
P2P
26 / 43
P2P
27 / 43
P2P
28 / 43
Bit Torrent
P2P: BT
29 / 43
Bit Torrent
P2P: BT
30 / 43
Bit Torrent
P2P: BT
31 / 43
Bit Torrent
P2P: BT
32 / 43
Bit Torrent
single point of failure performance bottleneck copyright infringement: target of lawsuit is obvious
Centralized Directory
Bit Torrent
Query ooding
34 / 43
Bit Torrent
Query ooding
35 / 43
Bit Torrent
Hierarchical Overlay
36 / 43
Bit Torrent
Skype
37 / 43
Overview
Literature
Cache models and architecture [?, Dolgikh2002], [?, Rodriguez2002], [?, Starobinski2001] Trac and workload characterization [?, Bai2004] [?, Breslau1999], [?, Dill2002]
38 / 43
CDN
CDNSim: http://sourceforge.net/projects/cdnsim/ Data Source :- http://cdn.novell.com/cached/video/bs08/LLK9.iso
P2P
Peersim:- http://peersim.sourceforge.net P2PSim:- http://pdos.csail.mit.edu/p2psim/ D-P2P-Sim:- http://www.ohloh.net/p/d-p2p-sim Data Source:- http://nsl.cs.sfu.ca/wiki/index.php/P2PTrac
Web Content Distribution 40 / 43
Concluding Remarks
+
Web Content Distribution has a history of more than one decade. Web Caching, CDN and P2P, all co-exist Plays an important role in scalability and performance enhancement
41 / 43
References I
42 / 43
End
Thanks
43 / 43