Beruflich Dokumente
Kultur Dokumente
in P2P Networks
Baogang Chen1 and Jinlong Hu2
1
College of Information and Management Science,
Henan Agriculture University, Zhengzhou, China, 450002
2
Communication and Computer Network Laboratory of Guangdong Province,
South China University of Technology, Guangzhou, China, 510641
Introduction
The large number of popular file downloading behavior is similar to the process of the
infectious disease spread in P2P file-sharing system, which can be described with
infectious diseases dynamics. In the field of medicine, Many infectious diseases
propagation model have been investigated for a long history, which are the effective
ways to research process of infectious diseases propagation and to predict the outbreak
of infectious diseases.
Most of existing researches on the characteristics of the files replication and
diffusion only consider the steady state performance of P2P networks, and does not
consider the unstable states in the file diffusion process [1,2], or not fully reflect the
status of nodes in the system [3,4]. In this paper, the study of infectious diseases
dynamics theories are referenced, and the various states of users nodes are
comprehensively examined in P2P file-sharing system, hereafter a new model is
proposed for file diffusion in P2P file sharing system.
In 1927, Kermack and McKendrick provided famous SIR compartment model when
they investigated the law of epidemics [5]. And later, a class of SEIR Epidemic Model
with Latent period was made on this basis. In the SEIR model, the population is divided
Z. Du (Ed.): Proceedings of the 2012 International Conference of MCSA, AISC 191, pp. 117122.
Springer-Verlag Berlin Heidelberg 2013
springerlink.com
118
B. Chen and J. Hu
into four groups. susceptible to infection denoted by type S; the infected denoted by
type I; if the infected have a period of incubation before ill, and the period of infection
is not contagious, then these people in the incubation period, denoted by E; recovered
class denoted by type R.
In 2005, Julien Arino et al proposed a disease transmission model with spatial
dynamics [6]. An SEIR epidemic model with spatial dynamics is considered for a
population consisting of S species and occupying N spatial patches. The total
population for species i in patch p is N ip and the population for species i is Ni0 > 0, a
fixed constant. At time t, the numbers of susceptible, exposed, infectious and recovered
individuals of species i in patch p at time t are denoted by Sip, Eip, Iip and Rip,
respectively. 1/ d ip > 0
1/
ip
>0
1/
ip
and infectious period for species i in patch p, respectively. The disease is assumed to be
horizontally transmitted within and between species according to standard incidence
with j ip 0 , the rate of disease transfer from species j to species i in patch p. The
dynamics for species i = 1, . . . , s in patch p = 1, . . . , n is given by the following system
of 4sn equations:
dSip
dt
dEip
dt
dIip
dt
I jp
j=1
Njp
=ijpSip
I jp
j =1
Njp
q=1
q=1
+ mipqSiq miqpSip
n
q=1
q=1
q=1
q=1
dRip
dt
q =1
q =1
(1)
(2)
(3)
(4)
In P2P file-sharing system ,the state of user node is called susceptibility before they
search for files and propose their downloading request, denoted by class W; When they
enter into downloading queue after they propose downloading request, the state of user
node is called latent period, denoted by class D; When they share file for period of time
after accomplishing their downloading task, the state of user node is called infection,
denoted by class S; When users are no longer interested in the file and delete it, or users
nodes does not share the file after they downloading the file, this status is called
restoration, denoted by Class I.
Let X be any particular type of nodes, and PX denotes the number of nodes in X
class. In system, the user node has two states: on-line and off-line, and between them
can be transformed into each other. Therefore, all nodes within the system are divided
119
into four different compartments, and each node has both online and offline status, such
as Won and Woff .
In the following, through analysis of user nodes number changes in various types,
we get the process model of the file diffusion behavior.
(1) Change rate of the node class Won
Firstly, a node of class Won changes its state from Won to Don because it searches the
download file and put forward download request. Suppose the rate of a file being
inquired, the ratio of the current file shared and the total number of nodes in the system
are proportional. Suppose the average rate of the user sending out file queries and file
download request is . According to standard incidence rate of infectious
diseases SI / N , then the class node Won will be converted to class Don with
rate PWon PSon / N P . Meanwhile, when some nodes enter into offline, the number of
nodes online will be reduced. If rate of offline is set at on off , the nodes of class Won will
transfer into class of Woff at rate on off PWon .As downloading nodes will lose
downloading sources, so they will be forced to seek new downloading source.
Therefore, these nodes will be reentering the Won class from Don class, and then set
the transfer rate of occurrence as r1 . When the node state transition from offline to
online, the number of Won class nodes will increase. Assuming that this conversion
occurs with rate off on , then the change rate of class Won can be expressed as:
dPWon
dt
(5)
120
B. Chen and J. Hu
offline to online. Set the rate from Won to Don is PWon PSon / NP and the rate from offline
to online is off on PDoff . Then the change rate of class Don can be expressed as:
dPDon
dt
(6)
dPSon
dt
(7)
D S
off
off
(8)
transition state rate from online to offline and in turn are the same. So, we have
dPWoff
dt
dPDoff
dt
dPSoff
dt
dPIoff
dt
(9)
(10)
(11)
(12)
121
The model assumes that all user nodes are initially interested in a particular file, then
we select RMVB type files downloaded rank in the top 7 in MAZE log, and file name
called A, B,, G respectively.
Due to the users are interested in download files initially, then download request
interval is equivalent to the average time interval of all users file download request. As
users of MAZE system have obvious periodicity and "day mode", so online and offline
times of nodes are set up at 12 hours. Taking into account the user log lasting for one
week, the average time of file shared can be treated as half week (84 hours). Assuming
that the number of user nodes are distributed in download queue evenly, when the user
doesnt share file, then the corresponding proportion of users are forced to choose
download source again. Therefore, the average leaving rate of downloading node
r1 equals to the average time of user nodes to share files. Parameter values in Table 1.
Table 1. Experimental parameter values
r1
1
98E-4 1
98E-4
pshare
0
122
on off
00612 0
00138
off on
0
00138
NP
100068
In order to obtain the initial number of nodes that share files, the log data is divided
into two parts: the first 12 hours log data and the remaining time of the data. The
number of nodes that share files initially are nodes that finish downloading file within
12 hours multiplied by the factor pshare and then plus number of nodes available to
share file in the beginning. The initial value of PSon and PSoff are set to the half number of
nodes sharing file initially. The initial values of PDon , PDoff , PIon , PIoff are set to 0; PWon
and PWoff are set to the half value that N P minus PSon and PSoff .The average
download rate is defined as all users download traffic divided by time between all
nodes entering into download queue and end of download. The average download rate
divided by the file size is the rate of download accomplished per unit time.
With time granularity in minutes, and using parameter values acquired within 12
hours first, we calculate the number of file downloaded completely in latter 156 hours.
The result is shown in Table 2. The comparison of user log data and model results are
shown in Figure 1, 2. As can be seen from Table 2, the differences between the top four
RMVB files of user log data and the model results are small. But at the back rank of the
files in table, the differences seem obvious. The experimental results and assumptions
are related. Suppose that in addition to users sharing file, all other users are initially
interested in and will download the file, but the reality is that files on the back list are
not very popular, and not all users want to download. So model results biased.
122
B. Chen and J. Hu
1
2
3
4
5
6
7
A
B
C
D
E
F
G
size
MB
386.
383.
441.
465.
158.
383.
734.
52
33
87
76
65
33
61
5838
5304
4368
3313
2629
1722
1399
157
146
138
114
96
33
66
log
model
2849
2678
2455
2021
1690
586
874
3106
2905
2791
2269
1997
720
1263
Conclusion
This paper analyzes the user node state changes based on dynamic model theory of
infectious diseases, and describe file diffusion model of P2P file sharing system.
Because of considering the online and offline status in user node, therefore this model
can be more close to the actual situation. Experimental analysis indicates that the model
can describe the diffusion of most popular file in P2P file sharing system very well.
How to use the model in depth analysis, and to research more P2P file sharing system
behavior and performance characteristics, is one of the main works in the future.
References
1. Lo, P.F., Giovannil, N., Giuseppe, B.: The effect of heterogeneous link capacities in
BitTorrent-like file sharing systems. In: Proceedings of the First International workshop on
Hot Topics in Peer-to-Peer Systems, Volendam, The Netherlands, pp. 4047 (2004)
2. Qiu, D.Y., Srikant, R.: Modeling and performance analysis of BitTorrrent_like peer-to-peer
networks. In: Proceedings of the ACM SIGCOMM 2004: Conference on Computer
Communications, New York, USA, pp. 367377 (2004)
3. Leibnitz, K., Hossfeld, T., Wakamiya, N., et al.: Modeling of epidemic diffusion in
Peer-to-Peer file-sharing networks. In: Proceedings of the 2nd International Workshop on
Biologically Inspired Approaches for Advanced Information Technology, Osaka, Japan, pp.
322329 (2006)
4. Ni, J., Lin, J., Harrington, S.J., et al.: Designing File Replication Schemes for Peer-to-Peer
File Sharing Systems. In: IEEE International Conference on Communications, Beijing,
China, pp. 56095612 (2008)
5. Kermack, W.O., McKendrick, A.G.: Contributions to the mathematical theory of epidemics.
In: Proceedings of the Royal Society. Series A, vol. 115, pp. 700721 (1927)
6. Arnio, J., Davis, J., Hartley, D., et al.: A multi-species epidemic model with spatial dynamics.
Mathematical Medicine and Biology 22(2), 129142 (2005)
7. Arnio, J., Jordan, R., van den Driessche, P.: Quarantine in a multi-species epidemic model
with spatial dynamics. Mathematical Biosciences 206(1), 4660 (2007)