Beruflich Dokumente
Kultur Dokumente
Robert R.
Computer
Puckett Science
Department
University of Hawaii at
Hilo Hilo, HI, USA
justpuckett@gmail.com
peer-to -peer co mputing coupled with Internet 2 will remain
Abstract – Grid Architecture coordinates the sharing o f
the most powerful supercomp uter [7].
resources and allows for collaboration of often
heterogeneous equipment that is geographically
distributed. While research grids provide needed services, 2 Grid Computing
volunteer-based grids promise greater processing power
Grid computing differs from many peer-to-peer
pro vided that the overhead, availability, and heterogeneity
distributed computing programs in that there is a central
of the resources are mana geable. JAVA-based applets
authority computer which manages the distribution and
pro vide quick installation across hetero geneo us platforms
collection of work units. To process a job o n a grid
and are well placed to speed the transition to grid
computing system, one would connect to the central
computing.
authority and request one’s job be processed.
3.3 Test 1: Ray Tracing Proof of Concept 3.5 Test 3: Monte Carlo Integration
In this experiment I performed the ray tracing demo In this program I have coded a Monte Carlo
included with the GreenTea demo package. For all tests integration estimation to the integral of one fourth of a
herein the number of lines and sphere/line is fixed at 10. circle with radius 1. This is accomplished using the Monte
The three trials, demonstrated in figure 2, show variations Carlo integration b y inscribing that quadrant within the unit
in the time not only from varying the number of nodes, but square. Note that this is equivalent to formula 2. For this I
also from varying the number of subtasks. As shown, have implemented an existing random number generator.
increasing the number of sub tasks tends to decrease the Supposing that the random number generator is sufficiently
time to complete the ray tracing operation. However, random, the Monte Carlo technique will provide an accurate
increasing the number of nodes tends to provide the estimation to the integral value at high numbers of intervals.
greatest benefit.
1
1x 2
(2)
Time Vs. # Nodes Vs. # Subtasks
# Su b ta s ks 0
60
#= No d e s
50 # Su b Tas k s = 2 *
#No d e s For the rand om number generation I chose the fourth
40
# Su b ta s ks example from Park and Miller [9 ]. This example stipulates
=
4 *# N o de s
30
that MAXINT must be smaller than 2 31 -1. Although quasi-
# No d e s
1
(3)
n
Figure 2 - Green Tea Trials
Overall the development of the random number
3.4 Test 2: Prime Numbers generating code is relatively straightforward and self-
explanatory. One point o f note is that each of the work
In this experiment I developed a prime number
nodes needed to receive its own unique seed for the random
valid atio n routine that op erates in a distributed manner
number generatio n. Without that, the pseudo-random nature
across a GreenTea network. First, the program divides the
of the random number generator would produce identical
test value in half since no factors can be larger than half the results across most of the work nodes. Thus, the main
test value. Actually, the square root of the test value might
program used its own copy of the random number
have been more appropriate here. Anyway, next the lower
generato r to produce pseudo-random seeds which were then
half is divided by the number of word no des in the
passed to and used in the work nodes.
GreenTea network. This value is provided by one of the
GreenTea classes.
Although the Monte Carlo method for integratio n
approximation does converge upon the correct answer, it
Next, the work classes are stored and subsequently
does so at an incred ibly slow rate. If high rate of accuracy is
sent through the network to the work nodes. There the work
needed , this method may be prohibitively expensive in CPU
nodes check ever value if the work unit includes values less time. However, for what Monte Carlo lacks in speed to a
than ten, or every odd value if the work unit is only
solution, it makes up for in simplicity. Of course, the
numbers greater than ten. While this causes many numbers
accuracy of the Monte Carlo method is dependent on the
to be checked whose factors have already been checked,
random number generator providing truly random numbers.
this method is very simple and it would be inapp ropriate to Additionally, owing to the nature of the random numbers,
maintain a list of the prime numbers.
the value of the estimation to the integral will not converge
in a strictly decreasing manner. This metho d may be useful
Lastly, a Boolean value representing whether the work
for functions that are difficult or cumbersome to integrate
unit’s interval is relatively prime to the test value is
analytically, as was the example above. Additionally, higher
returned to the main program via the network. The main dimension integrals can be performed in a similar manner
by enclosing the desired portion within a known weakness in using JAVA for high-performance computing
area/volume/etc. [8].
# N o de s
3.7 Future Work
Figure 3 - Monte Carlo Trials (Lower Order Trials) GreenTea Technologies has been gracious enough to
Figure 3 demonstrates that there is not always a cost provide sponsor my research with a twelve-license issue of
savings in a distrib uted solution. As the number of nod es their product. As such, future research could not only
increased, the time to complete the same task actually include groups of standalone co mputers, but also the two
increased! Add itionally, the data shows that some test clusters in develo pment by the other groups in the
subsequent, in other words larger, tests perform better than computer architecture class of the computer science
test cases where there is fewer trials. One reason is that department at the University o f Hawaii at Hilo. Though
overhead from division and distribution of work units plays using parallel execution programs as well as system
a major factor in determining the run time o f a program. monitoring software, such as Ganglia, future research can
Figure 4 demonstrates that in higher ord er trials there is demonstrate the cost and benefits of java-based volunteer
indeed an appreciable decrease in completion time for using grid computing. It will be interesting to see the breakeven
more nodes. The break even point for this algorithm appears point between communications/java overhead and the
to be around ten million trials. division of work to an increasing number of nodes.
References
D i st ri but ed M onte C a rl o
10 0, 00 0 T r ials
[1] Koch, Lewis E. 2004. A Quiet Time Bomb. The Raw
40 0 0 0 0
Sto ry. Retrieved February 23, 2006 from:
35 0 0 0 0 1, 00 0,0 00 T r i a ls
http://www.rawstory.com/exclusives/koch/vulnerable_comp
30 0 0 0 0 10 Million T ria ls
uter_grid.htm .
25 0 0 0 0
10 0 Millio n Tr ials
20 0 0 0 0
15 0 0 0 0
1 Billion T ria ls
[2] Levinson, Meridith. 200 5. Who’s Afraid of Grid
10 0 0 0 0
Computing? CIO.com. Retrieved February 25, 20 06 from:
50000 http://comment.cio.com/soundoff/042505.html .
0
123 [3] Peel, Roger M. Grid Computing. Retrieved Feb ruary
# N ode s 26, 2006 from:
http://www.computing.surrey.ac.uk/p ersonal/st/R.Peel/csm2
Figure 4 - Distributed Monte Carlo (Higher Order Trials) 3/parallel-1.pdf .
[7] Bell, Gordon & Gray, Jim. 2002. What's Next in High-
Performance Computing? Communications of the ACM,
Volume 45 Issue 2. ACM Press. February.