Beruflich Dokumente
Kultur Dokumente
I. I NTRODUCTION
As highlighted in [1], in practice, one often encounters
complex problems which have multiple interdependent modules, each of which can be called a silo. For example, in
the mine supply chain, one needs to solve the storing, production scheduling and delivery silos simultaneously, which
are interdependent with each other. A complex problem with
interdependent vehicle routing problem and location allocation
problem was proposed in [2], and a stochastic scheduling
problem with interdependent modules was proposed in [3]. In
these cases, solving each silo separately can by no means lead
to reasonable results due to the neglect of the interdependency.
In other words, when solving multi-silo problems, how to deal
with the interdependency between the silos largely determines
the goodness of the final solution to the entire problem, as
well as the single-silo solution components.
In order to investigate the effect of interdependence between
sub-problems in a complicated problem, Bonyadi et al. [4]
proposed an artificial benchmark problem called the Traveling
978-1-4799-7492-4/15/$31.00 (c)2015 IEEE
vmax vmin
,
Q
(1)
(2)
where 0 w
Q is the current total weight of the picked
items. When the knapsack is empty (w
= 0), the speed is
maximized (v = vmax ). When the knapsack is full (w
= Q),
the speed is minimized (v = vmin ). Then, the profit gained by
the thief is defined as the total value of the picked items minus
the rent of the knapsack. Let a TTP solution be represented
by a tour x and a picking plan z, where x = (x1 , . . . , xn ) is a
permutation of the cities and z = (z1 , . . . , zm ) is a 0-1 vector
of the items. zi takes 1 if item i is picked, and 0 otherwise.
According to [10], the TTP problem can be stated as follows:
(n1
)
m
1 i n,
w(i)
=
i1
w(k)
+ cw(xi ), 1 i n,
k=1
m
cw(i) =
wj zj [aj = i], 1 i n,
(4)
(5)
(6)
j=1
xi = xj , 1 i = j n,
(7)
wj zj Q,
(8)
j=1
(9)
for i = 1 popsize do
Initialize pop[i].tour with the chained LK heuristic;
Apply the item selection heuristic to obtain
pop[i].pickplan;
Evaluate pop[i];
end for
Sort pop in terms of the profit of the individuals;
while Stopping criterion is not met do
Randomly select two parents par1 and par2 from pop;
Apply the ordered crossover [14] on par1 .tour and
par2 .tour to generate the tour of the offspring o.tour;
Conduct TSP search on o.tour to improve it; the
first stage
Generate o.pickplan by the item selection heuristic
[10];
Conduct KP search on o.pickplan to improve it;
the second stage
Evaluate the offspring o, and add it into pop;
Sort pop by the decreasing order of profit, and remove
the worst individual;
end while
(10)
where
t1 (j, x) =
vmax wj
vmax
(11)
n1
(12)
i=loc(aj )
f2 (j, x, W ) = bj R t2 (j, x, W ),
t2 (j, x, W ) =
(13)
L(x, loc(aj ))
L(x, loc(aj ))
. (14)
vmax (W + wj )
vmax W
The third measure is called the expected gain under the total
weight (denoted as f3 ), which can be calculated as follows:
f3 (j, x, W ) = bj R t3 (j, x, W ),
L(x)
t3 (j, x, W ) =
g
(
)
g + h1
g + h2
log
log
h2
h1
(15)
(16)
Description
profit
weight
bdist
Q
L
R
vmax
Description
profit
weight
bdist
Q
L
R
vmax
W
6:
7:
8:
9:
10:
11:
12:
13:
14:
15:
16:
17:
fitness =
(17)
where profiti is the profit of the best known solution of the ith
TTP instance, which is obtained by running the original TSMA
once. In other words, the single-objective function implies
the average percentage deviation of the fitness of the evolved
heuristic from the best known profit. Note that the quality
of such transformation depends on that of the selected best
known solutions. However, we expected that the gap between
the best known result and the true optimum does not affect the
final results much, since the fitness values for each scenario
are already of the same scale after the transformation.
V. E XPERIMENTAL S TUDIES
A. Experimental Settings
To evaluate the proposed GP and the evolved item selection heuristics, three representative small sized TTP
benchmark instances [9], i.e., eil51 n50 bounded-stronglycorr 01, eil51 n50 uncorr 01 and eil51 n50 uncorrsimilar-weights 01 are selected as the training instances. All
the instances have 51 cities and 50 items. The ith item is
located at the (i + 1)th city. For each instance, the chained
LK heuristic is then run once on it and the resultant TSP tour
is taken as the input tour of the corresponding GP.
The GP package in the R software, which is named the
RGP package, is employed to evolve the heuristics based on
the terminal sets defined in Section IV. The population size
is set to 100, and the stopping criterion is set to a maximal
running time of two hours. The preliminary studies showed
that the stopping criterion is more than sufficient to guarantee
the convergence of GP.
During the testing phase, another subset of large scale TTP
benchmark instances are adopted to test the generalizability of
the heuristics evolved by GP. Concretely, given a test instance,
the TSMA is applied with the corresponding component replaced by the GAIN or PICKFUNC heuristic, respectively. The
(
(
))
bdist
fp = 0.93
bdist
profit
+ R profit 0.73 ( weight
W + L)
. (20)
bdist
The training performance of the picking function Eq. (20) is
fitness1 = 2411.81, fitness2 = 609.41 and fitness3 = 1338.95,
and the transformed fitness value is
fitness =
(21)
TABLE III: The performance of the compared algorithms on the large scale TTP benchmark instances with bounded strongly
correlated item weights.
Name
RLS
EA
TSMA
TSMA-GAIN
TSMA-PICKFUNC
brd14051
d15112
d18512
pla33810
rl11849
usa13509
14051
15112
18512
33810
11849
13509
140500
151110
185110
338090
118480
135080
-5.50e+7(1.22e+6)
-6.42e+7(2.52e+6)
-8.18e+7(1.27e+6)
-1.71e+8(1.49e+6)
-4.38e+7(7.58e+5)
-5.45e+7(7.57e+5)
-6.30e+7(2.52e+6)
-7.01e+7(1.63e+6)
-8.89e+7(3.98e+6)
-1.80e+8(1.64e+6)
-5.05e+7(2.76e+5)
-6.17e+7(2.95e+5)
2.66e+7(2.07e+5)
2.85e+7(5.35e+5)
3.07e+7(2.73e+5)
6.34e+7(4.59e+5)
1.97e+7(8.29e+4)
2.92e+7(2.60e+5)
2.66e+7(2.59e+5)
2.83e+7(3.40e+5)
3.07e+7(3.47e+5)
6.32e+7(6.87e+5)
1.97e+7(9.06e+4)
2.92e+7(2.55e+5)
2.66e+7(3.41e+5)
2.84e+7(4.75e+5)
3.06e+7(4.95e+5)
6.33e+7(5.11e+5)
1.97e+7(1.04e+5)
2.92e+7(1.70e+5)
TABLE IV: The performance of the compared algorithms on the large scale TTP benchmark instances with bounded uncorrelated
and dissimilar item weights.
Name
RLS
EA
TSMA
TSMA-GAIN
TSMA-PICKFUNC
brd14051
d15112
d18512
pla33810
rl11849
usa13509
14051
15112
18512
33810
11849
13509
140500
151110
185110
338090
118480
135080
-4.12e+7(7.68e+5)
-4.56e+7(7.44e+5)
-5.98e+7(6.12e+5)
-1.22e+8(8.40e+5)
-3.23e+7(1.79e+6)
-4.00e+7(5.79e+5)
-4.73e+7(1.64e+5)
-5.18e+7(1.08e+5)
-6.62e+7(8.82e+5)
-1.28e+8(1.01e+6)
-3.81e+7(2.12e+6)
-4.55e+7(1.29e+5)
1.69e+7(2.09e+5)
1.83e+7(2.43e+5)
1.94e+7(2.47e+5)
4.10e+7(2.38e+5)
1.29e+7(4.47e+4)
1.88e+7(1.69e+5)
1.70e+7(1.78e+5)
1.83e+7(1.88e+5)
1.94e+7(2.94e+5)
4.10e+7(3.37e+5)
1.29e+7(4.73e+4)
1.87e+7(4.57e+5)
1.69e+7(2.30e+5)
1.82e+7(2.31e+5)
1.94e+7(1.84e+5)
4.10e+7(2.82e+5)
1.29e+7(4.61e+4)
1.88e+7(2.94e+5)
TABLE V: The performance of the compared algorithms on the large scale TTP benchmark instances with bounded uncorrelated
but similar item weights.
Name
RLS
EA
TSMA
TSMA-GAIN
TSMA-PICKFUNC
brd14051
d15112
d18512
pla33810
rl11849
usa13509
14051
15112
18512
33810
11849
13509
140500
151110
185110
338090
118480
135080
-3.88e+7(1.86e+6)
-4.30e+7(1.43e+6)
-5.50e+7(5.86e+5)
-1.10e+8(7.47e+5)
-3.10e+7(4.54e+5)
-3.77e+7(1.18e+6)
-4.32e+7(1.82e+5)
-4.74e+7(7.51e+5)
-6.00e+7(1.09e+5)
-1.15e+8(6.16e+5)
-3.51e+7(1.70e+5)
-4.20e+7(6.38e+5)
1.31e+7(2.28e+5)
1.38e+7(2.64e+5)
1.42e+7(3.39e+5)
3.10e+7(3.38e+5)
9.15e+6(4.31e+4)
1.52e+7(1.70e+5)
1.31e+7(2.10e+5)
1.37e+7(3.29e+5)
1.42e+7(2.03e+5)
3.10e+7(3.08e+5)
9.16e+6(4.94e+4)
1.52e+7(1.87e+5)
1.30e+7(2.04e+5)
1.37e+7(3.48e+5)
1.42e+7(3.32e+5)
3.10e+7(3.97e+5)
9.17e+6(4.88e+4)
1.52e+7(1.55e+5)
[2]
[3]
[4]
[5]
[6]
VI. C ONCLUSION
This paper carries out a more systematic investigation
on the item selection heuristic in TSMA for solving TTP.
Contrary to the original manually designed item selection
heuristic, GP is adopted to evolve effective item selection
heuristics automatically. Two different types of heuristics,
namely the GAIN heuristic and the PICKFUNC heuristic
are designed to replace different components of the original
item selection heuristics. The GAIN heuristic replaces the
whole item selection heuristic, while the PICKFUNC heuristic
keeps the original main framework and replaces a lower level
component, i.e., the conditions to determine whether to select
the current item or not.
Both heuristics were trained on small scale representative
training instances, and tested on another set of large scale test
instances. The experimental results showed that even with a
naive GP without much parameter tuning, the obtained heuristics already managed to show comparable performance as the
original sophisticated designed heuristic. This demonstrates
the efficacy of using GP to find out promising item selection
heuristics for solving TTP, and it is expected that a more
structured GP such as the strong-typed GP will improve the
training and test performance of the evolved heuristics. Additionally, the generalizability of the evolved heuristics verifies
the feasibility to learn heuristics on smaller and simpler TTP
instances and generalize them on larger and more complex
instances to improve the efficiency.
The work in this paper is merely a preliminary work on this
topic. In the future, the following work will be done to further
understand the problem and improve the performance of the
algorithm: (1) Address the representativeness of the tour for
each training instance to improve the training performance of
the heuristics; (2) Employ more sophisticated GP structures
such as the strong-typed GP to restrict the huge search space
of the heuristics properly to improve the efficiency of the
GP search; and (3) Improve the selection of the final elitist
heuristics rather than simply selecting the one with the best
training performance.
R EFERENCES
[1] Z. Michalewicz, Quo vadis, evolutionary computation? on a growing
gap between theory and practice, in Advances in Computational Intel-
[7]
[8]
[9]
[10]
[11]
[12]
[13]
[14]
[15]
[16]
[17]
[18]
[19]
[20]
[21]
[22]
B. Zalik,
An efficient sweep-line delaunay triangulation algorithm,
Computer-Aided Design, vol. 37, no. 10, pp. 10271038, 2005.
J. Koza, Genetic programming: on the programming of computers by
means of natural selection. MIT press, 1992, vol. 1.
W. Banzhaf, P. Nordin, R. Keller, and F. Francone, Genetic programming: an introduction. Morgan Kaufmann San Francisco, 1998, vol. 1.
S. Nguyen, M. Zhang, M. Johnston, and K. Tan, A coevolution genetic
programming method to evolve scheduling policies for dynamic multiobjective job shop scheduling problems, in 2012 IEEE Congress on
Evolutionary Computation (CEC). IEEE, 2012, pp. 18.
, A computational study of representations in genetic programming
to evolve dispatching rules for the job shop scheduling problem, IEEE
Transactions on Evolutionary Computation, vol. 17, no. 5, pp. 621639,
2013.
J. Xie, Y. Mei, A. Ernst, X. Li, and A. Song, A genetic programmingbased hyper-heuristic approach for storage location assignment problem, in 2014 IEEE Congress on Evolutionary Computation (CEC).
IEEE, 2014, pp. 30003007.
, Scaling up solutions to storage location assignment problems by
genetic programming, in Simulated Evolution and Learning. Springer,
2014, pp. 691702.