Beruflich Dokumente
Kultur Dokumente
An effective foreign exchange ~forex! trading decision is usually dependent on effective forex
forecasting. In this study, an intelligent system framework integrating forex forecasting and
trading decision is first proposed. Based on this framework, an advanced intelligent decision
support system ~DSS! incorporating a back-propagation neural network ~BPNN!-based forex
forecasting subsystem and Web-based forex trading decision support subsystem is developed,
which has been used to predict the directional change of daily forex rates and provide intelli-
gent online decision support for financial institutions and individual investors. This article
describes the forex forecasting and trading decision method, the system architecture, main
functions, and operation of the developed DSS system. A comparative study is conducted
between our developed system and others commonly used in order to assess the overall per-
formance of the developed system. The assessment results show that our developed DSS
outperforms some commonly used forex forecasting and trading decision systems and can
provide intelligent e-service for forex traders to make useful trading decisions in the forex
market. 2007 Wiley Periodicals, Inc.
2. LITERATURE REVIEW
A literature review shows that there are diverse studies on forex forecasting
and trading, and there are numerous methodologies and models for forex forecast-
ing with artificial neural networks.119 Refenes et al.2 applied multilayer forward
neural network ~MLFNN! models to forecast foreign exchange prices. Kuan and
Liu 3 provided a comparative evaluation on MLFNNs performance and a recur-
rent network for the prediction of commonly traded exchange rates. Poddig and
Rehkugler 4 developed a clustering neural network to predict the direction of each
rate movements of the U.S. dollar against the German mark ~USD/DEM!. Gen-
cay 5 found that although the artificial neural network ~ANN! model and nearest-
neighbor regression were both nonparametric models, nearest-neighbor regression
was a local procedure used to approximate an unknown function whereas the ANN
model was a global approximation approach. El Shazly and El Shazly 6,7 used a
genetically evolved neural network ~GENN! to predict the 3-month spot rate of
exchange for four currencies: British pound ~GBP!, German mark ~DEM!, Japa-
nese yen ~JPY!, and Swiss franc ~CHF!. They reported that the GENN networks
forecasts outperformed the ones made by both the forward and futures rates in
terms of accuracy and correctness. In a more recent study by Leung et al.,8 the
forecasting accuracy of MLFNN was compared to the general regression neural
International Journal of Intelligent Systems DOI 10.1002/int
DEVELOPING A FOREX FORECASTING AND TRADING DSS 477
network ~GRNN! model. The study showed that the GRNN had a higher forecast-
ing accuracy than MLFNN with respect to a variety of currency exchanges. Simi-
larly, Chen and Leung 9 proposed a new method with error correction to predict
exchange rates in 2004. Empirical results showed that the new method could sig-
nificantly improve the prediction performance. Yao and Tan 10 used a neural net-
work to perform technical forecasting of foreign exchange rates and achieved good
forecasting performance.
Recently, some forecasting models have been developed integrating neural
network techniques with conventional and burgeoning artificial intelligent ~AI!
techniques to improve forecasting performance. Li et al.11 constructed a hybrid
fuzzy neural network model incorporating a fuzzy system and a neural network to
predict the U.S. dollar against the Singapore dollar ~USD/SGD!, and experimen-
tal results showed that the fuzzy neural network had a potential to improve fore-
casting performance. Zhang 12 presented a hybrid methodology that combined both
autoregressive integrated moving average and ANN models using the unique
strengths of linear and nonlinear modeling for time series forecasting. Empirical
results with real data sets indicated that a hybrid model could provide an effective
way to improve forecasting accuracy. Similarly, Yu et al.1 proposed a novel non-
linear ensemble forecasting model integrating generalized linear autoregression
model ~GLAR! with ANN for three currencies: DEM, GBP, and JPY. The empiri-
cal results revealed that the prediction using the nonlinear ensemble model was
generally better than those using the other forecasting models ~i.e., single GLAR
and single ANN models!, in terms of the normalized mean square error and direc-
tion accuracy statistics. For a more comprehensive survey on forex forecasting
please refers to Ref. 19.
to a dynamic situation. ~5! The inference engine ~IE! and ~6! explanation mecha-
nism ~EM! provide automatic criteria comparison and interpretation of rules. In
this way, traders and investors can make a decision to buy, sell, or hold foreign
currencies. ~7! The knowledge management and verification ~KMV! has the fol-
lowing functions: it deletes existing rules, adjusts existing rules, adds new rules to
the knowledge base using a knowledge acquisition tool, and verifies the knowl-
edge base to check consistency, completeness, and redundancy. There are hun-
dreds of rules in the knowledge base that represent experts heuristics and
experience. Using a knowledge acquisition tool, domain experts specify their rules
for the knowledge base and describe the rules with the format IF . . . THEN. . . .
Then the knowledge acquisition tool automatically converts the rules into an inner
encoded format. After new rules are added, the knowledge base verifier checks for
any inconsistency, incompleteness, or redundancy that might have arisen. ~8! The
tuning engine ~TE! adjusts knowledge after analyzing recent forex trends from the
database. In a dynamic situation, knowledge tuning is needed because old knowl-
edge can no longer be useful over time.
The conceptual framework of the intelligent forex rolling forecasting and
trading decision support system ~FRFTDSS!, as shown in Figure 1, consists
of two subsystems: one is the BPNN-based forex rolling forecasting subsystem
~BPNNFRFS!, which is an online training and forecasting subsystem; the other is
the Web-based forex trading decision support subsystem ~WFTDSS!, which is the
daily online trading decision subsystem. BPNNFRFS, using burgeoning AI tech-
nology ~e.g., BPNN here!, provides forecasting results for some main international
currencies to the WFTDSS, while WFTDSS is developed using decision models
and Web technology to provide some specific trading suggestions for investors.
Therefore, the former can provide data support for the latter and the latter feeds
back forecasting performance information to the former for continuous improve-
ment in forecasting.
The developed forex forecasting system functions include one-day-ahead
forecasts, three-day-ahead forecasts, and five-day-ahead forecasts. Forex trading
International Journal of Intelligent Systems DOI 10.1002/int
DEVELOPING A FOREX FORECASTING AND TRADING DSS 479
decision suggestions for the three types of forecasting are generated by the
WFTDSS subsystem. In addition, some auxiliary functions are also designed, which
include forex real-time quotes, online forex e-service, financial tools, and forex
market information. The implementation of these functions will be described in
Section 5.
This phase includes three main tasks: initialization, sample training, and sam-
ple validation. It is the core process of a BPNN model.
1 N
Ed ( f ~d ! ~ yt y[ t ! 2 ~1!
N t1
where N is sample size, yt and y[ t are actual and prediction values, respectively, and
f ~d ! is a indicator function, that is,
f ~d ! p1 ,
p2 ,
~ yt yt1 !~ y[ t yt1 ! 0
otherwise
~2!
4.1.3.2. Sample Training. After the initialization step, we can start sample
learning or training. The sample learning or training step is equivalent to the param-
eter estimation phase in conventional statistical models. In fact, the process of
sample learning is to find a global optimal solution to what is typically a highly
nonlinear optimization problem. This optimization problem can be carried out using
local or global methods. Local methods fall into two major categories: first-order
and second-order methods. First-order methods are based on a linear model ~gra-
dient descent! whereas second-order models are based on a quadratic model ~e.g.,
Newtons method!. However, local optimal methods are easy to trap into the local
International Journal of Intelligent Systems DOI 10.1002/int
484 YU ET AL.
minima. Therefore, global methods, which have the ability to escape local minima
in the error surface and thus are able to find optimal or near optimal weight con-
figurations, are generated. The common global methods include the simulated
annealing algorithm and genetic algorithms ~GAs!. However, these algorithms are
the time-consuming approaches in network training and learning.
Generally speaking, the steepest gradient descent algorithm is widely used.
Furthermore, this algorithm has good performance while away from a local mini-
mum. However, it requires many iterations to converge when close to the mini-
mum, and thus the speed of training and convergence is very slow. Thus, in our
subsystem, the LevenbergMarquardt method is introduced as a kind of quick and
effective algorithm for improving the speed of convergence. One characteristic of
the LevenbergMarquardt method is that the networks matrix of weights is updated
based on the Jacobian matrix, J, collecting the partial derivatives of the network
error e with respect to the weights. In other words, the matrix DW collecting the
corrections of the weights in matrix W is computed according to
DW ~J T J mI !1 J T e ~3!
4.1.3.3. Sample Validation. This step is also important. Once the training
phase has been completed, the performance of the trained network has to be vali-
dated on an independent data set using the criteria chosen. If the difference in the
error obtained using the validation set is markedly different than that obtained
using the training data, it is likely that the two data sets are not representative of
the same population or that the model has been overfitted.23 Therefore, it is vital
that the validation data should not have been used as part of the training process in
any capacity. In our study, we use a cross-validation method with the leave one
out procedure to avoid overfitting problem.
After completing the several previous steps, we can extrapolate the training
data by simulation and perform a forecast. Here the prediction results can be
obtained when the forecasting horizon is set.
From the previous discussion, we can see that BPNN modeling is a relatively
complex process that relates to many professional terms about neural networks.
However, easy operating is an important measurement of a software system. To
facilitate this system for users, we develop this easy-use BPNN-based forex fore-
casting subsystem as a component of FRFTDSS, as showed below. Figure 2 presents
the main user interface and training process interface of BPNNFRFS.
International Journal of Intelligent Systems DOI 10.1002/int
DEVELOPING A FOREX FORECASTING AND TRADING DSS 485
Figure 2. The main user interface and training process interface of BPNNFRFS.
It is worth noting that use of this rule has no human participation ~e.g., param-
eter specification!, but the forex trading strategies can still be generated by the
subsystem automatically.
Here transaction costs are not included because it is noninstallment and the quan-
tity is much larger than the difference between yt and y[ t . In this criterion, decision
International Journal of Intelligent Systems DOI 10.1002/int
DEVELOPING A FOREX FORECASTING AND TRADING DSS 487
makers can input the amount of funding costs according to the corresponding reg-
ulations and personal preferences.
That is to say, the investor should buy if the trading probability Bt1~max!
exceeds a buying threshold uB .
In the same way, if the trading probability St1~ j! of holding forex j is larger
than a selling threshold uS , the investor should sell.
International Journal of Intelligent Systems DOI 10.1002/int
488 YU ET AL.
In this criterion, once the users or decision makers specify a certain thresh-
old, the optimal trading strategies will be presented explicitly.
Rule I: If
ln
y[ t1
yt
~r * r!~1 g! and ~r * r!~1 g! 0
ln
y[ t1
yt
~r * r!~1 g!
2
and ~r * r!~1 g! 0
ln y[ t1
yt
~r * r!~1 g!
2
and ln
y[ t1
yt
0
ln
y[ t1
yt
~r * r!~1 g! and ln
y[ t1
yt
0
Compared with the previous three criteria, this criterion seems to be rela-
tively complex. However, once the values of r *, r, and g are specified by users or
decision makers, the optimal trading strategies will also be generated automati-
cally regardless of considering the meaning of optimal conditions.
To summarize, different trading rules may result in different trading strat-
egies. It is hard to say which one is better for the four criteria because of different
perspectives. From our experiment evaluations, the simple price comparison judg-
ment criterion is preferred by most business users because it is simple and easy to
understand.
Figure 4 shows the basic architecture of the WFTDSS subsystem and the data
flows between user interface, knowledge base ~KB!, model base ~MB!, and data-
base ~DB!. The components of the subsystem are described below.
According to the above WFTDSS architecture, we develop the Web-based
forex trading decision subsystem using latest Web technology and client/server
~C/S! mode, which contains a three-tier structure. The basic construction of the
three-tier structure is shown in Figure 5.
forecasting accuracy for all currencies. ~3! Of the five currencies, GBPs forecast-
ing results are the best in all evaluation categories, whereas JPY is the worst. Pos-
sible reasons for this could be that ~a! uncertainty increases with the increase of
forecasting horizon in forex market; ~b! the volatility of JPY is generally higher
than that of GBP and other currencies during the time period; ~c! from the point of
input variables, appropriate exogenous variables often increase the forecasting accu-
racy. In the Section 1 of the article, we selected five external factors ~GDP, exports
and imports, CPI, and interest rates! as input variables for prediction. Actually, due
to the daily data availability of these macroeconomic variables, these explanatory
variables are used as dumb variables to adjust the neural network forecasting model.
Second, for providing more convincing evidence that the proposed forecast-
ing method outperforms the classical forecasting methods, some classical models,
such as HoltWinter, ARIMA, polynomial regression and, nave random walk mod-
els, are selected as the comparable models. For simplicity, two main evaluation
criteria, RMSE and Dstat , are chosen. Accordingly, the comparison results between
FRFTDSS and four classical models based on the 1 years data are shown in
Table V.
From Table V, several conclusions can be made in the following. ~1! Focus-
ing on the RMSE indicator, the proposed FRFTDSS performs the best, followed
by the HoltWinter model, ARIMA, nave random walk, and polynomial regres-
sion model from a rough view. ~2! From the Dstat point of view, the FRFTDSS
model really outperforms the comparable classical models. The previous two prop-
ositions will lead to the final conclusion: ~3! in general, the proposed FRFTDSS
model clearly performs better than the four classical methods in terms of the same
evaluation criteria.
International Journal of Intelligent Systems DOI 10.1002/int
DEVELOPING A FOREX FORECASTING AND TRADING DSS 493
Table V. The comparison results between FRFTDSS and four
classical models.
Leading Dstat
Models Objects days RMSE ~%!
~1! Sample selection. The tasks of this step are to select comparative indices and collect
corresponding data.
~2! Data preprocessing. In original data, there may be some positive indices ~the larger
the index value, the better! and some inverted indices ~the smaller the index value,
the better!. Furthermore, original data may have different dimensions and charac-
teristics. It is, therefore, necessary to process these original data. In general, as for
inverted indices, we use reciprocals of the indices as evaluation indices. As for differ-
ent dimensions and characteristics of data, standardization is an effective way to elim-
inate differences:
x i xS i
x i* ~8!
si
where x i* is the standardized data, x i is the original data, xS i is the mean of the ith index,
and si is standard deviation of the ith index.
~3! PCA computation. After preprocessing the data, the principal component analysis can
be performed and corresponding principal components identified. Assuming that there
are n individual models and systems and that every model and system contains m
evaluation indices, the transformed evaluation matrix ~X ! can be represented as
x 11 x 12 J x 1m
x 21 x 22 J x 2m
X ~9!
I I L I
x n1 x n2 J x nm
where x ij is the jth index value with the ith model or system.
Next, we deal with the evaluation matrix using the PCA technique. First, eigen-
values ~l 1 , l 2 , . . . , l n ! and corresponding eigenvectors C ~c1 ,c2 , . . . ,cn ! can be gen-
erated from the evaluation matrix. The individual contribution rate and cumulative
contribution can then be obtained:
li
ai n
~10!
( li
i1
( li
i1
a mn n
~11!
( li
i1
where a i is the ith contribution rate, and a mn is the cumulative variance contributions
of the principal factors x 1 to x m . When a mn is sufficiently large or a mn satisfies a
certain cutoff, we can choose several principal factors to construct a comprehensive
evaluation function. In general, a mn is larger than 0.90 in practice.
~4! Constructing a comprehensive evaluation function. According to the previous compu-
tation and analysis, the comprehensive evaluation function of the principal factors can
be expressed as
k
Yi ( CX ( cj x i ~i 1, 2, . . . , n! ~12!
j1
where k represents the k-first principal axes related to the cutoff obtained from the
previous step.
~5! Calculating all values of comprehensive evaluation function for every comparative
model and system and ranking corresponding assessments.
Acknowledgments
This research was partially supported by grants from NSFC and CAS ~China! and SRG
~Hong Kong!.
References
1. Yu LA, Wang SY, Lai KK. A novel nonlinear ensemble forecasting model incorporating
GLAR and ANN for foreign exchange rates. Comput Oper Res 2005;32:25232541.
2. Refenes AN, Azema-Barac M, Chen L, Karoussos SA. Currency exchange rate prediction
and neural network design strategies. Neural Comput Appl 1993;1:4658.
3. Kuan CM, Liu T. Forecasting exchange rates using feedforward and recurrent neural net-
works. J Appl Econometrics 1995;10:347364.
4. Poddig T, Rehkugler H. A world model of integrated financial markets using artificial
neural networks. Neurocomputing 1996;10:251273.
5. Gencay R. Linear, nonlinear and essential foreign exchange rate prediction with simple
technical trading rules. J Int Econ 1999;47:91107.
6. El Shazly MR, El Shazly HE. Comparing the forecasting performance of neural networks
and forward exchange rates. J Multinat Finan Manag 1997;7:345356.
7. El Shazly MR, El Shazly HE. Forecasting currency prices using a genetically evolved
neural network architecture. Int Rev Finan Anal 1999;8:6782.
8. Leung MT, Chen AS, Daouk H. Forecasting exchange rates using general regression neu-
ral networks. Comput Oper Res 2000;27:10931110.
9. Chen AS, Leung MT. Regression network for error correction in foreign exchange fore-
casting and trading. Comput Oper Res 2004;31:10491068.
10. Yao JT, Tan CL. A case study on using neural networks to perform technical forecasting of
forex. Neurocomputing 2000;34:7998.
11. Li X, Ang CL, Gray R. An intelligent business forecaster for strategic business planning. J
Forecasting 1999;18:181204.
12. Zhang GP. Time series forecasting using a hybrid ARIMA and neural network model. Neuro-
computing 2003;50:159175.