Beruflich Dokumente
Kultur Dokumente
Post-Predictor Z Scores
We are interested in characterizing the post-predictor time period using a scaleindependent method that allows comparisons between financial series. The Z
Score is the most appropriate measure for this job. A Z-Score is the measure of
how many standard deviations price has moved away from its price at the
prediction event, assuming that the probability of either an up or down move is
random at 50%.
By measuring local volatility at the prediction event, a normal probability
distribution can be drawn going forward in time that acts as a roadmap for
subsequent price moves.
prediction event.
velocities, which are proportional to 1/Time, representing the region where the
future price is most likely to be found. An example of such a probability map is
shown below for the stock Home Depot on Feb 3, 2005:
75
Viewed from the side at two time steps, diffusion acts to spread out the region
where we might find the stock price as time elapses:
Diffusion causes
expected prices to
spread out over
time
At each time step, there is a larger standard deviation and the same mean. If we
represent the actual price achieved at each step in terms of that standard
deviation, we produce a series of Z-Scores. For example, if the price at a
prediction event is $5, and then moves to $7 on day ten with a standard deviation
of $1.60, then the Z-Score on day ten would be (7-5)/1.6 = 1.25. This means
that price moved 1.25 standard deviations above the price at the prediction event.
Since the standard deviation continues to increase, in order to maintain the same
Z, price would have to increase by the same relative amount.
76
If all post-predictor prices for all financial series are converted to Z-Scores, and if
the predictors do not work, and if markets are random, then at every time step, a
histogram of all the Z-Scores would be expected to result in a standardized
normal distribution, with mean of zero and a standard deviation of one N(0,1) as
shown below:
background distribution based on randomly selected dates across our entire 3000
day test period, and for all of the 2500 stocks being considered. The figure below
77
To illustrate this another way, we could ask what percentage of buy predictions
are winners (Z>0) if the timing is random, and plot this percentage each day
following the randomly selected purchase dates. Normally this would be 50%,
but since the background distributions is positively biased, the figure below
shows the percent winners for randomly selected buys climbs steadily over time.
The reverse is of course true for randomly selected sells (mirror image).
78
On a weekly scale the effect is even more pronounced, as shown in the next
figure. I guess this could be called the dartboard effect, in that even a random
selection of stocks during this period showed a 59% win rate at 30 weeks after
purchase, and shorting was decidedly unprofitable.
79
We will use these curves as benchmarks for our predictors over this test period
and stock set. In order to have a non-random buy predictor then, we need to see
a win rate in excess of 59% at 30 weeks for example, and sells would need a win
rate better than 41%.
So far we have examined the background distribution of Z-Scores and the
percentage of winners. What we still need to know is how much gain is possible
from randomly selected buys and sells, so that each predictors excess gain profile
can be evaluated. The pictures below show the background performance for
randomly selected buys and sells:
80
81
Parallax Predictors
Using the randomly selected background distribution we found in the last section,
we will now statistically evaluate our main stable of predictors using a chi-squared
test. The Chi-squared test is designed for comparing distributions and a p<0.01
constitutes a rejection of our predictor distributions as random. We will also look
at the excess %winners and excess %gain versus the random benchmarks shown
in the last figure. The following is a list and a qualitative description of each
predictor:
Predictor
Description
Duration
ExtremeHurst
ExtremeHurst
Extension (high)
Extension (low)
ExtremeHurst
Compression
Price Wizard
(Undervalued)
Price Wizard
(Overvalued)
Precision Turn
(high)
Precision Turn
(low)
82
Estimated at 6
months to a year
Estimated at 6
days
Estimated at 6
days
83
The ExtremeHurst sell predictor is extremely fast, showing 56% winners at only
4 days from the signal, with a Chi squared test p-value = 6.7x10-124. This means
that these are not random signals, and that market behavior leading up to the
prediction event significantly conditioned what happened in the postprediction period.
84
The way to interpret this picture is that we gain a significant advantage over
random for both buy and sell predictors for at least 30 days. Extension buy
signals yield the best percentage of winning predictions at 8% more than the
random background after 22 days. This tells us that if we had a large sample of
trades that were triggered by this predictor, we could expect that at 22 days, a
total of 55%+8% = 63% of the trades would be ahead. Remember that our
random trade sample had 55% winners at 22 days, so we would beat it by 8%.
This chart also tells us that we can hold trades for a relatively long time. Despite
the win rate staying constant, average portfolio returns would climb steadily.
85
The sell signal is different and faster. We immediately achieve a 7% edge above
random at 3 days and that edge diminishes steadily over 30 days. This might
make us design a faster sell trading strategy than what we would use for the long
side.
So far we have examined the overall distribution of Z-Scores for ExtremeHurst,
and the percentage of times that a buy signal goes up or a sell goes down. The
win rate is only part of the story however. What we still need to know is how
much gain is possible from these buys and sells in excess of the background
market performance. The pictures below show a gain analysis for ExtremeHurst
buys and sells. Each plot shows the median percentage gain with error bars at the
40th and 60th percentiles and the average of all returns. Note that the average is
not as good a measure of performance because of the overemphasis on outlier
signals. The count of signals is shown in parenthesis.
86
87
88
89
..
90
91
92
93
94
95
96