Beruflich Dokumente
Kultur Dokumente
Through these identities, any formula is Humanity's intellectual quest to decipher putational power can surpass that of the
eauivalent to one where all the negations are nature and to tnaster it has led to numerous Turing model. The proposed model builds
on variahles. Assuming that our formulas are efforts to build machines-endowed with on a particular chaotic dynarnical system
so restructured. I huild a contact network artificial intelligence-that
" simulate the (6); hy applying the system to computer
that simulates the formula inductively. If the world or communicate with it (1-4). The science, a "super-Turing" tnodel can be de-
formula is a variable or its negation, then cotnputational power and dynamic behav- veloped (7).
there is a single-edge contact network that is ior of such cotnputers is a central question Demonstrating the existence of an ide-
equivalent. For example, the formula i? is for mathematicians, computer scientists, ally realizable super-Turing model has
equivalent to the network with an edge from and physicists. Computer models are ulti- practical and theoretical implications.
s to t with the label T. mately based on idealized physical systems, Theoretically, it could open the way for
In the general case, the formula is equal called "realizable" or "natural" models. theories of computation that go beyond
to either E v F or E A F, where E and F are Since 1936, the standard accepted model of the Turing model. O n a practical level,
sim~lerformulas. Assuming that G is the universal computation has heen the Turing computers designed and built on the hasis
netkork for E and that H Is the one for F, machine (5),which forms the basis of tnod- of super-Turing theories should be capable
the network for E v F is constructed by ern cotnputer science. The Church-Turing of tnodeling phenomena that existing
placing G and H in parallel (Fig. 3A). thesis, the prevailing paradigm in computer computers are not powerf~ll enough to
Clearlv, there is a connected oath from s to science, states that no realizable comnutine
L L,
tnodel well.
t provibed that there is either'a path from s device can be more powerful (aside frotn In cotnputer science, tnachines are clas-
to t through G or through H. The network relative polynomial speedups that are a re- sified according to the classes of tasks thev
%
,
for E A F is constructed hy placing thetn in sult of richer instruction sets or parallel can execute or the functions they can per-
series (Fie. 3B). In this case, there is a computation) than a Turing tnachine (5). form. The most popular model is the Turing
connected path from s to t provided there is This report questions that assumption, pro- machine, introduced by the English math
one through both G and H. Dosing an alternative model of cotnDuta-
A u
etnatician Alan Turing in 1936. The Turing
It is quite simple to show how DNA tion, possibly realizable as well, whose com- machine (5) allows for unbounded "exter-
experiments can be used to solve the SAT nal" storage (tapes) in addition to the finite
Department of Informaton Systems Engneerlng, Faculty
orohlem for anv contact network. Associate Techn~on,H a~fa32000,Israel.
of Industrial Eng~neer~ng, information represented hy the current "in-
a test tube P, with each node v in the E-mall: iehava@~e.technon.ac.il ternal" state (control) of the system. At
SCIENCE VOL. 268 28 APRIL 1995
every step, the machine reads the tape sym- Fig. 1. [A related model is the model of
bol cx (where cx E 10, 1, blank}) from under com~utationover the real numbers 112). . .
an access head, checks the state of the This interesting model, however, is incom-
controls (where s E {I, 2, . . .IS/}),and exe- parable with the modern theory of comput- ,P'
m e - t
cutes three operations: ability, as it does not comply with the fi-
1) Writes a new binary symbol under niteness of input and output.] The ARNN
the head ( P E (0, 1)). consists of a finite number of neurons. The
2) Moves the head one letter to either activation of each processor i = 1, . . . , N
right or left (m E (L, R}). is updated by the equation
3 ) Changes the state of the control (st E
{I, 2, . . .lSi}).
The transition of the machine can be
summarized by a function
Fig. 1. The analog recurrent neural network
(ARNN). (A) Graphic view, (B) Engineering view.
When the control reaches a special state, where N is the number of neurons, M
called the "halting state," the machine stops. is the number of external input signals, xj
The input-output map (or the function) are the activations of the neurons, u, are realizable dynamical system that computa-
computed by a Turing machine is defined by the external inputs, and aij, bij, and c, are tionally is as strong as the analog model-
the pair of finite binary sequences on its tape the real coefficients, also called constants the "analog shift map."
before and after the computation, respective- or weights (the name "analog" indicates In the literature of dynamical systems,
ly. The finiteness of the input and output is real, rather than rational, coefficients). chaos is commonly exemplified by the
a crucial requirement in computer science, T h e function u is the simplest possible "shift map" [such as the Baker's map (17) or
assuring that the ability to compute depends "sigmoid," namely the saturated-linear the horseshoe map (18); Fig. 21 over a set of
purely on the inherent computational power function bi-infinite dotted sequences. Let E be a
of the model rather than on a larger amount finite alphabet; a dotted sequence over E
of external information. (denoted by E) is a sequence of letters
There are many variants of the basic where exactly one is the. dot. sign ( . ) and
model that yield the same power, such as the rest are all in E. The dotted sequences
adding tapes, access heads, and tape dimen- can be finite, (one-side) infinite, or bi-
sions. Other variants may result in more (but various other sigmoidal functions may infinite over E. Let k E N be an integer, the
powerful, though nonrealizable, models. substitute it). A subset of the N neurons is shift map
Nonuniform Turing machines exemplify
such models 18):
, , The machines receive on
singled out to communicate the output of
the network to the environment. I n ~ u t s
s'~: E + E : (a), + (a),+,<
their tape, in addition to the input, another and outputs are streams of letters, and com- shifts the dot k places, where negative val-
sequence w, to assist in the computation. putability is defined under the convention ues cause a shift to the left and positive ones
For all possible inputs of the same length n, that is sometimes used in practical commu- a shift to the right.
the machine receives the same advice se- nication networks: There are two binary I define the "analog shift" as follows:
quence w,, but different advice is provided input channels, where one is used to carry A dotted substring is replaced with anoth-
for input sequences of different lengths. We the binary input signal and the other one er dotted substring (of equal length) ac-
are focusing on the class of nonuniform indicates when the input is active. A similar cording to a function G ; then, this new
machines that compute in polynomial time convention is applied to the output. sequence is shifted a n integer number of
(and use a polynomial long advice), denot- The ARNN computes the super-Turing places left or right according to a function
ed by P/poly (8). For example, a super- class P/poly in polynomial time and all bi- F. Formally, the analog shift is the map
Turing function that appears in P/poly is narv functions in exuonential time (9).
the "unary halting" function (5): Given a his fact is connectedLtoclassical comput-
unary encoding of a computer program f ability by the observation that when the where the function
and a sequence x E {I}*, the function deter- real weights are constrained to be rational
-
mines whether the Drogram terminates
when acting on the sequence x. If both
numbers, the network has Turing power
only (10, 13). [Follow-up generalizations describes the modification of the sequence,
advice and time are exponentially long appear in (14-1 6)]. Furthermore, determin- and the function
[that is, O(Zn)],the advice can be used to istic and probabilistic ARNNs are compu-
indicate the desired resDonse for each of the tationally equivalent (16).
2" possible input strings of length n and thus Analogous to the Church-Turing thesis, indicates the amount of shifting of the dot.
to compute all functions f: (0, I}* + (0, 11, the ARNN was offered as a basic analog Both F and G have a finite domain of
including noncomputable ones. computation model, with the conjecture dependence (DoD)-that is, F and G de-
Siegelmann and Sontag (9, 10) noticed that "any realizable analog computer will pend only on a finite dotted substring of the
that such nonuniform classes are indeed have no more power (up to polynomial sequence on which they act. The domain of
very natural for analog computation mod- time) than the analog recurrent networks" effect (DOE)of G may be finite, infinite, or
els. They introduced the first model of com- (9). Here, I assert that although the bi-infinite. Finally, the operation @ is de-
putation that is uniform but yet has non- Church-Turing thesis is indeed a funda- fined by
uniform super-Turing capabilities; this mental observation for the large class of
model is the classical analog recurrent neu- discrete computing devices, it may not
ral network (ARNN), which is popular in provide the whole picture, and the analog
practice as a machine with automatic learn- computation thesis is thus required. T o where E is the empty element not contained
ing and adaptation capabilities (1 1); see support this assertion, I present a chaotic, in E.
Fig. 2. Various physical shift maps, (A) Smale's was constrained to systems that start with tape, and infinitely many 0's that encode
horseshoe. (B) Smale's horseshoe abbreviated to finite dotted sequences and that stop with the empty part of the tape. Note the asym-
the unit square. (C)The Baker's map. either finite or left infinite dotted se- etry of the left and right empty parts of the
quences only. Even under these con- taDe: this is a result of the asvmetric defini-
straints, the analog shift computes richer tion'of the output of the analog shift. The
T o illustrate, let us assume the analog maps than the Turing machines. initial dotted string is
shift is defined by Let AS(k) denote the class of functions
computed by the analog shift (AS) map in
time k, NN(k) the class of functions com-
puted by the ARNN in time k, and poly(k) where x is the finite input string. The G
the class of polynomials in k; my theorem function substitutes the left side of the dot
states that with the infinite string
Theorem 1. Let F be a function so that
F(n) r n. Then,
AS(F(n)) C NN(poly(F(n))), and which in polynomially many steps becomes
NN(F(n)) C AS(poly (F(n))). the associated advice (along with lots of
Here, 5 denotes the left infinite string [Although the result is mathematically garbage)
. . . 51413 in base 2 rather than in base 10. profound, a detailed proof is not provided
This table is a short description of the dy- here. For rigorous mathematical proof, see
namical system, in which the next step de- (21 1.1
pends on the first letter to the right of the Sketch of proof. Assume, without loss of From this point, each step of the machine is
dot and the one to its left. If these letters generality, that the finite alphabet E is simulated simply; the details are left to the
(that is, the DoD) are 0.0, then (according binary; that is, E = {0, 11. reader.
to the first row of the table) the left side of 1) AS(F(n)) C NN(Poly(F(n))): The appeal of the analog shift map is not
the dot is substituted by 5 and the dot moves Given a dotted sequence a = only as an almost classical, chaotic dynam-
one place to the right. If the DoD is instead . . .aL,aL,aLl . ala,a,. . ., it is map- ical system that is associated with analog
0.1 (as in the second row of the table), the ped into the two sequences computation models. It is also a mathemat-
second letter to the right of the dot becomes ical formulation that seems to describe ide-
0 and there is a right shift, and so on. alized physical phenomena. The ideiiliza-
The dynamic evolving from T h e analog shift map is redefined as tion allows for model assumptions such as
any convenient scale to describe the system,
noise-free environment, and physics of con-
is as follows: here, the DoD is 1.1,hence (by tinuous medium. Some of the physical mod-
the fourth row of the table) the letter to the els, previously used to simulate Turing ma-
right of the dot becomes 0 and the dot is where dl and dr are the left and right chines, turn out to be exactly as powerful as
shifted right: parts, respectively, of the DoD. The the analog models (both to simulate and to
value of al, as well as a,, is encoded in be simulated by).
a neuron, using a Cantor set represen- This assertion can be demonstrated, for
Now. the DoD is 0.0: tation (10). The next operation is example, with the system introduced by
decided by the first few bits of both Moore (20).
. , This is a "tov model" describing
neurons (DoD). The operation in- the motion of a particle' in a three-dimen-
volves either substitution of the first sional potential, such as a billiard ball or a
finitely many bits (when G is finite to particle bouncing among parabolic mirrors.
this side) or loading of the neuron (Note that this system describes a topologi-