Beruflich Dokumente
Kultur Dokumente
Noam Chomskys work has had enormous impact, not only within linguistics,
but also upon the humanities, social sciences, and sciences more generally. It has
certainly been of especial importance for philosophy because it has revivified the
stalled debate between Rationalists and Empiricists as to whether our knowledge is
innate or acquired with its proposal that our capacity to understand and produce our
own language and to learn others too is grounded in the fact that we are born in
possession of the principles governing all possible grammatical sentences.1
The discipline of art history has remained almost completely impervious to
Chomskys work, however. Instead it has preferred of late to draw its understanding
of what is language-like about pictures from structuralism,2 or from critical revisions
of this body of theory that nevertheless preserve its foundational assumption that
both signifiers and their associated signifieds are conventional and arbitrary. Applied
uncritically to pictures, this would imply that their signs simply cannot be iconic, in
the Peircean sense of resembling what they stand for, which would beg the question
why so many depictions do indeed look convincing. The other main theory of
representation to draw analogies between pictures and language (under the broader
rubric of the language of art) that still enjoys some currency among art historians
is Gombrichs. But this provides no more satisfying a resolution to the matter of the
arbitrariness or iconicity of depiction than its rival. Instead as Richard Wollheim
observed it contradicts its own thesis that pictorial signs can be illusionistic (or
strongly iconic) by maintaining at the same time that they are always conventional,
in the sense of subject to arbitrary rules of interpretation.3 (Gombrich also regards
painting as a code of sorts.) Given this impasse, not the least advantage of applying
Chomskian principles to the analysis of pictures is that it generates an account that
makes it possible to accommodate iconicity and conventionality together amenably.
In a career spanning forty years, the psychologist (and trained engineer and artist)
John Willats developed an account of precisely this kind, largely on the basis of the
standard theory Chomsky elaborated in his first two books.4 It is therefore relatively
unsophisticated compared with the more recent versions of Chomskys theory,5
which can specify the operations of linguistic grammar in minute detail in terms
that embrace syntax and semantics (or structure and sense) simultaneously, and at
the same time explain how we can in practice implement grammar economically in
our speech.6 Willatss account is none the less serviceable for pictures since it rests on
a conception of their operations that derives from those concepts most fundamental
to Chomskys account, and which have endured through the several reforms it has
563
Pictorial Grammar
undergone. It also turns out that pictures possess only a restricted set of semantic
features organized by a relatively small number of syntactical rules. So, rather as
Chomskys work could reconfigure our understanding of the role of nature and
nurture in language acquisition, Willatss account is powerful enough to reconfigure
our understanding of what is natural and what is conventional about depiction. It is
capable, in other words, not only of transcending the limitations of semiotic theory,
but also those of philosophical theories (unfamiliar to most art historians) which
regard pictures as seamless and transparent icons that resemble what they represent
naturally (as opposed to conventionally).
In short, what Willats argues on this score is that pictures are neither wholly
conventional nor naturally iconic. Instead he proposes that pictures have a natural
basis, and non-conventional characteristics, because they issue from our innate
capacity to map percepts (or mental representations) of real scenes and their
components according to rules that preserve many of their objective properties. At
the same time, however, he also argues that convention plays a substantial role in
depiction, since it is this that decides which of the manifold possible variants of these
rules are employed by particular groups and cultures.
Willats acknowledged his debt to various aspects of Chomskys thinking in
several places. In his magnum opus, Art and Representation of 1997,7 he recognized both
the impact that Chomskys theory of syntax made on his understanding of pictorial
structure, and his indebtedness to the pioneering research into picture grammars
undertaken in the 1960s by David Huffman, Max Clowes, and Adolfo Guzman
under the impact of Chomskys theories.8 And later, in Making Sense of Childrens Drawings
of 2005, Willats declared that his explanation of how children learn to draw was
premised upon Chomskys ideas about our innate capacity for producing speech.9
Willats also recognized Chomskys more indirect influence in both books by stating
that the other main source of his general theory was David Marrs Vision of 1980,10 a
work that explicitly modelled its use of computational principles to explain how the
brain converted raw data into conscious percepts upon Chomskys use of similar
principles to explain how speech transformed its own base content.11
Notwithstanding this, fleshing out Willatss account will involve analysing a
good deal more explicitly than he does himself how his theory applies Chomskian
principles since he only rarely cites specific passages of Chomskys writings, and is
reluctant to pursue theoretical generalizations. More specifically, demonstrating
that pictures have a grammar similar in kind to the grammar Chomsky discerned in
language will involve demonstrating a series of more basic facts: first, that pictures
have parts which can be segmented out of their larger structures; second, that syntax
operates on their component parts; third, that they map shapes and spatial relations
in a grammatical way; fourth, that the grammatical rules operating in any picture
are innately grounded; and, finally, that these rules map, or transform, a deeper
perceptual content.
It must be acknowledged that the type of account to be developed solely by
extrapolating from Willats must necessarily be restricted to characterizing how
pictures perform their most basic cognitive function, namely that of rendering
objects in forms that allow them to be recognized. I nevertheless hope to show
that developing a theory of this kind (or of pictorial grammar pure and simple) is
worth the effort. One justification for doing so is that the reality of such a grammar
is vigorously contested.12 Another is that the analytical categories provided by
Willats greatly enhance our understanding of how pictures work. What is more,
even a rudimentary account of pictorial grammar is capable of making sense of the
Association of Art Historians 2011
564
Paul Smith
complex meanings pictures sometimes achieve when they bend, or subvert, its rules.
And although it would be unwise to attach too much weight to any further claims
about the potential of a theory of pictorial grammar, I will none the less tentatively
suggest in my conclusion that it may be capable of explaining aspects of depiction
other than shape or spatial relations, and may also contribute towards a better
understanding of the relationship between pictures and language. A general theory
of pictorial grammar is, in other words, a worthwhile aspiration for art historians as
for others.
Convention, Iconicity, and Mapping
A convenient route into the differences between semiotic and natural resemblance
theories of depiction is provided by the arguments advanced by Yve-Alain Bois in
his essay, The semiology of cubism, of 1992,13 and by Richard Wollheim in his
response to it in his essay, Formalism and its Kinds, of 1995.14 In the principles laid out at
the outset of his essay, Bois argues that the pictorial sign in cubist collage from 1912
is used in the same way either as the arbitrary signifier (described by Saussure) or
the (Peircean) index, and that its iconic use is all but entirely suspended.15 An index
is simple enough in that it is a sign that refers to its referent by grace of a causal or
existential bond of some kind. Signifiers are more complex. In the first place, they do
not refer to things and the like, but signify their own signifieds. More specifically,
because signifiers are distinguished from one another within any system, or
language, by the play of phonic differences alone,16 the signifieds that system can
generate must also be a function of these differences, and hence must be as arbitrary
as their associated signifiers. It is also important to emphasize that the signifiers
actually present in any utterance do not signify by virtue of their relationship to each
another alone, but by virtue of their relationship to their absent relatives as well.17
Bois extends these arguments by analogy to propose that the meanings of pictorial
signifiers are a function of the whole system they constitute (although he is unclear
about how this category should be applied), and are therefore arbitrary in a strong
sense. Signs thus conceived are evidently incapable of reference,18 at any event, and
only mean what they do because convention assigns them their meaning.
Because Bois sees no role for iconicity in cubist collage, he does not allow that the
games at work in some cubist collages often flagged by the words jouer or jeu
or suchlike depend on playing with, or undermining, the function that iconic signs
fulfil when used more straightforwardly. In Picassos Bowl with Fruit, Violin and Wineglass
of 1912 (plate 1), for instance, it is ambiguous whether the piece of paper representing
a wooden violin is painted to look like wood and is therefore a straightforwardly
iconic sign or whether it is painted to look like faux-bois wallpaper and is therefore
a sign that imitates a sign that routinely employs iconicity for the purposes of posing
as indexical. But even while the ambiguity of such signs calls their own authority
into question, their ability to do so is ultimately parasitic upon the reliability of the
overwhelming majority of iconic signs to refer.19 So even though Picasso cut out the
illustrations of fruit in the top left-hand corner in rough angular shapes that place
them in the pictorial equivalent of quotation marks, the joke only works because they
still manage to resemble their referents.
Although Bois essay does not purport to be a general theory of depiction,
Wollheim nevertheless argues in Formalism and its Kinds that no theory of the kind
it implies could ever be tenable.20 Wollheims aversion to Bois way of thinking
arises from his commitment to the idea that seeing-in and iconicity are both
fundamental to depiction generally,21 even overtly non-figurative painting; from
Association of Art Historians 2011
565
Pictorial Grammar
566
Paul Smith
which it follows that any theory maintaining the possibility that a paintings
content can be non-representational falsifies what is most central to it. What
underlies Wollheims opposition to Bois, in other words, is what he construes as
the latent formalism betrayed by his concern for structure. Rather than tackling
this commitment head on, however, Wollheim prefers to criticize it indirectly by
countering Bois more general (and, to be fair, largely implicit) claim that pictures
are language-like in structure. And he does this by identifying a series of conditions
drawn from Chomskys thinking that representational pictures ought to be able to
satisfy were they language-like, but which they cannot because (in his view) they
are not.22 These all require that pictures possess grammatical features of a kind that
Bois cannot discern in them (any more than structuralism can in language).23 But
Wollheims critique goes further because it purports to show that there is no such
thing as pictorial grammar tout court, which if right must mean that Willatss
account of depiction collapses along with Bois.
Wollheims most fundamental objection to the idea of pictorial grammar is that
pictures cannot involve syntactical rules because, unlike sentences, they cannot be
segmented into smaller units or ultimately basic units upon which syntax operates.
Wollheim denies, in short, that pictures consist of lexical (and other kinds of) items
ordered by syntax. This claim can be understood better by grasping how it relates
to Flint Schiers criticism of the notion of pictorial grammar in Deeper into Pictures of
1986, a work Wollheim regarded highly.24 This contends that pictures are weakly
decomposable or compositional but not strongly so like sentences, and hence
contain nothing which plays the role of a word (or morpheme) in them.25 Schier
consequently claims that in order to understand a picture we have no need to grasp
anything akin to the syntax operating in a sentence, or the relevant grammatical
rules governing [the] composition of its parts.26 Rather, he believes that there
are no grammatical rules governing the composition of pictures, and so sees no
place for a grammar or syntax of pictures at all.27 Indeed, Schier sums up Wollheims
position along with his own when he asserts that There is no need for a grammar in
Chomskys sense for iconic systems,28 since a picture is just built up iconically out
of its parts. Our manifest ability to recognize a whole icon is, in other words, simply
a function of our ability to recognize the objects and states of affairs represented by
[its] parts.29 What this seems to suggest is that pictures iconify by grace of natural
resemblance alone, or in virtue of being somehow or in some unspecified sense
isomorphic to their referents.
One of the more important implications of such a view is that pictures resemble
things without making recourse to any independent syntax of their own which
might complicate their relationship to their referents. So, even though Wollheim
eschews the notion of pictorial grammar altogether, his arguments about the
naturalness of iconicity nevertheless accord with the Port-Royal grammarians view
that there was nothing in the grammar of language that exceeded the requirement
that it should be able to reflect the natural order of things. This view is highly
reductive, however, as Chomsky famously demonstrated when he showed that the
surface structure of a sentence maps or transforms a deep structure according
to rules proper to itself.30 It follows that inasmuch as he embraces a closely analogous
conception of pictorial grammar, Willats maintains that pictorial structure is to some
extent sui generis too.
At the same time as challenging Wollheims and Schiers account of
representation for being too strongly naturalistic, Willats implicitly contests Bois
view for being too strongly conventionalist. This is because Willats envisages pictorial
567
Pictorial Grammar
Paul Smith
569
Pictorial Grammar
as a) or another (such as d), nothing of the sort can be said of the components of
pictures.37 Instead, pictures have syntactically dense systems that lack clear internal
differentiation,38 from which it follows that they cannot be ordered by syntax.
Goodmans views were contested almost as soon as they appeared by Curtis
Carter, who argued that shapes are indeed disjoint components of pictures that
exhibit character class membership,39 and hence that they are readily seen as
constitutive elements of larger units akin to phonemes, morphemes, and spoken
words.40 His argument falls, however, because it does not clearly establish what the
defining limits of a shape are. In a similar vein, Willatss predecessors, Huffman and
Clowes, argued that structures such as line junctions and vertices, being composed
of more simple elements in the form of lines, thereby imply the operation of pictorial
syntax on disjoint units. It is a weakness of Huffmans theory, however, that it does
not closely specify the character of these units, and of Clowess and Guzmans that
they characterized lines somewhat vaguely as the boundaries of regions.41
In the light of these partial successes, it was an important part of Willatss
achievement that he clearly identified pictorial units of the kind Goodman and
Wollheim regarded as indiscernible by developing the analytical category of the
picture primitive.42 These items come in a small number of simple forms: lines
denoting edges, points indicating luminosity, and colours referring to hues and tones
simultaneously. Put this way, the argument seems trite. But it actually represented
a major advance over earlier attempts to identify basic pictorial units because it
involved distinguishing a pictorial semantic unit from the marks carrying it. A line,
in other words, is no more identical to a pencil or pen mark, or a set of brushstrokes,
or the interstices between tesserae, than a word is the same as the sound in which it is
expressed.43 Nor is a point the same as a blob, or a colour the same as a brush mark.
Once this is appreciated, it starts to appear as though Wollheims objection to pictorial
segmentability involves conflating pictorial marks, which are not always readily
segmentable, with the primitives they carry, which are. Or to put it the other way
round, a picture decomposes straightforwardly once its units are correctly specified,
just as a sentence does.
What Willats achieved by identifying the semantic units of the picture in this
manner is thus closely comparable to what Chomsky achieved by inventing the
rewrite rule,44 that is, the technique of characterizing the components of phrases
generatively in terms of explicit lexical categories such as noun, verb, adjective and
the like.45 And, indeed, Willats not only regards picture primitives as elements of the
pictures denotation system, where they play a role akin to the constituents of the
lexicon in language,46 but casts them as lexical units on account of how they refer
to scene primitives, or to the basic perceptual units of the scene they depict. Lighter
and darker points, for example, refer to the degrees of luminosity we extract from the
luminous array, and lines refer to the edges we extract from it. This means that lines
are indeed genuine units of sense, or that they are components of larger structures
that stand for things rather like nouns. Furthermore, Willats supports this inference
by showing that childrens drawings sometimes use what he calls shape modifiers,
or marks appended to a line, to specify particular aspects of its shape almost as
adjectives illustrating his claim with a drawing of a cube (plate 2), where these marks
stipulate several such properties.47
Being primitive a line can evidently be decomposed out of the larger semantic
unit of a vertex, rather as a word can be isolated from a phrase. A vertex, moreover,
can be decomposed out of the larger unit of a surface, which can itself be extracted
from a whole shape, which can in turn be isolated from the whole scene depicted,
570
Paul Smith
571
Pictorial Grammar
5 Stages of drawing a
rectangular object, figure 5
from John Willats, The third
domain: The role of pictorial
images in picture perception
and production, Axiomathes,
13: 1, March 2002, 115.
Photo: With kind permission
from Springer Science and
Business Media.
6 Stages of drawing a
rectangular object, figure 6
from John Willats, The third
domain: The role of pictorial
images in picture perception
and production, Axiomathes,
13: 1, March 2002, 115.
Photo: With kind permission
from Springer Science and
Business Media.
just as phrases can be decomposed out of larger phrases and ultimately whole
sentences. (The caveat is that it is nevertheless misleading to think that the units of
sense to be identified at different levels of pictorial structure have equivalent valences
to the units present at different levels of linguistic structure.)
There is an initially plausible objection to the idea of pictorial segmentability,
however, which is that pictures do not have units that we see in isolation in the same
way as we (seem to) hear words individually. It might be claimed, for example,
that when we fixate on a particular line in a drawing we cannot isolate it from
the lines adjacent to, or conjoint with, it, whereas we hear words as having sharp
boundaries. The fact is, however, that our impression that we hear individual words
independently of the larger structures they form is retrospective. That is, we extract
individual words from the larger sequences in which they are rolled out only after
the fact, even if this is not how it seems to us.48 There is therefore a genuine analogy
to be made between the phenomenology of parsing a sentence into its constituents
and what happens when we identify a pictures units only in the context of the
complete work, or of a large section of it. The area behind the screen of trees in the
centre of Czannes Provencal Landscape with a Red Roof of the mid-1870s (plate 3) is a good
example of this last phenomenon at work, since it is almost impossible to identify its
constituent primitives, or shapes, when seen in isolation (from close-to), but these
are readily identified when a sufficient distance is attained for the whole painting to
be visible.
Czannes mature paintings in general tend to exhibit compositionality overtly,
since they make little attempt to hide the fact that the objects in them are made up
of smaller components. In Woman with a Coffee-Pot of c. 1895 (plate 4), for example, it
is difficult to resist the temptation to see the upper and lower halves of the sitter as
joined together in the same way as the two sections of the coffee-pot to her right,
as both objects readily parse into roughly conical or cylindrical components
of the kind Irving Biederman calls geons, or viewpoint invariant, volumetric
primitive[s] analogous to phoneme[s].49 It remains a problem, however, that many
paintings appear so fluent that it can seem as though Wollheims argument against
compositionality must prevail. We may nevertheless be aware of their component
572
Paul Smith
lo que en el lenguaje
son unidades, en el arte
son complejos
problemas isomrficos,
el hecho de que los
nios abandonen el
dibujo justo cuando
comienza a manejarse
bien con la lengua me
parece que expresa bien
la eficiencia que el
lenguaje tiene respecto
a los dibujos, como
instrumento de
comunicacin
pero si yo
quisiera
pintar el
en el
sermn en la
lenguaje slo
montaa de
basta con
Lorena,
decir, perro,
tendra que
para
construir,
representar
dibujar, cada
un perro, en
elemento
lo que el
isomrficame
dibujo
nte, no me
necesita
bastara con
construir
juntar
isomrficame
elementos
nte un perro,
prestablecido
aqu se estan
s como
confundiendo
NOMBRES
reglas
PROPIOS,
sintacticas
sustantivos,
con reglas
rboles, y
gramaticales
verbos,
acciones,
Association of Art Historians 2011
tales como,
estar sentado
parts as such subliminally, since one part of our perceptual mechanism is devoted
to picking out areas of high contrast like lines, even though what that mechanism
detects is amalgamated with lower frequency information in conscious seeing (rather
as edges are when we see real scenes).50 Compositionality is thus a real feature of our
perception of pictures, even when it is not obviously so.
One symptom of compositionality is that pictures and speech are both rolled out
in units. With speech, it is obvious enough that we fabricate sense in ever-larger units
consisting of words, whole and embedded phrases, and whole sentences. But Willats
shows that we do the same kind of thing in pictures too, when, in an argument about
occlusion in pictures, he contrasts the natural sequence in which we draw an object
with a possible sequence we almost never use.51 The natural drawing (plate 5) shows
that we set down all the lines making up the front face of the object first, then repeat
the process to form the adjacent face, and finally draw its partially occluded faces
one after the other. The second (plate 6) shows how it is possible to begin drawing
the same object by drawing some of its various edges without joining them up into
faces. The fact, however, that we almost never use this sequence strongly implies that
we are predisposed to assemble pictures out of whole units of the very kind whose
existence Wollheim contests, at the earliest possible stage of the picture-making or
picture-perceiving process.52
2. Syntax
Two of the key sources of Willatss ideas about pictorial syntax were Huffmans
article, Impossible Objects as Nonsense Sentences,53 and Clowess On Seeing
Things, both of 1971. Both of these used line drawings of anomalous objects such
as an impossible polyhedron (plate 7) to reveal the rules governing the combination
of edges in properly-formed objects and pictures, in imitation of Chomskys
technique of using ambiguous and nonsense sentences to reveal the rules governing
well-formed sentences and their component strings.54 By this account, the sentence
colourless green ideas sleep furiously could appear to be a well-formed string
inasmuch as the adjectives, noun, verb and adverb are all in the right place, but
it is anomalous nevertheless because it violates (and hence reveals) the sectional
rule that only certain kinds of lexical items can be placed together if a sentence
is to makes sense of itself. By contrast, there is no such problem with the sentence
revolutionary new ideas appear infrequently, for all that it is superficially identical
in structure.55
One of the more notable discoveries of Huffman and Clowes was the principle
that the rules governing the lawful combination of edges in a rectilinear object can
be expressed as variations of four basic categories of line junction, the V-, W-, Y-, and
T-junctions (plate 8), to use Huffmans nomenclature; or the Ell, Arrow, Fork, and Tee
junctions, to use Clowess.56 This made it possible to at least envisage the possibility
that pictorial grammar could be described systematically in terms of clear categories
like those Chomsky had isolated. It also provided strong evidence for the existence of
something closely analogous to the kind of well-formed structure, or string, that
Chomksy had discerned in grammatical sentences.
At the same time, Huffman and Clowes developed systems of labelling their
diagrams which showed the convexity or concavity of an edge clearly (a technique
they compared to parsing a sentence),57 but which also made it possible readily
to identify forbidden, or ungrammatical,58 combinations of edges as well. They
were consequently able to specify a number of rules that pictures could not infringe
without being ill-formed, which included the rule that the same surface cannot exist
573
Pictorial Grammar
on two sides of the same edge, and the rule that the
character of an edge must be identical on both sides of
the line describing it (viz. either convex or concave).59
They also make it clear that all the junctions in an
object must be susceptible of being resolved coherently
together, just as the elements of a sentence must, if it is
to be well- and not ill-formed. In this way Huffman and
Clowes gave grounds for thinking that Wollheim was
wrong to contest the reality of pictorial grammar on the
basis that there is nothing analogous in pictures to the
well-formed strings of grammatical sentences because
it cannot be said of them, as it can of language, that the
meaning [of a well-formed] string is determined by
its syntactical form plus [its] vocabulary, or lexicon.
A further distinctive feature of Huffmans and
Clowess arguments is that they maintained that
we interpret shapes from the bottom-up, first of all
by identifying simple structures like line junctions
which are governed by only a few rules, then by
identifying the larger, coherent structures these create,
and ultimately by identifying the whole scene such
structures form together. Interpreting shapes this way
is clearly akin to the way that Chomsky suggests we
interpret sentences, by first identifying their smaller
units and then combining these into larger and more
complex structures. Chomsky notoriously developed
two methods of representing the consequences of this
process, bracketing sentences and representing them in
the form of trees (plate 9), both of which show clearly
how sentences are formed out of nesting structures,
or several levels of sense. Huffmans and Clowess
574
Paul Smith
575
Pictorial Grammar
13 Pictorial patterns for scene A and picture F, figure 1.1 from King
Sun Fu, Syntactic Pattern Recognition and Applications, first edn,
Englewood Cliffs, NJ, 1982, 2. Prentice Hall, 1982. Photo:
Reprinted by kind permission of Pearson Education Inc., Upper
Saddle River, NJ.
14 Hierarchical structural descriptions of scene A and picture F,
figure 1.2 from King Sun Fu, Syntactic Pattern Recognition and
Applications, first edn, Englewood Cliffs, NJ, 1982, 3. Prentice
Hall, 1982. Photo: Reprinted by kind permission of Pearson
Education Inc., Upper Saddle River, NJ.
15 Picture-faces from Ludwig Wittgenstein, Lectures and
Conversations on Aesthetics, Psychology and Religious Belief, ed.
Cyril Barrett, Oxford, 1966, section 10, 4. Photo: By permission
of John Wiley & Sons.
Paul Smith
577
Pictorial Grammar
consequence that they look radically different in this setting from how they look
outside it. This idea would certainly seem to explain why the same feature looks
different on different faces, and why people find it hard to recognize a person from
one facial feature alone.
If the grammar of picture-faces does in fact mimic that of face perception
it would help to explain several aspects of their peculiarity. It makes sense, for
instance, of the way that even very small changes to a schematic picture-face will
normally result in significant changes to its expression, as Wittgenstein observed
(plate 15).74 It would also explain why the round regions denoting the breasts of
578
Paul Smith
the figure in Matisses Blue Nude (plate 16) seem unusually compelling, without quite
turning into eyes, when viewed within the context provided by her upraised arms
and torso. The peculiarity of this kind of grammar would, in addition, account
for the fact that there is what Andrew Harrison has called a minimal syntax to a
picture-face,75 or that it can only work when it organizes a sufficient number of
features within a sufficiently replete structure. As Harrison was fond of observing,
a vivid demonstration of minimal syntax is that it is all but impossible to draw the
Cheshire cat disappearing.76 Too much information, as in Tenniels illustration, will
yield a cat obscured by foliage; but too little will produce a grin without a cat
something Alice declared to be the most curious thing I ever saw in my life!
The peculiar, holistic, grammar of the face also poses a problem for artists,
inasmuch as it means that faces will look different in kind from other objects in the
picture. Most artists successfully dissemble any problems of this kind, but Czanne
sometimes overcame it, as Merleau-Ponty pointed out, by painting [the] face as an
object, or a mere assemblage of shapes.77 His reasons for doing so were undoubtedly
complex, but the fact that he took such drastic steps in portraits like Woman with a
Coffee-Pot demonstrates how pictorial syntax was and is an inescapable reality.
Another way of describing what happens when we see a face as a meaningful
whole as opposed to a mere collection of shapes is that the syntax allows us to
perceive the relation between its parts synchronically rather than piecemeal. Pictorial
syntax also allows us to see a whole picture as a single coherent unit the meaning of
which is present all at once, just as linguistic syntax makes the sense of a complete
sentence present as such. The synchronicity of sense in both kinds of output is
explained better by the idea that syntax subsumes the meanings of the constituent
parts of a picture or a sentence within the meaning of the whole. The effect is, of
course, normally involuntary and recent research shows that unconscious impulse
towards synchronic coherence results in unexpected similarities between the ways
in which we attend to both pictures and language. Robert Solso has demonstrated,
for example, that a spectator familiar with pictorial conventions will move her
eyes around a painting in an attempt to discover thematic patterns in it,78 rather
as Alan Kennedy has shown that the eye movements we deploy when reading are
not linear, but involve both retrospective movements designed to check the sense
we have already acquired and proleptic movements used to anticipate the sense we
think we are about to encounter.79 At all events, such findings strongly suggest that
it is simplistic to assert that a painting is perceived instantaneously while a piece of
poetry is heard sequentially.80 Rather, it would seem that syntax bestows a degree of
synchronicity upon both.
3. Grammaticality
579
Pictorial Grammar
17 Drawing of a rectilinear
object in fold-out projection,
from figure 13.2 from
John Willats, Art and
Representation: New Principles
in the Analysis of Pictures,
Princeton, NJ, 1997.
Princeton University Press,
1997. Photo: Reprinted by
Permission of Princeton
University Press.
18 Paul Czanne, Victor
Chocquet Seated, c. 1877.
Oil on canvas, 46 38 cm.
Columbus, OH: Columbus
Museum of Art (Howald Fund
Purchase, 1950.024). Photo:
Columbus Museum of Art.
Paul Smith
581
Pictorial Grammar
deployed in drawings projected by enclosure. So, for example, when they experiment
with fold-out projection in an attempt to capture how the sides of a rectilinear
object appear to join up (plate 17), they still use regions to denote the objective shape
of these sides, with the result that their drawings look very odd indeed.85 Willats also
identifies several other uncomfortable combinations of projection and denotation
system,86 which further support the claim that a grammatical relation between the
two kinds of systems is normative. Ironically enough, so does Schiers conference dot
diagram, since it is an ill-formed icon of the very kind whose possibility he denies.
The more specific problem with it is that both its drawing system and its denotation
system are ambiguous. So it could be that it uses orthogonal projection to render a
birds-eye view of the contours of the table and the surrounding delegates, but it can
just as easily be seen as employing enclosure to articulate the same items in the form
of regions. It is unclear in any case whether we should interpret it according to the
rules of view-based or object-based systems.
Schiers diagram is fit for purpose nonetheless. One reason for this is that we
do not require diagrams to be consistent throughout in how they represent things.
But, according to Willats, we also find images of this kind acceptable because we are
accustomed to the fact that there are degrees of grammaticalness in pictures, just as
there are in normal sentences according to Chomsky.87 Willats also argues further
that we do not simply put up with ill-formed icons, but sometimes relish the way
they bend the rules, just as we enjoy grammatical play in poetry.88
Willats gives numerous examples of such creative play in pictures. He shows, for
example, how Klee creates a range of meanings by flouting Huffmans rule that a line
indicating a leading edge along one section of its course must not represent the fleeing
edge of the same surface along another.89 Willats also gives numerous examples of
the playful use of the anomaly known as accidental or false attachment,90 which
describes what happens when lines belonging to objects located at different depths
are allowed to join up, or to run into one another, with the result that they appear
to lie in the same plane. Czanne exploited false attachment to considerable effect
in his Victor Chocquet Seated of 1877 (plate 18), where a number of lines converge upon
the sitters wrist, making it look as though the planes to which they correspond are
pinned together at this point.91 It needs emphasizing, however, that pictures of this
sort are not completely ungrammatical, as they can only perform their tricks if they
are grammatical for the most part.92
A more radical subversion of pictorial grammar employed by Czanne and
the cubists is effected by the device known as passage, which describes their use
of marks interposed between or across occluding contours. Passage is particularly
evident in the area immediately to the right of the right edge of the pear in the lower
centre of Pot of Primroses and Fruit of c. 188890 (plate 19), where it elides the transition
between the base of the pot and the tabletop. Passage is significant because it is
unclear what kind of device it is. It might at a pinch be seen as lexical constituent
of the painting that Czanne used in order to register the vacillation edges undergo
when fixated;93 but it can just as easily be regarded as a functional constituent of
the work that Czanne used more synthetically to play down the breaks in depth
that contours normally create. This means that, if so, it only has meaning in so far
as it intervenes in the relationships between the pictures lexical items, just as the
functional constituents of a sentence (conjunctions, prepositions, determiners like
the and a, and complementizers, including auxiliaries and modals that modify
verbs) only contribute to its sense by specifying relations between its lexical parts.
But although words that are ambiguous between lexical and functional constituents
Association of Art Historians 2011
582
Paul Smith
are largely absent from language, this does nothing to threaten the idea that passage
is a form of creative ungrammaticality. It merely points to the fact that pictorial
grammar is different from its linguistic counterpart.
Czanne also exploited higher-level grammatical anomalies by employing
more than one projection system in the same work. Willats argues that Czanne
did this in his Still Life with Commode of 188788 (plate 20) by employing one form of
projection approximating to vertical oblique for the table at the front and another
approximating to horizontal oblique for the commode behind it. The two systems
are not massively anomalous, however, especially since Czanne used neither strictly,
and may in fact have been using a disguised form of oblique projection for the
commode.94 So the picture is not just an ill-formed icon but a work that warps space
for expressive effect.95
Adults pictures can couple projection and denotation systems anomalously, just
as childrens do. Willats shows, for example, that van Gogh resorted to combinations
of this kind when he used vanishing-point perspective in conjunction with regions.96
Arguably, however, Czanne used more complex and ambiguous combinations,
employing forms of parallel projection that render reasonably convincing views
while preserving aspects of the objective character of shapes, and employing
primitives in the form of areas of contrasting tone and colour that serve equally well
583
Pictorial Grammar
Paul Smith
a number of classes and can perform a wide variety of roles, most of which are
impossible for the narrow range of constituents to be found in pictures.98 Language
also possesses multifarious functional constituents, which can generate complex
syntactic relationships between its lexical parts, whereas nothing of the sort applies to
pictures. Wollheim is therefore on firm ground when he claims that pictures cannot
be segmented into parts that can be categorised according to the contribution
they make to the meaning of the whole,99 or that we have no need to classify the
basic units of pictures into general categories equivalent to noun, verb, adverb
etc. But such objections do nothing to undermine the idea of pictorial grammar
per se since parsing a picture only ever need involve distinguishing a few kinds of
primitives. And even these would seem to have some lexical diversity, and perhaps also
at least a suggestion of functional capacity.
4. Innateness
585
Pictorial Grammar
GRAMTI
CA DE
ESQUEM
AS
in contours (plate 21). And he even demonstrates that many of the drawings that
researchers elicited from children of the Jiri valley in Papua New Guinea, who were
wholly unused to the practice of drawing, exhibit a similar morphology (plate 22).
The fact that childrens ability to understand speech and drawings greatly
outstrips their ability to produce them may, by contrast, seem to undermine the
argument that innate grammar underlies both kinds of competence. It simply goes to
show, however, that producing output in either form is complicated by the need to
co-ordinate grammatical competence with performative, and predominantly motor,
skills.109 Indeed, Harrison has argued that the capacity of most people to recognize
pictures of far greater sophistication than they can produce can only be explained by
the existence of a generative grammar of the pictorial that we all share.110
The fact that children characteristically make errors in both their speech
and their drawings could also seem to contradict the idea of innate grammatical
competence. But as far as speech goes, it is clear that
many of their errors are grammatical in the sense that
they occur when a child applies a rule that works
perfectly well for one situation to another for which
it is inappropriate, as for instance when they add ed
to the root of an irregular verb to turn it into the past
tense.111 Willats argues in a similar vein that some of
the errors in childrens drawings can be attributed to
misapplications of grammatical rules.112 An example
might be the transparency error, which children
commit when they draw the line denoting the edge of
a solid object which is occluded as happens in plate
23, where the line denoting the top edge of the table
top is visible through the box in front of it.113 Such
errors can at any rate be seen as the result of a failure
on the part of the child to grasp how the particular
task in hand demands a modification to the rules for
drawing shapes she had previously acquired. What is
586
Paul Smith
more, the fact that children often learn to correct such mistakes themselves without
having to be taught the error of their ways suggests that they possess an innate sense
of what can count as grammatical drawing, just as they can work out what constitutes
grammatical speech.114 And indeed, in this drawing, the child spontaneously added
shading to obscure the illegitimate contour. Even the almost complete immunity
children exhibit towards adults attempts to correct their errors can be attributed to
their developing command of grammar. They will, in other words, normally only
elect to use the conventional norms when their competence has reached the point
where they are ready to do so.115
The broader significance of the argument that competence is innately grounded
is that it makes sense of facts that naturalists like Schier acknowledge but cannot
explain. So, for example, Schier acknowledges that our ability to recognize a wide
range of pictures is generative, in the sense of productive, and that our capacity to
produce a wide variety of icons exhibits generativity as well,116 rather as Wollheim
seems to,117 but denies that these competences are grounded in any innate grammar.
This means that his account offers no explanation of what makes generativity
possible. But Willatss argument that we are born with the capacity to map shapes
and their relations fills the explanatory gap perfectly.
5. Transformation
It is apparent from the general drift of Willatss account that his conception of the
mapping rules implemented in pictures is indebted to Chomksys thinking. His use
of the word transformations118 to describe the forms in which pictures render their
deep perceptual content also makes a clear allusion to Chomskys early idea that
a sentence transforms a deep grammatical structure into a surface structure. To all
intents and purposes, therefore, Willats would seem to regard pictorial grammar as
analogous to the universal grammar that Chomsky envisaged in the form of a set of
algorithms determining the transformations speech makes out of its underlying
contents. Admittedly, this conception of grammar begs some intractable questions
about what exactly the substrate to speech consists of.119 But if it is reasonable to
assume that language maps some pre-articulate linguistic equivalent of thought,
or even thought itself, into a public form,120 then by analogy it seems reasonable to
assume that pictures transform shape percepts into a form that makes them publicly
available.121 This idea or its principle finds support in the fact that computer
programs capable of mapping shape descriptions into visual form are readily
written.122
A more specific similarity between the transformational work done by pictures
and language is that both employ forms of deletion when they convert their deep
content into surface output. Hence, the surface structure of the sentence, A wise
man is honest, is more concise than the deep structure it corresponds to (plate
24), since, typically, it eliminates syntactic constituents that its counterpart makes
explicit.123 Notwithstanding, deletion of this kind has no serious deleterious effect
on the sense of most sentences. Something similar, although not identical, is true of
deletion in line drawings, which only inform us about the edges of objects and their
relations at the cost of excluding semantic information about the shape of objects
provided by shading. They can, as a result, be a little unclear about the contiguity of,
or distance between, objects, compared with fully elaborated drawings employing
cast shadows.124 But most of the time, line drawings provide perfectly good working
representations of objects and scenes, perhaps because they map what Marr calls the
2D sketch, or the skeletal preview of a scene that we construct for the purposes
587
Pictorial Grammar
Any language must be at least potentially accessible to its users, since a logically
private language is not a language at all.130 One way of explaining how all languages
Association of Art Historians 2011
588
Paul Smith
Pictorial Grammar
cultures, whereas the different languages are relatively opaque to non-native speakers
irrespective of the question of unfamiliar vocabulary.
Conclusions
It is fair to say then, that one major achievement of Willatss account of representation,
suitably modified, is that by furnishing a conception of pictorial grammar as
something to which nature and convention both contribute it can relieve the profound
antinomy between pictorial syntax and iconicity that dogged earlier debates about
representation. But perhaps the most general conclusion to be drawn from Willatss
work is that grammatical structure is central to how, and what, pictures articulate.
This basic idea has many possibilities. One is that a more developed theory of
pictorial grammar might allow us to specify more closely just what pictures can, and
cannot, achieve by pictorial means alone, apart from brute depiction. For example, it
still needs to be decided whether a picture can represent a particular object, or kind
of object, by showing a cluster of visible properties alone, or whether it can only do
so by recruiting concepts from language. By corollary, the same sort of analysis might
make it clearer how pictures are dependent on the services of concepts when they
show particular aspects of things,137 and on more complex alien structures imported
from language when they make statements,138 or tell stories.139
Closer attention to the constraints imposed by grammatical structure might
also produce a different, or at any rate a more complete, understanding of why
some pictorial forms evolved as they did than is presently available. So, for instance,
although the composite forms used to articulate the human figure in antique Greek
vase painting are readily described as conventional schemata, it might enrich our
understanding of them to appreciate how they are determined by the orthogonal
projection system demanded by curved, rotatable, surfaces that do not readily tolerate
more sophisticated drawing systems dependent on a fixed viewpoint.
Another possibility of a theory of pictorial grammar is that it might reveal how
mapping rules apply to colour, as well as to drawing, in painting. More specifically,
it might be capable of expressing what kind of regularities must obtain between
colours in a representational painting if they are to stand successfully for colours
in the world, which they cannot do punctually because chromatic effects like
contrast,140 and chromostereopsis, operate differently on a flat surface from how
they operate in depth.
Willatss radical idea that the grammar of projection makes all pictorial space sui
generis might be extended, as well, into an analysis of the space produced by artists
individual styles of drawing, particularly if combined with his idea of how artists
disguise their drawing systems and play with the rules of concatenation. An analysis
of this sort might reveal, for example, how Czannes peculiarly elastic space lends
itself to the expression of our tendency inside acts of seeing to probe the world
before us for the meanings it has for our potentially grasping physical hands and
mobile body.
Willatss more specific ideas about the more basic semantic constituents of
pictures have a similar potential inasmuch as they might explain how an artists
individual style is closely dependent on such things as the particular manner in
which it articulates pictorial features like edges. More particularly, the notion taken
from Frdo Durand that pictorial marks bestow attributes, including colour, tone,
transparency, texture, thickness, wiggling and orientation upon lines and other
pictures primitives, might start to explain why some artists way of using marks gives
the objects and spatial relations in their pictures a characteristic look of their own.141
Association of Art Historians 2011
590
Paul Smith
Notes
This article owes much to the writings of and conversation with
several colleagues and friends, especially John Willats, Jason
Gaiger, Andrew Harrison, Michael Podro, and Richard Wollheim.
I am also indebted to the J. Paul Getty Trust for a scholar grant
which allowed me to pursue the research it presents.
17
18
1
2
3
4
5
7
8
9
10
11
12
13
14
15
16
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
See Chomsky, Theory of Syntax, 8. Cf. Steven Pinker, The Language Instinct:
The New Science of Language and Mind, London, 1995, 84.
Bois, The semiology of cubism, 174. Cf. Kraus, The motivation of
the sign, 2623, and Rosalind Kraus, In the Name of Picasso [1981],
in The Originality of the Avant-Garde and Other Modernist Myths, Cambridge,
MA, 1996, 347.
Cf. Andrew Harrison, A minimal syntax for the pictorial: The
pictorial and the linguistic analogies and disanalogies, in Salim
Kemal and Iwan Gaskell, eds, The Language of Art History, Cambridge,
1991, 213, which argues that any account of linguistic or pictorial
meaning that takes no account of reference is absurd.
Ludwig Wittgenstein, On Certainty, Oxford, 1969, 160, and Anthony
Grayling, Wittgenstein, Oxford, 1996, 956.
Wollheim also attacks Kraus here. See Wollheim, Formalism and its Kinds,
28 and 38, note 12. Cf. the argument that cubism is not about the
nature of the sign in Flint Schier, Painting after art? Comments on
Wollheim, in Norman Bryson, Michael Ann Holly, and Keith Moxey,
eds, Visual Theory: Painting and Interpretation, Cambridge, 1991, 154.
Richard Wollheim, Painting as an Art, London, 1987, 43100. For a
succinct analysis of Wollheims position, see Harrison, Minimal
syntax, 2204.
Wollheim, Formalism and its Kinds, 267. All subsequent quotations
from Wollheim are from this section of text. The essay is reprinted,
in an abbreviated and modified form, as On formalism and pictorial
organisation, Journal of Aesthetics and Art Criticism, 59: 2, Spring 2001,
12737.
Cf. Flint Schier, Deeper into Pictures: An Essay on Pictorial Representation,
Cambridge, 1986, 150, which paraphrases Roger Scrutons criticism
of Barthes semiotics for its inability to adduce a grammatical
rule connecting the meaning of a systems significant parts to the
meaning of the ensemble.
Wollheim, Formalism and its Kinds, 1995, 37, note 10.
Schier, Deeper into Pictures, 678.
Schier, Deeper into Pictures, 67.
Schier, Deeper into Pictures, 656.
Schier, Deeper into Pictures, 66.
Schier, Deeper into Pictures, 667.
Chomsky, Theory of Syntax, vi. Cf. Max Clowes, On seeing things,
Artificial Intelligence, 2: 1, Spring 1971, 79116, esp. 80.
Harrison, Minimal syntax, 21617 and 2257. For the view
that pictures cannot map appearances constituted by Gestalt and
constancy effects (inter al.), see E. H. Gombrich in Mirror and Map
[1975], in The Image and the Eye, Oxford, 1982, 172214.
Cited in Noam Chomsky, On Cognitive Capacity [1975], reprinted in
On Language, 335, esp. 8.
Michael Podro, Formal elements and theories of modern art, British
Journal of Aesthetics, 6: 4, 1966, 32938, esp. 335.
Richard Wollheim, Form, elements, modernity: Reply to Michael
Podro, The British Journal of Aesthetics, 6: 4, 1966, 33945, esp. 344.
Richard Wollheim, Giovanni Morelli and the origins of scientific
connoisseurship, On Art and the Mind: Essays and Lectures, London, 1973,
2001.
Goodman, Languages of Art: An Approach to a Theory of Symbols, Indianapolis,
IN, 1976, 133.
Goodman, Languages of Art, 13141.
Goodman, Languages of Art, 22532.
Curtis Carter, Painting and language: A pictorial syntax of shapes,
Leonardo, 9: 2, Spring 1976, 11118, esp. 11215.
591
Pictorial Grammar
69
70
71
72
73
592
107 Willats, Childrens Drawings, 147, 171, 177, 216, and 235. Children, such
as Stephen Wiltshire, who draw in perspective spontaneously are
exceptional. See Oliver Sacks, An Anthropologist on Mars, New York, 1995,
1857.
108 Willats, Childrens Drawings, 159, and Art and Representation, 289 and
31115.
109 Cf. Willats, Art and Representation, 165, and Childrens Drawings, 379, 601,
and 767.
110 Harrison, Minimal syntax, 229.
111 Cf. Willats, Art and Representation, 175.
112 Willats, Childrens Drawings, 89.
113 Willats, Childrens Drawings, 17980.
114 Willats, Childrens Drawings, 170, 181, 201, and 229.
115 Willats, Childrens Drawings, 172. Cf. Carnie, Syntax, 21.
116 Schier, Deeper into Pictures, 55, 66, note 1, and 151. Cf. Harrison,
Minimal syntax, 221, which argues that Wollheim clearly does
think that the pictorial is in some central sense generative despite
his opposition to semiotic theories of it.
117 Harrison, Minimal syntax, 221.
118 Willats, Art and Representation, 1712 and 289.
119 See Chomsky, Theory of Syntax, 1718 and 634, and The Minimalist
Programme, 223. Cf. Smith, Chomsky, 434.
120 Cf. Ludwig Wittgenstein, Philosophical Investigations, Oxford, 1953,
319, 320, 329, and 335, and Pinker, The Language Instinct, 5582;
121 Cf. Wittgenstein, Philosophical Investigations, 198e.
122 See Willats, Art and Representation, 1712.
123 On the condition of deletion, see Chomsky, Theory of Syntax, 17984,
and Language and Mind, 502. Cf. Cook and Newson, Chomskys Universal
Grammar, 280.
124 Willats, Art and Representation, 13841.
125 Willats, Art and Representation, 112 and 1523. Marr, Vision, 26894.
126 Wollheim, Painting as an Art, 23 and 359, note 16.
127 Irving Biederman, Recognition-by-components: A theory of human
image understanding, Psychological Review, 94: 2, 1987, 11547.
128 Smith, Chomsky, 608, and Cook and Newson, Chomskys Universal
Grammar, 203 and 325.
129 Harrison, Minimal syntax, 229.
130 See Marie McGinn, Wittgenstein and the Philosophical Investigations, London,
1997, 11342.
131 Willats, Art and Representation, 146, 364, note 6, and Childrens Drawings,
1314.
132 Willats, Art and Representation, 3313.
133 Pinker, The Language Instinct, 2034.
134 Chomksy, Theory of Syntax, 6.
135 Cf. Willats, Art and Representation, 34 and 3534, which emphasizes the
function and purpose to which the different systems are put.
136 Willats, Art and Representation, 341.
137 See Victor Burgin, Seeing sense, in The End of Art Theory: Criticism and
Postmodernity, Basingstoke and London, 1986, 5170, and Wittgenstein,
Philosophical Investigations, 193e214e.
138 Schier, Deeper into Pictures, 1205. Cf. Harrison, Minimal syntax, esp.
219. Cf. Wittgenstein, Philosophical Investigations, 244, and Hans Sluga,
Whose house is that?: Wittgenstein on the self, in Hans Sluga and
David Stern, eds, The Cambridge Companion to Wittgenstein, Cambridge,
1996, 340.
139 Wollheim, Painting as an Art, 1889.
140 Cf. Jean-Dsir Rgnier, De la lumire et de la couleur chez les grands matres
anciens, Paris, 1865, esp. 318.
141 John Willats and Frdo Durand, Defi ning pictorial style: Lessons
from linguistics and computer graphics, Axiomathes, 15: 3, September
2005, 127, esp. 37, and 1221.
593