Beruflich Dokumente
Kultur Dokumente
Postures of listening
An ontology of sonic percepts from an anthropological perspective
Publisher
Association Terrain
Electronic version
URL: http://terrain.revues.org/16418 Brought to you by Centre national de la
ISSN: 1777-5450 recherche scientifique (CNRS)
Electronic reference
Victor A. Stoichita and Bernd Brabec de Mori, « Postures of listening », Terrain [Online], Symposia and
Debates, Online since 14 November 2017, connection on 24 November 2017. URL : http://
terrain.revues.org/16418
Terrain est mis à disposition selon les termes de la Licence Creative Commons Attribution - Pas
d'Utilisation Commerciale - Pas de Modification 4.0 International.
Postures of listening 1
Postures of listening
An ontology of sonic percepts from an anthropological perspective
auditory perception in terms of a massive error theory”. Massive error theories may not
be a problem in physics, but they are hardly ever satisfactory in anthropology.
3 If we are to understand what people experience through their ears, we then need to leave
aside the fact that ears sense pressure waves. Just as with the eyes perceiving light rays,
this is not untrue from a biological point of view, but it is not of immediate interest for
understanding cultural representations and social interactions. What we need is a more
specific description of the kind of things people sense with their ears, and the general
interactions these things afford.
4 After providing more background to auditory perception in the following section, we will
present the main thesis of this paper, which is a model of three alternative modes of
listening that can be adopted by any listener. The modes are akin to postures a listener
occupies when paying attention to a sonic event. We call these postures of listening: (A)
indexical listening; (B) structural listening; and (C) enchanted listening. These will be
described in detail. During discussions with colleagues, we found that some specific
questions were repeatedly raised.1 After proposing the model of listening postures, we
include a brief section for answering these frequently asked questions (FAQs). Finally, we
conclude with a section dealing with some of the consequences of our model, specifically
for the case of “music” as a form of enchanted listening.
meant to bypass any “change deafness”). The distinction between attention and
awareness helps to understand where auditory processes stop being universally
predictable. In fieldwork, anthropologists and ethnomusicologists gather data relevant to
people’s aware perceptions. But the allocation of attention is probably the earliest stage
where perception can be modulated by cultural preferences.
7 Beyond low-level processes of auditory stream segregation, most cognitive operations
involved in hearing arguably depend on learned systems of knowledge and meaning. This
was shown in Steven Feld’s work (1990 2000) about hearing and meaning among the
Papuan Kaluli, and even more specifically by Rafael José de Menezes Bastos (1999, 2013).
This author shows how the analysis of native terminologies and axionomies leads us to
observe that, for example, among the Xinguano indigenous people, “sound is as material
as stones are for us” (de Menezes Bastos 2013: 292), or that in the same society, sound is
not merely perceived, but “sound is actively sought and captured by the ears” (de
Menezes Bastos 2013: 292). The very base of auditory perception acquires a different
range of meaning and agentive power when conceptualized in such a way. Building upon
this, Menezes Bastos’s former students Mello and de C. Piedade (2005) also demonstrate
that the construction of space based on auditory perception among the Central Brazilian
Wauja shows particularities that are different from an alleged Western classical tradition.
They assume that the ontological characteristics of sound depend on culture, so that
although psychophysical basic processes may be similar, the characteristics of the things
heard are particular.
8 Ethnography shows that anthropologists cannot really presume what kind of things
people hear. The internals of the biological ear (an organ which clearly senses pressure
waves) do not tell much about the things that actually constitute people’s auditory
realms. This applies to “sounds” but also to “language” or “music”. Firstly, these concepts
are rather cumbersome in cross-cultural comparisons. “Language”, for example, can have
different extensions in various societies, including or not the sounds of animals, rivers
and so on. Likewise, “music” does not even exist as a category in most non-European
languages (sounds for healing, hunting or having fun are not necessarily linked by an
overarching concept). More importantly perhaps, “language”, “music”, or “the acoustic
environment” refer us to phenomena. We propose not to take phenomena for granted,
and to start by describing the modes of awareness which allow for auditory phenomena
to occur. This is one significant difference with most existing theories of listening: we do
not ask how many ways there are to hear “music”, or how listening to “music” compares
with listening to “language”. We propose to start our inquiry before, at a stage where we
do not know at all what people listen to.3 From there, we can simply observe how humans
interact with their surroundings through their ears: what they do, how they react, what
they say about it. On this basis, we try to give “functional specifications to the structures
that must be present” (Hutchins 1995: 131) beyond the reach of our methods of
investigation.
9 This approach is not meant to reveal especially new things about how people listen to
“sounds”, “music” or “language” (although it might serve to clarify a few points along the
way). The expected benefit would rather be to understand how they listen to things like
“gods-as-vibrations”. Beings such as these are not unusual encounters in ethnography,
but they remain critically absent from typologies of listening (when they are allowed
entrance, it is only as “sounds” or as “music”; see for instance Becker 2004). Another
objective of our inquiry is to determine whether some listening modes are found in all
human societies. On the one hand, ethnographies show great variability in the way
systems of auditory knowledge are built. On the other hand, humans engage in
remarkably cross-cultural activities: they listen for prey when they go hunting, they talk
to each other (sometimes to animals and rivers too), they dance … Beyond the general
ability to pay attention to sounds, are all modes of listening culturally variable? Or are
there some modes which are shared by all human beings? By investigating this question,
we might ultimately even be able to say something about temptingly universal
phenomena such as “language” and “music”.
10 Here we will deal exclusively with listening, leaving aside sound production. Listening, as
opposed to hearing, implies awareness. There is, however, more to listening than
awareness. In many cases, the sound one hears affords a certain reaction to the hearing
individual. Gibson suggested that the perception of affordances is embedded at a very low
level in our appraisal of the world (Gibson 1977, 1986).4 As Martin Clayton puts it for
sounds, “we do not passively perceive and subsequently decode sonic information, so
much as actively scan sound energy for patterns of which we can make sense” (Clayton
2001: 11). Cognitive and acoustic experiments have shown that this “scanning” requires a
mixture of short-term memory and anticipation (for a recent overview see Castellengo
2015: 146 sqq.). This means that auditory objects like tunes, words or familiar noises are
never “sensed” literally. The present, through our eardrums, brings us only a short
glimpse of them. In Husserl’s terms, “the objectivity of the sound that lasts is constituted
in the ‘continuum’ of an action that is in part remembrance, in a very short, punctual
part, perception, and in a larger part, expectation” (Husserl 1964: 36–37, quoted in
Castellengo 2015: 487, our translation). By “listening” we refer to this complex of action,
memory and imagination.
11 We define alternatives of listening as distinct ways of using a given item of sensory
information by the same being to construct different kinds of listening objects. These
objects are of different kinds when their affordances for that being differ. In a given
acoustic environment, the listener adopts one of these alternative listening postures, and
thereby perceives a specific set of entities, which opens the way to specific relational
possibilities. As their name implies, alternatives yield incompatible results: the same
auditory field can be apprehended in one way or another, but probably not in two ways at
the same time (see FAQ section for discussion).
12 We propose that three alternative postures can be identified in all human societies. We do
not claim that these listening modes are the only ones which humans have developed. We
do think, however, that these modes of listening are used and applied among all humans,
and they are probably the only ones with such universal character. Such an assertion can
of course not be proved. The best one can do to address something that might be common
to all humans is to make one’s claim refutable (in the sense of Popper 1959). To that
effect, we try to describe each alternative and the kind of auditory world it constructs in a
precise way, while also setting out its most obvious empirical consequences.
14 For such a hypothesis to be obtained, sounds are treated as indexes in the sense of Peirce
(see also Peirce 1992; 1998: 13). Through this way of listening – moving upstream to the
physical cause of the sound heard – animals endowed with hearing attain three different
kinds of knowledge.
15 Firstly, they discover and locate other entities acting in their surroundings (“something is
walking there”). From the sounds heard, they assume the mere existence of “something”.
16 Secondly, they can form a hypothesis about the thing or being which they hear. Acoustic
signatures allow us, for example, to infer that what is there in the branches is a fruit
dove, or that the person on the phone is Aïcha. This is a hypothesis about the entity’s
identity. Although it may occur simultaneously with the first inference, existence and
identity are distinct. We can indeed be wrong about the species of the bird in the tree or
the identity of the person on the phone, but something or someone is definitely there.
Both inferences are indexical, because they are built on causal associations. These
associations are grounded in observed recurrences: this kind of sound is usually produced
by a fruit dove, that one by Aïcha, etc. (for a perspective from cognitive psychology, see
e.g. Keller & Stevens 2004).
17 There is a third kind of indexical inference in sound. It enables listeners to build a
hypothesis about the interior state of other beings. This is what occurs, for example,
when we think that a person’s voice “betrays” the speaker’s thoughts or feelings. The
feelings may not be apparent in the words uttered by that person, but we still think that
we sense fear, happiness or anger in the tone of the voice. We rely for this on an indexical
interpretation of vocal prosodic features (intonation, rhythm, timbre, etc.) that we
understand causally as being produced by specific interior states. Note how this is
different from intentional communication: when we say that a voice “betrays” the
speaker’s inner feelings, the inference cannot be said to rely on the deliberate use of a
shared symbolic system. The “betraying” indexes are in fact not symbols at all in the
sense of Peirce. There is no convention stating how they should be interpreted. The
listener can only guess Aïcha’s interior state through the observance of previous
regularities (e.g. usually, when Aïcha sounds like this, she is in an angry state of mind),
and the inference is affected by gradual variations in sound (Aïcha can sound more or less
angry depending on how salient the corresponding prosodic features are in her voice).
18 Of course, indexical interpretations can be wrong, and indexes can also be faked. Let’s
consider the first inference about existence. It is usually accurate in ecological
environments: if you hear something there, something is probably there. But illusions can
be built on this inferential process with special tools like recording and playback
equipment. Listen to a recording of your favourite string quartet for example.
19 How many things do you hear? If you listen in stereo, the direct indexical answer should
be two. Most Europeans, however, will feel that they are listening to “the quartet” – two
violins, a viola and a cello, that is four – not to the loudspeakers. In such a case, the
recording and playback apparatus function as an extension of the situation in which four
actual sound sources were recorded and mapped onto the stereo panorama.5
20 The second kind of indexical inference – identification – can also be faked. The fruit dove,
for example, can be tricked into believing that it hears the calls of a potential mate, when
in fact a hunter lies hidden in the bushes. It is worth noting that tricks of this kind work
precisely because they are exceptional. Humans too rely on voice recognition in many
daily interactions, because voices are difficult to imitate.
21 Lastly, interior states can also be falsely attributed. By using vocal “icons of crying”
(Urban 1988), professional mourners can convey feelings of sadness while mourning for a
family they hardly know (cf. Amy de la Bretèque 2013: 89). Here again, it takes some
special skills to get the imitation right and make it convincing. Stage actors also need to
train their voices in order to acquire the special ability to embody various states of mind
on demand. Because such skills remain uncommon, listeners tend to infer the interior
states of other beings from the way they sound, even when they are well aware that the
feelings are enacted and not spontaneously felt.
22 One of the reasons why acoustic indexes are hard to fake is because the listener’s
inferences are very sensitive to infinitesimal variations. In indexical listening, it makes a
difference whether the voice we hear is a little higher or lower pitched, whether
consonants receive a bit more or less stress, whether the steps we hear sound a bit
further or closer. As shall be seen, many of these infinitesimal variations are stripped
away in structural listening, which will be outlined hereafter. Some of them are relevant,
again, in enchanted listening, but there they pertain to a totally different ontology.
23 Another specific characteristic of indexical listening is the way it maps sounds in space.
When we hear them as indexes, sounds share the same location as their physical source. If
we hear the fruit dove on that tree, for example, we expect to also find it there physically.
In indexical listening, hearing, sight and touch should map a similar world. We will see
that this contrasts with the two other kinds of listening to be described.
can be uttered by various voices, with various pitches, various intonations, louder or
softer, or at various speeds.6 And they can also be written down, flashed over the sea in
Morse code, or transmitted as binary data over computer networks. Such operations
imply an abstract level, where the “value” of the different phenomena, notwithstanding
their different material forms, remains the same.
28 This is how Saussure reached his conclusion that “it is impossible for sound, which is
material, to belong by itself to language” (de Saussure 1916: 164, our translation). In a
broader definition of language, which would include notably prosodic factors, gestures
and contextual implications, structural listening is only a small part of what we do in
linguistic communication. But when we listen for structures, we set aside most acoustic
features of sound, and the resulting auditory object is hardly acoustic at all. Whereas
infinitesimal variations are important in both indexical and enchanted listening postures,
here only a few oppositions matter.
29 Structural listening is not limited to linguistic communication. It can be triggered at will
by the listener, on any kind of sound. For example, bird calls can be “understood” in a
structural sense, when the way in which the bird vocalizes transmits an omen (cf. Walker
2010). In entirely different contexts, data sonification is often used to convey information
that is clearly abstracted from the actual sounds (see e.g. Supper 2014). Musical notations
are yet another achievement of structural listening. In order to write music down
(whether we compose or transcribe it from what we hear), we need an abstraction layer,
where sound and graphic signs share some common structure. The mundane ability to
consider that, say, a xylophone and a flute play “the same” melody points to a similar
abstraction. To assert their equivalence, the listener must retain only the structural
relations of the pitches, discarding the obvious differences in the sound spectra of these
instruments.
30 Structural listening contrasts with both the indexical and enchanted postures in several
ways. We have already mentioned its insensibility to many variations of sound. This
extends to spatial location. Physical space is important in indexical listening. As we shall
see, enchanted listening constructs a space of its own. But space is abolished altogether in
structural listening. Whether the person telling you Cinderella’s story is sitting to your
right or your left does not change the representation you form of the narrative. A
conventional musical transcription will be similarly unaffected by the position of the
sound source being transcribed.
“Virtual causality” and “animation”, for example. A listener may sense entities which
“move”, relate to each other in various ways, and possibly embody an agency of their
own. She might also perceive them as “coloured”, “shaped” or “textured”. Our listener
can be well aware that colours cannot normally be heard (even true synaesthetes do not
actually “see” colours in sound7). She hears them nevertheless, and they have nothing to
do with the colours of the sound sources. She may also check that these properties vanish
away when she adopts another manner of listening. She can “step back” and listen
indexically, for example, to the physical source: where is it, what is it, does it move, etc.
In that world, there are no more lines, movements or textures, just some sound coming
from some being or object. She can then step in once more, whereupon only the same
original movements, colours and textures will exist again.
33 If she uses a European language, our listener will probably label what she perceives in this
listening as “music”. Ethnomusicology has shown, however, that this concept is not found
in many other cultures. In the Amazon, for example, indigenous people often use terms
that refer to specific ways of singing, at the same time excluding the sounds of
instruments, which, on the other hand, are understood as voices of spirits (Brabec de
Mori 2012; de Menezes Bastos 2013; Piedade 2013). Even in those societies where the
concept of “music” is used, its extension is ambiguous. In many Muslim societies, for
example, what is called musīqī (with this loanword from Greek) explicitly excludes calls to
prayer (adhān) and recitation of the Qur’an, both of which are nevertheless definitely
perceived as “music” by the average European, and as peculiarly agentive vocal
productions by the pious Muslim (Hirschkind 2004; Shiloah 1997). For this reason, it is
better to stick to the listening ability itself. We will refer to it as “enchanted listening”.
34 We understand enchantment in reference to the “technologies of enchantment”
discussed by Alfred Gell (1988, 1992, 1996, 2006). Gell’s proposal stemmed initially from an
analysis of the concept of technology (Gell 1988). He observed that some technologies are
meant to modify the world, while others target instead the way the world is perceived. 8
The latter he named “technologies of enchantment”. Enchantment remained, however,
an obscure concept throughout his work, which focused primarily on the “technological”
side of the proposal. Some years later, on a different path, Philippe Descola pointed out
that the beings and things which constitute people’s experiential worlds are first
recognized and categorized through a set of low-level cognitive processes. He called these
processes “schemas”, in the general sense of “abstract structures that organize
understanding and practical action without mobilizing mental images or any knowledge
conveyed in declarative statements” (Descola 2005: 149, translation from 2013: 59).
Descola proposed a cross-cultural survey of collective schemas which govern
identification and relationships. He compared in particular the distribution of
“interiorities” and “physicalities” in different ethnographies, and arrived at the
conclusion that “these principles of identification define four major types of ontology,
that is to say systems of the properties of existing beings” (Descola 2005: 176, translation
from 2013: 64). Now auditory experience is normally not an autonomous ontological
realm. Most of the time human beings do not consider it under a distinct “system of the
properties of existing beings”. Indexical listening points to non-auditory causes.
Structural listening points to non-sonic structures. On occasion, however, human
audition can materialize specific systems of sonic beings which then display particular
sets of sensory and relational properties. People typically describe them in terms of
unhearable dimensions which are not linked to any physical causes. Quite often too, sonic
beings are endowed with autonomous agencies, meaning the capacity to initiate actions
by themselves.9 We call enchanted listening the fact of experiencing a properly auditory
ontology.
35 This ability appears to be shared by all human beings. In all human societies there exist
interactions that rely on sounds being apprehended as a world of their own. In such a
world, sound events are related to each other rather than to their physical sources. They
obey specific intrasonic causalities. For example, people may feel “tensions” and
“releases” in sound, and react to them emotionally, as well as bodily through dance. This
is a classic way of experiencing tonal music. The building of “tension” and “release” is
taught in composition classes, and is also used as a basic analytical framework by many
musicologists.
36 “Tension” and “release” refer to a world where sounds have the ability to build up in an
equilibristic pile, oppose “their own” antagonistic energies, lose “their own” momentum,
leading “by themselves” to new sound events which resolve the instability and release the
tensions accumulated. None of this happens in a world where sounds occur because a
musician pressed some buttons on her instrument. It happens in a world where sounds
obey their own causal rules, a world where sounds occur because of other sounds. 10
37 “Tension” and “release” are merely examples that apply to specific kinds of music. They
are by no means universal concepts, and other representations can be shown to fulfil
similar roles elsewhere. In Papua New Guinea, for instance, the Kaluli use (or at least used
in the 1980s) a wide range of water-derived terms to describe positions and movement
tendencies in the sound realm. According to Feld (1981), in the Kaluli language, sa is a
standalone term for a waterfall. It can also be used as a prefix for many things related to
water, and also to song. For example, a sa-we:l refers to “the ledge or upper place from
which the waterfall drops”, which in song corresponds to “the leading pitch in a line or
phrase from which the melody descends”. Hence one can correct someone’s singing using
such a sentence as “the waterfall ledge is too long before the fall”. Feld gives numerous
other examples outlining a consistent use of hydraulic representations by the Kaluli
people in commenting their songs.
38 In Kaluli aesthetics, references to water flows play a role similar to the mechanics of
“tension” and “release” in Western tonal music. Listening to something like a water flow
or an equilibrium of energies demanding resolution are both enchanted experiences,
because sounds then interact with each other in a suspended realm, according to rules of
their own.
is a purely auditory object, a sonic being. The Kaluli experience of ilib drumming bears
the mark of enchanted listening, because it causes, or results from, ontological shifts and
the reframing of human/non-human agencies.
44 Enchanted listening does not actually require articulate cosmologies. “Movement”, for
instance, is a basic metaphor for sound processes throughout Western music (e.g. “rise”
and “fall” in a tune, rhythmic “swing”, “walking” bass).
“Buildup” and “breakdown” in a trance music track, analyzed in Butler 2006 : 315.
Abbreviations: bass drum (BD), riff 1 (R1), snare drum2 (SD2), snare drum 3b (SD3b), riff 2 (R2).
Musical excerpt from Communication (Somebody Answer The Phone) by Mario Più, 1999, Incentive –
CENT2T.
of context, the reader could well believe that Swann encounters an actual female person.
This is intentional, and over the course of the novel, the petite phrase, which always
appears personified, becomes a sort of emblem for Swann’s relation with Odette de Crécy.
48 It probably takes Proust’s mastery to verbalize sound impressions with such delicacy. On
the other hand, that most readers understand his description points to the fact that this
kind of interaction is not completely foreign to them. Watt and Ash (1998) have shown
that British listeners are prone to relate musical excerpts to traits of personality such as
age, gender and emotional states. Once you admit that auditory objects can be “female”,
“move” and have affects of “their own”, it is indeed easy to figure as well how they might
“charm” a listener like Swann. Or, if you admit that a drum can “speak” like a bird, and
that a bird can reflect a dead spirit calling “father” (which is admittedly more
complicated for European minds), it becomes understandable that one could relate
emotionally to the pulsating sound of the Kaluli ilib drum.
49 Enchanted listening implements collective auditory schemes pertaining to distinctly
auditory ontologies. The ontologies are collective in the sense that they are shared by a
significant number of people, allowing for interactions and mutual understandings. Only
the listening mode however – the fact of switching to a specifically auditory ontology –
can be considered universal.
50 To conclude this section, let us summarize how enchanted listening differs from the other
two modes. Indexical listening is the “default” mode. Its objects are auditory aspects of
non-auditory beings or events. The objects of structural listening are hardly auditory at
all: the whole process is oriented towards abstract patterns. Enchanted listening is the
only mode where auditory objects stand by themselves. This entitles them to a distinct
mode of identification (neither indexes nor structures will do) and a distinct mode of
relation (auditory objects relate directly to other auditory objects): in other words, an
ontology which exists only in audition.
51 One of the interesting properties of such an ontology is that it brings new beings to social
interactions. Contrary to indexes or abstract structures, the objects of enchanted
listening do not refer us to something else. They instantiate other beings here and now.
Such beings materialize into sound and literally “possess” the acoustic vibrations (see
Sartre 1940: 165 for an equivalent in images). Western musical theory also has a
concurrent interpretation of a process similar to what we call enchantment. Following
Pierre Schaeffer, the severing of indexical ties, and the carelessness for abstract
meanings, has been described as a “reduced” form of listening (Chion 1983: 32 sqq., 1994;
Schaeffer 1966, Chap. 15 and 20). In Schaeffer’s terms, the listening is “reduced” because
it puts the world into “parenthesis”. This is supposed to lead to an objectivation of sound
“for its own sake” (Chion 1983: 33). We cannot contend that this might be what some
listeners are seeking. But what Schaeffer seems to have missed is that when auditory
experience is apprehended “for itself”, the outcome is often a transient irruption of new
things and beings into human interactions. All over the world, ethnography shows the
actual opposite of a “reduction”: enchanted listening practised as a path to augmented
social realities.
set human languages apart from other sound systems like musical scales. Nevertheless,
this is a special property of an object. It doesn’t imply that the listening mode which
apprehends it is special too. In our view, parsing acoustic data for oppositional units is
a distinct listening mode, whether the underlying system is articulated at one or two
levels.
Q: Are you sure that A/B/C modes of listening are alternative? Could they not be used in
conjunction too?
→ The question is whether a person can focus on the same auditory streams in
different ways at the same time. There are indeed sound activities which are defined,
culturally, as having an interest for several modes of listening. "Songs", for instance,
are vocal productions whereby Western listeners deem it interesting to focus on the
voice and its melody just as much as on the lyrics. Cognitive experiments found
contrasted evidence showing that, in the perception of sung words, pitch and semantic
content were treated either independently of each other (e.g. Besson et al. 1998; Bonnel
et al. 2001), or as an integrated percept (Gordon et al. 2010; discussion in Schön et al.
2005). The main problem here is that these studies assess only how listeners identify
words, pitch patterns and semantic or syntactic incongruities. These are all instances of
the same listening mode (i.e. structural). To answer our question, an experiment should
investigate, for example, the interplay of structural listening with the particular
ontologies of the enchanted mode. To the best of our knowledge this has not yet been
tried. There are some indications, however, regarding indexical vs structural
processings. Vitevitch (2003) asked people to repeat a list of words spoken by a
recorded voice; the recorded speaker was changed midway through the list, but only
half of the participants noticed the change. In a more realistic simulation of a phone
conversation, Fenn et al. (2011) found even lower results (only 6% of their participants
noticed that the interlocutors at the other end of the line had changed). People could of
course reliably detect the changes if cued to do so, or if the differences between the
voices were overtly salient (Fenn et al. 2011). These findings show that it is certainly
possible to use indexical and structural listening in a conversation, but that the normal
mode of attending to someone speaking is structural. This sends to the background
significant acoustic features which would have been relevant in the indexical mode.
Such clues prompted us to present the listening modes as alternatives. This is in line
with our general definition of listening, which implies selective focus. However, since
attentional processes can, to some extent, be allocated in parallel (Bonnel et al. 2001;
Cohen et al. 2012; Demany et al. 2015), we do not make a strong claim about an essential
incompatibility between the three modes.
Q: What about the sound producers? Where do they fit into your system?
→ We set aside the logic and postures of sound production as a different topic. In our
view, sound producers can adopt any of the three listening postures mentioned above.
For example, in a logic of “music” production, the producers are simultaneously
primary listeners of their own sound. They can do it in an enchanted or a structural
posture, depending on the moment and the kind of music (whether they improvise or
play from a score, for example). In “speech”, on the other hand, people hardly ever
listen to their own voice. Instead, they usually concentrate on the meanings they wish
to convey. These examples illustrate that listening postures are only loosely correlated
with sound production postures. The latter would deserve a study of their own.
Q: What exactly do we gain from this theory? After all, we already knew that daily sounds,
language and music were different. Is your demonstration not just a sophisticated way of
rediscovering the wheel?
→ Our search for listening postures started a few years ago, precisely from our
repeated frustrations with concepts such as “sound”, “language” and “music”. We
found that these concepts were used inconsistently in current anthropology and
ethnomusicology alike. Definitions were seldom given, or, when given, seldom followed
(see Ingold’s remark on the “sound” in “soundscape”, for example). We realized that
adding yet another definition for these concepts was not the way to go. Instead, we
looked for what people did with sounds, and what that revealed about their auditory
experience. One crucial fact, for which we tried to maintain a central place throughout
our proposal, is that human audition is never entirely constrained by the outside world.
The same vibrations become alternative kinds of things in audition, depending on the
posture adopted by the listener. The differences are ontological. They affect which
things exist in audition and what properties they have, including their interactional
affordances. We believe that this is an interesting and possibly new way of framing
auditory experiences in anthropology. One of the problems our distinction could
address is the relevance and the extension of the concept of “music”.
can also be apprehended through the other two listening modes. But, if there is anything
specific to it, it is probably due to its privileged link with auditory enchantment.
61 We believe that the general properties we described for this way of listening – the
ontological shift and the mapping of agency onto the sound realm – can account for many
effects attributed to “music”. Specific agencies operate in “music” because the things it is
made of have particular ontological properties. But we can intentionally switch back and
forth between this and other ways of listening. In other words, the enchanted alternative
is always (just) an option. As with the other alternatives, it is adopted by an individual,
often according to culturally formed suggestions. One of these suggestions could be the
ontological category of the sound producer – enchanted listening is linked to human
sound productions in some societies – although it cannot be considered a universal
condition. The same holds for criteria such as organization. We do not see these as
constituting a specific kind of object or activity (“music”). We see them as cultural
determinants that tend to orientate the listeners in given contexts towards specific ways
of listening.
62 Music is not universal, in any sense of the word. But enchanted listening is, as a capacity
to consider a distinct realm where sounds interact primarily with each other. If this is
true, we should also question the implicit assumption that what people describe as
colours, movements or beings in sound are “in the end” frequencies, amplitudes and
spectral components of air waves. It should be possible to take people seriously and give a
positive empirical status to the enchanted things and beings that appear at times in their
auditory experiences.
BIBLIOGRAPHY
ALAIN CLAUDE, STEPHEN R. ARNOTT and others, 2000.
“Selectively attending to auditory objects,” Frontiers in Bioscience no. 5, D202–D212.
BONNEL ANNE-MARIE, FRÉDÉRIQUE FAITA, ISABELLE PERETZ & MIREILLE BESSON, 2001.
“Divided attention between lyrics and tunes of operatic songs: Evidence for independent
processing,” Perception & Psychophysics no. 63/7, pp. 1201–13.
COHEN MICHAEL A., PATRICK CAVANAGH, MARVIN M. CHUN & KEN NAKAYAMA, 2012.
“The attentional requirements of consciousness,” Trends in Cognitive Sciences no. 16/9, pp. 411–7.
FENN KIMBERLY M., HADAS SHINTEL, ALEXANDRA S. ATKINS and others, 2011.
“When less is heard than meets the ear: Change deafness in a telephone conversation,” The
Quarterly Journal of Experimental Psychology, no. 64/7, pp. 1442–56.
GORDON REYNA L., DANIELE SCHÖN, CYRILLE MAGNE and others, 2010.
“Words and Melody Are Intertwined in Perception of Sung Words: EEG and Behavioral Evidence,”
PLoS ONE, no. 5, e9889.
IRSIK VANESSA C., CHRISTINA M. VANDEN BOSCH DER NEDERLANDEN & JOEL S. SNYDER,
2016.
“Broad attention to multiple individual objects may facilitate change detection with complex
auditory scenes,” Journal of Experimental Psychology: Human Perception and Performance no. 42,
pp. 1806–17.
MARTINELLI DARIO, 2009. Of birds, whales, and other musicians: An introduction to zoomusicology,
Scranton/Chicago, IL, University of Scranton Press.
Cross (eds.), Representing Musical Structure, London/San Diego/New York, Academic Press,
pp. 129–159.
PILCHER JUNE J., KRISTEN S. JENNINGS, GINGER E. PHILLIPS & JAMES A. MCCURBIN, 2016.
“Auditory Attention and Comprehension During a Simulated Night Shift: Effects of Task
Characteristics,” Human Factors, no. 58/7, pp. 1031–43.
SNYDER JOEL S., MELISSA K. GREGG, DAVID M. WEINTRAUB & CLAUDE ALAIN, 2012.
“Attention, Awareness, and the Perception of Auditory Scenes,” Frontiers in Psychology, no. 3.
Available online: ncbi.nlm.nih.gov/pmc/articles/PMC3273855/, last accessed November 2017.
NOTES
1. We are particularly grateful to the participants in the workshop “Sonic beings? The
ontologies of musical agency”, which we convened at the EASA Conference 2012. At the
Research Centre for Ethnomusicology in Nanterre (CREM-LESC/CNRS/UMR 7186) and the
Institute of Ethnomusicology in Graz (University of Music and Performing Arts), many
colleagues and students helped us shape our argument over the years. We received
important intellectual contributions from Estelle Amy de la Bretèque, Emmanuel de
Vienne, Matei Candea, Malik Sharif and Thibaud Aimard-Kesraoui, who reviewed in detail
and discussed with us preliminary versions of this text. We are also grateful to the
anonymous reviewers who expressed helpful comments on our proposal.
2. Attention plays a crucial role at later stages of auditory scene analysis. It actually also
modulates “from the top down” some very early processes of stream segregation
(Caporello Bluvas & Gentner 2013; Zobel et al. 2015).
3. This may seem similar to (and is probably inspired by) Pierre Schaeffer’s discussion of
preobjective modes of listening (Schaeffer 1966: 113 sqq.). Our study differs in method
and ethnographic coverage, but the most important distinction will perhaps appear in
relation to “enchanted listening”. In our analysis, the suspension of indexical and
structural/semantic interpretations is not a “reduced” listening, as Schaeffer posits, but
rather the “augmented” experience of a new auditory realm. We agree with Solomos
(1999) in his argument that Schaeffer did not actually consider the dissolution of the
“sound object” into distinct ontologies dependent on the listener’s system of knowledge
and intentions.
4. “The affordances of the environment are what it offers the animal, what it provides or
furnishes, either for good or ill” (Gibson 1986: 127). As summarized by Gibson himself, the
core of his thesis is that “the composition and layout of surfaces constitute what they
afford. If so to perceive them is to perceive what they afford.” In other words, appraising
action possibilities does not occur after perception but right within it. A liquid surface, for
example, is “sink-into-able” for heavy mammals but “stand-on-able” for water bugs.
Mammals and bugs never actually perceive the same surface.
5. This is true as long as the extension is within the range of possibility: compare the
string quartet recording with a progressive rock album, where “stereo effects” are
employed, so that, for example, the guitar solo circles around the listener or the drums
jump from right to left: in that case, the upstream inference of indexical listening would
invoke a space with flying guitarists and teleporting drums. This space is not possible,
because it cannot exist without changing the ontological properties of reality (on
possibility as a mode of existence, see Souriau 2009: 134 sqq.). If such ontological shifts in
space occur, we are confronted with another kind of auditory space that is explored in
detail in the section about enchanted listening.
6. This competence probably needs learning. Babies, for example, are initially more
sensitive to vocal pitch than adults. They must learn to lose some of this sensitivity in
order to acquire language (Sacks 2007: 138; Saffran & Griepentrog 2001).
7. Sacks (2007: 182) relates the following discussion with Michael Torke, a “true
synaesthete” (who happens to be a composer). Torke explained to Sacks that he vividly
saw the colour blue when he heard a D-major chord. Sacks asked Torke what would
happen if he listened to D-major when looking at a yellow wall. Would he see green?
Torke’s answer was negative: both the musical and the visual colors were “true” colours
for him, but they would not mix together. This indicates that even for “true
synaesthetes”, auditory colours remain distinct from optical ones.
8. To illustrate : “a flute, no less than an axe, is a tool, an element in a technical sequence;
but its purpose is to control and modify human psychological responses in social settings,
rather than to dismember the bodies of animals” (Gell 1988: 6).
ABSTRACTS
This essay identifies and describes three ways of listening that are available to all human beings.
Beforehand, we argue that the concept of “sound”, as borrowed from acoustics and commonly
used in anthropology, is too vague and too limited. In order to be able to understand the full
range of human auditory experiences as found in ethnography, as well as the social interactions
which they afford, we propose a distinction of at least three postures of listening. We define
these as “indexical”, “structural” and “enchanted”, by contrasting their interactional salience in
various settings. The auditory “things” that exist for each of the three stances (their ontologies)
are also shown to be different. This trichotomy provides a promising theoretical framework for
some longstanding problems in anthropology. After discussing some critical questions and
possible shortcomings of our model, we conclude by looking closely at one of these issues: the
definition of “music” and its ethnographic relevance throughout the world.
INDEX
Keywords: audition, sound, ontology, language, music, enchantment, agency