Beruflich Dokumente
Kultur Dokumente
Christopher R. Fetsch,1 Suhrud M. Rajguru,1 Anuk Karunaratne,1,2 Yong Gu,1 Dora E. Angelaki,1,2,*
and Gregory C. DeAngelis3,*
1
Departments of Anatomy and Neurobiology and 2Biomedical Engineering, Washington University School of Medicine, St. Louis, Missouri;
and 3Department of Brain and Cognitive Sciences, Center for Visual Science, University of Rochester, Rochester, New York
Submitted 21 November 2008; accepted in final form 10 July 2010
Fetsch CR, Rajguru SM, Karunaratne A, Gu Y, Angelaki DE, performance in monkeys trained to report their direction of
DeAngelis GC. Spatiotemporal properties of vestibular responses in self-motion (Gu et al. 2007, 2008, 2010). However, previous
area MSTd. J Neurophysiol 104: 1506 –1522, 2010. First published studies have quantified vestibular responses in MSTd based
July 14, 2010; doi:10.1152/jn.91247.2008. Recent studies have shown
that many neurons in the primate dorsal medial superior temporal area only on mean firing rate (MFR) within a particular stimulus
(MSTd) show spatial tuning during inertial motion and that these interval (Duffy 1998; Fetsch et al. 2007; Gu et al. 2006; Page
responses are vestibular in origin. Given their well-studied role in and Duffy 2003; Takahashi et al. 2007). A more complete
processing visual self-motion cues (i.e., optic flow), these neurons understanding of the nature of vestibular signals in MSTd
may be involved in the integration of visual and vestibular signals to requires a detailed analysis of their spatiotemporal dynamics.
facilitate robust perception of self-motion. However, the temporal Primary otolith afferents encode linear acceleration of the
structure of vestibular responses in MSTd has not been characterized
head in space (Fernandez and Goldberg 1976; Loe et al. 1973;
in detail. Specifically, it is not known whether MSTd neurons encode
velocity, acceleration, or some combination of motion parameters not Si et al. 1997). In contrast, central neurons as early as the brain
explicitly encoded by vestibular afferents. In this study, we have stem vestibular nuclei (VN) and vestibulo-cerebellum show a
applied a frequency-domain analysis to single-unit responses during diversity of response dynamics during translation. The major-
translation in three dimensions (3D). The analysis quantifies the ity of these neurons appear to encode velocity or combinations
stimulus-driven temporal modulation of each response as well as the of acceleration and velocity (Angelaki and Dickman 2000;
degree to which this modulation reflects the velocity and/or acceler- Dickman and Angelaki 2002; Musallam and Tomlinson 2002;
ation profile of the stimulus. We show that MSTd neurons signal a Shaikh et al. 2005; Yakusheva et al. 2008). Response dynamics
combination of velocity and acceleration components with the veloc-
have also been shown to vary with the direction of motion
ity component being stronger for most neurons. These two compo-
nents can exist both within and across motion directions, although (Angelaki and Dickman 2000; Musallam and Tomlinson
their spatial tuning did not show a systematic relationship across the 2002). Using position-transient stimuli, Musallam and Tom-
population. From these results, vestibular responses in MSTd appear linson (2002) observed spatially asymmetric temporal integra-
to show characteristic features of spatiotemporal convergence, similar tion in a population of VN neurons. These neurons showed
to previous findings in the brain stem and thalamus. The predomi- velocity-like responses to translation in one direction, but
nance of velocity encoding in this region may reflect the suitability of acceleration-like responses during motion in the opposite di-
these signals to be integrated with visual signals regarding self-motion rection. The authors propose a cellular mechanism to explain
perception. this asymmetry, but its function remains unknown. Angelaki
and Dickman (2000) have reported a similar variation of
temporal dynamics across motion directions for VN neurons,
INTRODUCTION suggesting a form of spatiotemporal filtering via convergence
The dorsal subdivision of the medial superior temporal area of afferents with different spatial and temporal response prop-
(MSTd) is well known as an area that processes complex optic erties (Angelaki 1993a,b).
flow patterns, such as those experienced during self-motion Beyond the VN, very little is known about the full range of
(Duffy and Wurtz 1995, 1991; Tanaka and Saito 1989; Tanaka vestibular response dynamics present in the brain and even less
et al. 1986; Warren 2003). Previous work has also shown that about the functional roles of these diverse signals for self-
MSTd neurons are selective for the direction of physical motion perception and spatial orientation. An important step
motion in darkness (Bremmer et al. 1999; Duffy 1998; Page toward addressing these questions is to characterize response
and Duffy 2003), suggesting that this area may contribute to dynamics at multiple sites along the vestibular sensory path-
the integration of visual and nonvisual self-motion cues. More way. To this end, Meng et al. (2007) recorded translation
recently, the responses of MSTd neurons to translation and responses in the primate thalamus, a probable source of ves-
rotation have been characterized in 3D (Gu et al. 2006; Taka- tibular inputs to cortical regions such as MSTd. Similar to the
hashi et al. 2007). Labyrinthectomy experiments (Gu et al. VN and cerebellar nuclei, thalamic neurons showed a broad
2007; Takahashi et al. 2007) strongly suggest that MSTd distribution of response phases during motion in their preferred
responses during physical motion originate from the vestibular direction (Meng et al. 2007). A goal of the present study was
labyrinths, and these responses have been linked to perceptual to continue tracking the evolution of otolith-driven signals into
the cortex, specifically in area MSTd.
Whether MSTd neurons encode velocity, acceleration, or a
* D. E. Angelaki and G. C. DeAngelis contributed equally to this work.
Address for reprint requests and other correspondence: G. C. DeAngelis,
combination of kinematic variables remains unclear. Gu and
Meliora 315, Dept. of Brain and Cognitive Sciences, University of Rochester, colleagues (2006) reported that ⬎90% of MSTd responses to
Rochester, NY 14627-0268 (E-mail: gdeangelis@cvs.rochester.edu). translation in the preferred direction were better correlated with
1506 0022-3077/10 Copyright © 2010 The American Physiological Society www.jn.org
Downloaded from www.physiology.org/journal/jn by ${individualUser.givenNames} ${individualUser.surname} (082.242.110.027) on May 15, 2018.
Copyright © 2010 American Physiological Society. All rights reserved.
VESTIBULAR RESPONSES IN AREA MSTd 1507
METHODS
DFTR was defined as the ratio of the average amplitude from 0.5 to velocity profile and 99.1% for the acceleration profile; Fig. 2, D–F).
2.5 Hz to the average amplitude from 3 to 25 Hz It is thus reasonable to assume that for a given response, any
1 6 frequency component ⬎2.5 Hz was not driven by the stimulus.
兺 ampi
5 i⫽2
Accordingly, the DFTR provides a measure of the stimulus-driven
temporal modulation (low-frequency components) relative to the
DFTR ⫽ (1)
1 50 noise (high-frequency components) present in the response. In prin-
兺 ampi
44 i⫽7 ciple this ratio can range from 0 (pure high-frequency noise) to an
arbitrarily high number (a noiseless low-frequency response), but in
Here, ampi is the amplitude of the ith frequency component, ranging practice, the range for our data set was 0.29 –7.80.
from 0 to 24.5 Hz in increments of 0.5 Hz [a 2-s window with a The statistical significance of the DFTR was assessed for each
sampling rate of 50 Hz (20-ms bins) gives a 100-point DFT and a neuron and each stimulus direction using a permutation test based on
Nyquist frequency of 25 Hz, yielding 50 components at 0.5-Hz 1,000 random reshufflings of binned spike counts. From visual in-
intervals]. We chose these particular frequency ranges because most spection of hundreds of individual PSTHs from our sample (blind to
of the amplitude in the stimulus was below 3 Hz (99.7% for the the results of the permutation test), a significance criterion of P ⬍ 0.01
A B C
D E F
G H I
FIG. 2. Example temporal response profiles and discrete Fourier transform ratio (DFTR) analysis. A–C: example PSTHs for a velocity-like response (A), an
acceleration-like response (B), and a weakly modulated response (C) with temporal envelopes (red traces) superimposed. The delay of each response relative to
the stimulus was calculated as the center of mass of the temporal envelope, and the 2-s window centered on this delay was selected for the DFTR analysis (black
bars). Acceleration (cyan) and velocity (green) traces are shown (normalized, vertically offset, and shifted horizontally by the delay) for display purposes, to
illustrate their similarity (or lack thereof) with the neural response. D–F: Fourier amplitude spectra (black lines) for the responses shown in A–C, respectively.
Amplitude spectra for stimulus velocity (green) and acceleration (cyan) profiles are shown for comparison (normalized to the peak value of the response
amplitude spectrum in each plot). The DFTR is defined as the ratio A/B, where A is the mean amplitude of the frequency components below and including 2.5
Hz (gray area), and B is the mean amplitude of the components ⬎2.5 Hz (yellow area). G–I: phase spectra for the responses shown in A–C for low-frequency
components only (0.5–2.5 Hz). Phase spectra for the velocity (green) and acceleration (cyan) stimulus profiles are also shown.
was chosen for this test because it produces a classification of a 60-ms (3-bin) boxcar function. From the smoothed, baseline-sub-
significant temporal responses that generally agrees well with quali- tracted version of the PSTH, denoted R(t), we computed the Hilbert
tative classification by eye. In addition, DFTR values were only transform H[R(t)] using the Matlab function hilbert (version 6.5,
considered significant if they were greater than the DFTR computed Signal Processing Toolbox), which shifts all frequency components
from “null” trials in which the motion platform remained stationary by 90°. The temporal envelope E(t) was then computed as
while the monkey was required to fixate the target. This additional
criterion was designed to reject a small proportion of cases in which E(t) ⫽ 兹R(t)2 ⫹ H[R(t)]2 (4)
responses to inertial motion showed some structure but did not exceed
the temporal modulation observed during fixation alone, for instance To illustrate the procedure in more detail, consider the stimulus
due to lingering visual responses to the fixation point or responses velocity and acceleration profiles themselves (Fig. 1A) as example
time locked to the onset of ocular fixation. Out of 10270 total “signals” from which we wish to extract the temporal envelope. In
directions tested (395 cells * 26 directions), 4,196 responses (40.9%) Fig. 1B, the velocity profile V(t) (dashed curve) is re-plotted along
passed the permutation test, and 179 (1.7%) were eliminated by the with its Hilbert transform H[V(t)] (dotted curve). These two functions
“greater-than-null” criterion. can be considered as a pair of orthogonal vectors (known as a
To quantify the degree to which stimulus velocity and/or acceler- quadrature pair), where the angle between them represents the 90°
ation were reflected in the neural responses, we computed velocity and phase shift due to the Hilbert transform. The magnitude of their vector
acceleration DFTR “components” (DFTRvel and DFTRacc) as follows. sum (i.e., the square root of the sum of squares; Eq. 4) is the envelope
The amplitude of each low-frequency component (0.5–2.5 Hz) was E(t), depicted as the solid curve. Similarly, the acceleration profile of
rescaled by multiplication with the negative cosine (velocity) or sine the stimulus and its Hilbert transform (Fig. 1C, dashed and dotted
(acceleration) of its phase angle (). The velocity and acceleration curves, respectively) form a quadrature pair whose vector sum com-
DFTR components were then computed from the rescaled amplitudes prises the temporal envelope of acceleration (solid curve, Fig. 1C).
Note that the temporal envelopes of the velocity and acceleration
1 6
stimulus profiles are very similar in shape: the envelope peak coin-
兺 ⫺cos(i) · ampi
5 i⫽2 cides with the maximum velocity and the zero-crossing of accelera-
DFTRvel ⫽ (2) tion, both at time t ⫽ 1.0 s. Thus centering the analysis window using
1 50
44 i⫽7 兺 ampi the temporal envelope serves to capture the relevant temporal modu-
lation in the response, while being largely unbiased with respect to
6
velocity versus acceleration components.
1
兺 sin(i) · ampi
5 i⫽2
For a given neuronal response profile (PSTH), the difference (in
units of time) between the envelope center of mass and time t ⫽ 1.0
DFTRacc ⫽ (3) was defined as the delay. This procedure was applied to each PSTH
1 50
兺
44 i⫽7
ampi separately (e.g., see Fig. 2, A–C), allowing the responses to different
motion directions to have independent delays. It is important to note
that delay, by this definition, does not correspond to neuronal latency
The logic behind scaling the amplitudes by the -cosine or sine of the
per se. Traditional measures of latency are difficult to compute in the
phase angle can be summarized as follows. Note that the dominant
case of a gradual stimulus onset (i.e., the early tail of our Gaussian
phase of the Gaussian stimulus velocity profile is 180°, whereas the
velocity profile), and in any case such a measure would not be useful
dominant phase of the acceleration profile is 90° (Fig. 1A). Thus
for our purposes. Rather the DFTR method required us to quantify the
frequency components that reflect stimulus velocity will have a phase
delay of the overall response profile (temporal modulation of firing
near 180°. Such components will contribute near-maximally to the
rate) relative to the stimulus, without assuming a particular shape or
DFTRvel [⫺cos(180°) ⫽ 1] but contribute very little to the DFTRacc
frequency content of the response. The Hilbert transform approach is
[sin(180°) ⫽ 0], and vice versa for components that reflect accelera-
well suited for this purpose.
tion (phase near 90°). Using these definitions, negative DFTRvel or
As mentioned in the preceding text, significance of the DFTR
DFTRacc values will arise naturally when a cell shows an inhibitory
velocity and acceleration components was determined using a permu-
(suppressed) response in correspondence with the velocity [phase near
tation test in which the temporal delay derived from the Hilbert
0° (or 360°)] or acceleration [phase near ⫺90° (or 270°)] stimulus
transform was constant across permutations. We also tested signifi-
profiles. Motion directions were considered to have statistically sig-
cance using a bootstrap test in which the temporal delay was deter-
nificant velocity or acceleration DFTR components if 1) their corre-
mined for each resampling of the data. This produced similar propor-
sponding “total” DFTR was significant (see preceding criteria) and
tions of PSTHs with significant velocity and acceleration components,
2) they passed a separate permutation test at a threshold of P ⬍ 0.01.
and all of the results were similar when responses were classified in
Phase analysis is complicated by the fact that neuronal responses
this fashion.
are always delayed to some degree with respect to the stimulus (i.e.,
due to sensory transduction, synaptic and conduction delays, and local
processing time). If we had used a fixed analysis window (for Spatial tuning
instance, corresponding to the 2-s stimulus duration), phase angles
computed for each response would be affected by response delays, Spatial tuning maps were generated for each cell by plotting the
which vary across neurons and even across stimulus directions. Thus mean response as a function of the azimuth and elevation of the
to dissociate delay from actual response dynamics, the analysis motion trajectory. Here, “response” could be defined as MFR,
window for the DFT was shifted such that it was centered on the DFTRvel, or DFTRacc. MFR was computed within a fixed 2-s window
midpoint of the response profile, defined as the center of mass of the beginning with stimulus onset. Computing MFR in the 2-s window
temporal envelope. In signal processing, the envelope of a real-valued used for the DFT (i.e., centered on the temporal envelope) did not
function (time series) is the magnitude of its analytic signal, or appreciably change the pattern of results. The statistical significance
equivalently its amplitude-modulating waveform. To compute the of spatial tuning was quantified by a one-way ANOVA performed on
envelope, we used a standard approach (Bracewell 1978; Gabor 1946) the MFR/DFTR values derived from each individual trial. Computing
that has previously been applied to the analysis of visual receptive DFTR based on single-trial PSTHs (rather than across-trial averaged
fields (DeAngelis et al. 1993; Field and Tolhurst 1986). Briefly, we PSTHs as described in the preceding text) was necessary to provide
first subtracted the baseline response (mean firing rate in a 200 ms single-trial metrics for the ANOVA. Note that when computing
window just prior to stimulus onset) and smoothed each PSTH using single-trial versions of DFTRvel and DFTRacc, we shifted the single-
trial PSTHs by the delay computed from the original temporal enve- the subscript “⫹” indicates half-wave rectification, simulating the
lope (from the averaged PSTH) to reduce uncertainty in the delay “clipping” effect at zero firing rate.
estimate. We varied wvel (and consequently wacc) in fine steps from ⫺1 to 1
A preferred direction was computed for all four response metrics and subjected each simulated response profile to the DFTR analysis
(MFR, DFTR, DFTRvel, and DFTRacc) as follows. Each MFR/ described in the preceding text (Eqs. 1–3). We then compared the
DFTR value was considered to represent the magnitude of a 3D relative magnitudes of the resulting DFTR components (DFTRvel and
vector the direction of which was defined by the azimuth and DFTRacc; Eqs. 2 and 3) to the corresponding weights that were used
elevation of the corresponding motion stimulus. These vectors to produce each simulated profile. This comparison can be thought of
were summed, and the direction of the resultant vector was taken as the input-output relationship of the DFTR method: the “inputs” are
as the preferred direction. Tuning maps were also compared for wvel and wacc from Eq. 6, and the “outputs” are the normalized
each cell across response metrics (e.g., DFTRvel vs. MFR, DFTRacc velocity and acceleration DFTR components, defined as follows
vs. MFR, and DFTRvel vs. DFTRacc) via a simple correlation
DFTRvel
coefficient. DFTRvel(norm) ⫽
To quantify spatial tuning strength, we calculated a direction ⱍDFTRvelⱍ ⫹ ⱍDFTRaccⱍ
discrimination index (DDI) (Nguyenkim and DeAngelis 2003; Prince (7)
DFTRacc
et al. 2002; Takahashi et al. 2007; Uka and DeAngelis 2003), which DFTRacc(norm) ⫽
measures peak-to-trough response modulation (due to changes in ⱍDFTRvelⱍ ⫹ ⱍDFTRaccⱍ
motion direction) relative to response variability Note that these normalized DFTR components should be comparable
Rmax ⫺ Rmin to wvel and wacc from Eq. 6 because they also range from ⫺1 to 1 and
DDI ⫽ (5) their absolute values sum to 1. Thus if the DFTR analysis perfectly
Rmax ⫺ Rmin ⫹ 2兹SSE ⁄ (N ⫺ M) measures the relative strength of velocity and acceleration compo-
nents in the response, then the weights from Eq. 6 should match the
DDI was computed separately for the four response metrics (e.g., normalized DFTR values from Eq. 7. This comparison will be dis-
MFR, DFTR, DFTRvel, or DFTRacc). In each case, Rmax and Rmin cussed in the following text in RESULTS (Fig. 10).
refer to the maximum and minimum MFR or DFTR values, respec-
tively, using data averaged across repetitions. N is the total number of
Curve fitting analysis
trials, M is the number of motion directions tested (26), and SSE is the
total sum-squared error around the mean responses for all motion As a complement to the DFTR approach, we performed a curve
directions. Thus computing DDI based on DFTR values also required fitting analysis in which each PSTH was modeled as a weighted sum
the analysis of single-trial PSTHs, as was the case for the spatial of velocity and acceleration components. The fitting procedure min-
tuning ANOVA (see preceding text). imized the mean-squared difference between the data and the values
of the following equation
Validation of the DFTR method using simulated data R(t) ⫽ 兵A * 关wvelrvel(t) ⫹ signacc * waccracc(t)兴 ⫹ DC其⫹ (8)
We evaluated the accuracy and robustness of the DFTR method for This equation is similar to Eq. 6 but with a few key differences. First,
estimating velocity and acceleration components by testing it with the mean and SD of the Gaussian velocity component rvel ( and in
simulated neuronal response profiles for which the true mixture of Eq. 6, respectively) were free parameters rather than being fixed to the
velocity and acceleration components was known. These simulated values of the velocity profile itself. The noise term c was removed,
profiles, denoted R(t), were computed as the weighted sum of a and an amplitude parameter A was added to scale the responses
normalized Gaussian (mimicking the velocity profile of the stimulus) appropriately (PSTHs were not normalized prior to fitting). The
and its derivative (the biphasic acceleration profile) velocity weight (wvel) varied from ⫺1 to 1, wacc was constrained to be
equal to (1 – |wvel|), and the parameter signacc was used to generate fits
⫺(t ⫺ )2 with positive or negative acceleration components. We fit two ver-
rvel(t) ⫽ e 2·2 sions of the model, one with signacc ⫽ 1 and the other with signacc ⫽
d ⫺1, and the version with the smaller error was taken as the best fit for
racc(t) ⫽ rvel(t) (6) each PSTH. In preliminary testing, we found that a model in which
dt
both weights ranged from ⫺1 to 1 was too poorly constrained for
R(t) ⫽ 关wvelrvel(t) ⫹ waccracc(t) ⫹ DC ⫹ cv兴⫹ many PSTHs, causing convergence problems to arise during the fitting
procedure (i.e., frequent local minima and/or large correlations be-
The Gaussian velocity component, rvel(t), was assigned a mean ( ⫽ tween parameters).
1.0 s) and SD ( ⫽ 0.2 s) approximating the stimulus velocity profile
(see Fig. 1A). The values of rvel(t) ranged from 0 to 1, and its
RESULTS
derivative, racc(t), was normalized into to a range from ⫺1 to 1. The
weighted sum, R(t), of these two normalized components was then Temporal response properties
computed. The weights wvel and wacc also ranged from ⫺1 to 1, with
negative weights representing inhibitory responses and positive The analyses presented here were performed on responses
weights representing excitatory responses (relative to baseline). The from 395 MSTd neurons recorded from three monkeys. We
two weights were allowed to have different sign, but to maintain recorded from any well-isolated neuron in MSTd that was
proper normalization the sum of their absolute values was constrained spontaneously active or responded to a large flickering field of
to be 1. The DC offset term represents the baseline response (i.e., random dots (for details, see Fetsch et al. 2007; Gu et al. 2006;
spontaneous activity), ranging from 0 to 1. The final term represents
Takahashi et al. 2007); neurons were not prescreened for
Gaussian noise () drawn from a standard normal distribution, mul-
tiplied by a scaling factor c. Varying this scaling factor had little effect vestibular responsiveness before inclusion in the current data
on the main results of the simulation. For the results presented here, set. For each motion direction, we computed an average PSTH
we fixed the value of c at 0.14, which typifies the noise level observed using 20-ms bins. Figure 2, A–C, shows three example PSTHs:
for a typical well-modulated neuron from our sample (e.g., Fig. 2, a velocity-like response (A), an acceleration-like response (B),
A–C) as quantified by the denominator of the DFTR (see Eq. 1). Last, and a poorly modulated (nonsignificant) response profile (C).
J Neurophysiol • VOL 104 • SEPTEMBER 2010 • www.jn.org
Downloaded from www.physiology.org/journal/jn by ${individualUser.givenNames} ${individualUser.surname} (082.242.110.027) on May 15, 2018.
Copyright © 2010 American Physiological Society. All rights reserved.
VESTIBULAR RESPONSES IN AREA MSTd 1511
37.5 21.2 (56.7) 11.6 (30.9) 15.5 (41.4) 5.9 (15.6) 5.7 (15.2) 10.4 (27.7)
Percent of all motion directions (n ⫽ 395 cells * 26 directions ⫽ 10,270) passing the discrete fourier transform ratio (DFTR) significance criteria in each
possible category. ‘Only’ refers to directions with exclusively one component (velocity or acceleration) significant and not the other. Numbers in parentheses
indicate percentages computed as a fraction of directions with a significant DFTR and hence the percentages in the last four columns add up to 100.
and 0.29 for velocity and acceleration, respectively. Similarly, Example cell 1 (Fig. 6, A–D) shows a strong velocity-like
the mean range (max–min) across cells was 0.88 (velocity) and (Gaussian) response in one direction (azimuth 45°), but a more
0.85 (acceleration). This degree of within-cell variation in the acceleration-like (biphasic) response in the opposite direction
velocity and acceleration DFTR components indicates that the (azimuth 225°). This opposite directional tuning for velocity
temporal dynamics of MSTd vestibular responses cannot ade- and acceleration response components can be observed in the
quately be classified based on one or even a few motion PSTHs (Fig. 6D) and is clearly seen in the 3D tuning maps
directions. (compare Fig. 6, B and C). Also note that the directional tuning
The preceding analysis raises several questions: is there a for mean firing rate (Fig. 6A) was well matched to that of the
pattern to the spatial variation in temporal dynamics, and more velocity component but not the acceleration component.
specifically, what is the relationship between the temporal and For example cell 2 (Fig. 6, E–H), all three spatial tuning
maps (MFR, velocity, acceleration) are fairly well aligned.
spatial information contained in these responses? To address
This can occur when the response to a given direction of
these questions, we compared the 3D spatial tuning of velocity
motion carries both velocity and acceleration signals. The
and acceleration components to that of mean firing rate (MFR). PSTHs (Fig. 6H) for 0 and 45° azimuth can be described as
Figure 6 shows the spatial tuning of two example neurons mixtures of velocity and acceleration, showing an initial peak
(A–D and E–H). The contour plots show response strength as followed by a modest suppression below baseline. Indeed, the
a function of azimuth and elevation, where response strength is DFTRvel and DFTRacc of the 0° response were 1.7 and 1.4,
defined by (from top to bottom): MFR (A and E), DFTRvel (B respectively (P ⬍ 0.001 for both), and for the 45° response
and F), or DFTRacc (C and G). Figure 6, D and H, illustrates they were 2.0 and 1.8 (P ⬍ 0.001 for both). Moreover,
average PSTHs for the cells’ responses to translation in the responses in the opposite directions (180 and 225°) resemble
horizontal plane, which corresponds to a cross section through an inverted acceleration-like profile, resulting in a negative
the 3D tuning profiles at zero elevation. As in Fig. 2, stimulus DFTRacc (⫺1.6 and ⫺2.0, P ⬍ 0.001 for both). DFTRvel for
velocity and acceleration profiles are overlaid for visual com- the 180° response was also negative (⫺0.8, P ⫽ 0.01). In
parison. summary, this cell shows positive and negative velocity/accel-
32
60 28
24
50
20
40 16
Elevation (deg)
-90
B 6
F
5 2
DFTRvel
-45
4
0 3 1
2
45 0 FIG. 6. Spatial tuning of different temporal response compo-
1 nents. Two example cells illustrating the relationship between
90 0 -1 3-diimensional (3D) spatial tuning for the velocity and acceleration
C 3
G 3
components of response, as well as for mean firing rate (MFR). 3D
spatial tuning maps for each cell were calculated using MFR (A
DFTRacc
-90° 2 2 and E), DFTRvel (B and F), and DFTRacc (C and G). D and
1 H: average peristimulus time histograms (PSTHs) for responses to
1
0° translation in the horizontal plane, corresponding to a slice through
0 0 the contour maps at elevation ⫽ 0. Stimulus velocity (solid black)
-1 and acceleration (dashed black) profiles are overlaid as in Fig. 2
90° -1
-2 (i.e., not fits to the data, but shifted by the measured response
270 180 90 0 -90 270 180 90 0 -90 delay).
Azimuth (deg)
D H
sp/s
0 1 2 0 1 2
200 0 1 2 0 1 2 150 0 1 2 0 1 2
90° 90°
100
100
50
0
180° 0° 0
180° 0°
0 1 2 0 1 2 0 1 2 0 1 2
270°(-90°) 270°(-90°)
0 1 2 0 1 2 0 1 2 0 1 2
0 1 2 0 1 2
Time (s) Time (s)
and neither groups, the number of directions with at least one The first result to take away from this analysis is that the
significant component was used. These distributions are plotted DFTR method is nearly perfect at estimating the relative
in Fig. 8B, showing that cells with significant spatial tuning for strength of velocity and acceleration components (i.e., the
acceleration only (gray filled bars) had relatively few responses weights) when there is little or no clipping in the responses
with significant DFTRacc values, compared with cells that (leftmost column of Fig. 9; red and orange traces in Fig. 10).
showed significant spatial tuning for velocity only (black filled Interestingly, this accurate estimation of weights occurs despite
bars). The mean number of significant directions for the ve- variations in the estimated response delay that are not actually
locity-only category was 5.9, compared with 2.9 for the accel- present in the simulated data (e.g., ⫺44 ms when wvel ⫽ wacc ⫽ 0.5;
eration-only category (medians ⫽ 4 and 3, respectively; Mann- Fig. 9). The simulated response profiles are assigned an actual
Whitney U test, P ⫽ 0.005). These data suggest that MSTd delay of zero (i.e., the response for wvel ⫽ 1.0 peaks at the
same time, t ⫽ 1.0 s, as the stimulus velocity profile), and
neurons encode stimulus acceleration over a narrower range of
hence one might expect that this discrepancy in the actual
motion directions compared with the encoding of stimulus
versus estimated delays would cause a misestimation of the
velocity. Note also that cells with both velocity and accel- velocity and acceleration components. In fact, the accuracy of
eration components tend to have the largest number of the DFTR method depends on these “misestimated” delays:
directions with significant response modulation (mean ⫽ artificially fixing the delay at zero rather than using the tem-
9.8, median ⫽ 8). poral envelope method, causes systematic errors in the DFTR
analysis even in the absence of clipping (data not shown).
Limitations of the frequency-domain analysis method: Second, even for more substantial amounts of response
analysis of simulated data clipping (e.g., Fig. 9, middle), the DFTR method remains fairly
accurate (Fig. 10, green trace). For example, a simulated
The frequency-domain analysis presented here represents acceleration-like response profile (wvel ⫽ 0, wacc ⫽ 1) with a
one possible approach for estimating spatiotemporal properties DC offset of 0.5 results in clipping of 50% of the trough of the
of neuronal responses, and like any method, it is not without response, yet the relative magnitude of the DFTR components
shortcomings. Perhaps the greatest pitfall when using phase still indicates a largely acceleration-like response [DFTRvel-
information to estimate the relative strength of velocity and (norm) ⫽ 0.15, DFTRacc(norm) ⫽ 0.85]. In contrast, for the
acceleration components is the potential for misestimating case of 100% clipping (DC offset ⫽ 0), the DFTR method
delay, and hence phase, due to nonlinearities in the response estimates this same pure acceleration response as nearly a
profiles. For example, if the baseline activity is relatively low, 50/50 mixture of velocity and acceleration components
the trough of an acceleration-like response may be “clipped” at [DFTRvel(norm) ⫽ 0.47, DFTRacc(norm) ⫽ 0.53]. This error
zero, causing it to appear more velocity-like. arises from a misestimation of the delay based on the temporal
To assess the accuracy of the DFTR method and its robust- envelope, which is shifted from the true value (0) by ⫺109 ms
ness to response clipping, we performed the temporal analyses due to clipping. Shifting the window for the DFT by this value
(Eqs. 1– 4) on simulated response profiles that were designed to amounts to a phase shift of ⬃20° at 0.5 Hz, ⬃40° at 1.0 Hz,
reflect the diversity of temporal dynamics we observed in area etc. This phase shift of the low-frequency components drives
MSTd (see METHODS). The simulated responses were modeled DFTRvel upward and DFTRacc downward (see Eqs. 2 and 3),
(Eq. 6) as weighted sums of normalized velocity and acceler- resulting in a substantial discrepancy between the true weights
ation profiles, plus a DC offset term (representing baseline of velocity and acceleration in the synthetic data and the
activity) and a Gaussian noise term. Some examples of simu- estimated DFTR components. However, even this extreme case
lated temporal responses are shown in Fig. 9, where each row is somewhat encouraging: in the near-complete absence of a
corresponds to a different set of weights (wvel and wacc), and trough in the response profile (due to clipping), the acceleration
each column a different value of the DC offset term, resulting DFTR component remains slightly larger than the velocity
in various degrees of response clipping. For all combinations component (0.53 vs. 0.47).
of wvel and wacc, we computed the DFTR components accord- Figure 10 summarizes the effect of response clipping across
ing to Eqs. 2 and 3, then normalized them according to Eq. 7 all values of wvel and wacc, showing that the DFTR method
to enable a direct comparison with the true weights that were becomes progressively less accurate as the DC offset parameter
assigned to the velocity and acceleration components in the decreases (i.e., potential clipping increases). Note that devia-
simulated responses. The inset text in each panel of Fig. 9 gives tions from the diagonal in Fig. 10A signify different things in
the delay in milliseconds (estimated from the temporal re- different quadrants: in the upper-right quadrant, moving up-
sponse envelope, Eq. 4) and the normalized velocity and ward off the diagonal indicates a velocity bias for positive-
acceleration components estimated using the DFTR method going (excitatory) responses, whereas in the lower-left quad-
(Eq. 7; abbreviated vel and acc in Fig. 9). A quantitative rant, the same deviation indicates an acceleration bias for
comparison between the estimated DFTR components and the negative-going (inhibitory) responses. Deviations in the upper-
actual weights used to create the synthesized data are shown in left quadrant—including the point of greatest departure from
Fig. 10 for velocity (A) and acceleration (B). Separate curves the diagonal (wvel ⫽ ⫺0.3)—mainly indicate a sign reversal of
are plotted for each level of the DC offset parameter in Eq. 6 the velocity component. Negative-going responses that are
(i.e., the amount of potential clipping). For simplicity of largely (but not completely) acceleration-like, when subjected
display, Figs. 9 and 10, A and B, do not show the symmetric to clipping, are interpreted by the DFTR as a mixture of a
combinations of negative velocity components with positive negative-going acceleration response with a positive-going
acceleration components, or vice versa; these results were velocity response, resulting in a (correctly) negative accelera-
similar and are shown in C. tion DFTR component but an (incorrectly) positive velocity
J Neurophysiol • VOL 104 • SEPTEMBER 2010 • www.jn.org
Downloaded from www.physiology.org/journal/jn by ${individualUser.givenNames} ${individualUser.surname} (082.242.110.027) on May 15, 2018.
Copyright © 2010 American Physiological Society. All rights reserved.
1516 FETSCH, RAJGURU, KARUNARATNE, GU, ANGELAKI, AND DeANGELIS
FIG. 9. Example simulated response profiles used to test DFTR method. Each row represents a particular combination of velocity and acceleration weights
used to synthesize the data (wvel and wacc, see Eq. 6), and each column represents different values of the DC offset parameter (i.e., different degrees of potential
response clipping at 0). In each panel, the text above and/or below the response profile indicates the estimated delay (ms) and the normalized DFTRvel and
DFTRacc values (abbreviated vel and acc) from the frequency-domain analysis method.
DFTR component. This can be seen more explicitly in Fig. (error) is represented by drawing a vector connecting the
10B, which plots the acceleration weights in the same format corresponding values. For clarity, only the three largest
(see the corresponding point at wacc ⫽ ⫺0.7, where the black amounts of clipping (50, 75, and 100%) are shown as the
trace crosses the unity diagonal). Overall, the same pattern of smaller two amounts are nearly error-free. This format permits
results is evident in Fig. 10, both A and B. For example, the displaying the symmetric combinations of positive velocity
velocity bias in the upper-right quadrant of Fig. 10A is seen as weights with negative acceleration weights (bottom-right quad-
a downward deviation from the diagonal in B. Figure 10C rant), and vice versa (top-left quadrant).
displays the same results in a different format, where the The simulation results of Figs. 9 and 10 show that the DFTR
abscissa represents wvel or DFTRvel(norm), the ordinate wacc or method systematically misestimates the velocity and accelera-
DFTRacc(norm), and the difference between the two methods tion components when a substantial portion of the temporal
J Neurophysiol • VOL 104 • SEPTEMBER 2010 • www.jn.org
Downloaded from www.physiology.org/journal/jn by ${individualUser.givenNames} ${individualUser.surname} (082.242.110.027) on May 15, 2018.
Copyright © 2010 American Physiological Society. All rights reserved.
VESTIBULAR RESPONSES IN AREA MSTd 1517
A B C
FIG. 10. Comparison of velocity and acceleration components estimated by DFTR and the true weights used to generate the simulated data. A: the normalized
DFTR velocity component [DFTRvel(norm), Eq. 7] is plotted against the true velocity weight of the simulated data (wvel in Eq. 6), averaged across 20 simulations
for each value of wvel. Colored traces indicate different values of the DC offset parameter in Eq. 6 (i.e., different potential amounts of response clipping). The
unity-slope diagonal represents a perfect input-output relationship, i.e., perfect estimation of the weights in the simulated profiles by the DFTR method. The
progressive deviation from this diagonal at decreasing DC offset values is caused by the loss of the downward lobe of acceleration-like responses (due to clipping)
and thus is large when wvel is near zero (true acceleration weight near 1 or ⫺1). B: same comparison as in A but for the true acceleration weight (wacc in Eq.
6) and the normalized DFTR acceleration component. C: the same data as in A and B (although now including combinations of weights with opposite sign),
plotted as an error vector plot. The origin of each vector represents the [x, y] coordinate [wvel, wacc] and the termination (arrowhead) represents [DFTRvel(norm),
DFTRacc(norm)]. Thus the length and orientation of each vector indicates the difference (error) between the true weights and the weights estimated with the
DFTR, in velocity-acceleration space. Vectors for each clipping level are offset slightly for clarity, and only the 50, 75, and 100% clipping levels (0.5, 0.25, and
0 DC offset values; green, blue, and black, respectively) are shown. The 0 and 25% clipping data showed very little error (vectors with near-0 length).
response is clipped at zero. How might this have affected our cutoff scenario in the simulations (Fig. 9, middle; Fig. 10,
MSTd results and their interpretation? Unlike in the simula- green curve). To examine the relationship between BPR and
tions, we do not have access to the true shape of an apparently the relative strength of velocity and acceleration components,
clipped response, and thus the underlying velocity and accel- we separated the data shown in Fig. 5B (absolute value of
eration components in such a response profile cannot be known DFTR components) depending on whether BPR was less than
with certainty. However, our simulations suggest that typical or ⬎0.33. The medians of these separate distributions are
degrees of clipping should only modestly affect the estimation plotted in Fig. 11B, alongside the medians from the full data set
of the components using DFTR, except for neurons with very (reproduced from Fig. 5B). The results indicate that the median
low levels of spontaneous activity relative to maximum evoked |DFTRvel| was lower for responses with greater BPR, although
activity. Such neurons were relatively rare in our sample. One it remained significantly greater than the median |DFTRacc|
way to estimate the potential for clipping in the real data is to across all BPR categories. This reduction in |DFTRvel| may not
compute the ratio of the baseline firing rate (measured during be solely due to clipping; it could also simply reflect lower
the 600 ms prior to stimulus onset) to the peak evoked firing peak activity of responses with high BPR or perhaps a satura-
rate, which we term the baseline-to-peak ratio (BPR). Higher tion effect when baseline activity is high. However, even if we
values of BPR suggest less potential for clipping: for example, assume that clipping accounts for most of the effect, velocity
the minimum BPR for the simulated data in the leftmost components in MSTd are still significantly stronger than ac-
column of Fig. 9 is 0.5 (⬃1.0/2.0), whereas it decreases to 0.33 celeration components for the largest BPR category (Wilcoxon
(⬃0.5/1.5) for the middle column, and is near zero for the matched pairs test, P ⬍ 0.0001). Importantly, the median
rightmost column. |DFTRacc| remained stable across BPR categories, suggesting
The distribution of BPR values for all responses with sig- that clipping did not greatly influence the strength of estimated
nificant DFTR values is shown in Fig. 11A. The median BPR acceleration components. Furthermore, as discussed in the
was 0.36, slightly higher than the minimum BPR for the 50% preceding text, clipping does not always bias the component
estimates toward velocity: for negative-going velocity and
A B acceleration components, the bias can be in the direction of
acceleration instead. Thus our main conclusion—the relative
prevalence of velocity signals in MSTd—is unlikely to be
strongly influenced by the effect of clipping on measured
temporal responses.
sponse dynamics across directions of motion. These results sug- variables simultaneously, similar to our findings in MSTd.
gest that the diversity of response dynamics observed in brain Note that the temporal response modulation in our analyses
stem, cerebellum and thalamus is preserved to some extent in was only decomposed into velocity and acceleration compo-
cortex, supporting the hypothesis that this diversity is functionally nents because we didn’t observe clear indications of higher-
relevant for perception and behavior (Klam and Graf 2003). order derivatives (e.g., jerk) in well modulated responses.
To our knowledge, this is the first study to use frequency However, for a few motion directions the “baseline” firing
analysis to quantify temporal response properties for transient rates immediately before and after the stimulus were clearly
vestibular stimuli. Our method complements previous studies different (e.g., see 45 and 90° azimuth PSTHs in example cell
that utilized traditional sinusoidal motion stimuli (Chen-Huang 2, Fig. 6H). Because our stimuli were position transients, this
and Peterson 2006; Dickman and Angelaki 2002; Liu and suggests a possible influence of position for some motion
Angelaki 2009; Meng et al. 2007; Shaikh et al. 2005; Yakushin directions. The possibility of position coding in MSTd de-
et al. 2006; Zhou et al. 2006). Indeed a recent study (Liu and serves further study, especially in light of a previous report of
Angelaki 2009) showed that MSTd responses show predomi- “path” and “place” responses in this region (Froehler and
nantly velocity-like dynamics during sinusoidal translations, Duffy 2002).
consistent with our findings. However, quantifying response
dynamics using steady-state sinusoidal stimuli is complicated Spatial variation in temporal dynamics
by the fact that central translation responses exhibit nonmini-
mum phase behavior (i.e., phase spectra that exhibit larger or One of our main findings is that response dynamics in MSTd
smaller frequency-dependent variation than would be expected are not constant for a given cell but instead vary depending on
from the dependence of amplitude on frequency) (Angelaki the direction of motion. This is consistent with findings in the
and Dickman 2000; Dickman and Angelaki 2002; Shaikh et al. vestibular nuclei and cerebellum (Angelaki and Dickman 2000;
2005). As a result, response phase during sinusoidal stimula- Musallam and Tomlinson 2002; Shaikh et al. 2005; Yakushin
tion cannot be used to conclusively determine whether cells et al. 2006) but not with the behavior of otolith afferents
encode velocity or position/acceleration. Using position tran- (Angelaki and Dickman 2000; Fernandez and Goldberg 1976;
sients (Gaussian velocity profiles) instead of sinusoids allows Si et al. 1997). Otolith afferents are characterized by a single
for a more direct evaluation of the temporal dynamics of response vector, corresponding to the direction of maximal
translation responses in the CNS. sensitivity. In contrast, the spatial tuning of many MSTd
The use of the Hilbert transform to compute the temporal neurons could be described as having multiple response vec-
envelope allowed us to distinguish response phase from re- tors. For example, some neurons clearly encoded velocity and
sponse delay (e.g., due to neuronal latency). Consequently, the acceleration along different directions of motion (e.g., example
phase information from the DFT provides a relatively unbiased cell 1 in Fig. 6, A–D; note the gradual transition from velocity
way to measure the relative strength of velocity and accelera- to acceleration going from 45 to 225° azimuth in Fig. 6D).
tion signals in the observed MSTd responses. An advantage of What then is the origin of this spatial variation in temporal
the DFTR method is that it involves no free parameters and is response dynamics, and what function(s) could it serve? Previous
model-independent, unlike curve-fitting approaches. However, theoretical (Angelaki 1992, 1993b; Angelaki et al. 1992, 1993)
a potential drawback to the DFTR approach is that the response and neurophysiological (Angelaki and Dickman 2000; Chan et al.
delay may not be estimated correctly if there is substantial 2002; Chen-Huang and Peterson 2006; Dickman and Angelaki
response clipping at zero firing rate (e.g., Fig. 6H, PSTHs at 2002; Yakushin et al. 2006) studies have proposed that VN
135 and 180° azimuth). Such a misestimation of delay has neurons receive convergent inputs from otolith afferents that
implications for the phase of the DFT frequency components, differ in both spatial and temporal properties. This convergence
and thus the DFTRvel and DFTRacc values computed from Eqs. could include afferents from both labyrinths or across the striola
2 and 3. However, based on the analysis of simulated data within a given labyrinth (Uchino et al. 2005). This process, known
(Figs. 9 and 10) and the separation of the data set based on as spatiotemporal convergence (STC), results in a type of spatio-
baseline-to-peak ratio (Fig. 11), this problem is unlikely to temporal filtering that is consistent with the dependence of dy-
have substantially altered our main conclusions. Indeed the namics on motion direction that we observed in MSTd. Evidence
dominance of velocity coding in MSTd was confirmed by for STC has been found as early as the second-order neurons in
application of a model-based curve fitting approach to a subset the VN (Angelaki and Dickman 2000) as well as in the vestibulo-
of the data (Fig. 14A). The curve-fitting approach is less biased cerebellum (Shaikh et al. 2005) and thalamus (Meng et al. 2007).
by clipping (Fig. 14C) but is also less robust to noise (Fig. 13B) Although differences in the stimuli preclude a direct comparison,
and therefore not well suited to single-trial analyses. Thus both it seems likely that the diverse spatial and temporal properties
methods have strengths and weaknesses but they yield the same conferred by STC at lower processing stages have been carried
basic conclusions regarding vestibular responses in MSTd. forward to MSTd.
In a similar study, Klam and Graf (2003) developed a The functional significance of spatiotemporal convergence is
random stimulation method to analyze response kinematics in unknown, but such a broad range of temporal dynamics within
the medial and ventral intraparietal areas (MIP and VIP) during and across neurons may reflect the diversity of roles that linear
yaw rotation. Because head position, velocity, and acceleration acceleration signals play in perception and motor control.
were only weakly correlated during their random motion par- These roles include reflexive eye movements (translational
adigm, they were able to plot firing rate in a 2D (position vs. VOR) (Angelaki 2004; McHenry and Angelaki 2000; Paige
velocity) or 3D (position, velocity, and acceleration) space. and Tomko 1991; Schwarz and Miles 1991; Schwarz et al.
The resulting “functional vestibular receptive fields” showed 1989), spatial updating (Klier et al. 2008), perception of head
that most MIP and VIP neurons encoded multiple kinematic orientation with respect to gravity (Angelaki et al. 2004; Green
J Neurophysiol • VOL 104 • SEPTEMBER 2010 • www.jn.org
Downloaded from www.physiology.org/journal/jn by ${individualUser.givenNames} ${individualUser.surname} (082.242.110.027) on May 15, 2018.
Copyright © 2010 American Physiological Society. All rights reserved.
VESTIBULAR RESPONSES IN AREA MSTd 1521
and Angelaki 2004, 2007; Lewis et al. 2005; Merfeld and Suitability for integration with visual (optic flow) signals
Zupan 2002), and perception of self-motion direction or head-
The velocity-like Gaussian shape of the most common
ing (Gu et al. 2007, 2008; Telford et al. 1995). One intriguing
MSTd vestibular responses appears similar to the dynamics of
speculation is that having several kinematic variables multiplexed visual (i.e., optic flow) responses in this area (Gu et al. 2006)
within the same population is computationally advantageous as well as visual motion responses in area MT (Lisberger and
(Klam and Graf 2003), perhaps analogous to the idea that a Movshon 1999; Rodman and Albright 1987), a primary source
mixture or “hybrid” spatial representation confers flexibility in of input to MSTd. Similar response dynamics for both vestib-
sensorimotor coordinate transformations (Pouget et al. 2002). ular and optic flow responses in MST might be useful for the
However, this diversity can be a double-edged sword, as such integration of these two complementary self-motion cues, and
a representation may complicate the readout of signals by down- indeed there is evidence that MSTd neurons may be involved
stream regions. For instance, MSTd is well known as a strong in this sensory integration (Gu et al. 2008). For integration to
candidate for the neural basis of heading perception, both from occur, one might assume that responses to different cues would
visual and vestibular cues (Britten and van Wezel 1998; Gu et al. signal motion with similar temporal dynamics. Interestingly
2007, 2008; Page and Duffy 2003). According to some models of the spatial variation in temporal dynamics that we have ob-
perceptual discrimination, optimal performance of a particular served in the present study suggests that visual and vestibular
task is achieved by preferentially “reading out” (pooling) the most signals in many MSTd neurons may only be temporally
sensitive neurons in a population (Parker and Newsome 1998; matched for some directions of motion. Future studies will be
Purushothaman and Bradley 2005; but see Shadlen et al. 1996). In needed to assess whether visual and vestibular dynamics are
this context, a neuron is most sensitive if the base stimulus value matched on a cell-by-cell and direction-specific basis, and
(e.g., motion direction) around which the discrimination is per- whether the degree of temporal similarity predicts other signa-
formed lies on the steep part of the cell’s tuning curve. Thus the tures of sensory integration such as improved sensitivity under
sensitivity of a given cell depends on its tuning properties, such as combined visual-vestibular stimulation (Gu et al. 2008).
its preferred direction and tuning width. However, these properties
themselves can depend strongly on the chosen metric of response ACKNOWLEDGMENTS
strength. Assuming that the temporal information in MSTd spik-
We thank K. Kocher, E. White, and A. Turner for animal care and training.
ing activity is retained to some extent in downstream areas, the Special thanks to Dr. Katsumasa Takahashi for assisting with data collection.
present results suggest that these areas must contend with the fact
that a given cell can have different preferred directions and/or GRANTS
tuning widths (and hence different regions of maximal sensitivity)
depending on whether the important variable is velocity, acceler- The work was supported by National Institutes of Health grants R01
EY-017866 and EY-019087 to D. E. Angelaki and EY-016178 to G. C.
ation, or some other metric. Thus models of heading discrimina- DeAngelis.
tion that posit any kind of selective readout or pooling may need
to account for the variation in temporal structure found in MSTd
DISCLOSURES
responses.
No conflicts of interest, financial or otherwise, are declared by the author(s).
It is important to point out that, unlike brain stem and cerebellar Angelaki DE. Spatio-temporal convergence (STC) in otolith neurons. Biol
cells, MSTd neurons encode primarily velocity along most motion Cybern 67: 83–96, 1992.
Angelaki DE. Generation of two-dimensional spatial and temporal properties
directions. Thus it is not surprising that spatial tuning for velocity through spatiotemporal convergence between one-dimensional neurons.
was strongly correlated with spatial tuning calculated using MFR IEEE Trans Biomed Eng 40: 686 – 692, 1993a.
(Fig. 7A) and preferred directions computed from these two Angelaki DE. Spatial and temporal coding in single neurons. Biol Cybern 69:
metrics were well aligned (D). This suggests that previous studies 147–154, 1993b.
Angelaki DE. Eyes on target: what neurons must do for the vestibuloocular
that computed spatial tuning from MFR (Duffy 1998; Fetsch et al. reflex during linear motion. J Neurophysiol 92: 20 –35, 2004.
2007; Gu et al. 2006; Page and Duffy 2003; Takahashi et al. 2007) Angelaki DE, Bush GA, Perachio AA. A model for the characterization of
were mainly capturing the velocity response of MSTd neurons. In the spatial properties in vestibular neurons. Biol Cybern 66: 231–240, 1992.
contrast, MFR analysis does not appropriately capture the prop- Angelaki DE, Bush GA, Perachio AA. Two-dimensional spatiotemporal
erties of the acceleration component of the responses. This may coding of linear acceleration in vestibular nuclei neurons. J Neurosci 13:
1403–1417, 1993.
not pose a major problem for understanding the function of a Angelaki DE, Dickman JD. Spatiotemporal processing of linear acceleration:
velocity-dominant area such as MSTd, but it could constitute a primary afferent and central vestibular neuron responses. J Neurophysiol 84:
problem for other areas with stronger acceleration tuning [e.g., 2113–2132, 2000.
parieto-insular vestibular cortex (PIVC)] (Chen et al. 2010). Angelaki DE, Shaikh AG, Green AM, Dickman JD. Neurons compute
internal models of the physical laws of motion. Nature 430: 560 –564, 2004.
Interestingly a small number of neurons had significant Bracewell RN. The Fourier Transform and Its Applications. New York:
temporal modulation for the majority of motion directions (Fig. McGraw-Hill, 1978.
4A, tail of the distribution). This means that even neurons that Bremmer F, Kubischik M, Pekel M, Lappe M, Hoffmann KP. Linear
are not spatially selective may have strong temporal modula- vestibular self-motion signals in monkey medial superior temporal area. Ann
tion to inertial motion. The possible role of this type of neuron NY Acad Sci 871: 272–281, 1999.
Britten KH, van Wezel RJ. Electrical microstimulation of cortical area MST
is unclear, but some evidence indicates that a larger proportion biases heading perception in monkeys. Nat Neurosci 1: 59 – 63, 1998.
of neurons in PIVC have this property as well (Chen et al. Chan YS, Lai CH, Shum DK. Bilateral otolith contribution to spatial coding
2010). in the vestibular system. J Biomed Sci 9: 574 –586, 2002.
Chen A, DeAngelis GC, Angelaki DE. Macaque parieto-insular vestibular Meng H, May PJ, Dickman JD, Angelaki DE. Vestibular signals in primate
cortex: responses to self-motion and optic flow. J Neurosci 30: 3022–3042, thalamus: properties and origins. J Neurosci 27: 13590 –13602, 2007.
2010. Merfeld DM, Zupan LH. Neural processing of gravitoinertial cues in humans.
Chen-Huang C, Peterson BW. Three dimensional spatial-temporal conver- III. Modeling tilt and translation responses. J Neurophysiol 87: 819 – 833,
gence of otolith related signals in vestibular only neurons in squirrel 2002.
monkeys. Exp Brain Res 168: 410 – 426, 2006. Musallam S, Tomlinson RD. Asymmetric integration recorded from vestib-
DeAngelis GC, Ohzawa I, Freeman RD. Spatiotemporal organization of ular-only cells in response to position transients. J Neurophysiol 88: 2104 –
simple-cell receptive fields in the cat’s striate cortex. II. Linearity of 2113, 2002.
temporal and spatial summation. J Neurophysiol 69: 1118 –1135, 1993. Nguyenkim JD, DeAngelis GC. Disparity-based coding of three-dimensional
Dickman JD, Angelaki DE. Vestibular convergence patterns in vestibular
surface orientation by macaque middle temporal neurons. J Neurosci 23:
nuclei neurons of alert primates. J Neurophysiol 88: 3518 –3533, 2002.
7117–7128, 2003.
Duffy CJ. MST neurons respond to optic flow and translational movement. J
Neurophysiol 80: 1816 –1827, 1998. Page WK, Duffy CJ. Heading representation in MST: sensory interactions
Duffy CJ, Wurtz RH. Sensitivity of MST neurons to optic flow stimuli. II. and population encoding. J Neurophysiol 89: 1994 –2013, 2003.
Mechanisms of response selectivity revealed by small-field stimuli. J Neu- Paige GD, Tomko DL. Eye movement responses to linear head motion in the
rophysiol 65: 1346 –1359, 1991. squirrel monkey. I. Basic characteristics. J Neurophysiol 65: 1170 –1182,
Duffy CJ, Wurtz RH. Response of monkey MST neurons to optic flow 1991.
stimuli with shifted centers of motion. J Neurosci 15: 5192–5208, 1995. Parker AJ, Newsome WT. Sense and the single neuron: probing the physi-
Fernandez C, Goldberg JM. Physiology of peripheral neurons innervating ology of perception. Annu Rev Neurosci 21: 227–277, 1998.
otolith organs of the squirrel monkey. III. Response dynamics. J Neuro- Pouget A, Deneve S, Duhamel JR. A computational perspective on the neural
physiol 39: 996 –1008, 1976. basis of multisensory spatial representations. Nat Rev Neurosci 3: 741–747,
Fetsch CR, Wang S, Gu Y, DeAngelis GC, Angelaki DE. Spatial reference 2002.
frames of visual, vestibular, and multimodal heading signals in the dorsal Prince SJ, Pointon AD, Cumming BG, Parker AJ. Quantitative analysis of
subdivision of the medial superior temporal area. J Neurosci 27: 700 –712, the responses of V1 neurons to horizontal disparity in dynamic random-dot
2007. stereograms. J Neurophysiol 87: 191–208, 2002.
Field DJ, Tolhurst DJ. The structure and symmetry of simple-cell receptive- Purushothaman G, Bradley DC. Neural population code for fine perceptual
field profiles in the cat’s visual cortex. Proc R Soc Lond B Biol Sci 228: decisions in area MT. Nat Neurosci 8: 99 –106, 2005.
379 – 400, 1986. Rodman HR, Albright TD. Coding of visual stimulus velocity in area MT of
Froehler MT, Duffy CJ. Cortical neurons encoding path and place: where you the macaque. Vision Res 27: 2035–2048, 1987.
go is where you are. Science 295: 2462–2465, 2002. Schwarz U, Busettini C, Miles FA. Ocular responses to linear motion are
Gabor D. Theory of communication. J Inst Electr Eng 93: 429 – 457, 1946. inversely proportional to viewing distance. Science 245: 1394 –1396, 1989.
Green AM, Angelaki DE. An integrative neural network for detecting inertial Schwarz U, Miles FA. Ocular responses to translation and their dependence
motion and head orientation. J Neurophysiol 92: 905–925, 2004. on viewing distance. I. Motion of the observer. J Neurophysiol 66: 851– 864,
Green AM, Angelaki DE. Coordinate transformations and sensory integration 1991.
in the detection of spatial orientation and self-motion: from models to Shadlen MN, Britten KH, Newsome WT, Movshon JA. A computational
experiments. Prog Brain Res 165: 155–180, 2007. analysis of the relationship between neuronal and behavioral responses to
Gu Y, Angelaki DE, DeAngelis GC. Neural correlates of multisensory cue visual motion. J Neurosci 16: 1486 –1510, 1996.
integration in macaque MSTd. Nat Neurosci 11: 1201–1210, 2008. Shaikh AG, Ghasia FF, Dickman JD, Angelaki DE. Properties of cerebellar
Gu Y, DeAngelis GC, Angelaki DE. A functional link between area MSTd fastigial neurons during translation, rotation, and eye movements. J Neuro-
and heading perception based on vestibular signals. Nat Neurosci 10: physiol 93: 853– 863, 2005.
1038 –1047, 2007. Si X, Angelaki DE, Dickman JD. Response properties of pigeon otolith
Gu Y, Fetsch CR, Adeyemo B, DeAngelis GC, Angelaki DE. Decoding of afferents to linear acceleration. Exp Brain Res 117: 242–250, 1997.
MSTd population activity accounts for variations in the precision of heading Takahashi K, Gu Y, May PJ, Newlands SD, DeAngelis GC, Angelaki DE.
perception. Neuron 66: 596 – 609, 2010. Multimodal coding of three-dimensional rotation and translation in area
Gu Y, Watkins PV, Angelaki DE, DeAngelis GC. Visual and nonvisual MSTd: comparison of visual and vestibular selectivity. J Neurosci 27:
contributions to three-dimensional heading selectivity in the medial superior 9742–9756, 2007.
temporal area. J Neurosci 26: 73– 85, 2006. Tanaka K, Hikosaka K, Saito H, Yukie M, Fukada Y, Iwai E. Analysis of
Hartigan JA, Hartigan PM. The dip test of unimodality. Ann Statist 13: local and wide-field movements in the superior temporal visual areas of the
70 – 84, 1985. macaque monkey. J Neurosci 6: 134 –144, 1986.
Hartigan PM. Algorithm AS 217: computation of the dip statistic to test for Tanaka K, Saito H. Analysis of motion of the visual field by direction,
unimodality. Appl Statist 34: 320 –325, 1985. expansion/contraction, and rotation cells clustered in the dorsal part of the
Judge SJ, Richmond BJ, Chu FC. Implantation of magnetic search coils for medial superior temporal area of the macaque monkey. J Neurophysiol 62:
measurement of eye position: an improved method. Vision Res 20: 535–538, 626 – 641, 1989.
1980. Telford L, Howard IP, Ohmi M. Heading judgments during active and
Klam F, Graf W. Vestibular response kinematics in posterior parietal cortex passive self-motion. Exp Brain Res 104: 502–510, 1995.
neurons of macaque monkeys. Eur J Neurosci 18: 995–1010, 2003. Uchino Y, Sasaki M, Sato H, Bai R, Kawamoto E. Otolith and canal
Klier EM, Hess BJ, Angelaki DE. Human visuospatial updating after passive integration on single vestibular neurons in cats. Exp Brain Res 164: 271–
translations in three-dimensional space. J Neurophysiol 99: 1799 –1809, 285, 2005.
2008. Uka T, DeAngelis GC. Contribution of middle temporal area to coarse depth
Lewis RF, Haburcakova C, Merfeld DM. Tilt psychophysics measured in discrimination: comparison of neuronal and psychophysical sensitivity. J
nonhuman primates. Ann NY Acad Sci 1039: 294 –305, 2005. Neurosci 23: 3515–3530, 2003.
Lisberger SG, Movshon JA. Visual motion analysis for pursuit eye move- Warren WH. Optic flow. In: The Visual Neurosciences, edited by Chalupa
ments in area MT of macaque monkeys. J Neurosci 19: 2224 –2246, 1999. LM, Werner JS. Cambridge, MA: MIT Press, 2003, p. 1247–1259.
Liu S, Angelaki DE. Vestibular signals in macaque extrastriate visual cortex Yakusheva T, Blazquez PM, Angelaki DE. Frequency-selective coding of
are functionally appropriate for heading perception. J Neurosci 29: 8936 – translation and tilt in macaque cerebellar nodulus and uvula. J Neurosci 28:
8945, 2009. 9997–10009, 2008.
Loe PR, Tomko DL, Werner G. The neural signal of angular head position Yakushin SB, Raphan T, Cohen B. Spatial properties of central vestibular
in primary afferent vestibular nerve axons. J Physiol 230: 29 –50, 1973. neurons. J Neurophysiol 95: 464 – 478, 2006.
McHenry MQ, Angelaki DE. Primate translational vestibuloocular reflexes. Zhou W, Tang BF, Newlands SD, King WM. Responses of monkey
II. Version and vergence responses to fore-aft motion. J Neurophysiol 83: vestibular-only neurons to translation and angular rotation. J Neurophysiol
1648 –1661, 2000. 96: 2915–2930, 2006.