1704 01971

The quasiprobability behind the out-of-time-ordered correlator
Nicole Yunger Halpern,1 Brian Swingle,2, 3, 4 and Justin Dressel5, 6

1
Institute for Quantum Information and Matter, Caltech, Pasadena, CA 91125, USA
2
Department of Physics, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
3
Department of Physics, Harvard University, Cambridge, MA 02138, USA
4
Department of Physics, Brandeis University, Waltham, MA 02453, USA
5
Institute for Quantum Studies, Chapman University, Orange, CA 92866, USA
6
Schmid College of Science and Technology, Chapman University, Orange, CA 92866, USA
(Dated: April 10, 2017)
Two topics, evolving rapidly in separate fields, were combined recently: The out-of-time-ordered
correlator (OTOC) signals quantum-information scrambling in many-body systems. The Kirkwood-
Dirac (KD) quasiprobability represents operators in quantum optics. The OTOC was shown to equal
a moment of a summed quasiprobability [Yunger Halpern, Phys. Rev. A 95, 012120 (2017)]. That
arXiv:1704.01971v1 [quant-ph] 6 Apr 2017
quasiprobability, we argue, is an extension of the KD distribution. We explore the quasiprobabil-

itys structure from experimental, numerical, and theoretical perspectives. First, we simplify and
analyze Yunger Halperns weak-measurement and interference protocols for measuring the OTOC
and its quasiprobability. We decrease, exponentially in system size, the number of trials required
to infer the OTOC from weak measurements. We also construct a circuit for implementing the
weak-measurement scheme. Next, we calculate the quasiprobability (after coarse-graining) numer-
ically and analytically: We simulate a transverse-field Ising model first. Then, we calculate the
quasiprobability averaged over random circuits, which model chaotic dynamics. The quasiproba-
bility, we find, distinguishes chaotic from integrable regimes. We observe nonclassical behaviors:
The quasiprobability typically has negative components. It becomes nonreal in some regimes. The
onset of scrambling breaks a symmetry that bifurcates the quasiprobability, as in classical-chaos
pitchforks. Finally, we present mathematical properties. The quasiprobability obeys a Bayes-type
theorem, for example, that exponentially decreases the memory required to calculate weak values, in
certain cases. A time-ordered correlator analogous to the OTOC, insensitive to quantum-information
scrambling, depends on a quasiprobability closer to a classical probability. This work not only il-
luminates the OTOCs underpinnings, but also generalizes quasiprobability theory and motivates
immediate-future weak-measurement challenges.
Two topics have been flourishing independently: A :

the out-of-time-ordered correlator (OTOC) and the
Kirkwood-Dirac (KD) quasiprobability distribution. The 2 D (W + 0 W 0 ) E
F (t) = e . (1)
OTOC signals chaos, and the dispersal of informa- 0
, 0 =0
tion through entanglement, in quantum many-body
systems [16]. Quasiprobabilities represent quantum W and W 0 denote measurable random variables analo-
states as phase-space distributions represent statistical- gous to thermodynamic work; and , 0 R. The aver-
mechanical states [7]. Classical phase-space distributions age h.i is with respect to a sum of quasiprobability val-
are restricted to positive values; quasiprobabilities are ues A (.). Equation (1) resembles Jarzynskis Equality,
not. The best-known quasiprobability is the Wigner a fluctuation relation in nonequilibrium statistical me-
function. The Wigner function can become negative; the chanics [38]. Jarzynski cast a useful, difficult-to-measure
KD quasiprobability, negative and nonreal [814]. Non- free-energy difference F in terms of the characteristic
classical values flag contextuality, a resource underlying function of a probability. Equation (1) casts the useful,
quantum-computation speedups [1420]. Hence the KD difficult-to-measure OTOC in terms of the characteristic
quasiprobability, like the OTOC, reflects nonclassicality. function of a summed quasiprobability. The OTOC has
Yet disparate communities use these tools: The OTOC recently been linked to thermodynamics also in [39, 40].
F (t) features in quantum information theory, high-energy A , we argue, is an extension of the KD quasiprob-
physics, and condensed matter. Contexts include black ability. Weak-measurement tools used to infer KD
holes within AdS/CFT duality [1, 2123], weakly inter- quasiprobabilities can be applied to infer A from ex-
acting field theories [2427], spin models [1, 28], and periments [37]. Upon measuring A , one can recover
the Sachdev-Ye-Kitaev model [29, 30]. The KD dis- the OTOC. Alternative OTOC-measurement proposals
tribution features in quantum optics. Experimentalists rely on Lochshmidt echoes [41], interferometry [37, 41
have inferred the quasiprobability from weak measure- 43], clocks [44], particle-number measurements of ultra-
ments of photons [1013, 3134] and superconducting cold atoms [43, 45, 46], and two-point measurements [39].
qubits [35, 36]. Initial experiments have begun the push toward charac-
The two tools were united in [37]. The OTOC was terizing many-body scrambling: OTOCs of an infinite-
shown to equal a moment of a summed quasiprobability, temperature four-site NMR system have been mea-
2
sured [47]. OTOCs of symmetric observables have been Sections III and IV feature calculations of A . First,
measured with infinite-temperature trapped ions [48] and we numerically simulate a transverse-field Ising model.
in nuclear spin chains [49]. Weak measurements offer a A changes significantly, we find, over time scales rele-
distinct toolkit, opening new platforms and regimes to vant to the OTOC. The quasiprobabilitys behavior dis-
OTOC measurements. The weak-measurement scheme tinguishes nonintegrable from integrable Hamiltonians.
in [37] is expected to provide a near-term challenge for The quasiprobabilitys negativity and nonreality remains
superconducting qubits [35, 5055], trapped ions [5662], robust with respect to substantial quantum interference.
ultracold atoms [63], cavity quantum electrodynamics We then calculate an average, over Brownian circuits,
(QED) [64, 65], and perhaps NMR [66, 67]. of A . Brownian circuits model chaotic dynamics: The
We investigate the quasiprobability A that lies be- system is assumed to evolve, at each time step, under
hind the OTOC. The study consists of three branches: random two-qubit couplings [6871].
We discuss experimental measurements, calculate (a
A final theory section concerns mathematical prop-
coarse-grained) A , and explore mathematical properties. erties and physical interpretations of A . A shares some,
Not only does quasiprobability theory shed new light on though not all, of its properties with the KD distribu-
the OTOC. The OTOC also inspires questions about tion. The OTOC motivates a generalization of a Bayes-
quasiprobabilities and motivates weak-measurement ex- type theorem obeyed by the KD distribution [14, 7275].
perimental challenges. The generalization exponentially shrinks the memory re-
The paper is organized as follows. In a technical intro- quired to compute weak values, in certain cases. The
duction, we review the KD quasiprobability, the OTOC, OTOC also motivates a generalization of decompositions
the OTOC quasiprobability A , and schemes for measur- of quantum states . This decomposition property may
ing A . We also introduce our set-up and notation. help experimentalists assess how accurately they pre-
Next, we discuss experimental measurements. We in- pared the desired initial state when measuring F (t). A
troduce a coarse-graining A of A . The coarse-graining time-ordered correlator FTOC (t) analogous to F (t), we
involves a projection trick that decreases, exponen- show next, depends on a quasiprobability that can reduce
tially in system size, the number of trials required to to a probability. The OTOC quasiprobability lies farther
infer F (t) from weak measurements. We evaluate pros from classical probabilities than the TOC quasiprobabil-
and cons of the quasiprobability-measurement schemes ity, as the OTOC registers quantum-information scram-
in [37]. We also compare our schemes with alternative bling that FTOC (t) does not. Finally, we recall that the
F (t)-measurement schemes [41, 42, 44]. We then present OTOC encodes three time reversals. OTOCs that en-
a circuit for weakly measuring a qubit systems A . Fi- code more equal moments of longer quasiprobabilities.
nally, we show how to infer the coarse-grained A from We conclude with theoretical and experimental opportu-
alternative OTOC-measurement schemes (e.g., [41]). nities.
CONTENTS
I. Technical introduction 3
A. The KD quasiprobability in quantum optics 3
1. Phase-space representations, alternative quasiprobabilities, and history 4
2. Bayes-type theorem and retrodiction with the KD quasiprobability 4
3. Decomposing operators in terms of KD-quasiprobability coefficients 5
4. Properties of the KD quasiprobability 6
B. Set-up 6
C. The out-of-time-ordered correlator 7
D. Introducing the quasiprobability A behind the OTOC 8
1. Quantum probability amplitude A 8
2. The OTOC quasiprobability A 9
3. Complex distribution P (W, W 0 ) 10
4. Weak-measurement and interference schemes for inferring A 10
II. Experimentally measuring A and the coarse-grained A 12

A. The coarse-grained OTOC quasiprobability A and a projection trick 12
B. Analysis of the quasiprobability-measurement schemes and comparison with other OTOC-measurement
schemes 13
C. Circuit for inferring A from weak measurements 16
1. Partial-projection operators 16
2. Notation 16
3
3. Weak-measurement subcircuit 16
4. Full circuit for weak-measurement scheme 17
D. How to infer A from other OTOC-measurement schemes 17
III. Numerical simulations 18

A. Thermal states 18
B. Random states 19
C. Product states 21
D. Summary 22
IV. Calculation of A averaged over Brownian circuits 22

A. Infinite-temperature thermal state 1/2N 23
B. General state 24
1. Special case: 2z eigenstate 25
C. Summary 25
V. Theoretical study of A 26
A. Mathematical properties of A 26
B. Bayes-type theorem and retrodiction with A 28
C. A (.) values as coefficients in an operator decomposition 32
D. Relationship between out-of-time ordering and quasiprobabilities 34
1. Time-ordered correlator FTOC (t) 34
2. TOC probability amplitude ATOC 34
3. TOC quasiprobability ATOC 35
0
4. Complex TOC distribution PTOC (WTOC , WTOC ) 36
5. Jarzynski-like equality for the TOC 36
E. Higher-order OTOCs as moments of longer (summed) quasiprobabilities 36
VI. Outlook 37
A. Experimental opportunities 37
B. Opportunities motivated by calculations 38
C. Fundamental-theory opportunities 38
Acknowledgements 39
Appendix A. Mathematical properties of P (W, W 0 ) 39
:= i(K . . . A A . . . K)
Appendix B. Retrodiction about the symmetrized composite observable 40
References 41
I. TECHNICAL INTRODUCTION A. The KD quasiprobability in quantum optics
The Kirkwood-Dirac quasiprobability is defined as fol-

lows. Let S denote a quantum system associated with a
This review consists of three parts. In Sec. I A, we Hilbert space H. Let {|ai} and {|f i} denote orthonor-
overview the KD quasiprobability. Section I B introduces mal bases for H. Let B(H) denote the set of bounded
our set-up and notation. In Sec. I C, we review the OTOC operators defined on H, and let O B(H). The KD
and its quasiprobability A . We overview also the weak- quasiprobability
measurement and interference schemes for measuring A (1)
AO (a, f ) := hf |aiha|O|f i , (2)
and F (t).
regarded as a function of a and f , contains all the in-
formation in O. Density operators O = are often fo-
The quasiprobability section (I A) provides back- cused on in the literature and in this paper. This sec-
ground for quantum-information, high-energy, and tion concerns the context, structure, and applications of
(1)
condensed-matter readers. The OTOC section (I C) AO (a, f ).
targets quasiprobability and weak-measurement readers. We set the stage with phase-space representations
We encourage all readers to study the set-up (I B), as of quantum mechanics, alternative quasiprobabilities,
well as A and the schemes for measuring A (I D). and historical background. Equation (2) facilitates
4
retrodiction, or inference about the past, reviewed in The KD quasiprobability resembles a little brother of
Sec. I A 2. How to decompose an operator O in terms theirs, whom hardly anyone has heard of [77]. Kirkwood
of KD-quasiprobability values appears in Sec. I A 3. The and Dirac defined the quasiprobability independently in
quasiprobability has mathematical properties reviewed in 1933 [8] and 1945 [9]. Their finds remained under the
Sec. I A 4. radar for decades. Rihaczek rediscovered the distribu-
Much of this section parallels Sec. V, our theoretical tion in 1968, in classical-signal processing [78, 79]. (The
investigation of the OTOC quasiprobability. More back- KD quasiprobability is sometimes called the Kirkwood-
ground appears in [14]. Rihaczek distribution.) The quantum communitys at-
tention has revived recently. Reasons include experimen-
tal measurements, mathematical properties, and applica-
1. Phase-space representations, alternative tions to retrodiction and state decompositions.
quasiprobabilities, and history
Phase-space distributions form a mathematical toolkit 2. Bayes-type theorem and retrodiction with the KD
applied in Liouville mechanics [76]. Let S denote a sys- quasiprobability
tem of 6N degrees of freedom (DOFs). An example sys-
tem consists of N particles, lacking internal DOFs, in a
Prediction is inference about the future. Retrodiction
three-dimensional space. We index the particles with i
is inference about the past. One uses the KD quasiproba-
and let = x, y, z. The th component qi of particle
bility to infer about a time t0 , using information about an
is position is conjugate to the th component p i of the event that occurred before t0 and information about an
particles momentum. The variables qi and p label the
i event that occurred after t0 . This forward-and-backward
axes of phase space.
propagation evokes the OTOCs out-of-time ordering.
Suppose that the system contains many DOFs: N
We borrow notation from, and condense the explana-
1. Tracking all the DOFs is difficult. Which phase-
tion in, [14]. Let S denote a discrete quantum system.
space point S occupies, at any instant, may be un-
Consider preparing S in a state |ii at time t = 0. Sup-
known. The probability that, at time t, S occupies an
pose that S evolves under a time-independent Hamilto-
infinitesimal volume element localized at (q1x , . . . , pzN ) is
nian that generates the family Ut of unitaries. Let F
({qi }, {p
i }; t) d
3N
q d3N p. The phase-space distribution
denotePan observable measured at time t00 > 0. Let
({qi }, {pi }; t) is a probability density.
qi and p
F = f f |f ihf | be the eigendecomposition, and let f
i seem absent from quantum mechanics (QM), denote the outcome.
prima facie. Most introductions to QM cast quantum P
states in terms of operators, Dirac kets |i, and wave Let A = a a|aiha| be the eigendecomposition of an
functions (x). Classical variables are relegated to mea- observable that fails to commute with F . Let t0 denote
surement outcomes and to the classical limit. Wigner, a time in (0, t00 ). Which value can we most reasonably
Moyal, and others represented QM in terms of phase attribute to the systems time-t0 A, knowing that S was
space [7]. These representations are used most in quan- prepared in |ii and that the final measurement yielded
tum optics. f?
In such a representation, a quasiprobability density re- Propagating the initial state forward to time t0 yields
0
places the statistical-mechanical probability density .1 |i i := Ut0 |ii. Propagating the final state backward yields
Yet quasiprobabilities violate axioms of probability [16]. |f 0 i := Ut00 t0 |f i. Our best guess about A is the weak
Probabilities are nonnegative, for example. Quasiproba- value [36, 7375, 8082]
bilities can assume negative values, associated with non-
classical physics such as contextuality [1418, 20], and hf 0 |A|i0 i
Aweak (i, f ) := < . (3)
nonreal values. Relaxing different axioms leads to differ- hf 0 |i0 i
ent quasiprobabilities. Different quasiprobabilities cor-
respond also to different orderings of noncommutative The real part of a complex number z is denoted by <(z).
operators [9]. The best-known quasiprobabilities include The guesss accuracy is quantified with a distance metric
the Wigner function, the Glauber-Sudarshan P represen- (Sec. V B) and with comparisons to weak-measurement
tation, and the Husimi Q function [7]. data.
Aharonov et al. discovered weak values in 1988 [72].
Weak values be anomalous, or strange: Aweak can ex-
ceed the greatest eigenvalue amax of A and can dip be-
1 We will focus on discrete quantum systems, motivated by a spin- low the least eigenvalue amin . Anomalous weak values
chain example. Discrete systems are governed by quasiprobabil- concur with negative quasiprobabilities and nonclassical
ities, which resemble probabilities. Continuous systems are gov-
erned by quasiprobability densities, which resemble probability
physics [14, 17, 18, 83, 84]. Debate has surrounded weak
densities. Our quasiprobabilities can be replaced with quasiprob- values role in quantum mechanics [8591].
ability densities, and our sums can be replaced with integrals, in, The weak value Aweak , we will show, depends on the
e.g., quantum field theory. KD quasiprobability. We replace the A in Eq. (3) with its
5
eigendecomposition. Factoring out the eigenvalues yields 3. Decomposing operators in terms of KD-quasiprobability
X 0 coefficients
hf |aiha|i0 i
Aweak (i, f ) = a< . (4)
hf 0 |i0 i
a The KD distribution can be interpreted not only in
The weight <(.) is a conditional quasiprobability. It re- terms of retrodiction, but also in terms of operation de-
sembles a conditional probabilitythe likelihood that, if compositions [10, 11]. Quantum-information scientists
|ii was prepared and the measurement yielded f , a is the decompose qubit states in terms of Pauli operators. Let
value most reasonably attributable to A. Multiplying = x x + y y
+ z z
denote a vector of the one-qubit
and dividing the argument by hi0 |f 0 i yields Paulis. Let n R3 denote a unit vector. Let denote
any state of a qubit, a two-level quantum system. can
< (hf 0 |aiha|i0 ihi0 |f 0 i) be expressed as = 21 (1 + n ) . The identity operator
p(a|i, f ) := . (5)
|hf 0 |i0 i|2 is denoted by 1. The n components n` constitute decom-
position coefficients. The KD quasiprobability consists of
Substituting into Eq. (4) yields coefficients in a more general decomposition.
X
Aweak (i, f ) = a p(a|i, f ) . (6) Let S denote a discrete quantum system associated
a
with a Hilbert space H. Let {|f i} and {|ai} denote or-
thonormal bases for H. Let O B(H) denote a bounded
Equation (6) illustrates why negative quasiprobabili- operator defined on H. Consider operating on each side
ties concur with anomalous weak values. Suppose that of O with a resolution of unity:
Aweak > amax . Some large eigenvalue a = alarge must
!
correspond to a high quasiprobability value Pp(alarge |i, f ). X X
The quasiprobability values sum to one: ap(a|i, f ) = O = 1O 1 = |aiha| O |f ihf | (11)
1. Hence another quasiprobability value must compen- a f
sate for p(alarge |i, f ). Some p(asmall |i, f ), associated with X
a smaller eigenvalue asmall , must be negative. = |aihf | ha|O|f i . (12)
The numerator of Eq. (5) is the Terletsky-Margenau- a,f
Hill (TMH) quasiprobability [73, 9294]. The TMH dis-
tribution is the real part of a complex number. That Suppose that every element of {|ai} has a nonzero overlap
complex generalization, with every element of {|f i}:
hf 0 |aiha|i0 ihi0 |f 0 i , (7) hf |ai =

6 0 a, f . (13)
is the KD quasiprobability. Each term in Eq. (12) can be multiplied and divided by
We can generalize the retrodiction argument to arbi- the inner product:
trary states [95]. Let D(H) denote the set of density op-
erators (unit-trace linearP positive-semidefinite operators) X |aihf |
defined on H. Let = i pi |iihi| D(H) be a density O= hf |aiha|O|f i . (14)
hf |ai
operators eigendecomposition. Let 0 := Ut0 Ut0 . The a,f
weak value Eq. (3) becomes n o

|aihf |
0 Under condition (13), hf |ai forms an orthonormal
hf |A0 |f 0 i
Aweak (, f ) := < . (8) basis for B(H) . [The orthonormality is with respect to
hf 0 |0 |f 0 i
the Hilbert-Schmidt inner product. Let O1 , O2 B(H).
P
Let us eigendecompose A and factor out a a. The
The operators have the Hilbert-Schmidt inner prod-
eigenvalues are weighted by the conditional quasiprob- uct (O1 , O2 ) = Tr(O1 O2 ).] The KD quasiprobability
ability hf |aiha|O|f i consists of the decomposition coefficients.
Condition (13) is usually assumed to hold [10, 11, 34].
< (hf 0 |aiha|0 |f 0 i)
p(a|, f ) = . (9) In [10, 11], for example, {|aiha|} and {|f ihf |} manifest as
hf 0 |0 |f 0 i the position and momentum eigenbases {|xi} and {|pi}.
The numerator is the TMH quasiprobability for . The Let |i denote a pure state. Let (x) and (p) represent
complex generalization |i relative to the positive and momentum eigenbases.
The KD quasiprobability for = |ih| has the form
A(1) 0 0 0
(a, f ) = hf |aiha| |f i (10)
(1)
is the KD quasiprobability for .2 We rederive (10), via A|ih| (p, x) = hx|pihp|ih|xi (15)
an operator decomposition, next. e ixp/~
= (x) .
(p) (16)
2~
2 should not be confused with
The A in the quasiprobability A The OTOC motivates a violation of condition (13)
the observable A. (Sec. V C).
6
4. Properties of the KD quasiprobability Equation (17) can be expressed in terms of jointly con-
ditional distributions. Let p(a|i, f ) denote the proba-
The KD quasiprobability shares some, but not all, of bility that an event a will occur, given that an event i
its properties with other quasiprobabilities. The notation occurred and that f occurred subsequently. p(a, f |i) is
below is defined as it has been throughout Sec. I A. defined similarly. What is the joint probability p(i, f, a)
(1)
that i, f , and a will occur? We can construct two ex-
Property 1. The KD quasiprobability AO (a, f ) maps pressions:
B(H) {a} {f } to C . The domain is a composition
of the set B(H) of bounded operators and two sets of real p(i, f, a) = p(a|i, f ) p(i, f ) = p(a, f |i) p(i) . (18)
numbers. The range is the set C of complex numbers, not
The joint probability p(i, f ) equals p(f |i) p(i). This p(i)
necessarily the set R of real numbers.
cancels with the p(i) on the right-hand side of Eq. (18).
The Wigner function assumes only real values. Only Solving for p(a|i, f ) yields Bayes Theorem for jointly
by dipping below zero can the Wigner function deviate conditional probabilities,
from classical probabilistic behavior. The KD distribu-
tions negativity has the following physical significance: p(a, f |i)
p(a|i, f ) = . (19)
Imagine measuring two (commuting) observables, A and p(f |i)
B, simultaneously. The measurement has some probabil-
Equation (5) echoes Eq. (19). The KD quasiproba-
ity p(a; b) of yielding the values a and b. Now, suppose
bilitys Bayesian behavior [12, 100] has been applied to
that A does not commute with B. No joint probability
quantum state tomography [10, 11, 13, 101104] and to
distribution p(a; b) exists. Infinitely precise values can-
quantum foundations [98].
not be ascribed to noncommuting observables simultane-
Having reviewed the KD quasiprobability, we approach
ously. Negative quasiprobability values are not observed
the extended KD quasiprobability behind the OTOC.
directly: Observable phenomena are modeled by aver-
We begin by concretizing our set-up, then reviewing the
ages over quasiprobability values. Negative values are
OTOC.
visible only on scales smaller than the physical coarse-
graining scale. But negativity causes observable effects,
visible in sequential measurements. Example effects in- B. Set-up
clude anomalous weak values [14, 17, 18, 72, 83, 84] and
violations of Leggett-Garg inequalities [96, 97].
Unlike the Wigner function, the KD distribution can This section concerns the set-up and notation used
assume nonreal values. Consider measuring two non- throughout the rest of this paper. Our framework is mo-
commuting observables sequentially. How much does the tivated by the OTOC, which describes quantum many-
first measurement affect the second measurements out- body systems. Examples include black holes [1, 30], the
come? This disturbance is encoded in the KD distribu- Sachdev-Ye-Kitaev model [29, 30], other holographic sys-
tions imaginary component [98101]. tems [2123] and spin chains. We consider a system S
associated with a Hilbert space H of dimensionality d.
(1)
Property 2. Summing A (a, f ) over a yields a proba- The system evolves under a Hamiltonian H that might
(1)
bility distribution. So does summing A (a, f ) over f . be nonintegrable or integrable. H generates the time-
evolution operator U := eiHt .
Consider substituting O = into Eq. (2). Summing over We will have to sum or integrate over spectra. For
a yields hf ||f i. This inner product equals a probability, concreteness, we sum, supposing that H is discrete. A
by Borns Rule. spin-chain example, discussed next, motivates our choice.
Property 3. The KD quasiprobability is defined as in Our sums can be replaced with integrals unless, e.g., we
Eq. (2) regardless of whether {a} and {f } are discrete. evoke spin chains explicitly.
We will often illustrate with a one-dimensional (1D)
The KD distribution and the Wigner function were de- chain of spin- 12 degrees of freedom. Figure 1 illustrates
fined originally for continuous systems. Discretizing the the chain, simulated numerically in Sec. III. Let N denote
Wigner function is less straightforward [16, 20]. the number of spins. This systems H has dimensionality
Property 4. The KD quasiprobability obeys an analog d = 2N .
of Bayes Theorem, Eq. (5). We will often suppose that S occupies, or is initialized
to, a state
Bayes Theorem governs the conditional probability X
p(f |i) that an event f will occur, given that an event = pj |jihj| D(H) . (20)
i has occurred. p(f |i) is expressed in terms of the con- j
ditional probability p(i|f ) and the absolute probabilities
p(i) and p(f ): The set of density operators defined on H is denoted by
D(H), as in Sec. I A. Orthonormal eigenstates are in-
p(i|f ) p(f ) dexed by j; eigenvalues are denoted by pj . Much litera-
p(f |i) = . (17)
p(i) ture focuses on temperature-T thermal states eH/T /Z.
7
1 2 N C. The out-of-time-ordered correlator
Given two unitary operators W and V , the out-of-time-

ordered correlator is defined as
F (t) := hW (t)V W(t)V i Tr(W (t)V W(t)V ) .

W= z 1(N 1)
V = 1(N 1)
x
(23)
FIG. 1: Spin-chain example: A spin chain exemplifies This object reflects the degree of noncommutativity of V
the quantum many-body systems characterized by the and the Heisenberg operator W(t). More precisely, the
out-of-time-ordered correlator (OTOC). We illustrate with a
OTOC appears in the expectation value of the squared
one-dimensional chain of N spin- 21 degrees of freedom. The
vertical red bars mark the sites. The dotted red arrows magnitude of the commutator [W(t), V ],
illustrate how spins can point in arbitrary directions. The
OTOC is defined in terms of local unitary or Hermitian C(t) := h[W(t), V ] [W(t), V ]i = 2 2<(F (t)) . (24)
operators W and V . Example operators include single-qubit
Paulis x and z that act nontrivially on opposite sides of Even if W and V commute, the Heisenberg operator W(t)
the chain. generically does not commute with V at sufficiently late
times.
An analogous definition involves Hermitian W and V .
(The partition function Z normalizes the state.) We leave The commutators square magnitude becomes
the form of general, as in [37].
The OTOC is defined in terms of local operators W C(t) = h[W(t), V ]2 i. (25)
and V . In the literature, W and V are assumed to be uni-
tary and/or Hermitian. Unitarity suffices for deriving the This squared commutator involves TOC (time-ordered-
results in [37], as does Hermiticity. Unitarity and Her- correlator) and OTOC terms. The TOC terms take the
miticity are assumed there, and here, for convenience.3 forms hV W(t)W(t)V i and hW(t)V V W(t)i. [Technically,
In our spin-chain example, the operators manifest as one- hV W(t)W(t)V i is time-ordered. hW(t)V V W(t)i behaves
qubit Paulis that act nontrivially on opposite sides of the similarly.]
chain, e.g., W = z 1(N 1) , and V = 1(N 1) x . In The basic physical process reflected by the OTOC is
the Heisenberg Picture, W evolves as W(t) := U WU . the growth of Heisenberg operators with time. Imagine
The operators eigendecompose as starting with a simple W, e.g., an operator acting non-
X trivially on just one spin in a many-spin system. Time-
W= w` |w` , w` ihw` , w` | (21) evolving yields W(t). The operator has grown if W(t)
w` ,w` acts nontrivially on more spins than W does. The op-
erator V functions as a probe for testing whether the
and action of W(t) has spread to the spin on which V acts
X nontrivially.
V = v` |v` , v` ihv` , v` | . (22) Suppose W and V are unitary and commute. At
v` ,v` early times, W(t) and V approximately commute. Hence
F (t) 1, and C(t) 0. Depending on the dynamics, at
The eigenvalues are denoted by w` and v` . The degener-
later times, W(t) may significantly fail to commute with
acy parameters are denoted by w` and v` . Recall that
V . In a chaotic quantum system, W(t) and V generically
W and V are local. In our example, W acts nontrivially
do not commute at late times, for most choices of W and
on just one of N 1 qubits. Hence W and V are expo-
V.
nentially degenerate in N . The degeneracy parameters
The analogous statement for Hermitian W and V is
can be measured: Some nondegenerate Hermitian oper-
has eigenvalues in a one-to-one correspondence that F (t) approximately equals the TOC terms at early
ator W
outputs a times. At late times, depending on the dynamics, the
with the w` s. A measurement of W and W
commutator can grow large. The time required for the
tuple (w` , w` ). We refer to such a measurement as a
measurement, for conciseness. Analogous statements TOC terms to approach their equilibrium values is called
W the dissipation time td . This time parallels the time re-
concern V and a Hermitian operator V . Section II A in- quired for a system to reach local thermal equilibrium.
troduces a trick that frees us from bothering with degen- The time scale on which the commutator grows to be
eracies. order-one is called the scrambling time t . The scram-
bling time parallels the time over which a drop of ink
spreads across a container of water.
3 Measurements of W and V are discussed in [37] and here. Her- Why consider the commutators square modulus? The
mitian operators GW and GV generate W and V . If W and V simpler object h[W(t), V ]i often vanishes at late times,
are not Hermitian, GW and GV are measured instead of W and due to cancellations between states in the expectation
V. value. Physically, the vanishing of h[W(t), V ]i signifies
8
that perturbing the system with V does not significantly D. Introducing the quasiprobability A behind the
change the expectation value of W(t). This physics is OTOC
expected for a chaotic system, which effectively loses
its memory of its initial conditions. In contrast, the F (t) was shown, in [37], to equal a moment of a
magnitude-squared commutator C(t) is the expectation summed quasiprobability. We review this result, estab-
value of a positive operator. The cancellations that zero lished in four steps: A quantum probability amplitude
out h[W(t), V ]i cannot occur. Mathematically, though A is reviewed in Sec. I D 1 . Amplitudes are combined
the diagonal matrix elements of [W(t), V ] may be small, to form the quasiprobability A in Sec. I D 2. Summing
the operator can be large.
A (.) values, with constraints, yields a complex distri-
We can gain intuition about the manifestation of chaos
bution P (W, W 0 ) in Sec. I D 3. Differentiating P (W, W 0 )
in F (t) from a simple quantum system that has a chaotic
yields the OTOC. A can be inferred experimentally from
semiclassical limit. Let W = q and V = p for some
a weak-measurement scheme and from interference. We
position q and momentum p:
review these schemes in Sec. I D 4.
C(t) = h[q(t), p]2 i ~2 e2L t . (26)
This L is a classical Lyapunov exponent. The final 1. Quantum probability amplitude A
expression follows from the Correspondence Principle:
Commutators are replaced with i~ times the correspond- The OTOC quasiprobability A is defined in terms of
ing Poisson bracket. The Poisson bracket of q(t) with p probability amplitudes A . The A s are defined in terms
equals the derivative of the final position with respect to of the following process, PA :
the initial position. This derivative reflects the butterfly
effect in classical chaos, i.e., sensitivity to initial condi- (1) Prepare .
tions. The growth of C(t), and the deviation of F (t) from
the TOC terms, provide a quantum generalization of the (2) Measure the eigenbasis, {|jihj|}.
butterfly effect.
Within this simple quantum system, the analog of the (3) Evolve S forward in time under U .
dissipation time may be regarded as td 1 L . The ana-
(4) Measure W.
log of the scrambling time is t 1 L ln
~ . The de-
notes some measure of the accessible phase-space volume. (5) Evolve S backward under U .
Suppose that the phase space is large in units of ~. The
scrambling time is much longer than the dissipation time: (6) Measure V .
t td . Such a parametric separation between the time
scales characterizes the systems that interest us most. (7) Evolve S forward under U .
In more general chaotic systems, the value of t de-
pends on whether the interactions are geometrically local
(8) Measure W.
and on W and V . Consider, as an example, a spin chain
governed by a local Hamiltonian. Suppose that W and Suppose that the measurements yield the outcomes j,
V are local operators that act nontrivially on spins sepa- (w1 , w1 ), (v1 , v1 ), and (w2 , w2 ). Figure 2a illustrates
rated by a distance `. The scrambling time is generically this process. The process corresponds to the probability
proportional to `. For this class of local models, `/t de- amplitude4
fines a velocity vB called the butterfly velocity. Roughly,
the butterfly velocity reflects how quickly initially local A (j; w1 , w1 ; v1 , v1 ; w2 , w2 ) := hw2 , w2 |U |v1 , v1 i

Heisenberg operators grow in space. hv1 , v1 |U |w1 , w1 ihw1 , w1 |U |ji pj . (27)
Consider a system in which td is separated parametri-
cally from t . The rate of change of F (t) [rather, a reg- We do not advocate for performing PA in any exper-
ulated variation on F (t)] was shown to obey a nontrivial iment. PA is used to define A and to interpret A
bound. Parameterize the OTOC as F (t) TOC eL t . physically. Instances of A are combined into A . A
The parameter 1 encodes the separation of scales. weak-measurement protocol can be used to measure A
The exponent L obeys L 2kB T in thermal equi- experimentally. An interference protocol can be used to
librium at temperature T [6]. kB denotes Boltzmanns measure A (and so A ) experimentally.
constant. Black holes in the AdS/CFT duality saturate
this bound, exhibiting maximal chaos [1, 30].
More generally, L and vB control the operators
growth and the spread of chaos. The OTOC has thus at- 4 We order the arguments of A differently than in [37]. Our
tracted attention for a variety of reasons, including (but ordering here parallels our later ordering of the quasiprobabil-
not limited to) the possibilities of nontrivial bounds on itys argument. Weak-measurement experiments motivate the
quantum dynamics, a new probe of quantum chaos, and quasiprobability arguments ordering. This motivation is de-
a signature of black holes in AdS/CFT. tailed in Footnote 6.
9
illustrates this realization. The initial and final measure-

(w1 , w1 ) (w2 , w2 )
ments yield the same outcomes as in the (27) realization.
We multiply the complex conjugate of the second real-

Measure W.
Measure W. izations amplitude by the first realizations probability
amplitude. Then, we sum over j and (w1 , w1 ):5,6
-t
A (v1 , v1 ; w2 , w2 ; v2 , v2 ; w3 , w3 )
U U
U X
Experiment := A (j; w3 , w3 ; v2 , v2 ; w2 , w2 )
time j,(w1 ,w1 )
0
A (j; w1 , w1 ; v1 , v1 ; w2 , w2 ) . (28)
Measure Measure Equation (28) resembles a probability but differs due
Prepare
{|jihj|}. j V . (v1 , v1 )
to the noncommutation of W(t) and V . We illustrate
. this relationship in two ways.
Consider a 1D quantum system, e.g., a particle on a
(a) line. We represent the systems state with a wave func-
tion (x). The probability density at point x equals
(w3 , w3 ) (w2 , w2 ) (x) (x). The A A in Eq. (28) echoes . But
the argument of the equals the argument of the .

Measure W.
Measure W. The argument of the A differs from the argument of the
A , because W(t) and V fail to commute.
-t Substituting into Eq. (28) from Eq. (27) yields
U U U A (v1 , v1 ; w2 , w2 ; v2 , v2 ; w3 , w3 )
Experiment
time
= hw3 , w3 |U |v2 , v2 ihv2 , v2 |U |w2 , w2 i
0
hw2 , w2 |U |v1 , v1 ihv1 , v1 |U |w3 , w3 i . (29)
Measure Measure A simple example illustrates how A nearly equals a
{|jihj|}. j V . (v2 , v2 ) probability. Suppose that an eigenbasis of coincides
Prepare
. with {|v` , v` ihv` , v` |} or with U |w` , w` ihw` , w` |U .
Suppose, for example, that
(b) X
= V := pv` ,v` |v` , v` ihv` , v` | . (30)
FIG. 2: Quantum processes described by the v` ,v`
probability amplitudes A in the out-of-time-ordered One such is the infinite-temperature Gibbs state
correlator (OTOC): These figures, and parts of this
1/d. Another example is easier to prepare: Sup-
caption, appear in [37]. The OTOC quasiprobability A x
pose that S consists of N spins and that V = N .
results from summing products A (.)A (.). Each A (.) x
denotes a probability amplitude [Eq. (27)], so each product One V equals a product of N eigenstates. Let
resembles a probability. But the amplitudes arguments (v2 , v2 ) = (v1 , v1 ). [An analogous argument follows
differthe amplitudes correspond to different quantum from (w3 , w3 ) = (w2 , w2 ).] Equation (29) reduces to
processesbecause the OTOC operators W(t) and V fail to
|hw2 , w2 |U |v1 , v1 i|2 |hw3 , w3 |U |v1 , v1 i|2 pv1 ,v1 .
commute, typically. Figure 2a illustrates the process
described by the A (.); and Fig. 2b, the process described by (31)
the A (.). Time, as measured by a laboratory clock,
increases from left to right. Each
P process begins with the
preparation of the state = j pj |jihj| and a measurement 5 Familiarity with tensors might incline one to sum over the
of the states eigenbasis. Three evolutions (U , U , and U ) (w2 , w2 ) shared by the trajectories. But we are not invok-
then alternate with three measurements of observables (W, ing tensors. More importantly, summing over (w2 , w2 ) intro-

V , and W). Figures 2a and 2b are used to define A , rather duces a v1 v2 v1 v2 that eliminates one (v` , v` ) degree of free-
dom. The resulting quasiprobability would not lie behind the
than showing protocols for measuring A .
OTOC. One could, rather than summing over (w1 , w1 ), sum
over (w3 , w3 ). Either way, one sums over one trajectorys first
W outcome. We sum over (w1 , w ) to maintain consistency
1
with [37].
2. The OTOC quasiprobability A 6 In [37], the left-hand sides arguments are ordered differently and
are condensed into the shorthand (w, v, w , v ). Experiments
motivate our reordering: Consider inferring A (a, b, c, d) from
The quasiprobabilitys definition is constructed as fol- experimental measurements. In each trial, one (loosely speaking)
lows. Consider a realization of PA that yields the out- weakly measures a, then b, then c; and then measures d strongly.
comes j, (w3 , w3 ), (v2 , v2 ), and (w2 , w2 ). Figure 2b As the measurements are ordered, so are the arguments.
10
Each square modulus equals a conditional probability. couplings are represented by constraints like W (w3 v2 )
pv1 ,v1 equals the probability that, if is measured with and W 0 (w2 v1 ) . Suppose that the detectors measure the
respect to {|v` , v` ihv` , v` |}, outcome (v1 , v1 ) obtains. systems Hamiltonian. Subtracting the measurements
In this simple case, some of the quasiprobabil- outcomes yields the work performed during the protocol.
itys valuesthe quasiprobability evaluated on certain The distribution over possible work values is a quasiprob-
argumentsreduce to products of probabilities. Not all ability. Their quasiprobability is a Husimi Q-function,
the quasiprobabilitys values reduce so. All the values of whereas the OTOC quasiprobability is a KD distribu-
the quasiprobability behind time-ordered correlators can tion [108]. Related frameworks appear in [109111]. The
reduce to probabilities, we show in Sec. V D. relationship between those thermodynamics frameworks
and our thermodynamically motivated OTOC framework
merits exploration.
3. Complex distribution P (W, W 0 )
A is summed, in [37], to form a complex distribu- 4. Weak-measurement and interference schemes for
tion P (W, W 0 ). Let W := w3 v2 and W 0 := w2 v1 inferring A
denote random variables calculable from measurement
outcomes. If W and V are Paulis, (W, W 0 ) can equal
(1, 1), (1, 1), (1, 1), or (1, 1). Consider fixing the A can be inferred from weak measurements and from
value of (W, W 0 ). For example, let (W, W 0 ) = (1, 1). interference. We focus mostly on weak measurements.
Consider the octuples (v1 , v1 ; w2 , w2 ; v2 , v2 ; w3 , w3 ) First, we briefly review the interference scheme.
that satisfy the constraints W = w3 v2 and W 0 = w2 v1 . The interference scheme in [37] differs from other in-
Each octuple corresponds to a quasiprobability value terference schemes for measuring F (t) [4143]: From
A (.). Summing these quasiprobability values yields the [37] interference scheme, one can infer not only F (t),
X but also A . Time need not be inverted (H need not
P (W, W 0 ) := (32) be negated) in any trial. The scheme is detailed in
(v1 ,v1 ),(w2 ,w2 ),(v2 ,v2 ),(w3 ,w3 ) Appendix B of [37]. The system is coupled to an an-
cilla prepared in a superposition 12 (|0i + |1i). A uni-
A (v1 , v1 ; w2 , w2 ; v2 , v2 ; w3 , w3 ) W (w3 v2 ) W 0 (w2 v1 ) .
tary, conditioned on the ancilla, rotates the systems
The Kronecker delta is represented by ab . P (W, W 0 ) state. The ancilla and system are measured projectively.
functions analogously to the probability distribution, in From many trials measurement data, one infers ha|U|bi,
the fluctuation-relation paper [38], over values of ther- wherein U = U or U and a, b = (w` , w` ), (vm , vm ).
modynamic work. These inner products are multiplied together to form
The OTOC equals a moment of P (W, W 0 ) [Eq. (1)], A [Eq. (29)]. If shares neither the V nor the W(t)

which equals a constrained sum over A [37]. Hence eigenbasis, quantum-state tomography is needed to infer
our labeling of A as the quasiprobability behind the hv1 , v1 |U |w3 , w3 i.
OTOC. Equation (32) expresses the useful, difficult-to- The weak-measurement scheme is introduced in Sec.
measure F (t) in terms of a characteristic function of a II B 3 of [37]. A simple case, in which = 1/d, is
(summed) quasiprobability, as Jarzynski [38] expresses detailed in Appendix A of [37]. Recent weak measure-
a useful, difficult-to-measure free-energy difference F ments [1013, 3135], some used to infer KD distribu-
in terms of a characteristic function of a probability. tions, inspired our weak A -measurement proposal. We
Quasiprobabilities reflect nonclassicality (contextuality) review weak measurements, a Kraus-operator model for
as probabilities do not; so, too, does F (t) reflect nonclas- measurements, and the A -measurement scheme.
sicality (noncommutation) as F does not. Review of weak measurements: Measurements can al-
The definition of P involves arbitrariness: The measur- ter quantum systems states. A weak measurement
able random variables, and P , may be defined differently. barely disturbs the measured systems state. In ex-
We use alternative definitions in Sec. V E. Those alter- change, the measurement provides little information
natives facilitate the construction of OTOCs that encode about the system. Yet one can infer much by performing
more time reversals. All possible definitions share two many trials and processing the outcome statistics.
properties: (i) The arguments W , etc. denote random Extreme disturbances result from strong measure-
variables inferable from measurement outcomes. (ii) P ments [112]. The measured systems state collapses onto
results from summing A (.) values subject to constraints a subspace.PFor example, let denote the initial state.
ab . Let A = a a|aiha| denote the measured observables
P (W, W 0 ) resembles a work distribution constructed eigendecomposition. A strong measurement has a prob-
by Solinas and Gasparinetti (S&G) [105, 106]. They ability ha||ai of projecting onto |ai.
study fluctuation-relation contexts, rather than the One canPimplement a measurement with an ancilla.
OTOC. S&G propose a definition for the work performed Let X = x x|xihx| denote an ancilla observable. One
on a quantum system [107, 108]. The system is coupled correlates A with X via an interaction unitary. Von Neu-
weakly to detectors at a protocols start and end. The mann modeled such unitaries with Vint := eig AX [14,
11
113]. The parameter g signifies the interaction strength.7 under 1) and a probability |g(x)|2 of projecting onto
An ancilla observablee.g., Xis measured strongly. |v` , v` ihv` , v` |.
The greater the g, the stronger the correlation between Weak-measurement scheme for inferring the OTOC
A and X. A is measured strongly if it is correlated with quasiprobability A : Weak measurements have been used
X maximally, if a one-to-one mapping interrelates the to measure KD quasiprobabilities [1013, 31, 32, 34, 35].
xs and the as. Suppose that the X measurement yields These experiments techniques can be applied to infer A
x. We say that an A measurement has yielded some and, from A , the OTOC. Our scheme involves three se-
outcome ax . quential weak measurements per trial (if is arbitrary)
Suppose that g is small. A is correlated imperfectly or two [if shares the V or the W(t)
eigenbasis, e.g., if
with X. The X-measurement outcome, x, provides in- = 1/d]. The weak measurements alternate with time
complete information about A. The value most reason- evolutions and precede a strong measurement.
ably attributable to A remains ax . But a subsequent We review the general and simple-case protocols. A
measurement of A would not necessarily yield ax . In projection trick, introduced in Sec. II A, reduces expo-
exchange for forfeiting information about A, we barely nentially the number of trials required to infer about A
disturb the systems initial state. We can learn more and F (t). The weak-measurement and interference pro-
about A by measuring A weakly in each of many trials, tocols are analyzed in Sec. II B. A circuit for implement-
then processing measurement statistics. ing the weak-measurement scheme appears in Sec. II C.
Kraus-operator model for measurement: Kraus oper- Suppose that does not share the V or the W(t)
eigen-
ators [112] model the system-of-interest evolution in- basis. One implements the following protocol, P:
duced by a weak P measurement. Let us choose P for VA
to equal V = v` ,v v |v
` ` , v` ihv ` , v` | = v ` v ` v ` . (1) Prepare .
`
Vv` projects onto the v` eigenspace. Let denote the
(2) Measure V weakly. (Couple the systems V weakly
systems initial state, and let |Di denote the detectors
to some observable X of a clean ancilla. Measure
initial state.
X strongly.)
Suppose that the X measurement yields x. The sys-
tems state evolves under the Kraus operator (3) Evolve the system forward in time under U .
Mx = hx|Vint |Di (33) weakly. (Couple the systems W
(4) Measure W
= hx|Di 1 weakly to some observable Y of a clean ancilla.

+ hx| eigX 1 |Di |v` , v` ihv` , v` | (34) Measure Y strongly.)
Mx Mx (5) Evolve the system backward under U .

as 7 Tr(Mx Mx )
. The second equation follows from
Taylor-expanding the exponential, then replacing the (6) Measure V weakly. (Couple the systems V weakly
projectors square with
p the projector.8 We relabel to some observable Z of a clean ancilla. Measure
the coefficients as p(x) := hx|Di ei and g(x) := Z strongly.)
hx| eigX 1 |Di ei . An unimportant global phase is
(7) Evolve the system forward under U .
denoted by ei . To remove this phase, we redefine Mx as
p strongly.
(8) Measure W
Mx = p(x) 1 + g(x) |v` , v` ihv` , v` | . (36)
X, Y , and Z do not necessarily denote Pauli operators.
The coefficients have the following significances. Sup-
Each trial yields three ancilla eigenvalues (x, y, and z)
pose that the ancilla did not couple to the system. The eigenvalue (w3 , w ). One implements P
X measurement would have a baseline probability p(x) and one W 3
of outputting x. The dimensionless parameter g(x) C many times. From the measurement statistics, one in-
is derived from g. We can roughly interpret Mx statis- fers the probability Pweak (x; y; z; w3 , w3 ) that any given
tically: In any given trial, the coupling has a probabil- trial will yield the outcome quadruple (x; y; z; w3 , w3 ).
ity p(x) of failing to disturb the system (of evolving From this probability, one infers the quasiprobability
A (v1 , v1 ; w2 , w2 ; v2 , v2 ; w3 , w3 ). The probability has
the form
7 A and X are dimensionless: To form them, we multiply dimen- Pweak (x; y; z; w3 , w3 ) = hw3 , w3 |U Mz U My U Mx
sionful observables by natural scales of the subsystems. These
scales are incorporated into g. Mx U My U Mz U |w3 , w3 i . (37)
8 Experimentalists might prefer measuring Pauli operators to
measuring projectors. Measuring Paulis can suffice, as discussed We integrate over x, y, and z, to take advantage of
in Sec. II. Paulis square to the identity, rather than to themselves: all measurement statistics. We substitute in for the
( )2 = 1, for = x, y, z. Equation (34) becomes
Kraus operators from Eq. (36), then multiply out. Two
g X) |Di 1 ihx| sin (
hx| cos ( g X) |Di . (35)
terms combine into <(A (.)). The other terms form in-
dependently measurable background terms. To infer
12
=(A (.)), one performs P many more times, using differ- one, on interference. We simplify, evaluate, and augment
ent couplings. Details appear in Appendix A of [37]. these schemes.
W and V are local. Their degeneracies therefore scale First, we introduce a projection trick: Sum-
with the system size. If S consists of N spin- 12 de- ming over degeneracies turns one-dimensional projectors
grees of freedom, |w` |, |v` | 2N . Exponentially many (e.g., |w` , w` ihw` , w` |) into projectors onto degener-
A (.) values must be inferred. Exponentially many tri- ate eigenspaces (e.g., W w` ). The coarse-grained OTOC
als must be performed. We sidestep this exponential- quasiprobability A results. This trick decreases ex-
ity in Sec. II A: One measures eigenprojectors of the ponentially the number of trials required to infer the
degenerate W and V , rather than of the nondegener- OTOC from weak measurements.9 Section II B concerns
ate W and V . The one-dimensional |v` , v ihv` , v | of pros and cons of the weak-measurement and interference
` `
Eq. (34) is replaced with Vv` . From the weak mea- schemes for measuring A and F (t). We also compare
surements,
P one infers the coarse-grained quasiprobabil- those schemes with alternative schemes for measuring
ity degeneracies A (.) =: A (.). Summing A (.) values F (t). Section II C illustrates a circuit for implementing
yields the OTOC. the weak-measurement scheme. Section II D shows how
Suppose that shares the V or the W(t) eigenba- to infer A not only from the measurement schemes in
sis. The number of weak measurements reduces to two. Sec. I D 4, but also with alternative OTOC-measurement
For example, suppose that is the infinite-temperature proposals (e.g., [41]) (if the eigenvalues of W and V are
Gibbs state 1/d. The protocol P becomes 1).
eigenstate |w3 , w i.
(1) Prepare a W 3
(2) Evolve the system backward under U .

A. The coarse-grained OTOC quasiprobability A
(3) Measure V weakly. and a projection trick
W and V are local. They manifest, in our spin-chain
weakly.
(5) Measure W example, as one-qubit Paulis that nontrivially trans-
form opposite ends of the chain. The operators de-
(6) Evolve the system backward under U .
generacies grows exponentially with the system size N :
(7) Measure V strongly. |w` |, |vm | 2N . Hence the number of A (.) values
grows exponentially. One must measure exponentially
In many recent experiments, only one weak measure- many numbers to calculate F (t) precisely via A . We cir-
ment is performed per trial [10, 12, 31]. A probabil- cumvent this inconvenience by summing over the degen-
ity Pweak must be approximated to first order in the eracies in A (.), forming the coarse-grained quasiproba-
coupling constant g(x). Measuring A requires two or bility A (.). A (.) can be measured in numerical simu-
three weak measurements per trial. We must approx- lations, experimentally via weak measurements, and (if
imate Pweak to second or third order. The more weak the eigenvalues of W and V are 1) experimentally with
measurements performed sequentially, the more demand- other F (t)-measurement set-ups (e.g., [41]).
ing the experiment. Yet sequential weak measurements The coarse-grained OTOC quasiprobability results
have been performed recently [3234]. The experimental- from marginalizing A (.) over its degeneracies:
ists aimed to reconstruct density matrices and to measure
non-Hermitian operators. The OTOC measurement pro- X
vides new applications for their techniques. A (v1 , w2 , v2 , w3 ) :=
v1 ,w2 ,v2 ,w3
II. EXPERIMENTALLY MEASURING A AND

A (v1 , v1 ; w2 , w2 ; v2 , v2 ; w3 , w3 ) . (38)
THE COARSE-GRAINED A
Equation (38) reduces to a more practical form. Con-
Multiple reasons motivate measurements of the OTOC sider substituting into Eq. (38) for A (.) from Eq. (29).
quasiprobability A . A is more fundamental than the The right-hand side of Eq. (29) equals a trace. Due to
OTOC F (t), F (t) results from combining values of A . the traces cyclicality, the three rightmost factors can be
A exhibits behaviors not immediately visible in F (t), as
shown in Sections III and IV. A therefore holds interest
in its own right. Additionally, A suggests new schemes
for measuring the OTOC. One measures the possible val- 9 The summation preserves interesting properties of the
ues of A (.), then combines the values to form F (t). Two quasiprobabilitynonclassical negativity and nonreality, as well
measurement schemes are detailed in [37] and reviewed as intrinsic time scales. We confirm this preservation via numer-
in Sec. I D 4. One scheme relies on weak measurements; ical simulation in Sec. III.
13
shifted leftward: Nor does A equal an expectation value. Expectation

X values have the form Tr(A), wherein A denotes a Hermi-
A (v1 , w2 , v2 , w3 ) = Tr U |w3 , w3 ihw3 , w3 |U tian operator. The operator rightward of the in Eq. (44)
v1 ,w2 , is not Hermitian. Hence A lacks two symmetries of fa-
v2 ,w3
miliar quantum objects: the symmetric conjugation in
|v2 , v2 ihv2 , v2 |U |w2 , w2 ihw2 , w2 |U |v1 , v1 ihv1 , v1 | . Borns Rule and the invariance, under Hermitian conju-
gation, of the observable A in an expectation value.
(39)
The right-hand side of Eq. (44) can be measured in
The sums are distributed throughout the trace: numerical simulations, experimentally via weak mea-
surements, and (if the eigenvalues of W and V are
" #
X 1) experimentally via alternative OTOC-measurement

A (v1 , w2 , v2 , w3 ) = Tr U
|w3 , w3 ihw3 , w3 |U schemes. We present numerical measurements in Sec. III.
w3 The weak-measurement scheme follows from Appendix A
" #" # of [37], reviewed in Sec. I D 4: Section I D 4 fea-
X X
|v2 , v2 ihv2 , v2 | U |w2 , w2 ihw2 , w2 |U tures projectors onto one-dimensional eigenspaces, e.g.,
v2 w2 |v1 , v1 ihv1 , v1 |. Those projectors are replaced with s
" #! onto higher-dimensional eigenspaces. Section II D de-
X tails how A can be inferred from alternative OTOC-
|v1 , v1 ihv1 , v1 | . (40)
measurement schemes.
v1
Define
X B. Analysis of the quasiprobability-measurement
W
w` := |w` , w` ihw` , w` | (41) schemes and comparison with other
w` OTOC-measurement schemes
as the projector onto the w` eigenspace of W, Section I D 4 reviews two schemes for inferring A : a
W(t) weak-measurement scheme and an interference scheme.
w`
:= U W
w` U (42) From A measurements, one can infer the OTOC F (t).
We evaluate our schemes pros and cons. Alternative
as the projector onto the w` eigenspace of W(t), and
schemes for measuring F (t) have been proposed [39, 41
X 46], and two schemes have been realized [47, 48]. We
Vv` := |v` , v` ihv` , v` | (43)
compare our schemes with alternatives, as summarized
v`
in Table I. For specificity, we focus on [41, 42, 44].
as the projector onto the v` eigenspace of V . Substituting The weak-measurement scheme augments the set
into Eq. (40) yields of techniques and platforms with which F (t) can be
measured. Alternative schemes rely on interferom-
etry [4143], controlled unitaries [41, 44], ultracold-
A (v1 , w2 , v2 , w3 ) = Tr w
W(t) V
3
W(t) V
v2 w 2 v1 . (44) atoms tools [43, 45, 46], and strong two-point measure-
ments [39]. Weak measurements, we have shown, belong
Asymmetry distinguishes Eq. (44) from Borns Rule in the OTOC-measurement toolkit. Such weak measure-
and from expectation values. The traces cyclicality im- ments are expected to be realizable, in the immediate
plies that future, with superconducting qubits [35, 5055], trapped
ions [5662], cavity QED [64, 65], ultracold atoms [63],

A (v1 , w2 , v2 , w3 ) = Tr W(t) V W(t) V and perhaps NMR [66, 67]. Circuits for weakly measuring
w3 v2 w2 v1 . (45)
qubit systems have been designed [36, 50]. Initial proof-
of-principle experiments might not require direct access
Imagine preparing , measuring V strongly, evolving S
to the qubits: The five superconducting qubits available
forward under U , measuring W strongly, evolving S back-
from IBM, via the cloud, might suffice [115]. Random
ward under U , measuring V strongly, evolving S forward
two-qubit unitaries could simulate chaotic Hamiltonian
under U , and measuring W. The probability of obtaining
evolution.
the outcomes v1 , w2 , v2 , and w3 , in that order, is
In many weak-measurement experiments, just one
weak measurement is performed per trial [1013]. Yet
Tr W(t) V W(t) V V W(t) V
w3 v2 w2 v1 v1 w2 v2 w3
W(t)
. (46) two weak measurements have recently been performed
sequentially [3234]. Experimentalists aimed to directly
W(t) W(t)
The operator w3 Vv2 w2 Vv1 conjugates symmet- measure general quantum states [11] and to infer about
rically. This operator multiplies asymmetrically in non-Hermitian observable-like operators. The OTOC
Eq. (45). Hence A does not obviously equal a proba- motivates a new application of recently realized sequen-
bility. tial weak measurements.
14
Yunger Halpern/ Yunger Halpern Swingle Yao Zhu

our weak meas. interferometry et al. et al. et al.
Key tools Weak Interference Interference, Ramsey interfer., Quantum
measurement Lochschmidt echo Renyi-entropy meas. clock

Whats inferable (1) F (t), A , F (K ) (t), AK
, <(F (t)) Regulated F (t)
from the mea- & or & K or |F (t)|2 correlator
surement? (2) F (t) & A Freg (t)
Generality Arbitrary Arbitrary Arbitrary Thermal: Arbitrary
of D(H) D(H) D(H) eH/T /Z D(H)
Ancilla Yes Yes Yes for <(F (t)), Yes Yes
needed? no for |F (t)|2
Ancilla coup- No Yes No No Yes
ling global?
How long must 1 weak Whole Whole Whole Whole
ancilla stay measurement protocol protocol protocol protocol
coherent?
# time 2 0 1 0 2
reversals
# copies of 1 1 1 2 1
needed / trial
Signal-to- To be deter- To be deter- Constant eN Constant
noise ratio mined [114] mined [114] in N in N
TABLE I: Comparison of our measurement schemes with alternatives: This paper focuses on the
weak-measurement and interference schemes for measuring the OTOC quasiprobability A or the coarse-grained
quasiprobability A . From A or A , one can infer the OTOC F (t). These schemes appear in [37], are reviewed in Sec. I D 4,
and are assessed in Sec. II B. We compare our schemes with the OTOC-measurement schemes in [41, 42, 44]. More
OTOC-measurement schemes appear in [39, 43, 4548]. Each row corresponds to a desirable quantity or to a resource
potentially challenging to realize experimentally. The regulated correlator Freg (t) [Eq. (104)] is expected to behave similarly
to F (t) [6, 42]. D(H) denotes the set of density operators defined on the Hilbert space H. denotes the initially prepared
state. Target states target are never prepared perfectly; may differ from target . Experimentalists can reconstruct by

trivially processing data taken to infer A [37] (Sec. V B). F (K ) (t) denotes the K-fold OTOC, which encodes K = 2K 1
(K )
time reversals. The conventional OTOC corresponds to K = 3. The quasiprobability behind F (K ) (t) is A (Sec. V E). N
denotes the system size, e.g., the number of qubits. The Swingle et al. and Zhu et al. schemes have constant signal-to-noise
ratios (SNRs) in the absence of environmental decoherence. The Yao et al. schemes SNR varies inverse-exponentially with
the systems entanglement entropy, SvN . The system occupies a thermal state eH/T /Z, so SvN log(2N ) = N .
Our schemes furnish not only the OTOC F (t), but also (C) The form of the state prepared. Suppose
more information: that we wish to evaluate F (t) on a target state
target . target might be difficult to prepare,
(1) From the weak-measurement scheme in [37], we can e.g., might be thermal. The prepared state
infer the following: approximates target . Consider performing the
weak-measurement protocol P with . One
(A) The OTOC quasiprobability A . The infers A . Summing A (.) values yields the
quasiprobability is more fundamental than form of . We can assess the preparations ac-
F (t), as combining A (.) values yields F (t): curacy without performing tomography inde-
X pendently. Whether this assessment meets ex-
F (t) = v1 w2 v2 w3 (47) perimentalists requirements for precision re-
(v1 ,v1 ),(w2 ,w2 ), mains to be seen. Details appear in Sec. V C.
(v2 ,v2 ),(w3 ,w3 )
A (v1 , v1 ; w2 , w2 ; v2 , v2 ; w3 , w3 ) . (2) The weak-measurement protocol P is simplified

later in this section. Upon implementing the sim-
Equation (47) follows from Eq. (1). plified protocol, we can infer the following informa-
(B) The OTOC F (t). tion:
15
(A) The coarse-grained OTOC quasiprobability protocol, however, is short: Time need not be reversed
A . Though less fundamental than the fine- in any trial. Each trial features exactly one U or U , not
grained A , A implies the OTOCs form: both. Time can be difficult to reverse in some platforms,
for two reasons. Suppose that a Hamiltonian H gener-
X ates a forward evolution. A perturbation might lead
F (t) = v1 w2 v2 w3 A (v1 , w2 , v2 , w3 ) .
v1 ,w2 ,v2 ,w3
(H + ) to generate the reverse evolution. Perturba-
(48) tions can mar long-time measurements of F (t) [44]. Sec-
ond, systems interact with environments. Decoherence
might not be completely reversible [41]. Hence the lack
(B) The OTOC F (t).
of a need for time reversal, as in our interference scheme
(3) Upon implementing the interferometry scheme and in [42, 44], has been regarded as an advantage.
in [37], we can infer the following information: Unlike our interference scheme, the weak-measurement
scheme requires that time be reversed. Perturbations
(A) The OTOC quasiprobability A . threaten the weak-measurement scheme as they threaten
(B) The OTOC F (t). the Swingle et al. scheme [41]. s might threaten
the weak-measurement scheme more, because time is in-
(C) The form of the state prepared. verted twice in our scheme. Time is inverted only once

(D) All the K-fold OTOCs F (K ) (t), which gen- in [41]. However, our error might be expected to have
eralize the OTOC F (t). F (t) encodes three roughly the size of the Swingle et al. schemes error [114].

time reversals. F (K ) (t) encodes K = 2K Furthermore, tools for mitigating the Swingle et al.
1 = 3, 5, . . . time reversals. Details appear in schemes inversion error are being investigated [114]. Re-
Sec. V E. silience of the Swingle et al. scheme to decoherence
(K ) has been analyzed [41]. These tools may be applied to
(E) The quasiprobability A behind F (K ) (t), the weak-measurement scheme [114]. Like resilience, our
for all K (Sec. V E). schemes signal-to-noise ratios require further study.
We have delineated the information inferable from the As noted earlier, as the system size N grows, the num-
weak-measurement and interference schemes for measur- ber of trials required to infer A grows exponentially. So
ing A and F (t). Let us turn to other pros and cons. does the number of ancillas required to infer A : Measur-
The weak-measurement schemes ancillas need not cou- ing a degeneracy parameter w` or vm requires a mea-
ple to the whole system. One measures a system weakly surement of each spin. Yet the number of trials, and
by coupling an ancilla to the system, then measuring the number of ancillas, required to measure the coarse-
the ancilla strongly. Our weak-measurement protocol re- grained A remains constant as N grows. One can in-
quires one ancilla per weak measurement. Let us focus, fer A from weak measurements and, alternatively, from
for concreteness, on an A measurement for a general . other F (t)-measurement schemes (Sec. II D). A is less
The protocol involves three weak measurements and so fundamental than A , as A results from coarse-graining
three ancillas. Suppose that W and V manifest as one-
A . A , however, exhibits nonclassicality and OTOC
qubit Paulis localized at opposite ends of a spin chain.
Each ancilla need interact with only one site (Fig. 3). In time scales (Sec. III). Measuring A can balance the de-
contrast, the ancilla in [44] couples to the entire system. sire for fundamental knowledge with practicalities.
So does the ancilla in our interference scheme for mea- The weak-measurement scheme for inferring A can
suring A . Global couplings can be engineered in some be rendered more convenient. Section II A describes mea-
platforms, though other platforms pose challenges. Like surements of projectors . Experimentalists might prefer
our weak-measurement scheme, [41] and [42] require only measuring Pauli operators . Measuring Paulis suffices
local ancilla couplings. for inferring a multiqubit systems A : The relevant
In the weak-measurement protocol, each ancillas projects onto an eigenspace of a . Measuring the
state must remain coherent during only one weak yields 1. These possible outcomes map bijectively onto
measurementduring the action of one (composite) gate the possible -measurement outcomes. See Footnote 8
in a circuit. The first ancilla may be erased, then reused for mathematics.
in the third weak measurement. In contrast, each ancilla Our weak-measurement and interference schemes offer
in [41, 42, 44] remains in use throughout the protocol. the advantage of involving general operators. W and V
The Swingle et al. scheme for measuring <(F (t)), too, must be Hermitian or unitary, not necessarily one or the
requires an ancilla that remains coherent throughout the other. Suppose that W and V are unitary. Hermitian
protocol [41]. The longer an ancillas active-duty time, operators GW and GV generate W and V , as discussed
the more likely the ancillas state is to decohere. Like the in Sec. I B. GW and GV may be measured in place of W
weak-measurement sheme, the Swingle et al. scheme for and V . This flexibility expands upon the measurement
measuring |F (t)|2 requires no ancilla [41]. opportunities of, e.g., [41, 42, 44], which require unitary
Also in the interference scheme for measuring A [37], operators.
an ancilla remains active throughout the protocol. That Our weak-measurement and interference schemes of-
16
fer leeway in choosing not only W and V , but also . Ux depends on the system-detector coupling and on the
The state can assume any form D(H). In contrast, detector-measurement outcome.
infinite-temperature Gibbs states = 1/d were used The imbalance |p q| can be tuned experimentally.
in [47, 48]. Thermality of is assumed in [42]. Com- Our scheme has no need for a nonzero imbalance. We
mutation of with V is assumed in [39]. If shares assume that p equals q.
a V eigenbasis or the W(t) eigenbasis, e.g., if = 1/d,
our weak-measurement protocol simplifies from requiring
three sequential weak measurements to requiring two. 2. Notation
Let := x x + y y + z z
denote a vector of one-
C. Circuit for inferring A from weak qubit Pauli operators. The z basis serves as the com-
measurements putational basis in [116]. We will exchange the z basis
with the W eigenbasis, or with the V eigenbasis, in each
Consider a 1D chain S of N qubits. A circuit weak-measurement subcircuit.
implements the weak-measurement scheme reviewed in In our spin-chain example, W and V denote one-qubit
Sec. I D 4. We exhibit a circuit for measuring A . One Pauli operators localized on opposite ends of the chain
subcircuit implements each weak measurement. These S: W = W 1(N 1) , and V = 1(N 1) V . Unit
subcircuits result from augmenting Fig. 1 of [116]. V R3 are chosen such that n := n,
vectors W, for
Dressel et al. use the partial-projection formalism, n = W, V .
which we review first. We introduce notation, then re- The one-qubit Paulis eigendecompose as W =
view the weak-measurement subcircuit of [116]. Copies |+Wih+W| |WihW| and V = |+V ih+V |
of the subcircuit are embedded into our A -measurement |V ihV |. The whole-system operators eigendecompose
circuit. as W = W W V V
+ and V = + . A rotation op-
erator Rn maps the eigenstates to the n eigenstates:
z
Rn |+zi = |+ni, and Rn |zi = |ni.

1. Partial-projection operators We model weak W measurements with the partial-
projection operators
Partial-projection operators update a state after a W p
D+ := pW W + + 1 pW W and (49)
measurement that may provide incomplete information. p
W
Suppose that S begins in a state |i. Consider perform- D := 1 pW W + + pW W
. (50)
ing a measurement that could output + or . Let + and The V partial-projection operators are defined analo-
denote the projectors onto the + and eigenspaces. gously:
Parameters p, q [0, 1] quantify the correlation between p
V
the outcome and the premeasurement state. If |i is a + D+ := pV V+ + 1 pV V and (51)
eigenstate, the measurement has a probability p of out- V
p V V
D := 1 pV + + pV . (52)
putting +. If |i is a eigenstate, the measurement has
a probability q of outputting .
Suppose that outcome + obtains. We update |i 3. Weak-measurement subcircuit

using the partial-projection operator D+ := p + +
|i
1 q : |i 7 ||DD++|i|| 2 . If the measurement yields Figure 3a depicts a subcircuit for measuring n = W or

, we update |i with D := 1 p + + q . V weakly. To simplify notation, we relabel pn as p. Most
The measurement is strong if (p, q) = (0, 1) or (1, 0). of the subcircuit appears in Fig. 1 of [116]. We set the
D+ and D reduce to projectors. The measurement col- imbalance parameter to 0. We sandwich Fig. 1 of [116]
lapses |i onto an eigenspace. The measurement is weak between two one-qubit unitaries. The sandwiching inter-
if p and q lie close to 12 : D lies close to the normalized changes the computational basis with the n eigenbasis.
identity, 1d . Operating on |i with 1 barely changes the The subcircuit implements the following algorithm:
state. The measurement provides hardly any informa- (1) Rotate the n eigenbasis into the z eigenbasis, using
tion. Rn .
We modeled measurements with Kraus operators Mx
in Sec. I D 4. The polar decomposition of Mx [117] (2) Prepare an ancilla in a fiducial state |0i |+zi.
is a partial-projection operator. Consider measuring a
(3) Entangle S with the ancilla via a Z-controlled-Y :
qubits z . Recall that X denotes a detector observ-
If S is in state |0i, rotate the ancillas state coun-
able. Suppose that, if an X measurement yields x, a
z terclockwise (CCW) through a small angle 2
subsequent measurement of the spins p most likely about the y-axis. Let Ry () denote the one-qubit
yields +. The Kraus operator Mx = p(x) 1 + g(x) +
unitary that implements this rotation. If S is in
updates the q systems state. Mx is related to D+ by state |1i, rotate the ancillas state CCW through
D+ = Ux Mx Mx for some unitary Ux . The form of an angle , with Ry ().
17
through a small angle precisely can pose challenges [35].
4. Full circuit for weak-measurement scheme
Figure 3b shows the circuit for measuring A . The

full circuit contains three weak-measurement subcircuits.
Each ancilla serves in only one subcircuit. No ancilla
need remain coherent throughout the protocol, as dis-
cussed in Sec. II B. The ancilla used in the first V mea-
surement can be recycled for the final V measurement.
(a) The circuit simplifies in a special case. Suppose that
shares an eigenbasis with V or with W(t), e.g., = 1/d.
Only two weak measurements are needed, as discussed in
Sec. I D 4.
We can augment the circuit to measure A , rather
than A : During each weak measurement, every qubit
will be measured. The qubits can be measured individ-
ually: The N -qubit measurement can be a product of
local measurements. Consider, for concreteness, the first
weak measurement. Measuring just qubit N would yield
an eigenvalue v1 of V . We would infer whether qubit
N pointed upward or downward along the V axis. Mea-
suring all the qubits would yield a degeneracy parameter
v1 . We could define v` as encoding the V -components
(b)
of the other N 1 qubits angular momenta.
FIG. 3: Quantum circuit for inferring the
coarse-grained OTOC quasiprobability A from
weak measurements: We consider a system of N qubits D. How to infer A from other
prepared in a state . The local operators OTOC-measurement schemes
W = W 1(N 1) and V = 1(N 1) V manifest as
one-qubit Paulis. Weak measurements can be used to infer
the coarse-grained quasiprobability A . Combining values of F (t) can be inferred, we have seen, from the quasiprob-
A yields the OTOC F (t). Figure 3a depicts a subcircuit ability A and from the coarse-grained A . A can be
used to implement a weak measurement of n = W or V . An inferred from F (t)-measurement schemes, we show, if the

ancilla is prepared in a fiducial state |0i. A unitary Rn eigenvalues of W and V equal 1. We assume, through-
n z
rotates the qubits eigenbasis into its eigenbasis. out this section, that they do. The eigenvalues equal 1
Ry () rotates the ancillas state counterclockwise about if W and V are Pauli operators.
the y-axis through a small angle , controlled by the
systems z . The angles smallness guarantees the
The projectors (41) and (43) can be expressed as
measurements weakness. Rn rotates the systems z
eigenbasis back into the n eigenbasis. The ancillas z is 1 1
W
w` = (1 + w` W) and Vv` = (1 + v` V ) . (53)
measured strongly. The outcome, + or , dictates which 2 2
n
partial-projection operator D updates the state. Figure 3b
shows the circuit used to measure A . Three weak Consider substituting from Eqs. (53) into Eq. (44). Mul-
measurements, interspersed with three time evolutions (U , tiplying out yields sixteen terms. If h.i := Tr( .),
U , and U ), precede a strong measurement. Suppose that
the initial state, , commutes with W or V , e.g., = 1/d. 1h
Figure 3b requires only two weak measurements. A (v1 , w2 , v2 , w3 ) = 1 + (w2 + w3 ) hW(t)i
16

+ (v1 + v2 ) hV i + w2 w3 W 2 (t) + v1 v2 V 2
(4) Measure the ancillas z . If the measurement yields + (w2 v1 + w3 v1 + w3 v2 ) hW(t)V i + w2 v2 hV W(t)i

outcome +, D+ updates the systems state; and if + w2 w3 v1 W 2 (t)V + w3 v1 v2 W(t)V 2
, then D .
+ w2 w3 v2 hW(t)V W(t)i + w2 v1 v2 hV W(t)V i
z i
(5) Rotate the eigenbasis into the n eigenbasis, using + w2 w3 v1 v2 F (t) . (54)
Rn .
The measurement is weak because is small. Rotating If W(t) and V are unitary, they square to 1. Equa-
18
tion (54) simplifies to pected, the quasiprobabilitys qualitative behavior is sen-

sitive primarily to whether H is integrable, as well as to
1n the initial states form. We study two sets of parameters,
A (v1 , w2 , v2 , w3 ) = (1 + w2 w3 + v1 v2 )
16
+ [w2 + w3 (1 + v1 v2 )] hW(t)i + [v1 (1 + w2 w3 ) + v2 ] hV i Integrable: h = 0, g = 1.05 and
+ (w2 v1 + w3 v1 + w3 v2 ) hW(t)V i + w2 v2 hV W(t)i Nonintegrable: h = .5, g = 1.05 . (57)
+ w2 w3 v2 hW(t)V W(t)i + w2 v1 v2 hV W(t)V i We study several classes of initial states , including ther-
o
+ w2 w3 v1 v2 F (t) . (55) mal states, random pure states, and product states.
For W and V , we choose single-Pauli operators that
The first term is constant. The next two terms act nontrivially on just the chains ends. We illustrate
are single-observable expectation values. The next two with W = 1x or W = 1z and V = N x
or Nz
. These
terms are two-point correlation functions. hV W(t)V i operators are unitary and Hermitian. They square to
and hW(t)V W(t)i are time-ordered correlation functions. the identity, enabling us to use Eq. (55). We calculate
F (t) is the OTOC. F (t) is the most difficult to measure. the coarse-grained quasiprobability directly:
If one can measure it, one likely has the tools to infer
A . One can measure every term, for example, using the A (v1 , w2 , v2 , w3 ) = Tr W(t)
w3 V
W(t) V
v2 w2 v1 . (58)
set-up in [41]
1
For a Pauli operator O, O
a = 2 (1+aO) projects onto the
a {1, 1} eigenspace. We also compare the quasiprob-
III. NUMERICAL SIMULATIONS ability with the OTOC,
X
We now study the OTOC quasiprobabilitys physical F (t) = v1 w2 v2 w3 A (v1 , w2 , v2 , w3 ) (59)
v1 ,w2 ,v2 ,w3
content in two simple models. In this section, we study a
geometrically local 1D model, an Ising chain with trans-
[Eq. (48)].
verse and longitudinal fields. In Sec. IV, we study a geo-
F (t) deviates from one at roughly the time needed for
metrically nonlocal model known as the Brownian-circuit
information to propagate from one end of the chain to
model. This model effectively has a time-dependent
the other. This onset time lies approximately between
Hamiltonian.
t = 4 and t = 6, according to our the data. The systems
We compare the physics of A with that of the
length and the butterfly velocity vB set the onset time
OTOC. The time scales inherent in A , as compared to (Sec. I C). Every term in the Hamiltonian (56) is order-
the OTOCs time scales, particularly interest us. We one. Hence vB is expected to be order-one, too. In light of
study also nonclassical behaviorsnegative and nonreal our spin chains length, the data below are all consistent
valuesof A . Finally, we find a parallel with classical with a vB of approximately two.
chaos: The onset of scrambling breaks a symmetry. This
breaking manifests in bifurcations of A , reminiscent of
pitchfork diagrams. A. Thermal states
The Ising chain is defined on a Hilbert space of N
spin- 12 degrees of freedom. The total Hilbert space has We consider first thermal states eH/T . Data
dimensionality d = 2N . The single-site Pauli matrices are for the infinite-temperature (T = ) state, with W =
labeled {ix , iy , iz }, for i = 1, ..., N . The Hamiltonian is 1z , V = Nz
, and nonintegrable parameters, appear in
Figures 4, 5, and 6. The legend is labeled such that abcd
N
X 1 N
X N
X corresponds to w3 = (1)a , v2 = (1)b , w2 = (1)c , and
H = J iz i+1
z
h iz g ix . (56)
v1 = (1)d . This labelling corresponds to the order in
i=1 i=1 i=1
which the operators appear in Eq. (58).
The chain has open boundary conditions. Energies are Three behaviors merit comment. Generically, the
measured in units of J. Times are measured in units of coarse-grained quasiprobability is a complex number:
1/J. The interaction strength is thus set to one, J = 1, A (.) C. However,
A(1/d) is real. The imaginary
henceforth. We numerically study this model for N = 10 component = A(1/d) might appear nonzero in Fig. 6.
by exactly diagonalizing H. This system size suffices
for probing the quasiprobabilitys time scales. However, Yet = A(1/d) 1016 . This value equals zero, to
N = 10 does not necessarily illustrate the thermody- within machine precision. The second feature to notice is
namic limit. that the time required for A(1/d) to deviate from its ini-
When h = 0, this model is integrable and can be solved tial value equals approximately the time required for the
with noninteracting-fermion variables. When h 6= 0, the OTOC to deviate from its initial value. Third, although
model appears to be reasonably chaotic. These state- A(1/d) is real, it is negative and hence nonclassical for
ments meanings are clarified in the data below. As ex- some values of its arguments.
19
What about lower temperatures? Data for the T = 1 and w3 , and under the simultaneous negations of v1 and
thermal state are shown in Figures 7, 8, and 9. The v2 . These symmetries have operational significances: A
coarse-grained quasiprobability is no longer real. Here, remains constant under permutations and negations of
too, the time required for A to deviate significantly measurement outcomes in the weak-measurement scheme
from its initial value is comparable with the time scale (Sec. I D 4). Symmetries break as the system starts
of changes in F (t). This comparability characterizes the scrambling: F (t) shrinks, shrinking the final term in
real and imaginary parts of A . Both parts oscillate at Eq. (62). A starts depending not only on squares of
long times. In the small systems considered here, such w` -and-vm functions, but also on the eigenvalues indi-
oscillations can arise from finite-size effects, including the vidually.
energy spectrums discreteness. With nonintegrable pa- Whereas the shrinking of F (t) bifurcates the lower lines
rameters, this model has an energy gap N =10 = 2.92 in Fig. 5, the shrinking does not bifurcate the upper
above the ground state. The temperature T = 1 is lines. The reason is that each upper line corresponds to
smaller than the gap. Hence lowering T from to 1 w2 w3 = v1 v2 = 1. [At early times, |F (t)| is small enough
brings the thermal state close to the ground state. that any F (t)-dependent correction would fall within the
What about long-time behavior? At infinite tem- lines widths.] Hence the final term in Eq. (61) is pro-
perature, A(1/d) approaches a limiting form after the portional to hW(t)V i. This prediction is consistent
scrambling-onset time but before any recurrence time. with the observed splitting. The hW(t)V i term does not
Furthermore, A(1/d) can approach one of only a few pos- split the lower lines: Each lower line satisfies w2 = w3
sible limiting values, depending on the functions argu- and/or v1 = v2 . Hence the hW(t)V i term vanishes. We
ments. This behavior follows from the terms in Eq. (55). leave as an open question whether these pitchforks can
At infinite temperature, hWi = hV i = 0. Also the 3- be understood in terms of equilibria, like classical-chaos
point functions vanish, due to the traces cyclicity. We pitchforks [118].
expect the nontrivial 2- and 4-point functions to be small In contrast with the T = data, the T = 1 data oscil-
at late times. (Such smallness is visible in the 4-point late markedly at late times. We expect these oscillations
function in Fig. 4.) Hence Eq. (55) reduces as to decay to zero at late times, if the system is chaotic, in
the thermodynamic limit. Unlike at infinite temperature,
1 + w2 w3 + v1 v2 W and V can have nonzero expectation values. But, if all
A (v1 , w2 , v2 , w3 ) |{z}
. (60)
16 nontrivial connected correlation functions have decayed,
t
Eq. (55) still implies a simple dependence on the w` and
According to Eq. (60), the late-time values of A(1/d) vm parameters at late times.
should cluster around 3/16, 1/16, and 1/16. This ex- Finally, Figures 10 and 11 show the coarse-grained
pectation is roughly consistent with Fig. 5, modulo the quasiprobability at infinite temperature, A(1/d) , with in-
upper lines bifurcation. tegrable parameters. The imaginary part remains zero,
A bifurcation of A signals the breaking of a sym- so we do not show it. The difference from the behavior in
metry at the onset of scrambling. Similarly, pitchfork Figures 4 and 5 (which shows T = , nonintegrable-H
plots signal the breaking of a symmetry in classical data) is obvious. Most dramatic is the large revival that
chaos [118]. The symmetrys mathematical form follows occurs at what would, in the nonintegrable model, be a
from Eq. (55). At early times, W(t) commutes with V , late time. Although this is not shown, the quasiprob-
and F (t) 1. Suppose, for simplicity, that = 1/d. The ability depends significantly on the choice of operator.
expectation values hW(t)i and hV i vanish, because every This dependence is expected, since different Pauli opera-
Pauli has a zero trace. Equation (55) becomes tors have different degrees of complexity in terms of the
noninteracting-fermion variables.
1h
A (v1 , w2 , v2 , w3 ) = (1 + w2 w3 + v1 v2 + w2 w3 v1 v2 )
16 i
+ (w2 + w3 )(v1 + v2 ) hW(t)V i . (61) B. Random states
Suppose that w2 = w3 and/or v1 = v2 , as in the We now consider random pure states |ih| and
lower lines in Fig. 5. A (.) reduces to the constant nonintegrable parameters. Figures 12, 13, and 14 show
F (t) and A for the operator choice W = 1z and V = N
z
1 in a randomly chosen pure state. Broadly speaking, the

(1 + w2 w3 + v1 v2 + w2 w3 v1 v2 ) (62) features are similar to those exhibited by the infinite-
16
1 h i temperature = 1/d, with additional fluctuations.
= (1 + w2 w3 + v1 v2 )2 (w2 w3 )2 (v1 v2 )2 + 1 . The upper branch of lines in Fig. 13 exhibits dynamics
32
before the OTOC does. However, lines average posi-
The right-hand side depends on the eigenvalues w` and tions move significantly (the lower lines bifurcate, and
vm only through squares. A (.) remains invariant un- the upper lines shift downward) only after the OTOC
der the interchange of w2 with w3 , under the interchange begins to evolve. The early motion must be associated
of v1 with v2 , under the simultaneous negations of w2 with the early dynamics of the 2- and 3-point functions
20
1.0 1.0
0.8 0.8
0.6 0.6
Re[F] Re[F]
Im[F] 0.4 Im[F]
0.4
0.2
0.2
t
t 2 4 6 8 10 12 14
2 4 6 8 10 12 14
FIG. 4: Real and imaginary parts of F (t) as a function of FIG. 7: Real and imaginary parts of F (t) as a function of
time. T = thermal state. Nonintegrable parameters, time. T = 1 thermal state. Nonintegrable parameters,
N = 10, W = 1z , V = N
z
. N = 10, W = 1z , V = N
z
.
0.25
0000 1000 0000 1000
0.20
0.6
0001 1001 0001 1001
0.15 0010 1010 0010 1010
0.4 0011 1011
0011 1011
0.10
0100 1100 0100 1100
0.05 0101 1101
0101 1101 0.2
t 0110 1110 0110 1110

2 4 6 8 10 12 14
0111 1111 t 0111 1111
-0.05 2 4 6 8 10 12 14
FIG. 5: Real part of A as a function of time. T = FIG. 8: Real part of A as a function of time. T = 1
thermal state. Nonintegrable parameters, N = 10, W = 1z , thermal state. Nonintegrable parameters, N = 10, W = 1z ,
z z
V = N . V = N .
in Eq. (55). The late-time values are roughly consistent given random pure state corresponds to an infinite tem-
with those for = 1/d but fluctuate more pronouncedly. perature. The reason is the thermodynamic entropys
The agreement between random pure states and the monotonic increase with temperature. Since the thermo-
T = thermal state is expected, due to closed-system dynamic entropy gives the density of states, more states
thermalization [119, 120]. Consider assigning a tempera- correspond to higher temperatures. Most states corre-
ture to a pure state by matching its energy density with spond to infinite temperature.
the energy density of the thermal state eH/T /Z, cast as For the random states and system sizes N considered,
a function of temperature. With high probability, any if H is nonintegrable, the agreement with thermal results
is not complete. However, the physics appears qualita-
2. 10-17
0000 1000
0001 1001 0.010 0000 1000
t
2 4 6 8 10 12 14 0010 1010 0001 1001
-17
0011 1011 0.005 0010 1010
-2. 10
0100 1100 0011 1011
0101 1101 t
-4. 10-17 2 4 6 8 10 12 14 0100 1100
0110 1110
- 0.005 0101 1101
0111 1111
-6. 10-17 0110 1110
- 0.010 0111 1111
FIG. 6: Imaginary part of A as a function of time. T =

z
thermal state. Nonintegrable parameters,N = 10, W = 1 , FIG. 9: Imaginary part of A as a function of time. T = 1
V = N . To within machine precision, = A vanishes for
z
thermal state. Nonintegrable parameters, N = 10, W = 1z ,
z
all values of the arguments. V = N .
21
1.0
0.25 0000 1000
0.8 0.20 0001 1001
0.15
0010 1010
0.6 0011 1011
0.10
Re[F] 0100 1100
0.4 Im[F] 0.05
0101 1101
t 0110 1110
0.2 2 4 6 8 10 12 14
-0.05 0111 1111
t
2 4 6 8 10 12 14
FIG. 13: Real part of A as a function of time. Random

pure state. Nonintegrable parameters, N = 10, W = 1z ,
FIG. 10: Real and imaginary parts of F (t) as a function of V = Nz
.
time. T = thermal state. Integrable parameters, N = 10,
W = 1z , V = N
z
.
0.25 0.006
0000 1000
0000 1000
0.20 0.004 0001 1001
0001 1001
0.002 0010 1010
0.15 0010 1010
0011 1011
0.10
0011 1011 t
2 4 6 8 10 12 14 0100 1100
0100 1100
0.05 -0.002 0101 1101
0101 1101
t -0.004 0110 1110
2 4 6 8 10 12 14 0110 1110
0111 1111
-0.05 0111 1111 -0.006
FIG. 14: Imaginary part of A as a function of time.

FIG. 11: Real part of A as a function of time. T =
Random pure state. Nonintegrable parameters, N = 10,
thermal state. Integrable parameters, N = 10, W = 1z ,
z W = 1z , V = N
z
.
V = N .
tively similar.
to use W = 1z and V = N z
. For the Hamiltonian pa-
rameters chosen, this state lies far from the ground state.
The state therefore should correspond to a large effective
C. Product states
temperature. Figures 15, 16, and 17 show F (t) and A
for nonintegrable parameters.
Finally, we consider the product |+xiN of N copies
of the +1 x eigenstate (Figures 1517). We continue The real part of F (t) decays significantly from its ini-
tial value of one. The imaginary part of F (t) is nonzero
but remains small. These features resemble the infinite-
1.0 temperature features. However, the late-time F (t) values
are substantially larger than in the T = case and os-
0.8 cillate significantly.
0.6 Correspondingly, the real and imaginary

components
Re[F]
of A oscillate significantly. < A exhibits dynamics
0.4 Im[F]
before scrambling begins, as when is a random pure
0.2
state. The real and imaginary parts of A differ more
from their T = counterparts than F (t) differs from its
t counterpart. Some of this differing is apparently washed
2 4 6 8 10 12 14
out by the averaging needed to construct F (t) [Eq. (59)].
We expected pure product states to behave roughly like
FIG. 12: Real and imaginary parts of F (t) as a function of random pure states. The data support this expectation
time. Random pure state. Nonintegrable parameters, very roughly, at best. Whether finite-size effects cause
N = 10, W = 1z , V = N
z
.
this deviation, we leave as a question for further study.
22
1.0 classical physics (Sec. I A).
0.8 (2) The derived quantity P (W, W 0 ) is generically com-

plex, our results imply. Nonclassicality thus sur-
0.6
Re[F]
vives even the partial marginalization that defines
P [Eq. (32)]. In general, marginalization can cause
0.4 Im[F]
interference to dampen nonclassicality. (We ob-
0.2
serve such dampening in Property 6 of Sec. V A
and in Property 9 of Appendix A.)
t
2 4 6 8 10 12 14
(3) Random pure states quasiprobabilities resemble
the T = thermal states quasiprobability but
fluctuate more.
FIG. 15: Real and imaginary parts of F (t) as a function of
time. Product |+xiN of N copies of the +1 x eigenstate.
(4) Certain product states quasiprobabilities display
Nonintegrable parameters, N = 10, W = 1z , V = Nz
.
anomalously large fluctuations. We expected these
states to resemble random states more.
0.3 0000 1000 (5) The A s generated by integrable Hamiltonians dif-

0001 1001
fer markedly from the A s generated by noninte-
0010 1010
0.2
0011 1011
grable Hamiltonians. Both types of A s achieve
0100 1100
nonclassical values, however. We did not clearly
0.1
0101 1101 observe a third class of behavior.
0110 1110
2 4 6 8 10 12 14
t
0111 1111 (6) The time scale after which A changes significantly
is similar to the OTOC time scale. A can display
nontrivial early-time dynamics not visible in F (t).
This dynamics can arise, for example, because of
FIG. 16: Real part of A as a function of time. Product the 2-point function contained in the expansion of
|+xiN of N copies of the +1 x eigenstate. Nonintegrable A [see Eq. (55)].
parameters, N = 10, W = 1z , V = Nz
.
(7) A reveals that scrambling breaks a symmetry. Op-
erationally, the symmetry consists of invariances of
D. Summary A under permutations and negations of measure-
ment outcomes in the weak-measurement scheme
The main messages from this study are the following. (Sec. I D 4). The symmetry breaking manifests
(1) The coarse-grained quasiprobability A is generi- in bifurcations of A . These bifurcations evoke
cally complex. Exceptions include the T = ther- classical-chaos pitchfork diagrams, which also arise
mal state 1/d and states that share an eigenbasis when a symmetry breaks. One equilibrium point
with V or with W(t) [e.g., as in Eq. (30)]. Recall splits into three in the classical case [118]. Perhaps
that the KD distributions nonreality signals non- the quasiprobabilitys pitchforks can be recast in
terms of equilibria.
0.015 0000 1000

IV. CALCULATION OF A AVERAGED OVER
0.010 0001 1001
BROWNIAN CIRCUITS
0.005
0010 1010
0011 1011
t
2 4 6 8 10 12 14 0100 1100 We study a geometrically nonlocal modelthe
-0.005 0101 1101 Brownian-circuit model governed by a time-dependent
-0.010 0110 1110 Hamiltonian [71]. We access physics qualitatively differ-
-0.015
0111 1111 ent from the physics displayed in the numerics of Sec. III.
We also derive results for large systems and compare with
the finite-size numerics. Since the two models locality
properties differ, we do not expect agreement at early
FIG. 17: Imaginary part of A as a function of time. times. The late-time scrambled states, however, may be
Product |+xiN of N copies of the +1 x eigenstate. expected to share similarities. We summarize our main
Nonintegrable parameters, N = 10, W = 1z , V = Nz
.
findings at the end of the section.
23
We consider a system of N qubits governed by the (3) WVn11, W 11 o V , 1V W 1, 11WV :

random time-dependent Hamiltonian 1z (t)2z
Tr 2N
=: G(t),
X X , 1
H(t) Ji,ji j (t) ii j j . (63) (4) W 1W 1, 1V 1V : Tr = 1,
2N
i<j i ,j
(5) WV W 1, WV 1V , W 1WV , 1V WV : 0, and
The couplings J are time-dependent random variables. n z z z zo
(t) (t)
We denote the site-i identity operator and Pauli opera- (6) WV WV : Tr 1 22N 1 2 = F (t).
tors by i , for = 0, 1, 2, 3. According to the models
precise formulation, the time-evolution operator U (t) is These computations rely on = 1/2N . Each term that
a random variable that obeys contains an odd number of Pauli operators vanishes, due
to the traces cyclicality and to the Paulis tracelessness.
N We have introduced a 2-point function G(t). An overall
U (t + dt) U (t) = U (t)dt i dB(t) . (64)
2 factor of 1/16 comes from the projectors normalization.
The final terms dB(t) has the form Combining all the ingredients, we can express A in
terms of G and F . The result is
s
1 X X
dB(t) =
,
i j dBi,ji j (t) . (65) 16 A (v1 , w2 , v2 , w3 ) = (1 + w2 w3 + v1 v2 ) (68)
8(N 1) i<j , i j + (w2 + w3 )(v1 + v2 ) G + w2 w3 v1 v2 F.
i j
We will sometimes call Eq. (65) dB. dB is a Gaussian This result depends on = 1/2N , not on the form of the
random variable with zero mean and with variance dynamics. But to compute A, we must compute
n 0 0
o
,
EB dBi,j dBi0 ,j,0 = ,0 , 0 i,i0 j,j 0 dt. (66) G = EB {G} (69)
and
The expectation value EB is an average over realizations
of the noise B. We demand that dt dt = 0 and dB dt = 0, F = EB {F } . (70)
in accordance with the standard Ito calculus. dB(t) is
independent of U (t), i.e., of all previous dBs. The computation of F appears in the literature [3].
We wish to compute the average, over the ensemble F initially equals unity. It decays to zero around t =
defined by Eq. (64), of the coarse-grained quasiprobabil- 1
3 log N , the scrambling time. The precise functional form
ity: of F is not crucial. The basic physics is captured in a phe-
n o nomenological form inspired by AdS/CFT computations
A(v1 , w2 , v2 , w3 ) = EB A (v1 , w2 , v2 , w3 ) . (67) [3],
c2
1 + c1
F , (71)
A. Infinite-temperature thermal state 1/2N 1 + c1 e3t
wherein c1 1/N and c2 1.

We focus here on the infinite-temperature thermal To convey a sense of the physics, we review the simpler
state, = 1/2N , for two reasons. First, a system with calculation of G. The two-point function evolves accord-
a time-dependent Hamiltonian generically heats to infi- ing to
nite temperature with respect to any Hamiltonian in the
ensemble. Second, the T = state is convenient for 1 N
G(t + dt) = N Tr U (t) U (t)dt i dB U (t) 1z
calculations. A discussion of other states follows. 2 2
The ensemble remains invariant under single-site rota-
N
tions, and all qubits are equivalent. Therefore, all pos- U (t) U (t) dt + i U (t) dB 2z .
2
sible choices of single-site Pauli operators for W and V
(72)
are equivalent. Hence we choose W = 1z and V = 2z
without loss of generality. Using the usual rules of Ito stochastic calculus, particu-
Let us return to Eq. (54). Equation (54) results from larly Eq. (66) and dt dt = dB dt = 0, we obtain
substituting in for the projectors in A . The sum con-
tains 16 terms. To each term, each projector contributes 1
G(t + dt) G(t) = N dt G(t) + dt
the identity 1 or a nontrivial Pauli (W or V ). The terms 8(N 1)
are X X 1
N
EB Tr 1z (t)ii j j 2z ii j j . (73)
(1) 1111: Tr 21N = 1, i<j ,
i
2
j
(2) W 111, 1V 11, 11W 1, 111V : 0, We have applied the traces cyclicality in the second term.
24
The second terms value depends on whether i and/or (2) w2 w3 = 1, v1 v2 = 1: A = 1/16,

j equals 2. If i and/or j equals 2, the second term van-
P3 (3) w2 w3 = 1, v1 v2 = 1: A = 1/16, and
ishes because =0 z = 0. If neither i nor j is

2, ii j j commutes with 2z . The second term becomes (4) w2 w3 = 1, v1 v2 = 1: A = 1/16.
proportional to G. In (N 1)(N 2)/2 terms, i, j 6= 2.
An additional factor of 42 = 16 comes from the two sums Modulo the splitting of the upper two lines, this result is
over Pauli matrices. Hence broadly consistent with the long-time behavior in Fig. 5.
As the models in Sec. III and this section differ, the long-
G(t + dt) G(t) = 2dt G , (74) time behaviors need not agree perfectly. However, the
models appear to achieve qualitatively similar scrambled
or states at late times.
dG
= 2G. (75)
dt B. General state
This differential equation implies that G exponentially

decays from its initial value. The initial value is zero: Consider a general state , such that A assumes the
G(0) = G(0) = 0. Hence G(t) is identically zero. general form in Eq. (54). We still assume that W = 1z
and V = 2z . However, the results will, in general, now
Although it does not arise when we consider
depend on these choices via the initial condition . We
A, the ensemble-average autocorrelation function
still expect that, at late times, the results will not depend
EB {h1z (t)1z i} obeys a differential equation similar to
on the precise choices. Below, we use the notation h.i
the equation obeyed by G. In particular, the equation
Tr( .).
decays exponentially with an order-one rate.
We must consider 16 terms again. The general case
By the expectation values linearity and the vanishing
involves fewer simplifications. The terms are
of G,
(1) 1111: 1,
(1 + w2 w3 + v1 v2 ) + w2 w3 v1 v2 F
A= . (76) (2) W 111, 1V 11, 11W 1, 111V : h1z (t)i , h2z i,
16
This simple equation states that the ensemble-averaged (3) WV 11, W 11V , 1V W 1, 11WV :
quasiprobability depends only on the ensemble-averaged h1z (t) 2z i, h2z 1z (t)i,
OTOC F (t), at infinite temperature. The time scale of
(4) W 1W 1, 1V 1V : 1,
Fs decay is t = 13 log N . Hence this is the time scale of
changes in A. (5) WV W 1, WV 1V , W 1WV , 1V WV :
Equation (76) shows (as intuition suggests) that A de- h1z (t) 2z 1z (t)i, h1z (t) i, h2z i, h2z 1z (t) 2z i,
pends only on the combinations w2 w3 and v1 v2 . At t = 0, and
F(0) = 1. Hence A is
(6) WV WV : h1z (t) 2z 1z (t) 2z i = F (t).
1 + w2 w3 + v1 v2 + w2 w3 v1 v2
At=0 = . (77) Consider first the terms of the form qi (t) :=
16 EB {hiz (t)i}. The time derivative is
The cases are dqi
= N qi (79)
(1) w2 w3 = 1, v1 v2 = 1: A = 1/4, dt
1 X X
(2) w2 w3 = 1, v1 v2 = 1: A = 0, + EB {hj j kk U (t)iz U (t) j j kk i}.
8(N 1) ,
j<k j k
(3) w2 w3 = 1, v1 v2 = 1: A = 0, and
To simplify the second term, we use a trick. Since
(4) w2 w3 = 1, v1 v2 = 1: A = 0.
j j kk m n j j kk = m
m n m n
n , (80)
These values are consistent with Fig. 5 at t = 0. These
values degeneracies are consistent with the symmetries we may pass the factors of j j kk through U (t), at the
discussed in Sec. III and in Sec. V A (Property 7). cost of changing some Brownian weights. We must con-
At long times, F() = 0, so A is sider a different set of dBs, related to the originals by
minus signs. This alternative set of Brownian weights
1 + w2 w3 + v1 v2 has the original sets ensemble probability. Hence the
At= = . (78) ensemble average gives the same result. Therefore,
16

The cases are EB {hj j kk U (t) iz U (t) j j kk i}

(1) w2 w3 = 1, v1 v2 = 1: A = 3/16, = EB {hU (t)j j kk iz j j kk U (t) i}. (81)
25
If i = j and/or i = k, the sum over j and/or the sum q12 (t) = q1 (t) = q12 (t) , (88)
over k vanishes. If i equals neither j nor k, the Pauli
and
operators commute. The term reduces to qi . i equals
neither j nor k in (N 1)(N 2)/2 terms. A factor of f12 = F. (89)
16 comes from the sums over j and k . Hence
Hermiticity of the Pauli operators implies that f12 is real.
dqi Hence the ensemble-averaged OTOC F is real for this
= N qi + (N 2)qi = 2qi . (82) choice of . The ensemble-averaged A has the form
dt
Consider the terms of the form qij (t) := hiz (t)jz i. k1 + k2 q1 + k3 F
A= , (90)
Note that hjz iz (t)i = qij . We may reuse the trick intro- 16
duced above. [This trick fails only when more than two wherein
copies of U appear, as in F (t)]. To be precise,
k1 = (1 + v1 )(1 + v2 + w3 w2 ), (91)
EB {hm n U (t)iz U (t) m
m n m n z
n j i}
k2 = (1 + v1 )(w3 + w2 )(1 + v2 ), (92)
= EB {hU (t)m n i m n U (t) jz i}.
m n z m n
(83)
and
As before, the sums over kill the relevant term in the
time derivative of qij , unless i 6= m, n. Hence k3 = (1 + v1 )w3 v2 w2 . (93)
Equations (90)(93) imply that A = 0 unless v1 = 1.
dqij
= 2qij , (84) The time scale after which q1 decays is order-one. The
dt time required for F to decay is of order log N (although
as at infinite temperature. not necessarily exactly the same as for the T = state).
Item (5), in the list above, concerns products of three Therefore, the late-time value of A is well approximated
Ws and V s. We must consider four expectation values by
of Pauli products. As seen above, two of these terms k1 + k3 F
reduce to qi terms. By the trick used earlier, At1 = . (94)
16
EB {h2z U (t)1z U (t) 2z }
= EB {hU (t)2z 1z 2z U (t) } = q1 (t). (85) C. Summary
The other term we must consider is This study has the following main messages.
EB {hiz (t)jz iz (t)i} =: fij . Our trick will not work,
because there are multiple copies of U (t) that are not all (1) In this model, the ensemble-averaged quasiproba-
simultaneously switched as operators are moved around. bility varies on two time scales. The first time scale
At early times, when iz (t) and jz approximately is an order-one relaxation time. At later times, the
commute, this term approximately equals hjz i = qj (0). OTOC controls the physics entirely. F (t) varies
At later times, including around the scrambling time, after a time of order log N .
this term decays to zero. (2) While the late-time physics of A is controlled en-
The general expression for A becomes tirely by the ensemble-averaged F (t), the negative
16 A = 1 + w3 w2 + v1 v2 values of A show a nonclassicality that might not
be obvious from F (t) alone. Furthermore, we com-
+ (w3 + w2 ) q1 (t) + (v1 + v2 ) q2 (0)
puted only the first moment of A . The higher
+ (w3 v2 + w3 v1 + w2 v1 ) q12 (t) + v2 w2 q12 (t) moments are likely not determined by F (t) alone.
+ w3 v2 w2 f12 (t) + (w3 v1 v2 + w2 v1 v2 ) q1 (t)
(3) For T = , the late-time physics is qualitatively
+ w3 w2 v1 q2 (0) + w3 w2 v1 v2 F(t). (86) similar to the late-time physics of the geometrically
All these q functions obey known differential equations. local spin chain in Sec. III.
The functions decay after a time of order one. We do not (4) Nonclassicality, as signaled by negative values of
have explicit expressions for the f functions that appear.
A , is extremely robust. It survives the long-time
They are expected to vary after a time log N .
limit and the ensemble average. One might have
expected thermalization and interference to stamp
1. Special case: 2z eigenstate out nonclassicality. On the other hand, we expect
the circuit average to suppress the imaginary part
of A rapidly.
We have no controlled examples
In a concrete example, we suppose that is a +1 eigen-
state of 2z . Expressions simplify: in which = A remains nonzero at long times.
Finding further evidence for or against this conjec-
q2 (0) = 1, (87) ture remains an open problem.
26

V. THEORETICAL STUDY OF A The quasiprobability behind F (K ) (t), we find, is K -
extended.
We have discussed experimental measurements, nu- Recent quasiprobability advances involve out-of-time
merical simulations, and analytical calculations of the ordering, including in correlation functions [123127].
OTOC quasiprobability A . We now complement these Merging these works with the OTOC framework offers
discussions with mathematical properties and physical an opportunity for further research (Sec. VI).
interpretations. First, we define an extended Kirkwood-
Dirac distribution exemplified by A . We still denote by
B(H) the set of bounded operators defined on H. A. Mathematical properties of A
Definition 1 (K -extended Kirkwood-Dirac quasiprob- A shares some of its properties with the KD
ability). Let {|ai} , . . . , {|ki} and {|f i} denote orthonor- quasiprobability (Sec. I A 4). Properties of A imply
mal bases for the Hilbert space H. Let O B(H) de- properties of P (W, W 0 ), presented in Appendix A.
note a bounded operator defined on H. A K -extended
Kirkwood-Dirac quasiprobability for O is defined as10 Property 5. The OTOC quasiprobability is a map A :
D(H) {v1 } {v1 } {w2 } {w2 } {v2 } {v2 }
(K )
AO (a, . . . , k, f ) := hf |kihk| . . . |aiha|O|f i . (95) {w3 } {w3 } C . The domain is a composition of
the set D(H) of density operators defined on H and eight
We will focus mostly on density operators O = sets of complex numbers. The range is not necessarily
D(H). One infers A
(K )
by performing 2K 1 weak real: C R.
measurements, and one strong measurement, per trial. A depends on H and t implicitly through U . The
The order in which the bases are measured is the order KD quasiprobability in [14] depends implicitly on time
in which the labels a, . . . , k, f appear in the argument similarly (see Footnote 10). Outside of OTOC contexts,
(K )
of AO (.). The conventional KD quasiprobability is 1- D(H) may be replaced with B(H). K -extended KD dis-
extended. The OTOC quasiprobability A is 3-extended. tributions represent bounded operators, not only quan-
Our investigation parallels the exposition, in Sec. I A, tum states. C, not necessarily R, is the range also of
(K )
of the KD distribution. First, we present basic mathe- the K -fold generalization A . We expound upon the
matical properties. A , we show next, obeys an analog ranges complexity after discussing the number of argu-
of Bayes Theorem. Our analog generalizes the known ments of A .
analog (5). Our theorem reduces exponentially (in sys- Five effective arguments of A : On the left-hand side
tem size) the memory needed to compute weak values, in of Eq. (28), semicolons separate four tuples. Each tu-
certain cases. Third, we connect A with the operator- ple results from a measurement, e.g., of W. We coarse-
decomposition argument in Sec. I A 3. A consists of co- grained over the degeneracies in Sections II AIV. Hence
efficients in a decomposition of an operator 0 that re- each tuple often functions as one degree of freedom. We
sults from asymmetrically decohering . Summing A (.) treat A as a function of four arguments (and of ). The
values yields a KD representation for . This sum can KD quasiprobability has just two arguments (apart from
be used, in experimental measurements of A and the O). The need for four arises from the noncommutation
OTOC, to evaluate how accurately the desired initial of W(t) and V .
state was prepared. Fourth, we explore the relation- Complexity of A : The ability of A to assume non-
ship between out-of-time ordering and quasiprobabilities. real values mirrors Property 1 of the KD distribution.
Time-ordered correlators are moments of quasiprobabil- The Wigner function, in contrast, is real. The OTOC
ities that clearly reduce to classical probabilities. Fi- quasiprobabilitys real component, <(A ), parallels the
nally, we generalize beyond the OTOC, which encodes Terletsky-Margenau-Hill distribution. We expect non-
K = 3 time reversals. Let K := 12 (K + 1). A K-fold classical values of A to reflect nonclassical physics,

OTOC F (K ) (t) encodes K time reversals [121, 122]. as nonclassical values of the KD quasiprobability do
(Sec. I A).
Equations (28) and (29) reflect the ability of A to as-
sume nonreal values. Equation (28) would equal a real
10 Time evolutions may be incorporated into the bases. For ex- product of probabilities if the backward-process ampli-
ample, Eq. (10) features the 1-extended KD quasiprobability
tude A and the forward-process amplitude A had equal
hf 0 |aiha|0 |f 0 i. The 0 := Ut0 Ut0 results from time-evolving
arguments. But the arguments typically do not equal
a state . The |f 0 i := Ut00 t0 |f i results from time-evolving an each other. Equation (29) reveals conditions under which
(1)
A (.) R and 6 R. We illustrate the case with two
P
eigenket |f i of F = f f |f ihf |. We label (10) as A (, a, f ),
rather than as A (1) 0 0
( , a, f ). Why? One would measure (10) by examples and the 6 case with one example.
preparing , evolving the system, measuring A weakly, inferring
outcome a, evolving the system, measuring F , and obtaining out- Example 1 (Real A #1: t = 0, shared eigenbasis, ar-
come f . No outcome f 0 is obtained. Our notation is that in [14] bitrary ). Consider t = 0, at which U = 1. The oper-
and is consistent with the notation in [37]. ators W(t) = W and V share an eigenbasis, under the
27
assumption that [W, V ] = 0: {|w` , w` i} = {|v` , v` i}. W eigenstates and the V eigenstates are
With respect to that basis, N
z y 1
A (v1 , v1 ; w2 , w2 ; v2 , v2 ; w3 , w3 ) h , +| , +i = ,
2
= w3 v2 w3 v2 v2 w2 v2 w2 w2 v1 w2 v1 N
1
X h z , +| y , i = ,
pj |hw3 , w3 |ji|2 2
j
N
z y i
R. (96) h , | , +i = , and
2
N
We have substituted into Eq. (29). We substituted in for i
h z , | y , i = . (100)
from Eq. (20). 2
Example 1 is consistent with the numerical simula- Suppose that = 1/d. A(1/d) (.) would have a chance
tions in Sec. III. According to Eq. (96), at t = 0, of being nonreal only if some |v` , v` i equaled | z , i.
P
degeneracies A =: A R. In Figures 9, 14, and 17, That | z , i would introduce an i into Eq. (29). But
the imaginary parts =(A ) clearly vanish at t = 0. In h z , | would introduce another i. The product would be
Fig. 6, =(A ) vanishes to within machine precision.11 real. Hence A(1/d) (.) R.
Consider a that lacks coherences relative to the
shared eigenbasis, 1/d. Example
e.g., = 1 implies that A is nonreal in the following example.

= A(1/d) at t = 0. But = A(1/d) remains zero for all Example 3 (Nonreal A : t = 0, nonshared eigenbases,
t in the numerical simulations. Why, if time evolution nondiagonal relative to both). Let t, W, V , {|w` , w` i},
deforms the W(t) eigenbasis from the V eigenbasis? The and {|vm , vm i} be as in Example 2.
reason appears to be a cancellation, as in Example 2. Suppose that has coherences relative to the W and V
Example 2 requires more notation. Let us focus on eigenbases. For instance, let = | x , +ih x , +|. Since
a chain of N spin- 21 degrees of freedom. Let denote | x , +i = 12 (| z , +i + | z , i),
the = x, y, z Pauli operator. Let | , i denote the
eigenstates, such that | , i = | , i. N -fold 1
= (| z , +ih z , +| + | z , +ih z , |
tensor products are denoted by | , i := | , iN . 2N/2
We denote by j the th Pauli operator that acts non- + | z , ih z , +| + | z , ih z , |)N . (101)
trivially on site j.
Let |w3 , w3 i = | z , i, such that its overlaps with V
Example 2 (Real A #2: t = 0, nonshared eigen- eigenstates can contain is. The final factor in Eq. (29)
bases, = 1/d). Consider the spin chain at t = 0, becomes
such that U = 1. Let W = 1z and V = N y
. Two
W eigenstates are | z , i. Two V eigenstates are 1 h
h iN hv1 , v1 ||w3 , w3 i = N/2 hv1 , v1 | | z , +iN
2
| y , +i = 12 (| z , +i + i| z , i) and | y , i = i
h iN + hv1 , v1 | | z , iN . (102)
1 (| z , +i i| z , i) . The overlaps between the
2
N
The first inner product evaluates to 1 , by
2
Eqs. (100). The second inner product evaluates to
N
11 The =(A ) in Fig. 6 equals zero identically, if w2 = w3 and/or i2 . Hence
if v1 = v2 . For general arguments,
=(A (v1 , w2 , v2 , w3 )) =
1 h
A (v1 , w2 , v2 , w3 ) 1 h N
i
2i hv1 , v1 ||w3 , w3 i = 1 + (i) . (103)
i 2N
(v1 , w2 , v2 , w3 ) .
A (97)

This expression is nonreal if N is odd.
The final term equals
h
W(t) W(t) V
i
W(t) V W(t)
Example 3, with the discussion after Example 1,
Tr w3 V v2 w2 v1 = Tr V
v1 w2 v2 w3 (98) shows how interference can eliminate
nonreality
from a

W(t) V W(t) V
= Tr w2 v2 w3 v1 = A (v1 , w3 , v2 , w2 ) . (99) quasiprobability. In Example 3, = A does not neces-

The first equality follows from projectors Hermiticity; and the sarily vanish. Hence the coarse-grained = A does not
second, from the traces cyclicality. Substituting into Eq. (97)
shows that A (.) is real if w2 = w3 . A (.) is real if v1 = v2 , by obviously vanish. But = A = 0 according to the dis-
an analogous argument. cussion after Example 1. Summing Example 3s nonzero
28

= A values must quench the quasiprobabilitys non- facilitate a proof [6]:12
reality. This quenching illustrates how interference can
wash out quasiprobabilities nonclassicality. Yet interfer- Freg (t) := Tr 1/4 W(t)1/4 V 1/4 W(t)1/4 V . (104)
ence does not always wash out nonclassicality. Section III
depicts A s that have nonzero imaginary components Freg (t) is expected to behave roughly like F (t) [6, 42].
(Figures 9, 14, and 17). Just as F (t) equals a moment of a sum over A , Freg (t)
Example 3 resonates with a finding in [107, 108]. Soli- equals a moment of a sum over
nas and Gasparinettis quasiprobability assumes nonreal
values when the initial state has coherences relative to Areg
(v1 , v1 ; w2 , w2 ; v2 , v2 ; w3 , w3 ) (105)
the energy eigenbasis. 1/4 1/4
:= hw3 , w3 |U |v2 , v2 ihv2 , v2 | U |w2 , w2 i
hw2 , w2 |U 1/4 |v1 , v1 ihv1 , v1 |1/4 U |w3 , w3 i
Property 6. Marginalizing A (.) over all its arguments |v2 , v ihv2 , v |U
hw3 , w |U
3 2
|w2 , w i
2 2
(106)
except any one yields a probability distribution. |v1 , v ihv1 , v |U
|w3 , w i .
hw2 , w2 |U 1 1 3
Consider, as an example, summing Eq. (29) over every The proof is analogous to the proof of Theorem 1 in [37].
tuple except (w3 , w3 ). P
The outer products become res- Equation (106) depends on U := 1 eiH , which prop-
Z
olutions of unity, e.g., (w2 ,w ) |w2 , w2 ihw2 , w2 | = 1. agates in the complex-time variable := t 4T i
. The
2
A unitary cancels with its Hermitian conjugate: U U = Hermitian conjugate U =
1
e iH
propagates along
Z
1. The marginalization yields hw3 , w3 |U U |w3 , w3 i. i
= t + 4T .
This expression equals the probability that preparing , Areg has the symmetries of A(1/d) (Property 7)
time-evolving, and measuring the W eigenbasis yields the (eH/T /Z )
outcome (w3 , w3 ). for arbitrary T . One might expect Areg to behave sim-

ilarly to A , as Freg (t) behaves similarly to F (t). Nu-
This marginalization property, with the structural
merical simulations largely support P this expectation. We
and operational resemblances between A and the KD
compared A (.) with Areg (.) := degeneracies A (.) . The
quasiprobability, accounts for our calling A an extended distributions vary significantly over similar time scales
(K )
quasiprobability. The general K -extended A obeys and have similar shapes. Areg tends to have a smaller
Property 6. imaginary component and, as expected, more degenera-
cies.
The properties of A imply properties of P (W, W 0 ).
Property 7 (Symmetries of A(1/d) ). Let be the We discuss these properties in Appendix A.
infinite-temperature Gibbs state 1/d. The OTOC
quasiprobability A(1/d) has the following symmetries.
B. Bayes-type theorem and retrodiction with A
(A) A(1/d) (.) remains invariant under the si- We reviewed, in Sec. I A 2, the KD quasiprobabilitys
multaneous interchanges of (w2 , w2 ) with (1)
role in retrodiction. The KD quasiprobability A gen-
(w3 , w3 ) and (v1 , v1 ) with (v2 , v2 ): 0 0 0
eralizes the nontrivial part <(hf |aiha| |f i) of a condi-
A(1/d) (v1 , v1 ; w2 , w2 ; v2 , v2 ; w3 , w3 ) =
tional quasiprobability p(a|, f ) used to retrodict about
A(1/d) (v2 , v2 ; w3 , w3 ; v1 , v1 ; w2 , w2 ). (1)
an observable A. Does A play a role similar to A ?
It does. To show so, we generalize Sec. I A 2 to com-
posite observables. Let A, B, . . . , K denote K observ-
(B) Let t = 0, such that {|w` , w` i} = {|v` , v` i} (un- ables. K . . . BA might not be Hermitian but can be sym-
der the assumption that [W, V ] = 0). A(1/d) (.) re- metrized. For example, := K . . . A + A . . . K is an ob-
mains invariant under every cyclic permutation of servable.13 Which value is most reasonably attributable
its arguments.
Equation (29) can be recast as a trace. Property 7 12 The name regulated derives from quantum field theory. F (t)
follows from the traces cyclicality. Subproperty (B) re- contains operators W (t) and W(t) defined at the same space-
lies on the triviality of the t = 0 time-evolution operator: time point (and operators V and V defined at the same space-
U = 1. The symmetries lead to degeneracies visible in time point). Products of such operators encode divergences.
One can regulate divergences by shifting one operator to an-
numerical plots (Sec. III). other space-time point. The inserted 1/4 = 1/4 1
eH/4T shifts
Z
Analogous symmetries characterize a regulated operators along an imaginary-time axis.
13 := i(K . . . A A . . . K). An operator can be symmetrized
So is
quasiprobability. Maldacena et al. regulated F (t) to
29
to retrodictively? A weak value weak given by Eq. (3). A rightward-pointing arrow labels quantities in which
We derive an alternative expression for weak . In our ex- the outer products, |kihk|, . . . , |aiha|, are ordered analo-
pression, eigenvalues are weighted by K -extended KD gously to the first term K . . . A in . A leftward-pointing
quasiprobabilities. Our expression reduces exponentially, arrow labels quantities in which reading the outer
in the systems size, the memory required to calculate products |aiha|, . . . , |kihk| backwardfrom right to left
weak values, under certain conditions. We present gen- parallels reading K . . . A forward.
(K )
eral theorems about A , then specialize to the OTOC
A . Proof. The initial steps come from [14, Sec. II A], which
recapitulates [7375]. For every measurement outcome f ,
Theorem 1 (Retrodiction about composite observ- we assume, some number f is the guess most reasonably
ables). Consider a system S associated with a Hilbert attributable to . We combine P these best guesses into
space H. For concreteness,
P we assume P that H is dis- the effective observable est := f f |f 0 ihf 0 |. We must
crete. Let A = a|aiha| , . . . , K = k k|kihk| denote
a optimize our choice of {f }. We should quantify the
K observables defined on H. Let Ut denote the family of
distance between (1) the operator est we construct and
unitaries that propagates the state of S along time t.
(2) the operator we wish to infer about. We use the
Suppose that S begins in the state at time P t = 0, then weighted trace distance
evolves under Ut00 until t = t00 . Let F = f f |f ihf | de-
note an observable measured at t = t00 . Let f denote
D0 (, est ) = Tr 0 [ est ]2 . (114)
the outcome. Let t0 (0, t00 ) denote an intermediate
time. Define 0 := Ut0 Ut0 and |f 0 i := Ut00 t0 |f i as time- 0 serves as a positive prior bias [14].
evolved states. Let us substitute in for the form of est . Expanding
The value most reasonably attributable retrodictively to the square, then invoking the traces linearity, yields
the time-t0 := K . . . A + A . . . K is the weak value
h Xh
X D0 (, est ) = Tr(0 2 ) + f2 hf 0 |0 |f 0 i
weak (, f ) = (a . . . k) p (a, . . . , k|, f )
f
a,...,k i
i
f (hf | |f i + hf 0 |0 |f 0 i) .
0 0 0
(115)
+ p (k, . . . , a|, f ) . (107)
The weights are joint conditional quasiprobabilities. They factor equals 2<(hf 0 |0 |f 0 i). Adding
The parenthesized P
0 0 0 0 0 0 2
obey analogs of Bayes Theorem: and subtracting f hf | |f i[<(hf | |f i)] to and
from Eq. (115), we complete the square:
p (a, . . . , k, f |)
p (a, . . . , k|, f ) = (108) X
p(f |) D0 (, est ) = Tr(0 2 ) hf 0 |0 |f 0 i[<(hf 0 |0 |f 0 i)]2
<(hf 0 |kihk| . . . |aiha|0 |f 0 i) f
, (109) !2
hf 0 |0 |f 0 i X
0 0 0 <(hf 0 |0 |f 0 i)
+ hf | |f i f .
and hf 0 |0 |f 0 i
f
p (k, . . . , a, f |) (116)
p (k, . . . , a|, f ) = (110)
p(f |)
Our choice of {f } should minimize the distance (116).
<(hf 0 |aiha| . . . |kihk|0 |f 0 i)
. (111) We should set the square to zero:
hf 0 |0 |f 0 i
<(hf 0 |0 |f 0 i)
Complex generalizations of the weights numerators, f = . (117)
hf 0 |0 |f 0 i
A(K ) 0 0 0
, (a, . . . , k, f ) := hf |kihk| . . . |aiha| |f i (112)
Now, we deviate from [14, 7375]. We substitute the
and definition of into Eq. (117). Invoking the linearity of
< yields
A(K ) 0 0 0
, (k, . . . , a, f ) := hf |aiha| . . . |kihk| |f i , (113)
<(hf 0 |K . . . A0 |f 0 i) <(hf 0 |A . . . K0 |f 0 i)
are K -extended KD distributions. f = + .
hf 0 |0 |f 0 i hf 0 |0 |f 0 i
(118)
in multiple ways. Theorem 1 governs . Appendix B contains an We eigendecompose A, . . . , K. The eigenvalues, being
analogous result about . Theorem 1 extends trivially to Her- real, can be factored out of the <s. Defining the eigen-
mitian (already symmetrized) instances of K . . . A. Corollary 1 values coefficients as in Eqs. (109) and (111), we reduce
illustrates this extension. Eq. (118) to the form in Eq. (107).
30
Theorem 1 reduces exponentially, in system size, the (ii) substitute into Eq. (107).
space required to calculate weak , in certain cases.14 For
concreteness, we focus on a multiqubit system and on Let (n) denote the space required to compute weak ,
l-localPoperators A, . . . , K. An operator O is l-local if aside from the space required to store weak , with constant
O = j Oj , wherein each Oj operates nontrivially on, precision, using method (n) = (1), (2), in the asymptotic
at most, l qubits. Practicality motivates this focus: The limit. Method (1) requires a number of bits at least expo-
lesser the l, the more easily l-local operators can be mea- nential in the number K of local observables:
sured.
We use asymptotic notation from computer science: (1) = 2K . (119)
Let f f (N ) and g g(N ) denote any functions of
the system size. If g = O(f ), g grows no more quickly Method (2) requires a number of bits linear in K :
than (is upper-bounded by) a constant multiple of f in
the asymptotic limit, as N . If g = (f ), g grows (2) = O(K ) . (120)
at least as quickly as (is lower-bounded by) a constant
multiple of f in the asymptotic limit. If g = (f ), g Method (2) requires exponentiallyin K and so in N
is upper- and lower-bounded by f : g = O(f ), and g = less memory than Method (1).
(f ). If g = o(f ), g shrinks strictly more quickly than f
in the asymptotic limit. Proof. Using Method (1), one computes S . S is a
2N 2N complex matrix. The matrix has (2K ) nonzero
Theorem 2 (Weak-value space saver). Let S denote a elements: A, . . . , K are traceless, so each of AS , . . . , KS
system of N qubits. Let H denote the Hilbert space as- contains at least two nonzero elements. Each operator
sociated with S. Let |f 0 i H denote a pure state and at least doubles the number of nonzero elements in S .
0 D(H) denote a density operator. Let S denote any Specifying each complex number with constant precision
fixed orthonormal basis for H in which each basis element requires (1) bits. Hence Method (1) requires 2K
equals a tensor product of N factors, each of which oper- bits.
ates nontrivially on exactly one site. S may, for example, Let us turn to Method (2). We can store hf 0 |0 |f 0 i in
consist of tensor products of z eigenstates. a constant number of bits.
Let K denote any polynomial function of N : K Step (B) can be implemented with a counter variable
K (N ) = poly(N ). Let A, . . . , K denote K traceless l- CO for each local operator O, a running-total variable G,
local observables defined on H, for any constant l. Each and a current term variable T . CO is used to iterate
observable may, for example, be a tensor product of l through the nonzero eigenvalues of O (arranged in some
nontrivial Pauli operators and N l identity opera- fiducial order). O has O(2l ) nonzero eigenvalues. Hence
tors. The composite observable := P A...K + K...A CO requires O(l) bits. Hence the set of K counters CO
is
P not necessarily l-local. Let A = a a|aiha| , . . . , K = requires O(lK ) = O(K ) bits.
k k|kihk| denote eigenvalue decompositions of the local The following algorithm implements Step (B):
observables. Let OS denote the matrix that represents an
operator O relative to S. (i) If CK < its maximum possible value, proceed as
Consider being given the matrices AS , . . . , KS , 0S , and follows:
0
|f iS . From this information, the weak value weak can
be computed in two ways: (a) For each O = A, . . . , K, compute the (2CO )th
nonzero eigenvalue (according to the fiducial
(1) Conventional method ordering).
(A) Multiply and sum given matrices to form S = (b) Multiply the eigenvalues to form a . . . k. Store
KS . . . AS + AS . . . KS . the product in T .
(B) Compute hf 0 |0 |f 0 i = hf 0 |S 0S |f 0 iS . (c) For each O = A, . . . , K, calculate the (2CO )th
0
(C) Substitute into weak = <
hf |S S 0S |f 0 iS
. eigenvector column (according to some fidu-
0 0 0
hf | |f i cial ordering).
(2) K -factored method (d) Substitute the eigenvector columns into
Eqs. (109) and (111), to compute p (.) and
(A) Compute hf 0 |0 |f 0 i. p (.).
(B) For each nonzero term in Eq. (107), h
(e) Form (a . . . k) p (a, . . . , k|, f ) +
(i) calculate p (.) and p (.) from Eqs. (109)
and (111). p (k, . . . , a|, f ). Update T to this value.
(f) Add T to G.
(g) Erase T .
14 Space means memory, or number of bits, here. (h) Increment CK .
31
(ii) If CK equals its maximum possible value, increment offers insight into the result:
the counter of the preceding variable, J , in the X
list; reset CK to one; and, if J has not attained its weak (, f ) = (k . . . a)
p (k, . . . , a|, f )
maximum possible value, return to Step (i). Pro- k,...,a
ceed in this mannerincrementing counters; then X
resetting counters, incrementing preceding coun- + (a . . . k)
p (a, . . . , k|, f ) . (121)
a,...,k
ters, and returning to Step (i)until CA reaches
its maximum possible value. Then, halt.
Each sum parallels the sum in Eq. (6). Equation (121)
suggests that we are retrodicting about K . . . A indepen-
The space needed to store G is the space needed to
dently of A . . . K. But neither K . . . A nor A . . . K is Her-
store weak . This space does not contribute to (2) .
mitian. Neither operator seems measurable. Ascribing
How much space is needed to store T ? We must cal- a value to neither appears to have physical significance,
culate weak with constant precision. weak equals a prima facie.
sum of 2lK terms. Let j denote the error in term Yet non-Hermitian products BA have been measured
P2lK
j. The sum j=1 j must be O(1). This require- weakly [3234]. Weak measurements associate a value
ment is satisfied if 2lK (max | |) = o(1), which implies with the supposedly unphysical K . . . A, just as weak
j j
maxj |j | = o 2lK . We can specify each term, with a measurements enable us to infer supposedly unphysical
small-enough roundoff error, using O(lK ) = O(K ) bits. probability amplitudes A . The parallel between K . . . A
Altogether, the variables require O(K ) bits. As the and A can be expanded. K . . . A and A . . . K, being
set of variables does, so does the O-factored method. non-Hermitian, appear to lack physical significance inde-
pendently. Summing the operators forms an observable.
Similarly, probability amplitudes A and A appear to
Performing Method (2) requires slightly more time lack physical significance independently. Multiplying the
than performing Method (1). Yet Theorem 2 can benefit amplitudes forms a probability. But A and K . . . A can
computations about quantum many-body systems. Con- be inferred individually from weak measurements.
sider measuring a weak value of a quantum many-body We have generalized Sec. I A 2. Specializing to k = 3,
system. One might wish to predict the experiments out- and choosing forms for A, . . . K, yields an application of
come and to compare the outcome with the prediction. A to retrodiction.
Alternatively, consider simulating quantum many-body
systems independently of laboratory experiments, as in Corollary 1 (Retrodictive application of A ). Let S,
Sec. III. One must compute weak values numerically, us- H, , W(t), and V be defined as in Sec. I B. Suppose
ing large matrices. The memory required to store these that S is in state at P time t = 0. Suppose that the
matrices can limit computations. Theorem 2 can free up observable F = W = w3 ,w3 w3 |w3 , w3 ihw3 , w3 | of
space. S is measured at time t00 =Pt. Let (w3 , w3 ) denote
Two more aspects of retrodiction deserve exposition: the outcome. Let A = V = v1 ,v v1 |v1 , v1 ihv1 , v1 |,
P 1
related studies and the physical significance of K . . . A. B = W(t) =
w2 ,w2 w2 U |w2 , w2 ihw2 , w2 |U , and
Related studies: Sequential weak measurements have P
C = V = v2 ,v v2 |v2 , v2 ihv2 , v2 | . Let the composite
been proposed [11] and realized recently [3234]. Lun- 2
observable = ABC = V W(t)V . The value most rea-
deen and Bamber proposed a direct measurement of a
sonably attributable to retrodictively is the weak value
density operator [11]. Let denote a density operator de-
fined on a dimension-d Hilbert space H. Let Sa := {|a` i} X
and Sb := {|b` i} denote orthonormal mutually unbiased weak (; w3 , w3 ) = v1 w 2 v2
bases (MUBs) for H. The interbasis inner products have (v1 ,v1 ),(v2 ,v2 ),(w2 ,w2 )
constant magnitudes: |ha` |bm i| = 1d `, m. Consider p (v2 , v2 ; w2 , w2 ; v1 , v1 |; w3 , w3 ) . (122)

measuring Sa weakly, then Sb weakly, then Sa strongly,
in each of many trials. One can infer (1) a KD quasiprob- The weights are joint conditional quasiprobabilities that
ability for and (2) a matrix that represents relative obey an analog of Bayes Theorem:
to Sa [11].
KD quasiprobabilities are inferred from experimental p (v1 , v1 ; w2 , w2 ; v2 , v2 |; w3 , w3 )
measurements in [33, 34]. Two weak measurements are p (v1 , v1 ; w2 , w2 ; v2 , v2 ; w3 , w3 |)
= (123)
performed sequentially also in [32]. Single photons are p(w3 , w3 |)
used in [32, 33]. A beam of light is used in [34]. These
<(hw3 , w3 |U |v2 , v2 ihv2 , v2 |U |w2 , w2 i
experiments indicate the relevance of Theorem 1 to cur-
rent experimental capabilities. Additionally, composite hw2 , w2 |U |v1 , v1 ihv1 , v1 |U |w3 , w3 i)
observables AB + BA accompany KD quasiprobabilities /hw3 , w3 ||w3 , w3 i . (124)
in e.g., [128].
Physical significance of K . . . A: Rearranging Eq. (107) A complex generalization of the weights numerator is the
32
OTOC quasiprobability: Proof. We deform the argument in Sec. I A 3. Let the

{|ai}
in Sec. I A 3 be {|v2 , v2 i}. Let the {|f i} be
A(3)
, (v1 , v1 ; w2 , w2 ; v2 , v2 ; w3 , w3 ) U |w3 , w3 i . We sandwich
P between
resolutions of
= A (v1 , v1 ; w2 , w2 ; v2 , v2 ; w3 , w3 ) . (125) P
unity: = ( a |aiha|) f |f ihf | . Rearranging yields
The OTOC quasiprobability, we have shown, assists X
with Bayesian-type inference, similarly to the KD distri- = |v2 , v2 ihw3 , w3 |U
bution. The inferred-about operator is V W(t)V , rather (v2 ,v2 ),(w3 ,w3 )
than the W(t)V W(t)V in the OTOC. The missing W(t)
hv2 , v2 |U |w3 , w3 i . (130)
plays the role of F . This structure parallels the weak-
measurement scheme in the main text of [37]: V , W(t), We wish to normalize the outer product, by dividing
and V are measured weakly. W(t) is, like F , then mea- by its trace. We assumed, in Sec. I A 3, that no interbasis
sured strongly. inner product vanishes. But inner products could vanish
here. Recall Example 1: When t = 0, W(t) and V share
an eigenbasis. That eigenbasis can have orthogonal states
C. A (.) values as coefficients in an operator
|i and |i. Hence hw3 , w3 |U |v2 , v2 i can equal h|i =
decomposition
0. No such term in Eq. (130) can be normalized.
We eliminate these terms from the sum with the con-
Let S denote any orthonormal operator basis for H. dition hw3 , w3 |U |v2 , v2 i 6= 0. The left-hand side of
Every state D(H) can be decomposed in terms of S, Eq. (130) is replaced with the 0 in Eq. (126). We di-
as in Sec. I A 3. The coefficients form a KD distribution. vide and multiply by the trace of each S element:
Does A consist of the coefficients in a state decomposi-
tion? X |v2 , v2 ihw3 , w3 |U
0 =
Summing A (.) values yields a coefficient in a decom- hw3 , w3 |U |v2 , v2 i
(v2 ,v2 ),(w3 ,w3 ) :
position of an operator 0 .15 0 results from asymmetri- hw3 ,w3 |U |v2 ,v2 i6=0
cally decohering . This decoherence relates to time-
reversal asymmetry. We expect 0 to tend to converge to hw3 , w3 |U |v2 , v2 ihv2 , v2 |U |w3 , w3 i . (131)
after the scrambling time t . By measuring A after The coefficients are KD-quasiprobability values.
t , one may infer how accurately one prepared the target Consider inserting, just leftward of the , the resolution
initial state. of unity
Theorem 3. Let
X X
0 := |v2 , v2 ihw3 , w3 |U 1 = U |w2 , w2 ihw2 , w2 |U
(v2 ,v2 ),(w3 ,w3 ) : w2 ,w2
hw3 ,w3 |U |v2 ,v2 i6=0
X
hv2 , v2 |U |w3 , w3 i (126) |v1 , v1 ihv1 , v1 | . (132)
denote the result of removing, from , the terms that con- v1 ,v1
nect the input state U |w3 , w3 i to the output state P P
In the resulting 0 decomposition, the
|v2 , v2 i. We define the set w2 ,w2 v1 ,v1
|v2 ,v2 ihw3 ,w3 |U
is pulled leftward, to just after the
|v2 , v2 ihw3 , w3 |U hw3 ,w3 |U |v2 ,v2 i This
.
S := (127) double sum becomes a sum of A s. The weights have 0
hw3 , w3 |U |v2 , v2 i hw3 ,w |U |v2 ,v i6=0
3 2
the form in Eq. (129).
0
of trace-one operators. decomposes in terms of S as
Theorem 3 would hold if were replaced with any
X (w , ) |v2 , v2 ihw3 , w3 |U bounded operator O B(H). Four more points merit
C(v23,vw3) . (128)
2 hw3 , w3 |U |v2 , v2 i discussion. We expect that, after the scrambling time t ,
(v2 ,v2 ),(w3 ,w3 ) :
hw3 ,w3 |U |v2 ,v2 i6=0 there tend to exist parameterizations {w` } and {vm }
such that S forms a basis. Such a tendency could facili-
The coefficients follow from summing values of the tate error estimates: Suppose that A is measured after
OTOC quasiprobability: t . One can infer the form of the state prepared at the
(w , )
X trials start. The target initial state may be difficult to
C(v23,vw3) := A (v1 , v1 ; w2 , w2 ; v2 , v2 ; w3 , w3 ) . prepare, e.g., thermal. The preparation procedures ac-
2
(w2 ,w2 ), curacy can be assessed at a trivial cost. Third, the phys-
(v1 ,v1 )
ical interpretation of 0 merits investigation. The asym-
(129) metric decoherence relates to time-reversal asymmetry.
Fourth, the sum in Eq. (129) relates to a sum over trajec-
tories, a marginalization over intermediate-measurement
15 This 0 should not be confused with the 0 in Theorem 1. outcomes.
33
Relationship
n between scrambling and completeness of
o target, state target . Thermal states eH/T /Z are diffi-
|aihf |
S: The hf |ai in Sec. I A 3 forms a basis for D(H). cult to prepare, for example. How accurately was target
prepared? One may answer by comparing target with
But suppose that 0 6= . S fails to form a basis.
the KD quasiprobability A for .
What does this failure imply about W(t) and V ?
Reconstructing the KD quasiprobability requires
The failure is equivalent to the existence of a vanish-
a trivial sum over already-performed measurements
ing := |hw3 , w3 |U |v2 , v2 i|. Some vanishes if some
[Eq. (129)]. One could reconstruct independently via
degenerate eigensubspace H0 of W(t) is a degenerate
conventional quantum-state tomography [131]. The re-
eigensubspace of V : Every eigenspace of every Hermi-
tian operator has an orthogonal basis. H0 therefore has construction inferred from A may have lower precision,
an orthogonal basis. One basis element can be labeled due to the multiplicity of weak measurements and to the
U |w3 , w3 i; and the other, |v2 , v2 i. sum. But independent tomography would likely require
The sharing of an eigensubspace is equivalent to the extra measurements, exponentially many in the system
commutation of some component of W(t) with some com- size. Inferring A requires exponentially many measure-
ponent of V . The operators more likely commute before ments, granted.16 But, from these measurements, one
the scrambling time t than after. Scrambling is therefore can infer A , the OTOC, and . Upon reconstructing
expected to magnify the similarity between the OTOC the KD distribution for , one can recover a matrix rep-
quasiprobability A and the conventional KD distribu- resentation for via an integral transform [11].
tion. The asymmetrically decohered 0 : What does the de-
Let us illustrate with an extreme case. Suppose that composed operator 0 signify? 0 has the following prop-
all the s lie as far from zero as possible: erties: The term subtracted off in Eq. (126) has trace
zero. Hence 0 has trace one, like a density operator.
1 But the subtracted-off term is not Hermitian. Hence 0
= . (133) is not Hermitian, unlike a density operator. Nor is 0
d
anti-Hermitian, necessarily unitarity, or necessarily anti-
Equation (133) implies that W(t) and V eigenbases are unitary.
mutually unbiased biases (MUBs) [129]. MUBs are eigen- 0 plays none of the familiar rolesof state, observ-
bases of operators that maximize the lower bound in an able, or time-evolution operatorin quantum theory.
uncertainty relation [130]. If you prepare any eigen- The physical significance of 0 is not clear. Similar
state of one operator (e.g., U |w` , w` i) and measure quantities appear in weak-measurement theory: First,
the other operator (e.g., V ), all the possible outcomes non-Hermitian products BA of observables have been
have equal likelihoods. You have no information with measured weakly (see Sec. V B and [3234]). Second,
which to predict the outcome; your ignorance is maxi- nonsymmetrized correlation functions characterize quan-
mal. W(t) and V are maximally incompatible, in the tum detectors of photon absorptions and emissions [124].
quantum-information (QI) sense of entropic uncertainty Weak measurements imbue these examples with physi-
relations. Consistency between this QI sense of mutu- cal significance. We might therefore expect 0 to have
ally incompatible and the OTOC sense might be ex- physical significance. Additionally, since 0 is non-
pected: W(t) and V eigenbases might be expected to Hermitian, non-Hermitian quantum mechanics might of-
form MUBs after the scrambling time t . We elaborate fer insights [132].
on this possibility in Sec. VI C. The subtraction in Eq. (126) constitutes a removal of
KD quasiprobabilities are typically evaluated on coherences. But the subtraction is not equivalent to a de-
MUBs, such as position and momentum eigenbases [10, cohering channel [112], which outputs a density operator.
11, 34]. One therefore might expect A to relate more Hence our description of the decoherence as asymmetric.
closely the KD quasiprobability after t than before. The The asymmetry relates to the breaking time-reversal
OTOC motivates a generalization of KD studies beyond invariance. Let U |w3 , w3 i =: |w
3 i be fixed throughout
MUBs. the following argument (be represented, relative to any
Application: Evaluating a state preparations accuracy: given basis, by a fixed list of numbers). Suppose that
Experimentalists wish to measure the OTOC F (t) at = eH/T /Z. The removal of hv2 , v2 ||w 3 i terms from
each of many times t. One may therefore wish to measure is equivalent to the removal of hv2 , v2 |H|w 3 i terms from
A after t . Upon doing so, one may be able to infer not H: 7 0 H 7 H 0 . Imagine, temporarily, that H 0
only F (t), but also the accuracy with which one prepared could represent a Hamiltonian without being Hermitian.
the target initial state.
Suppose that, after t , some S that forms a basis for H.
Consider summing late-time A (.) values over (w2 , w2 ) 16 One could measure, instead of A , the coarse-grained quasiprob-
and (v1 , v1 ). The sum equals a KD quasiprobability ability A =:
P
degeneracies A (Sec. II A). From A , one could
for . The quasiprobability encodes all the information
infer the OTOC. Measuring A would require exponentially fewer
in [10, 11]. One can reconstruct the state that one measurements. But from A , one could not infer the KD distri-
prepared [3234]. bution. One could infer a coarse-grained KD distribution, akin
The prepared state might differ from the desired, or to a block-diagonal matrix representation for .
34
H 0 would generate a time evolution under which |w 3 i to probability values. Summing ATOC
under constraints
could not evolve into |v2 , v2 i. But |v2 , v2 i could evolve yields a complex distribution PTOC (W, W 0 ). The TOC
into |w3 i. The forward process would be allowed; the FTOC (t) equals a moment of PTOC (W, W 0 ), obeying a
reverse would be forbidden. Hence 7 0 relates to a Jarzynski-like equality.
breaking of time-reversal symmetry.
Interpretation of the sum in Eq. (129): Summing

A (.) values, in Eq. (129), yields a decomposition co- 1. Time-ordered correlator FTOC (t)
efficient C of 0 . Imagine introducing that sum into
Eq. (125). The OTOC quasiprobability A (.) would The OTOC equals a term in the expectation value h.i of
become a KD quasiprobability. Consider applying this the squared magnitude |.|2 of a commutator [. , .] [6, 30],
summed Eq. (125) in Eq. (122). We would change from

retrodicting about V W(t)V to retrodicting about the C(t) := [W(t), V ] [W(t), V ] (134)
leftmost V .

= W (t)V V W(t) V W (t)W(t)V
D. Relationship between out-of-time ordering and + 2<(F (t)) . (135)
quasiprobabilities
The second term is a time-ordered correlator (TOC),

The OTOC has been shown to equal a moment of FTOC (t) := V W (t)W(t)V . (136)
the complex distribution P (W, W 0 ) [37]. This equal-

ity echoes Jarzynskis [38]. Jarzynskis equality governs The first term, W (t)V V W(t) , exhibits similar
out-of-equilibrium statistical mechanics. Examples in- physics. Each term evaluates to one if W and V are
clude a quantum oscillator whose potential is dragged unitary. If W and V are nonunitary Hermitian opera-
quickly [133]. With such nonequilibrium systems, one tors, the TOC reaches its equilibrium value by the dissi-
can associate a difficult-to-measure, but useful, free- pation time td < t (Sec. I C). The TOC fails to reflect
energy difference F . Jarzynski

cast F in terms of the scrambling, which generates the OTOCs Lyapunov-type
characteristic function eW of a probability distribu- behavior at t (td , t ).
tion P (W ).17 Similarly, the difficult-to-measure, but use-
ful, OTOC F (t)D has been castE in terms of the character-
0
W 0) 2. TOC probability amplitude ATOC
istic function e(W + of the summed quasiprob-
ability P (W, W 0 ) [37].

We define
Jarzynskis classical probability must be replaced with
a quasiprobability because [W(t), V ] = 0. This re- ATOC (j; v1 , v1 ; w1 , w1 )

placement appeals to intuition: Noncommutation and
quasiprobabilities reflect nonclassicality as commuting := hw1 , w1 |U |v1 v1 ihv1 v1 |ji pj (137)
operators and probabilities do not. The OTOC registers
as the TOC probability amplitude. ATOC
governs a quan-
quantum-information scrambling unregistered by time- TOC
ordered correlators (TOCs). One might expect TOCs to tum process PA . Figure 18a, analogous to Fig. 2a,
TOC
equal moments of coarse-grained quasiprobabilities closer depicts PA , analogous to the PA in Sec. I D 1:
to probabilities than A is. (1) Prepare .
We prove this expectation. First, we review the TOC
FTOC (t). Then, we introduce the TOC analog ATOC of (2) Measure the eigenbasis, {|jihj|}.
the probability amplitude A [Eq. (27)]. A encodes no
time reversals, as expected. Multiplying a forward ampli- (3) Measure V .

tude ATOC
by a backward amplitude ATOC yields the (4) Evolve the system forward in time under U .
TOC quasiprobability ATOC . Inferring ATOC
requires
TOC
(5) Measure W.
only two weak measurements per trial. A reduces
to a probability if = V [Eq. (30)]. In contrast, un- Equation (137) represents the probability amplitude as-
der no known condition on do all A (.) values reduce sociated with the measurements yielding the outcomes
j, (v1 , v1 ), and (w1 , w1 ), in that order. All the measure-
ments are strong. ATOC is not a protocol for measuring
ATOC
. Rather, P TOC
A facilitates the physical interpre-
17 Let P (W ) denote a probability distribution over a random vari-
tation of ATOC .
able W . The characteristic function G(s) equals the Fourier
TOC
isW . Defining s as an imaginary-time PA results from eliminating, from PA , the initial
R
transform: G(s) := dW e
variable, is , yields eW . Jarzynskis equality reads, measurement, and U . A encodes two time rever-

W U, W
F
e =e . sals. ATOC encodes none, as one might expect.
35
case. The middle outcome can differ. This process cor-

Measure W. (w1 , w1 )
responds to the probability amplitude
ATOC
(j; v2 , v2 ; w1 , w1 )
-t
= hw1 , w1 |U |v2 , v2 |v2 , v2 |ji pj . (138)
U
Experiment Complex conjugation reverses the inner products, yield-
time
0 ing the reverse processs amplitude.
We multiply this reverse amplitude by the forward am-
Measure V . (v1 , v1 ) plitude (137). Summing over j yields the TOC quasiprob-
Prepare
.
Measure ability:
{|jihj|}. j
ATOC
(v1 , v1 ; w1 , w1 ; v2 , v2 )
X
(a) := ATOC
(j; v2 , v2 ; w1 , w1 ) ATOC
(j; v1 , v1 ; w1 , w1 )
j
(139)

Measure W. (w1 , w1 )

= hv2 , v2 |U |w1 , w1 ihw1 , w1 |U |v1 , v1 ||v2 , v2 i .
-t (140)
U
Like A , ATOC
is an extended Kirkwood-Dirac
Experiment
time quasiprobability.
ATOC is 2-extended, whereas A
0
is 3-extended. A TOC
can be inferred from a weak-
Measure V . (v2 , v2 )
measurement protocol P TOC :
Prepare
Measure
.
{|jihj|}. j (1) Prepare .
(2) Measure V weakly.
(b)
FIG. 18: Quantum processes described by the weakly.
(4) Measure W
probability amplitudes ATOC in the time-ordered
correlator (TOC) FTOC (t): FTOC (t), like F (t), equals a (5) Evolve the system backward under U .
moment of a summed quasiprobability (Theorem 4). The
quasiprobability, ATOC , equals a sum of multiplied (6) Measure V strongly.
probability amplitudes ATOC [Eq. (139)]. Each product P TOC
requires just two weak measurements. The weak-
contains two factors: ATOC (j; v1 , v1 ; w1 , w1 ) denotes the measurement protocol P for inferring A requires three.
probability amplitude associated with the forward process P TOC requires one time reversal; P requires two.
in Fig. 18a. The system, S, is prepared in a state . The
In a simple case, every ATOC (.) value reduces to a
eigenbasis {|jihj|} is measured, yielding outcome j. V is
measured, yielding outcome (v1 , v1 ). S is evolved forward probability value. Suppose that shares the V eigen-
in time under the unitary U . W is measured, yielding basis, as in Eq. (30). The (v2 , v2 ) in Eq. (140) comes
outcome (w1 , w1 ). Along the abscissa runs the time to equal (v1 , v1 ); Figures 18a and 18b become identical.
measured by a laboratory clock. Along the ordinate runs the Equation (140) reduces to
t in U := eiHt . The second factor in each ATOC product is
ATOC

ATOC (j; v2 , v2 ; w1 , w1 ) . This factor relates to the process V (v1 , v1 ; w1 , w1 ; v2 , v2 ) (141)

2
in Fig. 18b. The operations are those in Fig. 18a. The = |hw1 , w1 |U |v1 , v1 i| pv1 ,v1 v1 v2 v1 v2 (142)
processes initial measurements yield the same outcome. So
do the final measurements. The middle outcomes might = p(w1 , w1 |v1 , v1 ) pv1 ,v1 v1 v2 v1 v2 (143)
differ. Complex-conjugating ATOC yields the probability = p(v1 , v1 ; w1 , w1 ) v1 v2 v1 v2 . (144)
amplitude associated with the reverse process. Figures 18a
and 18b depict no time reversals. Each analogous OTOC The p(a|b) denotes the conditional probability that, if
figure (Fig. 2a and Fig. 2b) depicts two. b has occurred, a will occur. p(a; b) denotes the joint
probability that a and b will occur.
All values ATOC
V (.) of the TOC quasiprobability have
reduced to probability values. Not all values of AV re-
3. TOC quasiprobability ATOC

duce: The values associated with (v2 , v2 ) = (v1 , v1 )
or (w3 , w3 ) = (w2 , w2 ) reduce to products of probabil-
TOC
Consider a PA implementation that yields the out- ities. [See the analysis around Eq. (31).] The OTOC
comes j, (v2 , v2 ), and (w1 , w1 ). Such an implementa- quasiprobability encodes nonclassicalityviolations of
tion appears in Fig. 18b. The first and last outcomes [j the axioms of probabilitymore resilient than the TOC
and (w1 , w1 )] equal those in Fig. 18a, as in the OTOC quasiprobabilitys.
) 2 function of, WWTOC and WTOC 0 reducedner
(w
all
reduc, to
FA(K . .. .The PTOC average of
0 proba
The 22 w
hW(t)Vhas . . . W(t)Vhf (WiTOC W(t)V
)i W(t)V (14
K(t) :=OTOC Tr( Let) f. can
TOC than the
All TO
valu
ha(
probabilitie
probab
-fold | {z been defined
PTOC
} X
forms [112,
a complex
|and 113]: distribution.
0 0 {z } average
denote
OTOC
reduce
can
,ity.
OTOC
bAll
qua
r
function
hf
:= (W TOCof ,W W 0
TOC f (W)i TOCW , W
TOC . The
)P P
TOC (W
TOC TOC
(w
, W
than ucts
reduced
0reduce
(w
the
than
probabilitie
can
prob
probaof
reduc
),
(1T)
th
to 2 w22
2K PTOC X formsTOC a complex distribution.

TOC
2 K Let fOTOCdenoteTOC
4. can Com Al re
W
function ,W
of, W
WTOC and WW 0 0 0 (wnew ner
OTOC
,(w qua
redu , ) 2 w22
. .. .The PTOC
)average
0 of
F (K ) (t) := hW(t)V . . . W(t)V
0 TOC TOC
:=(WiTOC
hf Tr(TOC f (W TOC , TOC
W(t)V
)i TOC )P (144)
W(t)V
TOC (WTOC ,W
.than
Let
the
than
probabilit
4.proba
WTOC
can4.(1
Com
Tt
| {z } X | 0 {z } OTOC wea
ity. (wqu
OTOC
TOC

:=
W ,W
hf (WTOC , WTOC 0
f (W TOC
)iTOC , WTOC 36
0
TOC

)PTOC (WTOC
ogous
W , Let
W
than
to0the
than
4. TOC
th,
=Com
prob
Let (wtTW
=
2
comeK
W 4. TOC
2K X
5. Jarzynski-like 2K for the TOC
equality
,Heisenberg- TO
Each such correlation function := contains
W ,W
f (WK
TOC
W 0
0
TOC
)P
ogous new
sextu
ogousOTO
to0 t
t
TOC
TOC TOC (144)TOC
(W , W
constraints.
W
bilitytarie
WLet4.W
than
A Let
TO=Co
4. TOC
comewea
TOC TOC
TO
picture operators W(t) interleaved W 5.,Wwith K
Jarzynski-like time-0
equality oper-
0
The TOC obeys a Jarzynski-like equality analogous
for the TOC ogous
come
sextu
to
ogous th
s
0
Each such correlation Heisenberg- TOC TOC isfy the constra
sure cons
4. Complex TOC distribution PTOC (WTOC , WTOC ) .function contains K
Wconstraints
WLet = 4
K) K
Let W A
bility TOC TOC
ators
Each Vsuch F (correlation
(t) encodes Eq.2K
function
(11) in5.[31].
contains
Jarzynski-like
1 obeys
= time
K
equality
Heisenberg-
Ka Jarzynski-like for the TOC
reversals, come
bility
Pogous sextu
A
come
ogous
(W
isfy to
the TOC
TOCT
TO
W(t)operators
picture operatorsillustrated interleaved with The TOC time-0 equality analogou
W sure
constraints
isfy W
tari
the
constr con
K Koper-
TOC
Le =
picture in Fig. 19. W(t) We focus
Theorem
interleaved
Eq. The(11)
TOC
on Hermitian
in45.[31].
(Jarzynski-like
with
Jarzynski-like
obeys a
W
theorem
time-0 and
equality
Jarzynski-like
forVthe
oper-
for
equality
, TOC
the TOC).
bility come
come
P Athe
analogou

PA
bility
sex
ogou
T TOC TOC
TOC
TOC
(K) as in [6, 125], time-ordered correlator (132) obeys the Jarzynski-l isfy to W th
isfy
sure
(W
constr (w
con th
TOC T
Let WTOC and WTOC 0

denote random variables anal- ators V . F (t)ators
encodes
V . F 2K ( K for
) simplicity.
1 =equality
(t) encodes Eq. K
2The
TheoremK time
(11) in 4[31].
1 = reversals,
K
(Jarzynski-like time reversals,
theorem for the TOC).
constraint
bility
P isfy
bility
AAth
come
(W
P TOCTOC
TO
T
TO
T
The conventional OTOC TOC obeys a to

corresponds
time-ordered correlator K
Jarzynski-like
(132) 3 equality
=obeys andthe
analogou
A
isfy the
sure
Functions W
const co (
TO
illustrated infocus
Fig.(2)19. We focus(11) on Hermitian W ,and VEthe, Jarzynski-
illustrated in Fig. 19. We on Hermitian @(2WK)D and V+obeys
Theorem in4 [31].
(Jarzynski-like theorem for fTOC).
denote P anT
bility
ogous to thermodynamic work. We fix the constraints K = 2: F (t) = F (t). If
Eq.
K
equality <
time-ordered 3, Fcorrelator (t) is
W not
( (132) OTO.
W the
)
P A Func
average
Jarzynski-
0 to (W
of
isfy
0 A (t
ft
TOC

TOC
TOC
as in [6, 125], for simplicity. FTOC (t) = e TOC f deno

, (14
Function TOC
WTOC = w1 v2 and WTOC 0

= w1 v1 . (w1 and v2 need as in [6, 28], for simplicity. @ 0
Theorem 4@ (Jarzynski-like
equality theorem for the fhfTOC). (W average
denote
A X
PAa,W
W
TOC
TO
@ 2 Dto( K E , average
Function 0 TOC
The conventional OTOC F
corresponds
time-ordered
(t) = correlator e (132)
W = 3 and
+obeys
W the
)
=0
Jarzynski
f:=denote
0
Fun
hf (W
0
f, den
of
(1 a

The conventional K =OTOC2: F (t) =corresponds to, 3,0 K @(@K= 0 ) 3 and

TOC
D E TOC
not be complex-conjugated because they are real, as W(t) W(t) F (2) (t). wherein
TOC
Ifequality
K < 2@FR. 2
(t)( isW not + W OTO. )
hf (W
average
0 :=of
averag
0
f den
W
Fun
X
A WTOC ,W
TOC
(F ) (t) = @ W(t) e , hf(1 TOC Functio TOC 0
K = 2: F (t) = F (2)
TOC , hf=0
K 0 (W
f:=denote
WT
t (t). If K < 3, FProof. (t)proof

is @@not D OTO.
averag
(W
W and V are Hermitian.) Multiple outcome sextuples
TOC
The is2 analogous =0 X
to the proofEof, Theorem
average Fu
of
0 5.,
WTOC
wherein , 0 2 R. ( W
in [31]. + W
FTOC (t) = ) := hf :=(W
e TOC
0 0
TOC Con f, de(
(v2 , v2 ; w1 , w1 ; v1 , v1 ) satisfy these constraints. Each W(t)
wherein , 0 2@ R.@ 0
W(t) Proof. The proofW(t)
Thehf (W avera
:=
TOC
X
WTOC ,W
TO
W
... is analogous to the proof ofEq. , Theore 0

Cert
=0The
(11)
:= in
hf ([
5 W
sextuple corresponds to a quasiprobability ATOC (.). We t

U U Proof.
in [31].
wherein
E.
in [31].
The
,
proof
Higher-order0
2 R.
is analogous
OTOCs
to
as
the proof
moments
of
of
Theore
Theorem Eq. (11
stra
Con
longer
The
time-ordered
Theor
TOC:=
54 WTOC
sum the quasiprobabilities that satisfy the constraints: W(t) W(t) W(t) (summed) quasiprobabilities equality Eq. The (11) TOC
The
time-or
thes
in
Proof. The proof is analogous
t U U inE. ...
[31].Higher-order OTOCs as
Dierentiating a characteristic
to the proof ofEq.
Experiment
time
moments of
function of
Theore
Theorem
F longe
Cer
(11)
Eq.
equality
time-ordere
Theorem
again
The
(t) (1
Eq.
Theo
stra
TOC
in
=
(1
a
X 0

E. Higher-order
again yields higher-
(summed)
(summed)
OTOCs as moments
quasiprobabilities
and quasiprobabilities
higher-point correlation
longe
The
equality
time-ordere
Eq.P(11)
equality
F TO
time-o
(K
Theo
fu
equali Th i
TOC
0 wherein time-o
thes ,
PTOC (WTOC , WTOC ) := V V V
(v1 ,v1 ),(w1 ,w1 ),(v2 ,v2 )

U U
0 st
...
tions.
versals.
So does dierentiating
Dierentiating
E. . .Higher-order
.
ButDierentiating
each resulting
again yields higher-
Let K
(summed)
0 th
a characteristic
acorrelator
=W(t)V (2K pair
P (W, W 0 ) again and
Experiment
OTOCs as moments
encodes
characteristic
1and quasiprobabilities
higher-point
+ 1), for K
function of
time function
just three
= 1,
Theorem
again
F longe
time-order
again
wherein
2,P
:=
time
equality
correlation
Eq.
(t) (=
equalit
F whereinaga
(t) =
F
The
F fu
time
. (K
.where
TOC
TOC TO
..fu , TO
1 W(t)V pair again yields
V K
tions.
V 0
-fold
SoOTOCdoeshigher-has
2and
dierentiating
been V
higher-point
P (W,
defined [91,
0 correlation
W92]:
0
) againwhereinand equa
ag ,W
tions. .So. .does dierentiating Pencodes
(W, W just ) again F
andtime ag
wherei(t) TOC
ATOC
(v1 , v1 ; w1 , w1 ; v2 , v2 ) W (w1 v2 ) W 0 (w1 v1 ) . 1ststtime
ButDierentiating
But
versals.
again
each
each
(K ) yields
resulting
. . Let
.resulting
(2 K 0
higher-
correlator
a
Experiment 1
characteristic
0 -1)correlator
=th.time (2K
12.and +encodes
1),
higher-point
function
for justK =
three
three1,
correlation
again
A
2, :=
time(KF
.. ..f
T
1 W(t)V pair Fversals. (t) :=LethW(t)V 0 th

|Kreversal =.time W(t)V +}i1),
2 (2Kdefined
Tr(
for K W(t)V
0| = 1,{z
. .2,. W(t
wherein
.wher
. ..
reversal K 0
0 -foldSo OTOC hasW(t)V
{zbeen pair [91, 92]:
(145) 0

tions.
K -fold. .OTOC
does dierentiating
has2K
P (W,
been defined [91, 92]:
But each. resulting correlator encodes just three
0
W ) again
2K
and
tim
ag 0
(K ) 0
1. W(t)V i Tr( W(t)V . . .W(
(14
(K
VFIG. 19:1stK
V-fold out-of-time-ordered
time Fversals.
(K ) (t) :=
F 0 (t) := hW(t)VV
Let hW(t)V
(2
| K 0 -1)th. .time
0
= (2K
correlator + 1), for
.2. W(t)V}i Tr( W(t)V
.{z K | = 1, A
2,
. . W(
.{z .. .
reversal {z
[Eq.reversal
| has2K {z } contains |K 0 Heisenbe
PTOC forms a complex distribution. Let f denote any (OTOC): . . . Each
The conventional KOTOC -fold
such OTOCcorrelation
operators W(t)
(22)],
2K
been
function defined
encodes 0
0
[91,
just 92]:
interleaved with K time-0op 0
2K
2K
0
0
picture T (1
0 three time reversals. The K-fold
1st W(t)V pair th ) OTOC
(KW(t)V (K pair
F V .(t)F := 0hW(t)V
19: K-fold out-of-time-ordered F (K ) (t) encodes
0
. . . W(t)V
2K }i K W(t)V ...W
(1
function of WTOC and WTOC . The PTOC average of f is FIG.
K 1 = KThe
2(OTOC): = conventional
ators
3, 5, . . . timeEach
)
|(t) encodes
such correlation
reversals. correlation
The
correlator
{zfunction
time
0
1=
measured
Tr(
contains |K
time reversa
{z
K 00 Heisenb
Heisenb
...
Each
OTOC such
picturethe [Eq.
operators (22)],
2K function
0
W(t) interleaved
contains
encodes
interleaved just
with K 0 2K(K
G
time-0
K 0 time-0
0
op
by a laboratory clock runs along picture
abscissa.
operators W(t)
(K ) The ordinate
with T(
op
0 1st time (2
three time reversals. The Kators -1)
ators th time(K 00 )
-fold .. OTOC (t) F
encodes(t) 2Kencodes
0
hf (WTOC , WTOC )i (146) reversal
represents
the time parameter t,VVwhich F (K ) (t)
F may encodes 2K
be inverted 0
11 =
=K time revers
inK time revers
2K reversal
Each such correlation
1 = K = 3, 5, . . . time reversals. The time measured function contains K 0
Heisenb
X experiments. The orange, leftmost
by a laboratory clock runs along picturethe dot represents
operators the state
W(t) interleaved
abscissa. The ordinate with K 0 time-0G (K
Po
0 0 preparation . Each green dot represents W(t) or
:= f (WTOC , WTOC )PTOC (WTOC , WTOC ). V . F (K ) (t)a encodes 2Ka0 V1. = K time rever
0
ators
FIG. 19: K-fold represents
Each
the time parameter t,correlator
out-of-time-ordered
purple line represents a unitary
which may be inverted in
time evolution. The
0
WTOC ,WTOC experiments. The orange, leftmost dot represents(Kthe state
(OTOC): The conventional
diagram, scanned OTOC from left[Eq. to (23)], encodes
right, represents Fjust) (t), P
preparation . Each green dot represents a W(t) or a V .
scanned fromleft to right. (K)
three time reversals.
EachThe purple -fold
K line OTOCa F
represents unitary(t) encodes
time evolution. The

2K 1 = K = 3,diagram,
5, . . . time
scannedreversals.
from left The time
to right, measured
represents F (K ) (t),
(K ) The
5. Jarzynski-like equality for the TOC Theruns
scanned
by a laboratory clock greater
from thetothe
left
along K , abscissa.
the longer The
right. the distribution
ordinate P
tem
of which F (K ) (t) equals a moment. We define P (K ) in
represents the time
three
parameter t,recall
which may be inverted in liter
The
Thesteps:
experiments. TheAorange,
Wethe
greater
leftmost Kdot,the
the -extended
Klonger
represents thethe quasiprobability
distribution
state P (K ) feren
The TOC obeys a Jarzynski-like equality analogous to (K )
[Eq.F(91)].
(K ) We introduce measurable random(K tem
of which (t) equals a moment. We define P vari- )
in
preparation . Each
ables green dotWrepresents
0 a W(t) or a V. liter
Eq. (11) in [37]. three W ` andWe
steps: 0. These
`recall the K variables
-extendedparticipate in con-
quasiprobability The
Each purple line represents
(K ) a unitary (Ktime
)
Aintroduce
evolution. The fere
A
straints on sums
[Eq. (91)]. of We (.) values.
measurable
random vari- OTO
diagram, scanned ables
from W left to right,0 represents F (K ) (t),
` and W`0 . These variables participate in con-
Theorem 4 (Jarzynski-like theorem for the TOC). The Th
scanned from left straints
to right. on sums of A
(K )
(.) values. OT
time-ordered correlator (136) obeys the Jarzynski-like
equality
The greater the K , the longer the distribution P (K )

2 D (WTOC + 0 WTOC E of which F (K ) (t) equals a moment. We define P (K ) in
0
)
FTOC (t) = e , (147) three steps: We recall the K -extended quasiprobability
0 0 (K )
, =0 A [Eq. (95)]. We introduce measurable random vari-
ables W` and W`00 . These variables participate in con-
wherein , 0 R. (K )
straints on sums of A (.) values.
Proof. The proof is analogous to the proof of Theorem 1 Let us evaluate Eq. (95) on particular arguments:
in [37].
A(K

)
(v1 , v1 ; w2 , w2 ; . . . ; vK , vK ; wK+1 , wK+1 )
= hwK+1 , wK+1 |U |vK , vK ihvK , vK |U |wK , wK i
E. Higher-order OTOCs as moments of longer
(summed) quasiprobabilities . . . hw2 , w2 |U |v1 , v1 ihv1 , v1 |U |wK+1 , wK+1 i .
(149)
Differentiating a characteristic function again and
(K )
again yields higher- and higher-point correlation func- One can infer A from the interferometry scheme
tions. So does differentiating P (W, W 0 ) again and again. in [37] and from weak measurements. Upon implement-
But each resulting correlator encodes just K = 3 time ing one batch of the interferometry trials, one can infer
reversals. Let K = 21 (K +1) = 2, 3, . . ., for K = 3, 5, . . . (K )
A for all K -values: One has measured all the inner
A K-fold OTOC has been defined [121, 122]: products ha|U|bi. Multiplying together arbitrarily many
inner products yields an arbitrarily high-K quasiproba-
(K )
F (K ) (t) := hW(t)V . . . W(t)V i Tr( W(t)V . . . W(t)V ) . bility. Having inferred some A , one need not perform
| {z } | {z } (K +2) (K )
2K 2K new experiments to infer A . To infer A from
(148) weak measurements, one first prepares . One performs
37
K = 2K 1 weak measurements interspersed with uni- of that papers Eq. (15):

taries. (One measures V weakly, evolves with U , mea-
sures W weakly, evolves with U , etc.) Finally, one mea- G (K ) (s2 , . . . , sK+1 , s01 , . . . , s0K+1 ) = Tr(
sures W strongly. The strong measurement corresponds "
X
to the anomalous index K + 1 in (wK+1 , wK+1 ). U
|wK+1 , wK+1 ihwK+1 , wK+1 |
We define 2K random variables (wK+1 ,w
K+1
)
#
isK+1 wK+1
e U
W` {w` } ` = 2, 3, . . . , K + 1 and (150)
W`00 {v`0 } `0 = 1, 2, . . . , K . (151) " #
X
is0K vK
|vK , vK ihvK , vK |e ...
Consider fixing the values of the W` s and the W`00 s. (vK ,v
K
)
(K )
" #!
Certain quasiprobability values A (.) satisfy the con- X
is01 v1
straints W` = w` and W`0 = v`0 for all ` and `0 . Summing
0 |v1 , v1 ihv1 , v1 |e . (155)
these quasiprobability values yields (v1 ,v1 )
P (K ) (W2 , W3 , . . . , WK+1 , W10 , W20 , . . . , WK

0
) (152)
X X
:= The greater the K , the longer the quasiprobability
(K )
W2 ,W3 ,...,WK+1 W10 ,W20 ,...,W 0 A . The more weak measurements are required to
K
(K ) (K )
infer A . Differentiating A more does not raise
A(K

)
(v1 , v1 ; w2 , w2 ; . . . ; vK , vK ; wK+1 , wK+1 )
the number of time reversals encoded in the correlator.
W2 w2 . . . WK+1 wK+1 W10 v1 . . . WK 0 v
K .
VI. OUTLOOK
(K )
The characteristic function of P is
We have characterized the quasiprobability A that
X X
G (K ) (s2 , . . . , sK+1 , s01 , . . . , s0K ) := lies behind the OTOC F (t). A , we have argued, is
W2 ,...,WK+1 W10 ,...,W 0
an extension of the Kirkwood-Dirac distribution used
K
in quantum optics. We have analyzed and simplified
P (K ) (W2 , W3 , . . . , WK+1 , W10 , W20 , . . . , WK0
) measurement protocols for A , calculated A numeri-
0 0 0 0
cally and on average over Brownian circuits, and inves-
is2 W2 isK+1 WK+1 is1 W1
e ... e e . . . eisk WK . tigated mathematical properties. This work redounds
(153) upon quantum chaos, quasiprobability theory, and weak-
measurement physics. As the OTOC equals a combina-
tion of A (.) values, A provides more-fundamental infor-
The s` and s0`0 variables are regarded as imaginary-
mation about scrambling. The OTOC motivates general-
temperature variables, to parallel the fluctuation-relation
izations of, and fundamental questions about, KD theory.
literature (e.g., [134]): is` ` , and is0`0 `0 0 . Dif-
The OTOC also suggests a new application of sequential
ferentiating G (K ) yields the K-fold OTOC. weak measurements.
At this intersection of fields lie many opportunities.
Theorem 5 (Jarzynski-like equality for the K-fold We classify the opportunities by the tools that inspired
OTOC). The K-fold OTOC obeys the Jarzynski-like them: experiments, calculations, and abstract theory.
equality
A. Experimental opportunities
(K) 2K
F (t) =
2 . . . K+1 10 . . . K 0

+ We expect the weak-measurement scheme for A and
*
KX+1 K
X F (t) to be realizable in the immediate future. Candi-

exp ` W ` + `0 0 W`00 . date platforms include superconducting qubits, trapped
0
ions, ultracold atoms, cavity QED, and perhaps NMR.
`=2 ` =1 ` ,`0 0 =0 `,`0
(154) Experimentalists have developed key tools required to
implement the protocol [1013, 3135, 50, 63].
Achievable control and dissipation must be compared
Proof. The proof proceeds in analogy with the proof of with the conditions needed to infer the OTOC. Errors
Theorem 1 in [37]. For clarity, we emphasize the analog might be mitigated with tools under investigation [114].
38
B. Opportunities motivated by calculations that, in some cases, the distribution over possible val-
ues of |hv2 , v2 |U |w3 , w3 i| peaks at 1d . But the dis-
Numerical simulations and analytical calculations tribution approaches this form before t . Also, the dis-
point to three opportunities. tributions width seems constant in d. Further study is
Physical models OTOC quasiprobabilities may be required. The overlap between OTOC and two QI def-
evaluated. The Sachdev-Ye-Kitaev model, for example, initions of scrambling have been explored already: (1)
scrambles quickly [29, 30]. The quasiprobabilitys func- When the OTOC is small, a tripartite information is
tional form may suggest new insights into chaos. Our negative [28]. (2) An OTOC-like function is propor-
Brownian-circuit calculation (Sec. IV), while a first step, tional to a frame potential that quantifies pseudorandom-
involves averages over unitaries. Summing quasiproba- ness [121]. The relationship between the OTOC and a
bilities can cause interference to dampen nonclassical be- third QI sense of incompatibilityMUBs and entropic
haviors [14]. Additionally, while unitary averages model uncertainty relationsmerits investigation.
chaotic evolution, explicit Hamiltonian evolution might Second, A effectively has four arguments, apart from
provide different insights. Explicit Hamiltonian evolution (Sec. V A). The KD quasiprobability has two. This
would also preclude the need to calculate higher moments doubling of indices parallels the Choi-Jamiolkowski (CJ)
of the quasiprobability. representation of quantum channels [117]. Hosur et al.
In some numerical plots, the real part <(A ) bifur- have, using the CJ representation, linked F (t) to the tri-
cates. These bifurcations resemble classical-chaos pitch- partite information [28]. The extended KD distribution
forks [118]. Classical-chaos plots bifurcate when a differ- might be linked to information-theoretic quantities simi-
ential equations equilibrium point branches into three. larly.
The OTOC quasiprobability A might be recast in terms Third, our P (W, W 0 ) and weak-measurement protocol
of equilibria. Such a recasting would strengthen the par- resemble analogs in [107, 108]. {See [109111] for frame-
allel between classical chaos and the OTOC. works similar to Solinas and Gasparinettis (S&Gs).}
Finally, the Brownian-circuit calculation has untied Yet [107, 108] concern quantum thermodynamics, not
threads. We calculated only the first moment of A . the OTOC. The similarity between the quasiprobabilities
Higher moments may encode physics less visible in F (t). in [107, 108] and those in [37], their weak-measurement
Also, evaluating certain components of A requires new protocol and ours, and the thermodynamic agendas
calculational tools. These tools merit development, then in [107, 108] and [37] suggest a connection between the
application to A . An example opportunity is discussed projects [105, 106]. The connection merits investigation
after Eq. (86). and might yield new insights. For instance, S&G cal-
culate the heat dissipated by an open quantum system
that absorbs work [107, Sec. IV]. OTOC theory focuses
C. Fundamental-theory opportunities on closed systems. Yet experimental systems are open.
Dissipation endangers measurements of F (t). Solinas and
Gasparinettis toolkit might facilitate predictions about,
Seven opportunities concern the mathematical proper-
and expose interesting physics in, open-system OTOCs.
ties and physical interpretations of A .
Fourth, W and W 0 suggest understudies for work in
The KD quasiprobability prompts the question, Is the
quantum thermodynamics. Thermodynamics sprouted
OTOC definition of maximal noncommutation consis-
during the 1800s, alongside steam engines and facto-
tent with the mutually-unbiased-bases definition? Re-
ries. How much work a system could outputhow much
call Sec. V C: n
We decomposed
o an operator 0 in terms
|aihf | orderly energy one could reliably drawheld practi-
of a set S = hf |ai of operators. In the KD- cal importance. Todays experimentalists draw energy
hf |ai6=0
quasiprobability literature, the bases Sa = {|ai} and from power plants. Quantifying work may be less critical
Sf = {|f i} tend to be mutually unbiased (MU): |hf |ai| = than it was 150 years ago. What can replace work in the
1 a, f . Let A and B denote operators that have todays growing incarnation of thermodynamics, quan-
d
MU eigenbases. Substituting A and B into an uncer- tum thermodynamics? Coherence relative to the energy
tainty relation maximizes the lower bound on an uncer- eigenbasis is being quantified [135, 136]. The OTOC sug-
tainty [130]. In this quantum-information (QI) sense, A gests alternatives: W and W 0 are random variables, anal-
and B noncommute maximally. ogous to work, natural to quantum-information scram-
bling. The potential roles of W and W 0 within quantum
In Sec. V C, Sa = {|v2 , v2 i}, and Sf = U |w3 , w3 i .
These Ss are eigenbases of V and W(t). When do thermodynamics merit exploration.
we expect these eigenbases to be MU, as in the KD- Fifth, relationships amongst three ideas were identified
quasiprobability literature? After the scrambling time recently:
t after F (t) decays to zerowhen W(t) and V non- (1) We have linked quasiprobabilities with the OTOC,
commute maximally in the OTOC sense. following [37].
The OTOC provides one definition of maximal non-
commutation. MUBs provide a QI definition. To what (2) Aleiner et al. [137] and Haehl et al. [138, 139] have
extent do these definitions overlap? Initial results show linked the OTOC with Schwinger-Keldysh path in-
39
tegrals. time irreversibility. Tools from non-Hermitian quantum

mechanics might offer insight [132].
(3) Hofer has linked Schwinger-Keldysh path integrals
with quasiprobabilities [126].
ACKNOWLEDGEMENTS
The three ideasquasiprobabilities, the OTOC, and
Schwinger-Keldysh path integralsform the nodes of the This research was supported by NSF grant PHY-
triangle in Fig. 20. The triangles legs were discovered 0803371. Partial support came from the Walter Burke
recently; their joinings can be probed further. For ex- Institute for Theoretical Physics at Caltech. The In-
ample, Hofer focuses on single-timefold path integrals. stitute for Quantum Information and Matter (IQIM) is
OTOC path integrals contain multiple timefolds [137 an NSF Physics Frontiers Center supported by the Gor-
139]. Just as Hofers quasiprobabilities involve fewer don and Betty Moore Foundation. NYH thanks Jor-
timefolds than the OTOC quasiprobability A , the TOC dan Cotler and Paolo Solinas for pointing out the paral-
quasiprobability ATOC
(139) can be inferred from fewer lel with [107, 108]; David Ding for asking whether A
weak measurements than A can. One might expect represents a state; Mukund Rangamani for discussing
K-fold OTOCs; Michele Campisi, Snir Gazit, John
Hofers quasiprobabilities to relate to ATOC
. Kindred
Goold, Jonathan Jones, Leigh Samuel Martin, Oskar
works, linking quasiprobabilities with out-of-time order-
Painter, and Norman Yao for discussing experiments;
ing, include [123127].
and Christopher D. White and Elizabeth Crosson for
discussing computational complexity. Parts of this pa-
OTOC per were developed while NYH was visiting the Stanford
ITP and UCL. BGS is supported by the Simons Founda-
tion, as part of the It From Qubit collaboration; through
a Simons Investigator Award to Senthil Todadri; and by
Aleiner et al. (2016) Yunger Halpern MURI grant W911NF-14-1-0003 from ARO. JD is sup-
Haehl et al. (2016) (2017) ported by ARO Grant No. W911NF-15-1-0496.
Appendix A MATHEMATICAL PROPERTIES

Schwinger- OF P (W, W 0 )
Quasi-
Keldysh Hofer (2017)
probability
path integral Summing A , with constraints, yields P (W, W 0 )
[Eq. (32)]. Hence properties of A (Sec. V A) imply prop-
erties of P (W, W 0 ).
FIG. 20: Three interrelated ideas: Relationships
amongst the out-of-time-ordered correlator, Property 8. P (W, W 0 ) is a map from a composition
quasiprobabilities, and Schwinger-Keldysh path integrals of two sets of complex numbers to the complex numbers:
were articulated recently. P : {W } {W 0 } C. The range is not necessarily real:
C R.
Sixth, the OTOC equals a moment of the complex dis- Summing quasiprobability values can eliminate non-
tribution P (W, W 0 ) [37]. The OTOC has been bounded classical behavior: Interference can reduce quasiproba-
with general-relativity and Lieb-Robinson tools [6, 71]. bilities nonreality and negativity. Property 6 consists of
A more information-theoretic bound might follow from an example. One might expect P (W, W 0 ), a sum of A (.)
the Jarzynski-like equality in [37]. values, to be real. Yet P (W, W 0 ) is nonreal in many nu-
Finally, the KD distribution consists of the coefficients merical simulations (Sec. III).
in a decomposition of a quantum state D(H) [10, 11]
Property 9. Marginalizing P (W, W 0 ) over one argu-
(Sec.
n IoA 3). is decomposed in terms of a set S :=
|aihf |
ment yields a probability if shares the V eigenbasis or
hf |ai of operators. S forms a basis for H only if
the W(t) eigenbasis.
hf |ai 6= 0 a, f . The inner product has been nonzero Consider marginalizing Eq. (32) over W 0 . The
in experiments, because {|ai} and {|f i} are chosen to (w2 , w2 ) and (v1 , v1 ) sums can be performed explicitly:
be mutually unbiased bases (MUBs): They are eigen- X
bases of maximally noncommuting observables. The P (W ) := P (W, W 0 ) (A1)
OTOC, evaluated before the scrambling time t = t , mo- W0
tivates a generalization beyond MUBs. What if, F (t) X
= hw3 , w3 |U |v2 , v2 ihv2 , v2 |U |w3 , w3 i
prompts us to ask, hf |ai = 0 for some a, f (Sec. V C)?
(v2 ,v2 ),
The decomposition comes to be of an asymmetrically (w3 ,w3 )
decohered 0 . This decoherences physical significance
merits investigation. The asymmetry appears related to W (w3 v2 ) . (A2)
40
The final expression is not obviously a probability. Consider inferring A(1/d) or A(1/d) from weak mea-
But suppose that shares its eigenbasis with V or surements. From one trial, we infer about four random

with W(t). Suppose, for example, that has the form in variables: v1 , w2 , v2 and w3 . Each variable equals 1.
Eq. (30). Equation (A2) simplifies: The quadruple (v1 , w2 , v2 , w3 ) therefore assumes one of
X sixteen possible values. These four base variables are
P (W ) = p(v2 , v2 ; w3 , w3 ) W (w3 v2 ) . (A3) multiplied to form the composite variables W and W 0 .
(v2 ,v2 ), The tuple (W, W 0 ) assumes one of four possible values.
(w3 ,w3 ) Every (W, W 0 ) value can be formed from each of four val-
ues of (v1 , w2 , v2 , w3 ). Table II lists the tuple-quadruple
The p(v2 , v2 ; w3 , w3 ) := |hw3 , w3 |U |v2 , v2 i|2 pv2 ,v2 correspondences.
denotes the joint probability that a V measurement of Consider any quadruple associated with (W, W 0 ) =
yields (v2 , v2 ) and, after a subsequent evolution under (1, 1), e.g., (1, 1, 1, 1). Consider swapping w2 with w3
U, a W measurement yields (w3 , w ). and swapping v1 with v2 . The result, e.g., (1, 1, 1, 1),
3
Every factor in Eq. (A3) is nonnegative. Summing over leads to (W, W 0 ) = (1, 1). This double swap amounts
W yields a sum over the arguments to a cyclic permutation of the quadruples elements. This
P of A (.). The latter permutation is equivalent to a cyclic permutation of the
sum equals one, by Property 6: W P (W ) = 1. Hence
P (W ) [0, 1]. Hence P (W ) behaves as a probability. argument of the (A6) trace. This permutation pre-
We can generalize Property 9 to arbitrary Gibbs states serves the traces value while transforming the trace into
= eH/T /Z, using the regulated quasiprobability (106). P (1, 1). The trace originally equaled P (1, 1). Hence
The regulated OTOC (104) equals a moment of the com- P (1, 1) = P (1, 1).
plex distribution
X
Preg (W, W 0 ) := (A4) Appendix B RETRODICTION ABOUT THE
(v1 ,v1 ),(w2 ,w2 ),(v2 ,v2 )(w3 ,w3 ) SYMMETRIZED COMPOSITE OBSERVABLE
:= i(K . . . A A . . . K)
Areg
(v1 , v1 ; w2 , w2 ; v2 , v2 ; w3 , w3 ) W (w3 v2 ) W 0 (w2 v1 ) .
The proof is analogous to the proof of Theorem 1 in [37]. Section V B concerns retrodiction about the sym-
W0 metrized observable := K . . . A + A . . . K. The prod-
PSumming over yields Preg (W ) :=
:= i(K . . . A
0
W 0 Preg (W, W ). We substitute in from Eq. (A4), then
uct K . . . A is symmetrized also in
for Areg from Eq. (106). We perform the sum over W 0 using K -extended
A . . . K). One can retrodict about ,

(K )
explicitly, then the sums over (w2 , w2 ) and (v1 , v1 ): KD quasiprobabilities A , similarly to in Theorem 1.
X The value most reasonably attributable retrodictively
Preg (W ) = |v2 , v i|2 W (w v ) .
|hw3 , w3 |U to the time-t0 value of is given by Eqs. (107), (108),
2 3 2
(v2 ,v2 ) and (110). The conditional quasiprobabilities on the
(w3 ,w3 ) right-hand sides of Eqs. (109) and (111) become
(A5)
=(hf 0 |kihk| . . . |aiha|0 |f 0 i)
This expression is real and nonnegative. Preg (W ) sums p (a, . . . , k, f |) = (B1)
hf 0 |0 |f 0 i
to one, as P (W ) does. Hence Preg (W ) [0, 1] acts as a
probability. and
0
Property 10 (Degeneracy of every P (W, W ) associated
=(hf 0 |aiha| . . . |kihk|0 |f 0 i)
with = 1/d and with eigenvalue-(1) operators W and p (k, . . . , a, f |) = . (B2)
V ). Let the eigenvalues of W and V be 1. For exam- hf 0 |0 |f 0 i
ple, let W and V be Pauli operators. Let = 1/d be the
infinite-temperature Gibbs state. The complex distribu- The extended KD distributions become
tion has the degeneracy P (1, 1) = P (1, 1).
A(K ) 0 0 0
, (, a, . . . , k, f ) = ihf |kihk| . . . |aiha| |f i (B3)
Property 10 follows from (1) Eq. (44) and (2) Prop-
erty 7 of A(1/d) . Item (2) can be replaced with the traces and
cyclicality. We reason as follows: P (W, W 0 ) is defined
in Eq. (32). Performing the sums over the degeneracies A(K ) 0 0
, (, k, . . . , a, f ) = ihf |aiha| . . . |kihk||f i . (B4)
yields A(1/d) . Substituting in from Eq. (44) yields
To prove this claim, we repeat the proof of Theorem 1
1 X requires
P (W, W ) = 0 W(t) V
Tr w v2 W(t) V until reaching Eq. (118). The definition of
w 2 v 1
d v ,w ,v ,w 3
that an i enter the argument of the first < and that a
1 2 2 3
i enter the argument of the second <. The identity
W (w3 v2 ) W 0 (w2 v1 ) . (A6) <(iz) = =(z), for z C, implies Eqs. (B1)(B4).
41
(W, W 0 ) (v1 , w2 , v2 , w3 )
(1, 1) (1, 1, 1, 1), (1, 1, 1, 1), (1, 1, 1, 1), (1, 1, 1, 1)
(1, 1) (1, 1, 1, 1), (1, 1, 1, 1), (1, 1, 1, 1), (1, 1, 1, 1)
(1, 1) (1, 1, 1, 1), (1, 1, 1, 1), (1, 1, 1, 1), (1, 1, 1, 1)
(1, 1) (1, 1, 1, 1), (1, 1, 1, 1), (1, 1, 1, 1), (1, 1, 1, 1)
TABLE II: Correspondence between tuples of composite variables and quadruples of base variables: From
each weak-measurement trial, one learns about a quadruple (v1 , w2 , v2 , w3 ). Suppose that the out-of-time-ordered-correlator
operators W and V have the eigenvalues w` , vm = 1. For example, suppose that W and V are Pauli operators. The
quadruples elements are combined into W := w3 v2 and W 0 := w2 v1 . Each (W, W 0 ) tuple can be formed from each of four
quadruples.
REFERENCES [25] A. A. Patel and S. Sachdev, ArXiv e-prints (2016),

1611.00003.
[1] S. H. Shenker and D. Stanford, Journal of High Energy [26] D. Chowdhury and B. Swingle, ArXiv e-prints (2017),
Physics 3, 67 (2014). 1703.02545.
[2] S. H. Shenker and D. Stanford, Journal of High Energy [27] A. A. Patel, D. Chowdhury, S. Sachdev, and B. Swingle,
Physics 12, 46 (2014). ArXiv e-prints (2017), 1703.07353.
[3] S. H. Shenker and D. Stanford, Journal of High Energy [28] P. Hosur, X.-L. Qi, D. A. Roberts, and B. Yoshida, Jour-
Physics 5, 132 (2015). nal of High Energy Physics 2, 4 (2016), 1511.04021.
[4] D. A. Roberts, D. Stanford, and L. Susskind, Journal [29] S. Sachdev and J. Ye, Phys. Rev. Lett. 70, 3339 (1993).
of High Energy Physics 3, 51 (2015). [30] A. Kitaev, A simple model of quantum holography,
[5] D. A. Roberts and D. Stanford, Physical Review Letters KITP strings seminar and Entanglement 2015 program,
115, 131603 (2015). 2015.
[6] J. Maldacena, S. H. Shenker, and D. Stanford, ArXiv [31] V. Bollen, Y. M. Sua, and K. F. Lee, Phys. Rev. A 81,
e-prints (2015), 1503.01409. 063826 (2010).
[7] H. J. Carmichael, Statistical Methods in Quantum Op- [32] Y. Suzuki, M. Iinuma, and H. F. Hofmann, New Journal
tics I: Master Equations and Fokker-Planck Equations of Physics 18, 103045 (2016).
(Springer-Verlag, 2002). [33] F. Piacentini et al., Phys. Rev. Lett. 117, 170402
[8] J. G. Kirkwood, Physical Review 44, 31 (1933). (2016).
[9] P. A. M. Dirac, Rev. Mod. Phys. 17, 195 (1945). [34] G. S. Thekkadath et al., Phys. Rev. Lett. 117, 120401
[10] J. S. Lundeen, B. Sutherland, A. Patel, C. Stewart, and (2016).
C. Bamber, Nature 474, 188 (2011). [35] T. C. White et al., npj Quantum Information 2, 15022
[11] J. S. Lundeen and C. Bamber, Phys. Rev. Lett. 108, (2016).
070402 (2012). [36] J. P. Groen et al., Phys. Rev. Lett. 111, 090506 (2013).
[12] C. Bamber and J. S. Lundeen, Phys. Rev. Lett. 112, [37] N. Yunger Halpern, Phys. Rev. A 95, 012120 (2017).
070405 (2014). [38] C. Jarzynski, Physical Review Letters 78, 2690 (1997).
[13] M. Mirhosseini, O. S. Maga na Loaiza, S. M. [39] M. Campisi and J. Goold, ArXiv e-prints (2016),
Hashemi Rafsanjani, and R. W. Boyd, Phys. Rev. Lett. 1609.05848.
113, 090402 (2014). [40] N. Tsuji, T. Shitara, and M. Ueda, ArXiv e-prints
[14] J. Dressel, Phys. Rev. A 91, 032116 (2015). (2016), 1612.08781.
[15] R. W. Spekkens, Phys. Rev. Lett. 101, 020401 (2008). [41] B. Swingle, G. Bentsen, M. Schleier-Smith, and P. Hay-
[16] C. Ferrie, Reports on Progress in Physics 74, 116001 den, ArXiv e-prints (2016), 1602.06271.
(2011). [42] N. Y. Yao et al., ArXiv e-prints (2016), 1607.01801.
[17] A. G. Kofman, S. Ashhab, and F. Nori, Physics Reports [43] A. Bohrdt, C. B. Mendl, M. Endres, and M. Knap,
520, 43 (2012). ArXiv e-prints (2016), 1612.02434.
[18] J. Dressel, M. Malik, F. M. Miatto, A. N. Jordan, and [44] G. Zhu, M. Hafezi, and T. Grover, ArXiv e-prints
R. W. Boyd, Rev. Mod. Phys. 86, 307 (2014). (2016), 1607.00079.
[19] M. Howard, J. Wallman, V. Veitch, and J. Emerson, [45] I. Danshita, M. Hanada, and M. Tezuka, ArXiv e-prints
Nature 510, 351 (2014). (2016), 1606.02454.
[20] N. Delfosse, P. Allard Guerin, J. Bian, and [46] N. Tsuji, P. Werner, and M. Ueda, Phys. Rev. A 95,
R. Raussendorf, Phys. Rev. X 5, 021003 (2015). 011601 (2017), 1610.01251.
[21] J. Maldacena, International Journal of Theoretical [47] J. Li et al., ArXiv e-prints (2016), 1609.01246.
Physics 38, 1113 (1999). [48] M. G arttner et al., ArXiv e-prints (2016), 1608.08938.
[22] E. Witten, Advances in Theoretical and Mathematical [49] K. X. Wei, C. Ramanathan, and P. Cappellaro, ArXiv
Physics 2, 253 (1998). e-prints (2016), 1612.05249.
[23] S. S. Gubser, I. R. Klebanov, and A. M. Polyakov, [50] S. Hacohen-Gourgy et al., Nature 538, 491 (2016).
Physics Letters B 428, 105 (1998), hep-th/9802109. [51] R. P. Rundle, T. Tilma, J. H. Samson, and M. J. Everitt,
[24] D. Stanford, Journal of High Energy Physics 10, 9 ArXiv e-prints (2016), 1605.08922.
(2016), 1512.07687. [52] M. Takita et al., Phys. Rev. Lett. 117, 210505 (2016).
42
[53] J. Kelly et al., Nature 519, 66 (2015). [93] H. Margenau and R. N. Hill, Prog. Theor. Phys. 26,
[54] R. W. Heeres et al., ArXiv e-prints (2016), 1608.02430. 722 (1961).
[55] D. Rist`e et al., Nature Communications 6, 6983 (2015). [94] L. M. Johansen and A. Luis, Phys. Rev. A 70, 052115
[56] S. A. Gardiner, J. I. Cirac, and P. Zoller, Phys. Rev. (2004).
Lett. 79, 4790 (1997). [95] H. M. Wiseman, Phys. Rev. A 65, 032111 (2002).
[57] S. K. Choudhary, T. Konrad, and H. Uys, Phys. Rev. [96] A. J. Leggett and A. Garg, Phys. Rev. Lett. 54, 857
A 87, 012131 (2013). (1985).
[58] L. G. Lutterbach and L. Davidovich, Phys. Rev. Lett. [97] C. Emary, N. Lambert, and F. Nori, Reports on
78, 2547 (1997). Progress in Physics 77, 016001 (2014).
[59] S. Debnath et al., Nature 536, 63 (2016). [98] H. F. Hofmann, New Journal of Physics 14, 043031
[60] T. Monz et al., Science 351, 1068 (2016). (2012).
[61] N. M. Linke et al., ArXiv e-prints (2016), 1611.06946. [99] J. Dressel and A. N. Jordan, Phys. Rev. A 85, 012107
[62] N. M. Linke et al., ArXiv e-prints (2017), 1702.01852. (2012).
[63] A. Browaeys, D. Barredo, and T. Lahaye, Journal of [100] H. F. Hofmann, Phys. Rev. A 89, 042115 (2014).
Physics B: Atomic, Molecular and Optical Physics 49, [101] H. F. Hofmann, New Journal of Physics 16, 063056
152001 (2016). (2014).
[64] C. Guerlin et al., Nature (London)448, 889 (2007), [102] J. Z. Salvail et al., Nat Photon 7, 316 (2013).
0707.3880. [103] M. Malik et al., Nat Commun 5 (2014), Article.
[65] K. W. Murch, S. J. Weber, C. Macklin, and I. Siddiqi, [104] G. A. Howland, D. J. Lum, and J. C. Howell, Opt.
Nature (London)502, 211 (2013), 1305.7270. Express 22, 18870 (2014).
[66] L. Xiao and J. A. Jones, Physics Letters A 359, 424 [105] J. Cotler, private communication, 2016.
(2006). [106] P. Solinas, private communication, 2016.
[67] D. Lu, A. Brodutch, J. Li, H. Li, and R. Laflamme, New [107] P. Solinas and S. Gasparinetti, Phys. Rev. E 92, 042150
Journal of Physics 16, 053015 (2014). (2015).
[68] W. Brown and O. Fawzi, ArXiv e-prints (2012), [108] P. Solinas and S. Gasparinetti, Phys. Rev. A 94, 052103
1210.6644. (2016).
[69] P. Hayden and J. Preskill, Journal of High Energy [109] J. J. Alonso, E. Lutz, and A. Romito, Phys. Rev. Lett.
Physics 2007, 120 (2007). 116, 080403 (2016).
[70] Y. Sekino and L. Susskind, Journal of High Energy [110] H. J. D. Miller and J. Anders, ArXiv e-prints (2016),
Physics 2008, 065 (2008). 1610.04285.
[71] N. Lashkari, D. Stanford, M. Hastings, T. Osborne, and [111] C. Elouard, D. A. Herrera-Mart, M. Clusel, and
P. Hayden, Journal of High Energy Physics 2013, 22 A. Auff`eves, npj Quantum Information 3, 9 (2017).
(2013). [112] M. A. Nielsen and I. L. Chuang, Quantum Computa-
[72] Y. Aharonov, D. Z. Albert, and L. Vaidman, Phys. Rev. tion and Quantum Information (Cambridge University
Lett. 60, 1351 (1988). Press, 2010).
[73] L. M. Johansen, Phys. Lett. A 329, 184 (2004). [113] J. von Neumann, Mathematische Grundlagen der Quan-
[74] M. J. W. Hall, Phys. Rev. A 64, 052103 (2001). tenmechanik (Springer, Berlin, 1932).
[75] M. J. W. Hall, Phys. Rev. A 69, 052113 (2004). [114] B. Swingle, G. Bentsen, M. Schleier-Smith, N. Yao, and
[76] L. D. Landau and E. M. Lifshitz, Statistical Physics N. Yunger Halpern, Resilience of the scrambling inter-
(Pergamon Press, Oxford, England, 1980). ferometer, in prep.
[77] J. Banerji, Contemporary Physics 48, 157 (2007). [115] IBM, The quantum experience.
[78] A. Rihaczek, IEEE Transactions on Information Theory [116] J. Dressel, T. A. Brun, and A. N. Korotkov, Phys. Rev.
14, 369 (1968). A 90, 032302 (2014).
[79] L. Cohen, Proceedings of the IEEE 77, 941 (1989). [117] J. Preskill, Quantum computation: Ch. 3: Foundations
[80] N. W. M. Ritchie, J. G. Story, and R. G. Hulet, Phys. of quantum theory ii: Measurement and evolution, Lec-
Rev. Lett. 66, 1107 (1991). ture notes, 2015.
[81] G. J. Pryde, J. L. OBrien, A. G. White, T. C. Ralph, [118] S. Strogatz, Non-linear Dynamics and Chaos: With
and H. M. Wiseman, Phys. Rev. Lett. 94, 220405 applications to Physics, Biology, Chemistry and Engi-
(2005). neering (Perseus Books, 2000).
[82] J. Dressel, C. J. Broadbent, J. C. Howell, and A. N. [119] L. DAlessio, Y. Kafri, A. Polkovnikov, and
Jordan, Phys. Rev. Lett. 106, 040402 (2011). M. Rigol, Advances in Physics 65, 239 (2016),
[83] M. F. Pusey, Phys. Rev. Lett. 113, 200401 (2014). http://dx.doi.org/10.1080/00018732.2016.1198134.
[84] M. Waegell et al., ArXiv e-prints (2016), 1609.06046. [120] C. Gogolin and J. Eisert, Reports on Progress in Physics
[85] C. Ferrie and J. Combes, Phys. Rev. Lett. 113, 120404 79, 056001 (2016).
(2014). [121] D. A. Roberts and B. Yoshida, ArXiv e-prints (2016),
[86] L. Vaidman, ArXiv e-prints (2014), 1409.5386. 1610.04903.
[87] E. Cohen, ArXiv e-prints (2014), 1409.8555. [122] F. M. Haehl, R. Loganayagam, P. Narayan, and
[88] Y. Aharonov and D. Rohrlich, ArXiv e-prints (2014), M. Rangamani, ArXiv e-prints (2017), 1701.02820.
1410.0381. [123] V. Manko and R. V. Mendes, Physica D: Nonlinear
[89] D. Sokolovski, ArXiv e-prints (2014), 1410.0570. Phenomena 145, 330 (2000).
[90] A. Brodutch, Phys. Rev. Lett. 114, 118901 (2015). [124] A. Bednorz, C. Bruder, B. Reulet, and W. Belzig, Phys.
[91] C. Ferrie and J. Combes, Phys. Rev. Lett. 114, 118902 Rev. Lett. 110, 250404 (2013).
(2015). [125] D. Oehri, A. V. Lebedev, G. B. Lesovik, and G. Blatter,
[92] Y. P. Terletsky, JETP 7, 1290 (1937). Phys. Rev. B 93, 045308 (2016).
43
[126] P. P. Hofer, ArXiv e-prints (2017), 1702.00998. [134] P. Talkner, E. Lutz, and P. H anggi, Phys. Rev. E 75,
[127] J. Lee and I. Tsutsui, ArXiv e-prints (2017), 1703.06068. 050102 (2007).
[128] J. J. Halliwell, Phys. Rev. A 93, 022123 (2016). [135] M. Lostaglio, D. Jennings, and T. Rudolph, Nature
[129] T. Durt, B.-G. Englert, I. Bengtsson, and Communications 6, 6383 (2015), Article.

K. Zyczkowski, ArXiv e-prints (2010), 1004.3348. [136] V. Narasimhachar and G. Gour, Nature Communica-
[130] P. J. Coles, M. Berta, M. Tomamichel, and S. Wehner, tions 6, 7689 EP (2015).
ArXiv e-prints (2015), 1511.04857. [137] I. L. Aleiner, L. Faoro, and L. B. Ioffe, Annals of Physics
[131] M. Paris and J. Rehacek, editors, Quantum State Es- 375, 378 (2016).
timation, Lecture Notes in Physics Vol. 649 (Springer, [138] F. M. Haehl, R. Loganayagam, and M. Rangamani,
Berlin, Heidelberg, 2004). ArXiv e-prints (2016), 1610.01940.
[132] N. Moiseyev, Non-Hermitian Quantum Mechanics [139] F. M. Haehl, R. Loganayagam, and M. Rangamani,
(Cambridge UP, 2011). ArXiv e-prints (2016), 1610.01941.
[133] S. An et al., Nature Physics 11, 193 (2015).

1704 01971

Hochgeladen von

Dokumentinformationen

Originaltitel

Copyright

Verfügbare Formate

Dieses Dokument teilen

Dokument teilen oder einbetten

Freigabeoptionen

Stufen Sie dieses Dokument als nützlich ein?

Sind diese Inhalte unangemessen?

Copyright:

Verfügbare Formate

1704 01971

Hochgeladen von

Copyright:

Verfügbare Formate

The quasiprobability behind the out-of-time-ordered correlator

Nicole Yunger Halpern,1 Brian Swingle,2, 3, 4 and Justin Dressel5, 6

quasiprobability, we argue, is an extension of the KD distribution. We explore the quasiprobabil-

Two topics have been flourishing independently: A :

II. Experimentally measuring A and the coarse-grained A 12

III. Numerical simulations 18

IV. Calculation of A averaged over Brownian circuits 22

Appendix A. Mathematical properties of P (W, W 0 ) 39

The Kirkwood-Dirac quasiprobability is defined as fol-

hf 0 |aiha|i0 ihi0 |f 0 i , (7) hf |ai =

weak value Eq. (3) becomes n o

1 2 N C. The out-of-time-ordered correlator

Given two unitary operators W and V , the out-of-time-

F (t) := hW (t)V W(t)V i Tr(W (t)V W(t)V ) .

illustrates this realization. The initial and final measure-

Mx Mx (5) Evolve the system backward under U .

(2) Evolve the system backward under U .

II. EXPERIMENTALLY MEASURING A AND

shifted leftward: Nor does A equal an expectation value. Expectation

Yunger Halpern/ Yunger Halpern Swingle Yao Zhu

A (v1 , v1 ; w2 , w2 ; v2 , v2 ; w3 , w3 ) . (2) The weak-measurement protocol P is simplified

Rn |+zi = |+ni, and Rn |zi = |ni.

through a small angle precisely can pose challenges [35].

4. Full circuit for weak-measurement scheme

Figure 3b shows the circuit for measuring A . The

tion (54) simplifies to pected, the quasiprobabilitys qualitative behavior is sen-

1 in a randomly chosen pure state. Broadly speaking, the

t 0110 1110 0110 1110

FIG. 6: Imaginary part of A as a function of time. T =

0.8 0.20 0001 1001

FIG. 13: Real part of A as a function of time. Random

FIG. 14: Imaginary part of A as a function of time.

0.6 Correspondingly, the real and imaginary

1.0 classical physics (Sec. I A).

0.8 (2) The derived quantity P (W, W 0 ) is generically com-

0.3 0000 1000 (5) The A s generated by integrable Hamiltonians dif-

0.015 0000 1000

We consider a system of N qubits governed by the (3) WVn11, W 11 o V , 1V W 1, 11WV :

wherein c1 1/N and c2 1.

The second terms value depends on whether i and/or (2) w2 w3 = 1, v1 v2 = 1: A = 1/16,

This differential equation implies that G exponentially

constant magnitudes: |ha` |bm i| = 1d `, m. Consider p (v2 , v2 ; w2 , w2 ; v1 , v1 |; w3 , w3 ) . (122)

OTOC quasiprobability: Proof. We deform the argument in Sec. I A 3. Let the

ability P (W, W 0 ) [37].

case. The middle outcome can differ. This process cor-

2K PTOC X formsTOC a complex distribution.

Let WTOC and WTOC 0

The conventional OTOC TOC obeys a to

as in [6, 125], for simplicity. FTOC (t) = e TOC f deno

WTOC = w1 v2 and WTOC 0

The conventional K =OTOC2: F (t) =corresponds to, 3,0 K @(@K= 0 ) 3 and

(F ) (t) = @ W(t) e , hf(1 TOC Functio TOC 0

t (t). If K < 3, FProof. (t)proof

... is analogous to the proof ofEq. , Theore 0

sextuple corresponds to a quasiprobability ATOC (.). We t

(v1 ,v1 ),(w1 ,w1 ),(v2 ,v2 )

1 W(t)V pair Fversals. (t) :=LethW(t)V 0 th

K = 2K 1 weak measurements interspersed with uni- of that papers Eq. (15):

P (K ) (W2 , W3 , . . . , WK+1 , W10 , W20 , . . . , WK

tegrals. time irreversibility. Tools from non-Hermitian quantum

Appendix A MATHEMATICAL PROPERTIES

REFERENCES [25] A. A. Patel and S. Sachdev, ArXiv e-prints (2016),

Das könnte Ihnen auch gefallen

0.6 Correspondingly, the real and imaginary