Beruflich Dokumente
Kultur Dokumente
sured [47]. OTOCs of symmetric observables have been Sections III and IV feature calculations of A . First,
measured with infinite-temperature trapped ions [48] and we numerically simulate a transverse-field Ising model.
in nuclear spin chains [49]. Weak measurements offer a A changes significantly, we find, over time scales rele-
distinct toolkit, opening new platforms and regimes to vant to the OTOC. The quasiprobabilitys behavior dis-
OTOC measurements. The weak-measurement scheme tinguishes nonintegrable from integrable Hamiltonians.
in [37] is expected to provide a near-term challenge for The quasiprobabilitys negativity and nonreality remains
superconducting qubits [35, 5055], trapped ions [5662], robust with respect to substantial quantum interference.
ultracold atoms [63], cavity quantum electrodynamics We then calculate an average, over Brownian circuits,
(QED) [64, 65], and perhaps NMR [66, 67]. of A . Brownian circuits model chaotic dynamics: The
We investigate the quasiprobability A that lies be- system is assumed to evolve, at each time step, under
hind the OTOC. The study consists of three branches: random two-qubit couplings [6871].
We discuss experimental measurements, calculate (a
A final theory section concerns mathematical prop-
coarse-grained) A , and explore mathematical properties. erties and physical interpretations of A . A shares some,
Not only does quasiprobability theory shed new light on though not all, of its properties with the KD distribu-
the OTOC. The OTOC also inspires questions about tion. The OTOC motivates a generalization of a Bayes-
quasiprobabilities and motivates weak-measurement ex- type theorem obeyed by the KD distribution [14, 7275].
perimental challenges. The generalization exponentially shrinks the memory re-
The paper is organized as follows. In a technical intro- quired to compute weak values, in certain cases. The
duction, we review the KD quasiprobability, the OTOC, OTOC also motivates a generalization of decompositions
the OTOC quasiprobability A , and schemes for measur- of quantum states . This decomposition property may
ing A . We also introduce our set-up and notation. help experimentalists assess how accurately they pre-
Next, we discuss experimental measurements. We in- pared the desired initial state when measuring F (t). A
troduce a coarse-graining A of A . The coarse-graining time-ordered correlator FTOC (t) analogous to F (t), we
involves a projection trick that decreases, exponen- show next, depends on a quasiprobability that can reduce
tially in system size, the number of trials required to to a probability. The OTOC quasiprobability lies farther
infer F (t) from weak measurements. We evaluate pros from classical probabilities than the TOC quasiprobabil-
and cons of the quasiprobability-measurement schemes ity, as the OTOC registers quantum-information scram-
in [37]. We also compare our schemes with alternative bling that FTOC (t) does not. Finally, we recall that the
F (t)-measurement schemes [41, 42, 44]. We then present OTOC encodes three time reversals. OTOCs that en-
a circuit for weakly measuring a qubit systems A . Fi- code more equal moments of longer quasiprobabilities.
nally, we show how to infer the coarse-grained A from We conclude with theoretical and experimental opportu-
alternative OTOC-measurement schemes (e.g., [41]). nities.
CONTENTS
I. Technical introduction 3
A. The KD quasiprobability in quantum optics 3
1. Phase-space representations, alternative quasiprobabilities, and history 4
2. Bayes-type theorem and retrodiction with the KD quasiprobability 4
3. Decomposing operators in terms of KD-quasiprobability coefficients 5
4. Properties of the KD quasiprobability 6
B. Set-up 6
C. The out-of-time-ordered correlator 7
D. Introducing the quasiprobability A behind the OTOC 8
1. Quantum probability amplitude A 8
2. The OTOC quasiprobability A 9
3. Complex distribution P (W, W 0 ) 10
4. Weak-measurement and interference schemes for inferring A 10
3. Weak-measurement subcircuit 16
4. Full circuit for weak-measurement scheme 17
D. How to infer A from other OTOC-measurement schemes 17
V. Theoretical study of A 26
A. Mathematical properties of A 26
B. Bayes-type theorem and retrodiction with A 28
C. A (.) values as coefficients in an operator decomposition 32
D. Relationship between out-of-time ordering and quasiprobabilities 34
1. Time-ordered correlator FTOC (t) 34
2. TOC probability amplitude ATOC 34
3. TOC quasiprobability ATOC 35
0
4. Complex TOC distribution PTOC (WTOC , WTOC ) 36
5. Jarzynski-like equality for the TOC 36
E. Higher-order OTOCs as moments of longer (summed) quasiprobabilities 36
VI. Outlook 37
A. Experimental opportunities 37
B. Opportunities motivated by calculations 38
C. Fundamental-theory opportunities 38
Acknowledgements 39
:= i(K . . . A A . . . K)
Appendix B. Retrodiction about the symmetrized composite observable 40
References 41
I. TECHNICAL INTRODUCTION A. The KD quasiprobability in quantum optics
retrodiction, or inference about the past, reviewed in The KD quasiprobability resembles a little brother of
Sec. I A 2. How to decompose an operator O in terms theirs, whom hardly anyone has heard of [77]. Kirkwood
of KD-quasiprobability values appears in Sec. I A 3. The and Dirac defined the quasiprobability independently in
quasiprobability has mathematical properties reviewed in 1933 [8] and 1945 [9]. Their finds remained under the
Sec. I A 4. radar for decades. Rihaczek rediscovered the distribu-
Much of this section parallels Sec. V, our theoretical tion in 1968, in classical-signal processing [78, 79]. (The
investigation of the OTOC quasiprobability. More back- KD quasiprobability is sometimes called the Kirkwood-
ground appears in [14]. Rihaczek distribution.) The quantum communitys at-
tention has revived recently. Reasons include experimen-
tal measurements, mathematical properties, and applica-
1. Phase-space representations, alternative tions to retrodiction and state decompositions.
quasiprobabilities, and history
Phase-space distributions form a mathematical toolkit 2. Bayes-type theorem and retrodiction with the KD
applied in Liouville mechanics [76]. Let S denote a sys- quasiprobability
tem of 6N degrees of freedom (DOFs). An example sys-
tem consists of N particles, lacking internal DOFs, in a
Prediction is inference about the future. Retrodiction
three-dimensional space. We index the particles with i
is inference about the past. One uses the KD quasiproba-
and let = x, y, z. The th component qi of particle
bility to infer about a time t0 , using information about an
is position is conjugate to the th component p i of the event that occurred before t0 and information about an
particles momentum. The variables qi and p label the
i event that occurred after t0 . This forward-and-backward
axes of phase space.
propagation evokes the OTOCs out-of-time ordering.
Suppose that the system contains many DOFs: N
We borrow notation from, and condense the explana-
1. Tracking all the DOFs is difficult. Which phase-
tion in, [14]. Let S denote a discrete quantum system.
space point S occupies, at any instant, may be un-
Consider preparing S in a state |ii at time t = 0. Sup-
known. The probability that, at time t, S occupies an
pose that S evolves under a time-independent Hamilto-
infinitesimal volume element localized at (q1x , . . . , pzN ) is
nian that generates the family Ut of unitaries. Let F
({qi }, {p
i }; t) d
3N
q d3N p. The phase-space distribution
denotePan observable measured at time t00 > 0. Let
({qi }, {pi }; t) is a probability density.
qi and p
F = f f |f ihf | be the eigendecomposition, and let f
i seem absent from quantum mechanics (QM), denote the outcome.
prima facie. Most introductions to QM cast quantum P
states in terms of operators, Dirac kets |i, and wave Let A = a a|aiha| be the eigendecomposition of an
functions (x). Classical variables are relegated to mea- observable that fails to commute with F . Let t0 denote
surement outcomes and to the classical limit. Wigner, a time in (0, t00 ). Which value can we most reasonably
Moyal, and others represented QM in terms of phase attribute to the systems time-t0 A, knowing that S was
space [7]. These representations are used most in quan- prepared in |ii and that the final measurement yielded
tum optics. f?
In such a representation, a quasiprobability density re- Propagating the initial state forward to time t0 yields
0
places the statistical-mechanical probability density .1 |i i := Ut0 |ii. Propagating the final state backward yields
Yet quasiprobabilities violate axioms of probability [16]. |f 0 i := Ut00 t0 |f i. Our best guess about A is the weak
Probabilities are nonnegative, for example. Quasiproba- value [36, 7375, 8082]
bilities can assume negative values, associated with non-
classical physics such as contextuality [1418, 20], and hf 0 |A|i0 i
Aweak (i, f ) := < . (3)
nonreal values. Relaxing different axioms leads to differ- hf 0 |i0 i
ent quasiprobabilities. Different quasiprobabilities cor-
respond also to different orderings of noncommutative The real part of a complex number z is denoted by <(z).
operators [9]. The best-known quasiprobabilities include The guesss accuracy is quantified with a distance metric
the Wigner function, the Glauber-Sudarshan P represen- (Sec. V B) and with comparisons to weak-measurement
tation, and the Husimi Q function [7]. data.
Aharonov et al. discovered weak values in 1988 [72].
Weak values be anomalous, or strange: Aweak can ex-
ceed the greatest eigenvalue amax of A and can dip be-
1 We will focus on discrete quantum systems, motivated by a spin- low the least eigenvalue amin . Anomalous weak values
chain example. Discrete systems are governed by quasiprobabil- concur with negative quasiprobabilities and nonclassical
ities, which resemble probabilities. Continuous systems are gov-
erned by quasiprobability densities, which resemble probability
physics [14, 17, 18, 83, 84]. Debate has surrounded weak
densities. Our quasiprobabilities can be replaced with quasiprob- values role in quantum mechanics [8591].
ability densities, and our sums can be replaced with integrals, in, The weak value Aweak , we will show, depends on the
e.g., quantum field theory. KD quasiprobability. We replace the A in Eq. (3) with its
5
eigendecomposition. Factoring out the eigenvalues yields 3. Decomposing operators in terms of KD-quasiprobability
X 0 coefficients
hf |aiha|i0 i
Aweak (i, f ) = a< . (4)
hf 0 |i0 i
a The KD distribution can be interpreted not only in
The weight <(.) is a conditional quasiprobability. It re- terms of retrodiction, but also in terms of operation de-
sembles a conditional probabilitythe likelihood that, if compositions [10, 11]. Quantum-information scientists
|ii was prepared and the measurement yielded f , a is the decompose qubit states in terms of Pauli operators. Let
value most reasonably attributable to A. Multiplying = x x + y y
+ z z
denote a vector of the one-qubit
and dividing the argument by hi0 |f 0 i yields Paulis. Let n R3 denote a unit vector. Let denote
any state of a qubit, a two-level quantum system. can
< (hf 0 |aiha|i0 ihi0 |f 0 i) be expressed as = 21 (1 + n ) . The identity operator
p(a|i, f ) := . (5)
|hf 0 |i0 i|2 is denoted by 1. The n components n` constitute decom-
position coefficients. The KD quasiprobability consists of
Substituting into Eq. (4) yields coefficients in a more general decomposition.
X
Aweak (i, f ) = a p(a|i, f ) . (6) Let S denote a discrete quantum system associated
a
with a Hilbert space H. Let {|f i} and {|ai} denote or-
thonormal bases for H. Let O B(H) denote a bounded
Equation (6) illustrates why negative quasiprobabili- operator defined on H. Consider operating on each side
ties concur with anomalous weak values. Suppose that of O with a resolution of unity:
Aweak > amax . Some large eigenvalue a = alarge must
!
correspond to a high quasiprobability value Pp(alarge |i, f ). X X
The quasiprobability values sum to one: ap(a|i, f ) = O = 1O 1 = |aiha| O |f ihf | (11)
1. Hence another quasiprobability value must compen- a f
sate for p(alarge |i, f ). Some p(asmall |i, f ), associated with X
a smaller eigenvalue asmall , must be negative. = |aihf | ha|O|f i . (12)
The numerator of Eq. (5) is the Terletsky-Margenau- a,f
Hill (TMH) quasiprobability [73, 9294]. The TMH dis-
tribution is the real part of a complex number. That Suppose that every element of {|ai} has a nonzero overlap
complex generalization, with every element of {|f i}:
4. Properties of the KD quasiprobability Equation (17) can be expressed in terms of jointly con-
ditional distributions. Let p(a|i, f ) denote the proba-
The KD quasiprobability shares some, but not all, of bility that an event a will occur, given that an event i
its properties with other quasiprobabilities. The notation occurred and that f occurred subsequently. p(a, f |i) is
below is defined as it has been throughout Sec. I A. defined similarly. What is the joint probability p(i, f, a)
(1)
that i, f , and a will occur? We can construct two ex-
Property 1. The KD quasiprobability AO (a, f ) maps pressions:
B(H) {a} {f } to C . The domain is a composition
of the set B(H) of bounded operators and two sets of real p(i, f, a) = p(a|i, f ) p(i, f ) = p(a, f |i) p(i) . (18)
numbers. The range is the set C of complex numbers, not
The joint probability p(i, f ) equals p(f |i) p(i). This p(i)
necessarily the set R of real numbers.
cancels with the p(i) on the right-hand side of Eq. (18).
The Wigner function assumes only real values. Only Solving for p(a|i, f ) yields Bayes Theorem for jointly
by dipping below zero can the Wigner function deviate conditional probabilities,
from classical probabilistic behavior. The KD distribu-
tions negativity has the following physical significance: p(a, f |i)
p(a|i, f ) = . (19)
Imagine measuring two (commuting) observables, A and p(f |i)
B, simultaneously. The measurement has some probabil-
Equation (5) echoes Eq. (19). The KD quasiproba-
ity p(a; b) of yielding the values a and b. Now, suppose
bilitys Bayesian behavior [12, 100] has been applied to
that A does not commute with B. No joint probability
quantum state tomography [10, 11, 13, 101104] and to
distribution p(a; b) exists. Infinitely precise values can-
quantum foundations [98].
not be ascribed to noncommuting observables simultane-
Having reviewed the KD quasiprobability, we approach
ously. Negative quasiprobability values are not observed
the extended KD quasiprobability behind the OTOC.
directly: Observable phenomena are modeled by aver-
We begin by concretizing our set-up, then reviewing the
ages over quasiprobability values. Negative values are
OTOC.
visible only on scales smaller than the physical coarse-
graining scale. But negativity causes observable effects,
visible in sequential measurements. Example effects in- B. Set-up
clude anomalous weak values [14, 17, 18, 72, 83, 84] and
violations of Leggett-Garg inequalities [96, 97].
Unlike the Wigner function, the KD distribution can This section concerns the set-up and notation used
assume nonreal values. Consider measuring two non- throughout the rest of this paper. Our framework is mo-
commuting observables sequentially. How much does the tivated by the OTOC, which describes quantum many-
first measurement affect the second measurements out- body systems. Examples include black holes [1, 30], the
come? This disturbance is encoded in the KD distribu- Sachdev-Ye-Kitaev model [29, 30], other holographic sys-
tions imaginary component [98101]. tems [2123] and spin chains. We consider a system S
associated with a Hilbert space H of dimensionality d.
(1)
Property 2. Summing A (a, f ) over a yields a proba- The system evolves under a Hamiltonian H that might
(1)
bility distribution. So does summing A (a, f ) over f . be nonintegrable or integrable. H generates the time-
evolution operator U := eiHt .
Consider substituting O = into Eq. (2). Summing over We will have to sum or integrate over spectra. For
a yields hf ||f i. This inner product equals a probability, concreteness, we sum, supposing that H is discrete. A
by Borns Rule. spin-chain example, discussed next, motivates our choice.
Property 3. The KD quasiprobability is defined as in Our sums can be replaced with integrals unless, e.g., we
Eq. (2) regardless of whether {a} and {f } are discrete. evoke spin chains explicitly.
We will often illustrate with a one-dimensional (1D)
The KD distribution and the Wigner function were de- chain of spin- 12 degrees of freedom. Figure 1 illustrates
fined originally for continuous systems. Discretizing the the chain, simulated numerically in Sec. III. Let N denote
Wigner function is less straightforward [16, 20]. the number of spins. This systems H has dimensionality
Property 4. The KD quasiprobability obeys an analog d = 2N .
of Bayes Theorem, Eq. (5). We will often suppose that S occupies, or is initialized
to, a state
Bayes Theorem governs the conditional probability X
p(f |i) that an event f will occur, given that an event = pj |jihj| D(H) . (20)
i has occurred. p(f |i) is expressed in terms of the con- j
ditional probability p(i|f ) and the absolute probabilities
p(i) and p(f ): The set of density operators defined on H is denoted by
D(H), as in Sec. I A. Orthonormal eigenstates are in-
p(i|f ) p(f ) dexed by j; eigenvalues are denoted by pj . Much litera-
p(f |i) = . (17)
p(i) ture focuses on temperature-T thermal states eH/T /Z.
7
that perturbing the system with V does not significantly D. Introducing the quasiprobability A behind the
change the expectation value of W(t). This physics is OTOC
expected for a chaotic system, which effectively loses
its memory of its initial conditions. In contrast, the F (t) was shown, in [37], to equal a moment of a
magnitude-squared commutator C(t) is the expectation summed quasiprobability. We review this result, estab-
value of a positive operator. The cancellations that zero lished in four steps: A quantum probability amplitude
out h[W(t), V ]i cannot occur. Mathematically, though A is reviewed in Sec. I D 1 . Amplitudes are combined
the diagonal matrix elements of [W(t), V ] may be small, to form the quasiprobability A in Sec. I D 2. Summing
the operator can be large.
A (.) values, with constraints, yields a complex distri-
We can gain intuition about the manifestation of chaos
bution P (W, W 0 ) in Sec. I D 3. Differentiating P (W, W 0 )
in F (t) from a simple quantum system that has a chaotic
yields the OTOC. A can be inferred experimentally from
semiclassical limit. Let W = q and V = p for some
a weak-measurement scheme and from interference. We
position q and momentum p:
review these schemes in Sec. I D 4.
C(t) = h[q(t), p]2 i ~2 e2L t . (26)
This L is a classical Lyapunov exponent. The final 1. Quantum probability amplitude A
expression follows from the Correspondence Principle:
Commutators are replaced with i~ times the correspond- The OTOC quasiprobability A is defined in terms of
ing Poisson bracket. The Poisson bracket of q(t) with p probability amplitudes A . The A s are defined in terms
equals the derivative of the final position with respect to of the following process, PA :
the initial position. This derivative reflects the butterfly
effect in classical chaos, i.e., sensitivity to initial condi- (1) Prepare .
tions. The growth of C(t), and the deviation of F (t) from
the TOC terms, provide a quantum generalization of the (2) Measure the eigenbasis, {|jihj|}.
butterfly effect.
Within this simple quantum system, the analog of the (3) Evolve S forward in time under U .
dissipation time may be regarded as td 1 L . The ana-
(4) Measure W.
log of the scrambling time is t 1 L ln
~ . The de-
notes some measure of the accessible phase-space volume. (5) Evolve S backward under U .
Suppose that the phase space is large in units of ~. The
scrambling time is much longer than the dissipation time: (6) Measure V .
t td . Such a parametric separation between the time
scales characterizes the systems that interest us most. (7) Evolve S forward under U .
In more general chaotic systems, the value of t de-
pends on whether the interactions are geometrically local
(8) Measure W.
and on W and V . Consider, as an example, a spin chain
governed by a local Hamiltonian. Suppose that W and Suppose that the measurements yield the outcomes j,
V are local operators that act nontrivially on spins sepa- (w1 , w1 ), (v1 , v1 ), and (w2 , w2 ). Figure 2a illustrates
rated by a distance `. The scrambling time is generically this process. The process corresponds to the probability
proportional to `. For this class of local models, `/t de- amplitude4
fines a velocity vB called the butterfly velocity. Roughly,
the butterfly velocity reflects how quickly initially local A (j; w1 , w1 ; v1 , v1 ; w2 , w2 ) := hw2 , w2 |U |v1 , v1 i
Heisenberg operators grow in space. hv1 , v1 |U |w1 , w1 ihw1 , w1 |U |ji pj . (27)
Consider a system in which td is separated parametri-
cally from t . The rate of change of F (t) [rather, a reg- We do not advocate for performing PA in any exper-
ulated variation on F (t)] was shown to obey a nontrivial iment. PA is used to define A and to interpret A
bound. Parameterize the OTOC as F (t) TOC eL t . physically. Instances of A are combined into A . A
The parameter 1 encodes the separation of scales. weak-measurement protocol can be used to measure A
The exponent L obeys L 2kB T in thermal equi- experimentally. An interference protocol can be used to
librium at temperature T [6]. kB denotes Boltzmanns measure A (and so A ) experimentally.
constant. Black holes in the AdS/CFT duality saturate
this bound, exhibiting maximal chaos [1, 30].
More generally, L and vB control the operators
growth and the spread of chaos. The OTOC has thus at- 4 We order the arguments of A differently than in [37]. Our
tracted attention for a variety of reasons, including (but ordering here parallels our later ordering of the quasiprobabil-
not limited to) the possibilities of nontrivial bounds on itys argument. Weak-measurement experiments motivate the
quantum dynamics, a new probe of quantum chaos, and quasiprobability arguments ordering. This motivation is de-
a signature of black holes in AdS/CFT. tailed in Footnote 6.
9
U U U A (v1 , v1 ; w2 , w2 ; v2 , v2 ; w3 , w3 )
Experiment
time
= hw3 , w3 |U |v2 , v2 ihv2 , v2 |U |w2 , w2 i
0
hw2 , w2 |U |v1 , v1 ihv1 , v1 |U |w3 , w3 i . (29)
Measure Measure A simple example illustrates how A nearly equals a
{|jihj|}. j V . (v2 , v2 ) probability. Suppose that an eigenbasis of coincides
Prepare
. with {|v` , v` ihv` , v` |} or with U |w` , w` ihw` , w` |U .
Suppose, for example, that
(b) X
= V := pv` ,v` |v` , v` ihv` , v` | . (30)
FIG. 2: Quantum processes described by the v` ,v`
probability amplitudes A in the out-of-time-ordered One such is the infinite-temperature Gibbs state
correlator (OTOC): These figures, and parts of this
1/d. Another example is easier to prepare: Sup-
caption, appear in [37]. The OTOC quasiprobability A x
pose that S consists of N spins and that V = N .
results from summing products A (.)A (.). Each A (.) x
denotes a probability amplitude [Eq. (27)], so each product One V equals a product of N eigenstates. Let
resembles a probability. But the amplitudes arguments (v2 , v2 ) = (v1 , v1 ). [An analogous argument follows
differthe amplitudes correspond to different quantum from (w3 , w3 ) = (w2 , w2 ).] Equation (29) reduces to
processesbecause the OTOC operators W(t) and V fail to
|hw2 , w2 |U |v1 , v1 i|2 |hw3 , w3 |U |v1 , v1 i|2 pv1 ,v1 .
commute, typically. Figure 2a illustrates the process
described by the A (.); and Fig. 2b, the process described by (31)
the A (.). Time, as measured by a laboratory clock,
increases from left to right. Each
P process begins with the
preparation of the state = j pj |jihj| and a measurement 5 Familiarity with tensors might incline one to sum over the
of the states eigenbasis. Three evolutions (U , U , and U ) (w2 , w2 ) shared by the trajectories. But we are not invok-
then alternate with three measurements of observables (W, ing tensors. More importantly, summing over (w2 , w2 ) intro-
V , and W). Figures 2a and 2b are used to define A , rather duces a v1 v2 v1 v2 that eliminates one (v` , v` ) degree of free-
dom. The resulting quasiprobability would not lie behind the
than showing protocols for measuring A .
OTOC. One could, rather than summing over (w1 , w1 ), sum
over (w3 , w3 ). Either way, one sums over one trajectorys first
W outcome. We sum over (w1 , w ) to maintain consistency
1
with [37].
2. The OTOC quasiprobability A 6 In [37], the left-hand sides arguments are ordered differently and
are condensed into the shorthand (w, v, w , v ). Experiments
motivate our reordering: Consider inferring A (a, b, c, d) from
The quasiprobabilitys definition is constructed as fol- experimental measurements. In each trial, one (loosely speaking)
lows. Consider a realization of PA that yields the out- weakly measures a, then b, then c; and then measures d strongly.
comes j, (w3 , w3 ), (v2 , v2 ), and (w2 , w2 ). Figure 2b As the measurements are ordered, so are the arguments.
10
Each square modulus equals a conditional probability. couplings are represented by constraints like W (w3 v2 )
pv1 ,v1 equals the probability that, if is measured with and W 0 (w2 v1 ) . Suppose that the detectors measure the
respect to {|v` , v` ihv` , v` |}, outcome (v1 , v1 ) obtains. systems Hamiltonian. Subtracting the measurements
In this simple case, some of the quasiprobabil- outcomes yields the work performed during the protocol.
itys valuesthe quasiprobability evaluated on certain The distribution over possible work values is a quasiprob-
argumentsreduce to products of probabilities. Not all ability. Their quasiprobability is a Husimi Q-function,
the quasiprobabilitys values reduce so. All the values of whereas the OTOC quasiprobability is a KD distribu-
the quasiprobability behind time-ordered correlators can tion [108]. Related frameworks appear in [109111]. The
reduce to probabilities, we show in Sec. V D. relationship between those thermodynamics frameworks
and our thermodynamically motivated OTOC framework
merits exploration.
3. Complex distribution P (W, W 0 )
A is summed, in [37], to form a complex distribu- 4. Weak-measurement and interference schemes for
tion P (W, W 0 ). Let W := w3 v2 and W 0 := w2 v1 inferring A
denote random variables calculable from measurement
outcomes. If W and V are Paulis, (W, W 0 ) can equal
(1, 1), (1, 1), (1, 1), or (1, 1). Consider fixing the A can be inferred from weak measurements and from
value of (W, W 0 ). For example, let (W, W 0 ) = (1, 1). interference. We focus mostly on weak measurements.
Consider the octuples (v1 , v1 ; w2 , w2 ; v2 , v2 ; w3 , w3 ) First, we briefly review the interference scheme.
that satisfy the constraints W = w3 v2 and W 0 = w2 v1 . The interference scheme in [37] differs from other in-
Each octuple corresponds to a quasiprobability value terference schemes for measuring F (t) [4143]: From
A (.). Summing these quasiprobability values yields the [37] interference scheme, one can infer not only F (t),
X but also A . Time need not be inverted (H need not
P (W, W 0 ) := (32) be negated) in any trial. The scheme is detailed in
(v1 ,v1 ),(w2 ,w2 ),(v2 ,v2 ),(w3 ,w3 ) Appendix B of [37]. The system is coupled to an an-
cilla prepared in a superposition 12 (|0i + |1i). A uni-
A (v1 , v1 ; w2 , w2 ; v2 , v2 ; w3 , w3 ) W (w3 v2 ) W 0 (w2 v1 ) .
tary, conditioned on the ancilla, rotates the systems
The Kronecker delta is represented by ab . P (W, W 0 ) state. The ancilla and system are measured projectively.
functions analogously to the probability distribution, in From many trials measurement data, one infers ha|U|bi,
the fluctuation-relation paper [38], over values of ther- wherein U = U or U and a, b = (w` , w` ), (vm , vm ).
modynamic work. These inner products are multiplied together to form
The OTOC equals a moment of P (W, W 0 ) [Eq. (1)], A [Eq. (29)]. If shares neither the V nor the W(t)
which equals a constrained sum over A [37]. Hence eigenbasis, quantum-state tomography is needed to infer
our labeling of A as the quasiprobability behind the hv1 , v1 |U |w3 , w3 i.
OTOC. Equation (32) expresses the useful, difficult-to- The weak-measurement scheme is introduced in Sec.
measure F (t) in terms of a characteristic function of a II B 3 of [37]. A simple case, in which = 1/d, is
(summed) quasiprobability, as Jarzynski [38] expresses detailed in Appendix A of [37]. Recent weak measure-
a useful, difficult-to-measure free-energy difference F ments [1013, 3135], some used to infer KD distribu-
in terms of a characteristic function of a probability. tions, inspired our weak A -measurement proposal. We
Quasiprobabilities reflect nonclassicality (contextuality) review weak measurements, a Kraus-operator model for
as probabilities do not; so, too, does F (t) reflect nonclas- measurements, and the A -measurement scheme.
sicality (noncommutation) as F does not. Review of weak measurements: Measurements can al-
The definition of P involves arbitrariness: The measur- ter quantum systems states. A weak measurement
able random variables, and P , may be defined differently. barely disturbs the measured systems state. In ex-
We use alternative definitions in Sec. V E. Those alter- change, the measurement provides little information
natives facilitate the construction of OTOCs that encode about the system. Yet one can infer much by performing
more time reversals. All possible definitions share two many trials and processing the outcome statistics.
properties: (i) The arguments W , etc. denote random Extreme disturbances result from strong measure-
variables inferable from measurement outcomes. (ii) P ments [112]. The measured systems state collapses onto
results from summing A (.) values subject to constraints a subspace.PFor example, let denote the initial state.
ab . Let A = a a|aiha| denote the measured observables
P (W, W 0 ) resembles a work distribution constructed eigendecomposition. A strong measurement has a prob-
by Solinas and Gasparinetti (S&G) [105, 106]. They ability ha||ai of projecting onto |ai.
study fluctuation-relation contexts, rather than the One canPimplement a measurement with an ancilla.
OTOC. S&G propose a definition for the work performed Let X = x x|xihx| denote an ancilla observable. One
on a quantum system [107, 108]. The system is coupled correlates A with X via an interaction unitary. Von Neu-
weakly to detectors at a protocols start and end. The mann modeled such unitaries with Vint := eig AX [14,
11
113]. The parameter g signifies the interaction strength.7 under 1) and a probability |g(x)|2 of projecting onto
An ancilla observablee.g., Xis measured strongly. |v` , v` ihv` , v` |.
The greater the g, the stronger the correlation between Weak-measurement scheme for inferring the OTOC
A and X. A is measured strongly if it is correlated with quasiprobability A : Weak measurements have been used
X maximally, if a one-to-one mapping interrelates the to measure KD quasiprobabilities [1013, 31, 32, 34, 35].
xs and the as. Suppose that the X measurement yields These experiments techniques can be applied to infer A
x. We say that an A measurement has yielded some and, from A , the OTOC. Our scheme involves three se-
outcome ax . quential weak measurements per trial (if is arbitrary)
Suppose that g is small. A is correlated imperfectly or two [if shares the V or the W(t)
eigenbasis, e.g., if
with X. The X-measurement outcome, x, provides in- = 1/d]. The weak measurements alternate with time
complete information about A. The value most reason- evolutions and precede a strong measurement.
ably attributable to A remains ax . But a subsequent We review the general and simple-case protocols. A
measurement of A would not necessarily yield ax . In projection trick, introduced in Sec. II A, reduces expo-
exchange for forfeiting information about A, we barely nentially the number of trials required to infer about A
disturb the systems initial state. We can learn more and F (t). The weak-measurement and interference pro-
about A by measuring A weakly in each of many trials, tocols are analyzed in Sec. II B. A circuit for implement-
then processing measurement statistics. ing the weak-measurement scheme appears in Sec. II C.
Kraus-operator model for measurement: Kraus oper- Suppose that does not share the V or the W(t)
eigen-
ators [112] model the system-of-interest evolution in- basis. One implements the following protocol, P:
duced by a weak P measurement. Let us choose P for VA
to equal V = v` ,v v |v
` ` , v` ihv ` , v` | = v ` v ` v ` . (1) Prepare .
`
Vv` projects onto the v` eigenspace. Let denote the
(2) Measure V weakly. (Couple the systems V weakly
systems initial state, and let |Di denote the detectors
to some observable X of a clean ancilla. Measure
initial state.
X strongly.)
Suppose that the X measurement yields x. The sys-
tems state evolves under the Kraus operator (3) Evolve the system forward in time under U .
Mx = hx|Vint |Di (33) weakly. (Couple the systems W
(4) Measure W
= hx|Di 1 weakly to some observable Y of a clean ancilla.
+ hx| eigX 1 |Di |v` , v` ihv` , v` | (34) Measure Y strongly.)
of outputting x. The dimensionless parameter g(x) C many times. From the measurement statistics, one in-
is derived from g. We can roughly interpret Mx statis- fers the probability Pweak (x; y; z; w3 , w3 ) that any given
tically: In any given trial, the coupling has a probabil- trial will yield the outcome quadruple (x; y; z; w3 , w3 ).
ity p(x) of failing to disturb the system (of evolving From this probability, one infers the quasiprobability
A (v1 , v1 ; w2 , w2 ; v2 , v2 ; w3 , w3 ). The probability has
the form
7 A and X are dimensionless: To form them, we multiply dimen- Pweak (x; y; z; w3 , w3 ) = hw3 , w3 |U Mz U My U Mx
sionful observables by natural scales of the subsystems. These
scales are incorporated into g. Mx U My U Mz U |w3 , w3 i . (37)
8 Experimentalists might prefer measuring Pauli operators to
measuring projectors. Measuring Paulis can suffice, as discussed We integrate over x, y, and z, to take advantage of
in Sec. II. Paulis square to the identity, rather than to themselves: all measurement statistics. We substitute in for the
( )2 = 1, for = x, y, z. Equation (34) becomes
Kraus operators from Eq. (36), then multiply out. Two
g X) |Di 1 ihx| sin (
hx| cos ( g X) |Di . (35)
terms combine into <(A (.)). The other terms form in-
dependently measurable background terms. To infer
12
=(A (.)), one performs P many more times, using differ- one, on interference. We simplify, evaluate, and augment
ent couplings. Details appear in Appendix A of [37]. these schemes.
W and V are local. Their degeneracies therefore scale First, we introduce a projection trick: Sum-
with the system size. If S consists of N spin- 12 de- ming over degeneracies turns one-dimensional projectors
grees of freedom, |w` |, |v` | 2N . Exponentially many (e.g., |w` , w` ihw` , w` |) into projectors onto degener-
A (.) values must be inferred. Exponentially many tri- ate eigenspaces (e.g., W w` ). The coarse-grained OTOC
als must be performed. We sidestep this exponential- quasiprobability A results. This trick decreases ex-
ity in Sec. II A: One measures eigenprojectors of the ponentially the number of trials required to infer the
degenerate W and V , rather than of the nondegener- OTOC from weak measurements.9 Section II B concerns
ate W and V . The one-dimensional |v` , v ihv` , v | of pros and cons of the weak-measurement and interference
` `
Eq. (34) is replaced with Vv` . From the weak mea- schemes for measuring A and F (t). We also compare
surements,
P one infers the coarse-grained quasiprobabil- those schemes with alternative schemes for measuring
ity degeneracies A (.) =: A (.). Summing A (.) values F (t). Section II C illustrates a circuit for implementing
yields the OTOC. the weak-measurement scheme. Section II D shows how
Suppose that shares the V or the W(t) eigenba- to infer A not only from the measurement schemes in
sis. The number of weak measurements reduces to two. Sec. I D 4, but also with alternative OTOC-measurement
For example, suppose that is the infinite-temperature proposals (e.g., [41]) (if the eigenvalues of W and V are
Gibbs state 1/d. The protocol P becomes 1).
eigenstate |w3 , w i.
(1) Prepare a W 3
Define
X B. Analysis of the quasiprobability-measurement
W
w` := |w` , w` ihw` , w` | (41) schemes and comparison with other
w` OTOC-measurement schemes
as the projector onto the w` eigenspace of W, Section I D 4 reviews two schemes for inferring A : a
W(t) weak-measurement scheme and an interference scheme.
w`
:= U W
w` U (42) From A measurements, one can infer the OTOC F (t).
We evaluate our schemes pros and cons. Alternative
as the projector onto the w` eigenspace of W(t), and
schemes for measuring F (t) have been proposed [39, 41
X 46], and two schemes have been realized [47, 48]. We
Vv` := |v` , v` ihv` , v` | (43)
compare our schemes with alternatives, as summarized
v`
in Table I. For specificity, we focus on [41, 42, 44].
as the projector onto the v` eigenspace of V . Substituting The weak-measurement scheme augments the set
into Eq. (40) yields of techniques and platforms with which F (t) can be
measured. Alternative schemes rely on interferom-
etry [4143], controlled unitaries [41, 44], ultracold-
A (v1 , w2 , v2 , w3 ) = Tr w
W(t) V
3
W(t) V
v2 w 2 v1 . (44) atoms tools [43, 45, 46], and strong two-point measure-
ments [39]. Weak measurements, we have shown, belong
Asymmetry distinguishes Eq. (44) from Borns Rule in the OTOC-measurement toolkit. Such weak measure-
and from expectation values. The traces cyclicality im- ments are expected to be realizable, in the immediate
plies that future, with superconducting qubits [35, 5055], trapped
ions [5662], cavity QED [64, 65], ultracold atoms [63],
A (v1 , w2 , v2 , w3 ) = Tr W(t) V W(t) V and perhaps NMR [66, 67]. Circuits for weakly measuring
w3 v2 w2 v1 . (45)
qubit systems have been designed [36, 50]. Initial proof-
of-principle experiments might not require direct access
Imagine preparing , measuring V strongly, evolving S
to the qubits: The five superconducting qubits available
forward under U , measuring W strongly, evolving S back-
from IBM, via the cloud, might suffice [115]. Random
ward under U , measuring V strongly, evolving S forward
two-qubit unitaries could simulate chaotic Hamiltonian
under U , and measuring W. The probability of obtaining
evolution.
the outcomes v1 , w2 , v2 , and w3 , in that order, is
In many weak-measurement experiments, just one
weak measurement is performed per trial [1013]. Yet
Tr W(t) V W(t) V V W(t) V
w3 v2 w2 v1 v1 w2 v2 w3
W(t)
. (46) two weak measurements have recently been performed
sequentially [3234]. Experimentalists aimed to directly
W(t) W(t)
The operator w3 Vv2 w2 Vv1 conjugates symmet- measure general quantum states [11] and to infer about
rically. This operator multiplies asymmetrically in non-Hermitian observable-like operators. The OTOC
Eq. (45). Hence A does not obviously equal a proba- motivates a new application of recently realized sequen-
bility. tial weak measurements.
14
TABLE I: Comparison of our measurement schemes with alternatives: This paper focuses on the
weak-measurement and interference schemes for measuring the OTOC quasiprobability A or the coarse-grained
quasiprobability A . From A or A , one can infer the OTOC F (t). These schemes appear in [37], are reviewed in Sec. I D 4,
and are assessed in Sec. II B. We compare our schemes with the OTOC-measurement schemes in [41, 42, 44]. More
OTOC-measurement schemes appear in [39, 43, 4548]. Each row corresponds to a desirable quantity or to a resource
potentially challenging to realize experimentally. The regulated correlator Freg (t) [Eq. (104)] is expected to behave similarly
to F (t) [6, 42]. D(H) denotes the set of density operators defined on the Hilbert space H. denotes the initially prepared
state. Target states target are never prepared perfectly; may differ from target . Experimentalists can reconstruct by
trivially processing data taken to infer A [37] (Sec. V B). F (K ) (t) denotes the K-fold OTOC, which encodes K = 2K 1
(K )
time reversals. The conventional OTOC corresponds to K = 3. The quasiprobability behind F (K ) (t) is A (Sec. V E). N
denotes the system size, e.g., the number of qubits. The Swingle et al. and Zhu et al. schemes have constant signal-to-noise
ratios (SNRs) in the absence of environmental decoherence. The Yao et al. schemes SNR varies inverse-exponentially with
the systems entanglement entropy, SvN . The system occupies a thermal state eH/T /Z, so SvN log(2N ) = N .
Our schemes furnish not only the OTOC F (t), but also (C) The form of the state prepared. Suppose
more information: that we wish to evaluate F (t) on a target state
target . target might be difficult to prepare,
(1) From the weak-measurement scheme in [37], we can e.g., might be thermal. The prepared state
infer the following: approximates target . Consider performing the
weak-measurement protocol P with . One
(A) The OTOC quasiprobability A . The infers A . Summing A (.) values yields the
quasiprobability is more fundamental than form of . We can assess the preparations ac-
F (t), as combining A (.) values yields F (t): curacy without performing tomography inde-
X pendently. Whether this assessment meets ex-
F (t) = v1 w2 v2 w3 (47) perimentalists requirements for precision re-
(v1 ,v1 ),(w2 ,w2 ), mains to be seen. Details appear in Sec. V C.
(v2 ,v2 ),(w3 ,w3 )
(A) The coarse-grained OTOC quasiprobability protocol, however, is short: Time need not be reversed
A . Though less fundamental than the fine- in any trial. Each trial features exactly one U or U , not
grained A , A implies the OTOCs form: both. Time can be difficult to reverse in some platforms,
for two reasons. Suppose that a Hamiltonian H gener-
X ates a forward evolution. A perturbation might lead
F (t) = v1 w2 v2 w3 A (v1 , w2 , v2 , w3 ) .
v1 ,w2 ,v2 ,w3
(H + ) to generate the reverse evolution. Perturba-
(48) tions can mar long-time measurements of F (t) [44]. Sec-
ond, systems interact with environments. Decoherence
might not be completely reversible [41]. Hence the lack
(B) The OTOC F (t).
of a need for time reversal, as in our interference scheme
(3) Upon implementing the interferometry scheme and in [42, 44], has been regarded as an advantage.
in [37], we can infer the following information: Unlike our interference scheme, the weak-measurement
scheme requires that time be reversed. Perturbations
(A) The OTOC quasiprobability A . threaten the weak-measurement scheme as they threaten
(B) The OTOC F (t). the Swingle et al. scheme [41]. s might threaten
the weak-measurement scheme more, because time is in-
(C) The form of the state prepared. verted twice in our scheme. Time is inverted only once
(D) All the K-fold OTOCs F (K ) (t), which gen- in [41]. However, our error might be expected to have
eralize the OTOC F (t). F (t) encodes three roughly the size of the Swingle et al. schemes error [114].
time reversals. F (K ) (t) encodes K = 2K Furthermore, tools for mitigating the Swingle et al.
1 = 3, 5, . . . time reversals. Details appear in schemes inversion error are being investigated [114]. Re-
Sec. V E. silience of the Swingle et al. scheme to decoherence
(K ) has been analyzed [41]. These tools may be applied to
(E) The quasiprobability A behind F (K ) (t), the weak-measurement scheme [114]. Like resilience, our
for all K (Sec. V E). schemes signal-to-noise ratios require further study.
We have delineated the information inferable from the As noted earlier, as the system size N grows, the num-
weak-measurement and interference schemes for measur- ber of trials required to infer A grows exponentially. So
ing A and F (t). Let us turn to other pros and cons. does the number of ancillas required to infer A : Measur-
The weak-measurement schemes ancillas need not cou- ing a degeneracy parameter w` or vm requires a mea-
ple to the whole system. One measures a system weakly surement of each spin. Yet the number of trials, and
by coupling an ancilla to the system, then measuring the number of ancillas, required to measure the coarse-
the ancilla strongly. Our weak-measurement protocol re- grained A remains constant as N grows. One can in-
quires one ancilla per weak measurement. Let us focus, fer A from weak measurements and, alternatively, from
for concreteness, on an A measurement for a general . other F (t)-measurement schemes (Sec. II D). A is less
The protocol involves three weak measurements and so fundamental than A , as A results from coarse-graining
three ancillas. Suppose that W and V manifest as one-
A . A , however, exhibits nonclassicality and OTOC
qubit Paulis localized at opposite ends of a spin chain.
Each ancilla need interact with only one site (Fig. 3). In time scales (Sec. III). Measuring A can balance the de-
contrast, the ancilla in [44] couples to the entire system. sire for fundamental knowledge with practicalities.
So does the ancilla in our interference scheme for mea- The weak-measurement scheme for inferring A can
suring A . Global couplings can be engineered in some be rendered more convenient. Section II A describes mea-
platforms, though other platforms pose challenges. Like surements of projectors . Experimentalists might prefer
our weak-measurement scheme, [41] and [42] require only measuring Pauli operators . Measuring Paulis suffices
local ancilla couplings. for inferring a multiqubit systems A : The relevant
In the weak-measurement protocol, each ancillas projects onto an eigenspace of a . Measuring the
state must remain coherent during only one weak yields 1. These possible outcomes map bijectively onto
measurementduring the action of one (composite) gate the possible -measurement outcomes. See Footnote 8
in a circuit. The first ancilla may be erased, then reused for mathematics.
in the third weak measurement. In contrast, each ancilla Our weak-measurement and interference schemes offer
in [41, 42, 44] remains in use throughout the protocol. the advantage of involving general operators. W and V
The Swingle et al. scheme for measuring <(F (t)), too, must be Hermitian or unitary, not necessarily one or the
requires an ancilla that remains coherent throughout the other. Suppose that W and V are unitary. Hermitian
protocol [41]. The longer an ancillas active-duty time, operators GW and GV generate W and V , as discussed
the more likely the ancillas state is to decohere. Like the in Sec. I B. GW and GV may be measured in place of W
weak-measurement sheme, the Swingle et al. scheme for and V . This flexibility expands upon the measurement
measuring |F (t)|2 requires no ancilla [41]. opportunities of, e.g., [41, 42, 44], which require unitary
Also in the interference scheme for measuring A [37], operators.
an ancilla remains active throughout the protocol. That Our weak-measurement and interference schemes of-
16
fer leeway in choosing not only W and V , but also . Ux depends on the system-detector coupling and on the
The state can assume any form D(H). In contrast, detector-measurement outcome.
infinite-temperature Gibbs states = 1/d were used The imbalance |p q| can be tuned experimentally.
in [47, 48]. Thermality of is assumed in [42]. Com- Our scheme has no need for a nonzero imbalance. We
mutation of with V is assumed in [39]. If shares assume that p equals q.
a V eigenbasis or the W(t) eigenbasis, e.g., if = 1/d,
our weak-measurement protocol simplifies from requiring
three sequential weak measurements to requiring two. 2. Notation
Let := x x + y y + z z
denote a vector of one-
C. Circuit for inferring A from weak qubit Pauli operators. The z basis serves as the com-
measurements putational basis in [116]. We will exchange the z basis
with the W eigenbasis, or with the V eigenbasis, in each
Consider a 1D chain S of N qubits. A circuit weak-measurement subcircuit.
implements the weak-measurement scheme reviewed in In our spin-chain example, W and V denote one-qubit
Sec. I D 4. We exhibit a circuit for measuring A . One Pauli operators localized on opposite ends of the chain
subcircuit implements each weak measurement. These S: W = W 1(N 1) , and V = 1(N 1) V . Unit
subcircuits result from augmenting Fig. 1 of [116]. V R3 are chosen such that n := n,
vectors W, for
Dressel et al. use the partial-projection formalism, n = W, V .
which we review first. We introduce notation, then re- The one-qubit Paulis eigendecompose as W =
view the weak-measurement subcircuit of [116]. Copies |+Wih+W| |WihW| and V = |+V ih+V |
of the subcircuit are embedded into our A -measurement |V ihV |. The whole-system operators eigendecompose
circuit. as W = W W V V
+ and V = + . A rotation op-
erator Rn maps the eigenstates to the n eigenstates:
z
The measurement is weak because is small. Rotating If W(t) and V are unitary, they square to 1. Equa-
18
What about lower temperatures? Data for the T = 1 and w3 , and under the simultaneous negations of v1 and
thermal state are shown in Figures 7, 8, and 9. The v2 . These symmetries have operational significances: A
coarse-grained quasiprobability is no longer real. Here, remains constant under permutations and negations of
too, the time required for A to deviate significantly measurement outcomes in the weak-measurement scheme
from its initial value is comparable with the time scale (Sec. I D 4). Symmetries break as the system starts
of changes in F (t). This comparability characterizes the scrambling: F (t) shrinks, shrinking the final term in
real and imaginary parts of A . Both parts oscillate at Eq. (62). A starts depending not only on squares of
long times. In the small systems considered here, such w` -and-vm functions, but also on the eigenvalues indi-
oscillations can arise from finite-size effects, including the vidually.
energy spectrums discreteness. With nonintegrable pa- Whereas the shrinking of F (t) bifurcates the lower lines
rameters, this model has an energy gap N =10 = 2.92 in Fig. 5, the shrinking does not bifurcate the upper
above the ground state. The temperature T = 1 is lines. The reason is that each upper line corresponds to
smaller than the gap. Hence lowering T from to 1 w2 w3 = v1 v2 = 1. [At early times, |F (t)| is small enough
brings the thermal state close to the ground state. that any F (t)-dependent correction would fall within the
What about long-time behavior? At infinite tem- lines widths.] Hence the final term in Eq. (61) is pro-
perature, A(1/d) approaches a limiting form after the portional to hW(t)V i. This prediction is consistent
scrambling-onset time but before any recurrence time. with the observed splitting. The hW(t)V i term does not
Furthermore, A(1/d) can approach one of only a few pos- split the lower lines: Each lower line satisfies w2 = w3
sible limiting values, depending on the functions argu- and/or v1 = v2 . Hence the hW(t)V i term vanishes. We
ments. This behavior follows from the terms in Eq. (55). leave as an open question whether these pitchforks can
At infinite temperature, hWi = hV i = 0. Also the 3- be understood in terms of equilibria, like classical-chaos
point functions vanish, due to the traces cyclicity. We pitchforks [118].
expect the nontrivial 2- and 4-point functions to be small In contrast with the T = data, the T = 1 data oscil-
at late times. (Such smallness is visible in the 4-point late markedly at late times. We expect these oscillations
function in Fig. 4.) Hence Eq. (55) reduces as to decay to zero at late times, if the system is chaotic, in
the thermodynamic limit. Unlike at infinite temperature,
1 + w2 w3 + v1 v2 W and V can have nonzero expectation values. But, if all
A (v1 , w2 , v2 , w3 ) |{z}
. (60)
16 nontrivial connected correlation functions have decayed,
t
Eq. (55) still implies a simple dependence on the w` and
According to Eq. (60), the late-time values of A(1/d) vm parameters at late times.
should cluster around 3/16, 1/16, and 1/16. This ex- Finally, Figures 10 and 11 show the coarse-grained
pectation is roughly consistent with Fig. 5, modulo the quasiprobability at infinite temperature, A(1/d) , with in-
upper lines bifurcation. tegrable parameters. The imaginary part remains zero,
A bifurcation of A signals the breaking of a sym- so we do not show it. The difference from the behavior in
metry at the onset of scrambling. Similarly, pitchfork Figures 4 and 5 (which shows T = , nonintegrable-H
plots signal the breaking of a symmetry in classical data) is obvious. Most dramatic is the large revival that
chaos [118]. The symmetrys mathematical form follows occurs at what would, in the nonintegrable model, be a
from Eq. (55). At early times, W(t) commutes with V , late time. Although this is not shown, the quasiprob-
and F (t) 1. Suppose, for simplicity, that = 1/d. The ability depends significantly on the choice of operator.
expectation values hW(t)i and hV i vanish, because every This dependence is expected, since different Pauli opera-
Pauli has a zero trace. Equation (55) becomes tors have different degrees of complexity in terms of the
noninteracting-fermion variables.
1h
A (v1 , w2 , v2 , w3 ) = (1 + w2 w3 + v1 v2 + w2 w3 v1 v2 )
16 i
+ (w2 + w3 )(v1 + v2 ) hW(t)V i . (61) B. Random states
Suppose that w2 = w3 and/or v1 = v2 , as in the We now consider random pure states |ih| and
lower lines in Fig. 5. A (.) reduces to the constant nonintegrable parameters. Figures 12, 13, and 14 show
F (t) and A for the operator choice W = 1z and V = N
z
1.0 1.0
0.8 0.8
0.6 0.6
Re[F] Re[F]
Im[F] 0.4 Im[F]
0.4
0.2
0.2
t
t 2 4 6 8 10 12 14
2 4 6 8 10 12 14
FIG. 4: Real and imaginary parts of F (t) as a function of FIG. 7: Real and imaginary parts of F (t) as a function of
time. T = thermal state. Nonintegrable parameters, time. T = 1 thermal state. Nonintegrable parameters,
N = 10, W = 1z , V = N
z
. N = 10, W = 1z , V = N
z
.
0.25
0000 1000 0000 1000
0.20
0.6
0001 1001 0001 1001
0.15 0010 1010 0010 1010
0.4 0011 1011
0011 1011
0.10
0100 1100 0100 1100
0.05 0101 1101
0101 1101 0.2
FIG. 5: Real part of A as a function of time. T = FIG. 8: Real part of A as a function of time. T = 1
thermal state. Nonintegrable parameters, N = 10, W = 1z , thermal state. Nonintegrable parameters, N = 10, W = 1z ,
z z
V = N . V = N .
in Eq. (55). The late-time values are roughly consistent given random pure state corresponds to an infinite tem-
with those for = 1/d but fluctuate more pronouncedly. perature. The reason is the thermodynamic entropys
The agreement between random pure states and the monotonic increase with temperature. Since the thermo-
T = thermal state is expected, due to closed-system dynamic entropy gives the density of states, more states
thermalization [119, 120]. Consider assigning a tempera- correspond to higher temperatures. Most states corre-
ture to a pure state by matching its energy density with spond to infinite temperature.
the energy density of the thermal state eH/T /Z, cast as For the random states and system sizes N considered,
a function of temperature. With high probability, any if H is nonintegrable, the agreement with thermal results
is not complete. However, the physics appears qualita-
2. 10-17
0000 1000
0001 1001 0.010 0000 1000
t
2 4 6 8 10 12 14 0010 1010 0001 1001
-17
0011 1011 0.005 0010 1010
-2. 10
0100 1100 0011 1011
0101 1101 t
-4. 10-17 2 4 6 8 10 12 14 0100 1100
0110 1110
- 0.005 0101 1101
0111 1111
-6. 10-17 0110 1110
- 0.010 0111 1111
1.0
0.25 0000 1000
0.15
0010 1010
0.6 0011 1011
0.10
Re[F] 0100 1100
0.4 Im[F] 0.05
0101 1101
t 0110 1110
0.2 2 4 6 8 10 12 14
-0.05 0111 1111
t
2 4 6 8 10 12 14
0.25 0.006
0000 1000
0000 1000
0.20 0.004 0001 1001
0001 1001
0.002 0010 1010
0.15 0010 1010
0011 1011
0.10
0011 1011 t
2 4 6 8 10 12 14 0100 1100
0100 1100
0.05 -0.002 0101 1101
0101 1101
t -0.004 0110 1110
2 4 6 8 10 12 14 0110 1110
0111 1111
-0.05 0111 1111 -0.006
tively similar.
to use W = 1z and V = N z
. For the Hamiltonian pa-
rameters chosen, this state lies far from the ground state.
The state therefore should correspond to a large effective
C. Product states
temperature. Figures 15, 16, and 17 show F (t) and A
for nonintegrable parameters.
Finally, we consider the product |+xiN of N copies
of the +1 x eigenstate (Figures 1517). We continue The real part of F (t) decays significantly from its ini-
tial value of one. The imaginary part of F (t) is nonzero
but remains small. These features resemble the infinite-
1.0 temperature features. However, the late-time F (t) values
are substantially larger than in the T = case and os-
0.8 cillate significantly.
We will sometimes call Eq. (65) dB. dB is a Gaussian This result depends on = 1/2N , not on the form of the
random variable with zero mean and with variance dynamics. But to compute A, we must compute
n 0 0
o
,
EB dBi,j dBi0 ,j,0 = ,0 , 0 i,i0 j,j 0 dt. (66) G = EB {G} (69)
and
The expectation value EB is an average over realizations
of the noise B. We demand that dt dt = 0 and dB dt = 0, F = EB {F } . (70)
in accordance with the standard Ito calculus. dB(t) is
independent of U (t), i.e., of all previous dBs. The computation of F appears in the literature [3].
We wish to compute the average, over the ensemble F initially equals unity. It decays to zero around t =
defined by Eq. (64), of the coarse-grained quasiprobabil- 1
3 log N , the scrambling time. The precise functional form
ity: of F is not crucial. The basic physics is captured in a phe-
n o nomenological form inspired by AdS/CFT computations
A(v1 , w2 , v2 , w3 ) = EB A (v1 , w2 , v2 , w3 ) . (67) [3],
c2
1 + c1
F , (71)
A. Infinite-temperature thermal state 1/2N 1 + c1 e3t
(2) W 111, 1V 11, 11W 1, 111V : 0, We have applied the traces cyclicality in the second term.
24
dG
= 2G. (75)
dt B. General state
(3) w2 w3 = 1, v1 v2 = 1: A = 0, and
To simplify the second term, we use a trick. Since
(4) w2 w3 = 1, v1 v2 = 1: A = 0.
j j kk m n j j kk = m
m n m n
n , (80)
These values are consistent with Fig. 5 at t = 0. These
values degeneracies are consistent with the symmetries we may pass the factors of j j kk through U (t), at the
discussed in Sec. III and in Sec. V A (Property 7). cost of changing some Brownian weights. We must con-
At long times, F() = 0, so A is sider a different set of dBs, related to the originals by
minus signs. This alternative set of Brownian weights
1 + w2 w3 + v1 v2 has the original sets ensemble probability. Hence the
At= = . (78) ensemble average gives the same result. Therefore,
16
The cases are EB {hj j kk U (t) iz U (t) j j kk i}
(1) w2 w3 = 1, v1 v2 = 1: A = 3/16, = EB {hU (t)j j kk iz j j kk U (t) i}. (81)
25
If i = j and/or i = k, the sum over j and/or the sum q12 (t) = q1 (t) = q12 (t) , (88)
over k vanishes. If i equals neither j nor k, the Pauli
and
operators commute. The term reduces to qi . i equals
neither j nor k in (N 1)(N 2)/2 terms. A factor of f12 = F. (89)
16 comes from the sums over j and k . Hence
Hermiticity of the Pauli operators implies that f12 is real.
dqi Hence the ensemble-averaged OTOC F is real for this
= N qi + (N 2)qi = 2qi . (82) choice of . The ensemble-averaged A has the form
dt
Consider the terms of the form qij (t) := hiz (t)jz i. k1 + k2 q1 + k3 F
A= , (90)
Note that hjz iz (t)i = qij . We may reuse the trick intro- 16
duced above. [This trick fails only when more than two wherein
copies of U appear, as in F (t)]. To be precise,
k1 = (1 + v1 )(1 + v2 + w3 w2 ), (91)
EB {hm n U (t)iz U (t) m
m n m n z
n j i}
k2 = (1 + v1 )(w3 + w2 )(1 + v2 ), (92)
= EB {hU (t)m n i m n U (t) jz i}.
m n z m n
(83)
and
As before, the sums over kill the relevant term in the
time derivative of qij , unless i 6= m, n. Hence k3 = (1 + v1 )w3 v2 w2 . (93)
Equations (90)(93) imply that A = 0 unless v1 = 1.
dqij
= 2qij , (84) The time scale after which q1 decays is order-one. The
dt time required for F to decay is of order log N (although
as at infinite temperature. not necessarily exactly the same as for the T = state).
Item (5), in the list above, concerns products of three Therefore, the late-time value of A is well approximated
Ws and V s. We must consider four expectation values by
of Pauli products. As seen above, two of these terms k1 + k3 F
reduce to qi terms. By the trick used earlier, At1 = . (94)
16
EB {h2z U (t)1z U (t) 2z }
= EB {hU (t)2z 1z 2z U (t) } = q1 (t). (85) C. Summary
The other term we must consider is This study has the following main messages.
EB {hiz (t)jz iz (t)i} =: fij . Our trick will not work,
because there are multiple copies of U (t) that are not all (1) In this model, the ensemble-averaged quasiproba-
simultaneously switched as operators are moved around. bility varies on two time scales. The first time scale
At early times, when iz (t) and jz approximately is an order-one relaxation time. At later times, the
commute, this term approximately equals hjz i = qj (0). OTOC controls the physics entirely. F (t) varies
At later times, including around the scrambling time, after a time of order log N .
this term decays to zero. (2) While the late-time physics of A is controlled en-
The general expression for A becomes tirely by the ensemble-averaged F (t), the negative
16 A = 1 + w3 w2 + v1 v2 values of A show a nonclassicality that might not
be obvious from F (t) alone. Furthermore, we com-
+ (w3 + w2 ) q1 (t) + (v1 + v2 ) q2 (0)
puted only the first moment of A . The higher
+ (w3 v2 + w3 v1 + w2 v1 ) q12 (t) + v2 w2 q12 (t) moments are likely not determined by F (t) alone.
+ w3 v2 w2 f12 (t) + (w3 v1 v2 + w2 v1 v2 ) q1 (t)
(3) For T = , the late-time physics is qualitatively
+ w3 w2 v1 q2 (0) + w3 w2 v1 v2 F(t). (86) similar to the late-time physics of the geometrically
All these q functions obey known differential equations. local spin chain in Sec. III.
The functions decay after a time of order one. We do not (4) Nonclassicality, as signaled by negative values of
have explicit expressions for the f functions that appear.
A , is extremely robust. It survives the long-time
They are expected to vary after a time log N .
limit and the ensemble average. One might have
expected thermalization and interference to stamp
1. Special case: 2z eigenstate out nonclassicality. On the other hand, we expect
the circuit average to suppress the imaginary part
of A rapidly.
We have no controlled examples
In a concrete example, we suppose that is a +1 eigen-
state of 2z . Expressions simplify: in which = A remains nonzero at long times.
Finding further evidence for or against this conjec-
q2 (0) = 1, (87) ture remains an open problem.
26
V. THEORETICAL STUDY OF A The quasiprobability behind F (K ) (t), we find, is K -
extended.
We have discussed experimental measurements, nu- Recent quasiprobability advances involve out-of-time
merical simulations, and analytical calculations of the ordering, including in correlation functions [123127].
OTOC quasiprobability A . We now complement these Merging these works with the OTOC framework offers
discussions with mathematical properties and physical an opportunity for further research (Sec. VI).
interpretations. First, we define an extended Kirkwood-
Dirac distribution exemplified by A . We still denote by
B(H) the set of bounded operators defined on H. A. Mathematical properties of A
Definition 1 (K -extended Kirkwood-Dirac quasiprob- A shares some of its properties with the KD
ability). Let {|ai} , . . . , {|ki} and {|f i} denote orthonor- quasiprobability (Sec. I A 4). Properties of A imply
mal bases for the Hilbert space H. Let O B(H) de- properties of P (W, W 0 ), presented in Appendix A.
note a bounded operator defined on H. A K -extended
Kirkwood-Dirac quasiprobability for O is defined as10 Property 5. The OTOC quasiprobability is a map A :
D(H) {v1 } {v1 } {w2 } {w2 } {v2 } {v2 }
(K )
AO (a, . . . , k, f ) := hf |kihk| . . . |aiha|O|f i . (95) {w3 } {w3 } C . The domain is a composition of
the set D(H) of density operators defined on H and eight
We will focus mostly on density operators O = sets of complex numbers. The range is not necessarily
D(H). One infers A
(K )
by performing 2K 1 weak real: C R.
measurements, and one strong measurement, per trial. A depends on H and t implicitly through U . The
The order in which the bases are measured is the order KD quasiprobability in [14] depends implicitly on time
in which the labels a, . . . , k, f appear in the argument similarly (see Footnote 10). Outside of OTOC contexts,
(K )
of AO (.). The conventional KD quasiprobability is 1- D(H) may be replaced with B(H). K -extended KD dis-
extended. The OTOC quasiprobability A is 3-extended. tributions represent bounded operators, not only quan-
Our investigation parallels the exposition, in Sec. I A, tum states. C, not necessarily R, is the range also of
(K )
of the KD distribution. First, we present basic mathe- the K -fold generalization A . We expound upon the
matical properties. A , we show next, obeys an analog ranges complexity after discussing the number of argu-
of Bayes Theorem. Our analog generalizes the known ments of A .
analog (5). Our theorem reduces exponentially (in sys- Five effective arguments of A : On the left-hand side
tem size) the memory needed to compute weak values, in of Eq. (28), semicolons separate four tuples. Each tu-
certain cases. Third, we connect A with the operator- ple results from a measurement, e.g., of W. We coarse-
decomposition argument in Sec. I A 3. A consists of co- grained over the degeneracies in Sections II AIV. Hence
efficients in a decomposition of an operator 0 that re- each tuple often functions as one degree of freedom. We
sults from asymmetrically decohering . Summing A (.) treat A as a function of four arguments (and of ). The
values yields a KD representation for . This sum can KD quasiprobability has just two arguments (apart from
be used, in experimental measurements of A and the O). The need for four arises from the noncommutation
OTOC, to evaluate how accurately the desired initial of W(t) and V .
state was prepared. Fourth, we explore the relation- Complexity of A : The ability of A to assume non-
ship between out-of-time ordering and quasiprobabilities. real values mirrors Property 1 of the KD distribution.
Time-ordered correlators are moments of quasiprobabil- The Wigner function, in contrast, is real. The OTOC
ities that clearly reduce to classical probabilities. Fi- quasiprobabilitys real component, <(A ), parallels the
nally, we generalize beyond the OTOC, which encodes Terletsky-Margenau-Hill distribution. We expect non-
K = 3 time reversals. Let K := 12 (K + 1). A K-fold classical values of A to reflect nonclassical physics,
OTOC F (K ) (t) encodes K time reversals [121, 122]. as nonclassical values of the KD quasiprobability do
(Sec. I A).
Equations (28) and (29) reflect the ability of A to as-
sume nonreal values. Equation (28) would equal a real
10 Time evolutions may be incorporated into the bases. For ex- product of probabilities if the backward-process ampli-
ample, Eq. (10) features the 1-extended KD quasiprobability
tude A and the forward-process amplitude A had equal
hf 0 |aiha|0 |f 0 i. The 0 := Ut0 Ut0 results from time-evolving
arguments. But the arguments typically do not equal
a state . The |f 0 i := Ut00 t0 |f i results from time-evolving an each other. Equation (29) reveals conditions under which
(1)
A (.) R and 6 R. We illustrate the case with two
P
eigenket |f i of F = f f |f ihf |. We label (10) as A (, a, f ),
rather than as A (1) 0 0
( , a, f ). Why? One would measure (10) by examples and the 6 case with one example.
preparing , evolving the system, measuring A weakly, inferring
outcome a, evolving the system, measuring F , and obtaining out- Example 1 (Real A #1: t = 0, shared eigenbasis, ar-
come f . No outcome f 0 is obtained. Our notation is that in [14] bitrary ). Consider t = 0, at which U = 1. The oper-
and is consistent with the notation in [37]. ators W(t) = W and V share an eigenbasis, under the
27
assumption that [W, V ] = 0: {|w` , w` i} = {|v` , v` i}. W eigenstates and the V eigenstates are
With respect to that basis, N
z y 1
A (v1 , v1 ; w2 , w2 ; v2 , v2 ; w3 , w3 ) h , +| , +i = ,
2
= w3 v2 w3 v2 v2 w2 v2 w2 w2 v1 w2 v1 N
1
X h z , +| y , i = ,
pj |hw3 , w3 |ji|2 2
j
N
z y i
R. (96) h , | , +i = , and
2
N
We have substituted into Eq. (29). We substituted in for i
h z , | y , i = . (100)
from Eq. (20). 2
Example 1 is consistent with the numerical simula- Suppose that = 1/d. A(1/d) (.) would have a chance
tions in Sec. III. According to Eq. (96), at t = 0, of being nonreal only if some |v` , v` i equaled | z , i.
P
degeneracies A =: A R. In Figures 9, 14, and 17, That | z , i would introduce an i into Eq. (29). But
the imaginary parts =(A ) clearly vanish at t = 0. In h z , | would introduce another i. The product would be
Fig. 6, =(A ) vanishes to within machine precision.11 real. Hence A(1/d) (.) R.
Consider a that lacks coherences relative to the
shared eigenbasis, 1/d. Example
e.g., = 1 implies that A is nonreal in the following example.
= A(1/d) at t = 0. But = A(1/d) remains zero for all Example 3 (Nonreal A : t = 0, nonshared eigenbases,
t in the numerical simulations. Why, if time evolution nondiagonal relative to both). Let t, W, V , {|w` , w` i},
deforms the W(t) eigenbasis from the V eigenbasis? The and {|vm , vm i} be as in Example 2.
reason appears to be a cancellation, as in Example 2. Suppose that has coherences relative to the W and V
Example 2 requires more notation. Let us focus on eigenbases. For instance, let = | x , +ih x , +|. Since
a chain of N spin- 21 degrees of freedom. Let denote | x , +i = 12 (| z , +i + | z , i),
the = x, y, z Pauli operator. Let | , i denote the
eigenstates, such that | , i = | , i. N -fold 1
= (| z , +ih z , +| + | z , +ih z , |
tensor products are denoted by | , i := | , iN . 2N/2
We denote by j the th Pauli operator that acts non- + | z , ih z , +| + | z , ih z , |)N . (101)
trivially on site j.
Let |w3 , w3 i = | z , i, such that its overlaps with V
Example 2 (Real A #2: t = 0, nonshared eigen- eigenstates can contain is. The final factor in Eq. (29)
bases, = 1/d). Consider the spin chain at t = 0, becomes
such that U = 1. Let W = 1z and V = N y
. Two
W eigenstates are | z , i. Two V eigenstates are 1 h
h iN hv1 , v1 ||w3 , w3 i = N/2 hv1 , v1 | | z , +iN
2
| y , +i = 12 (| z , +i + i| z , i) and | y , i = i
h iN + hv1 , v1 | | z , iN . (102)
1 (| z , +i i| z , i) . The overlaps between the
2
N
The first inner product evaluates to 1 , by
2
Eqs. (100). The second inner product evaluates to
N
11 The =(A ) in Fig. 6 equals zero identically, if w2 = w3 and/or i2 . Hence
if v1 = v2 . For general arguments,
=(A (v1 , w2 , v2 , w3 )) =
1 h
A (v1 , w2 , v2 , w3 ) 1 h N
i
2i hv1 , v1 ||w3 , w3 i = 1 + (i) . (103)
i 2N
(v1 , w2 , v2 , w3 ) .
A (97)
This expression is nonreal if N is odd.
The final term equals
h
W(t) W(t) V
i
W(t) V W(t)
Example 3, with the discussion after Example 1,
Tr w3 V v2 w2 v1 = Tr V
v1 w2 v2 w3 (98) shows how interference can eliminate
nonreality
from a
W(t) V W(t) V
= Tr w2 v2 w3 v1 = A (v1 , w3 , v2 , w2 ) . (99) quasiprobability. In Example 3, = A does not neces-
The first equality follows from projectors Hermiticity; and the sarily vanish. Hence the coarse-grained = A does not
second, from the traces cyclicality. Substituting into Eq. (97)
shows that A (.) is real if w2 = w3 . A (.) is real if v1 = v2 , by obviously vanish. But = A = 0 according to the dis-
an analogous argument. cussion after Example 1. Summing Example 3s nonzero
28
= A values must quench the quasiprobabilitys non- facilitate a proof [6]:12
reality. This quenching illustrates how interference can
wash out quasiprobabilities nonclassicality. Yet interfer- Freg (t) := Tr 1/4 W(t)1/4 V 1/4 W(t)1/4 V . (104)
ence does not always wash out nonclassicality. Section III
depicts A s that have nonzero imaginary components Freg (t) is expected to behave roughly like F (t) [6, 42].
(Figures 9, 14, and 17). Just as F (t) equals a moment of a sum over A , Freg (t)
Example 3 resonates with a finding in [107, 108]. Soli- equals a moment of a sum over
nas and Gasparinettis quasiprobability assumes nonreal
values when the initial state has coherences relative to Areg
(v1 , v1 ; w2 , w2 ; v2 , v2 ; w3 , w3 ) (105)
the energy eigenbasis. 1/4 1/4
:= hw3 , w3 |U |v2 , v2 ihv2 , v2 | U |w2 , w2 i
hw2 , w2 |U 1/4 |v1 , v1 ihv1 , v1 |1/4 U |w3 , w3 i
Property 6. Marginalizing A (.) over all its arguments |v2 , v ihv2 , v |U
hw3 , w |U
3 2
|w2 , w i
2 2
(106)
except any one yields a probability distribution. |v1 , v ihv1 , v |U
|w3 , w i .
hw2 , w2 |U 1 1 3
Consider, as an example, summing Eq. (29) over every The proof is analogous to the proof of Theorem 1 in [37].
tuple except (w3 , w3 ). P
The outer products become res- Equation (106) depends on U := 1 eiH , which prop-
Z
olutions of unity, e.g., (w2 ,w ) |w2 , w2 ihw2 , w2 | = 1. agates in the complex-time variable := t 4T i
. The
2
A unitary cancels with its Hermitian conjugate: U U = Hermitian conjugate U =
1
e iH
propagates along
Z
1. The marginalization yields hw3 , w3 |U U |w3 , w3 i. i
= t + 4T .
This expression equals the probability that preparing , Areg has the symmetries of A(1/d) (Property 7)
time-evolving, and measuring the W eigenbasis yields the (eH/T /Z )
outcome (w3 , w3 ). for arbitrary T . One might expect Areg to behave sim-
ilarly to A , as Freg (t) behaves similarly to F (t). Nu-
This marginalization property, with the structural
merical simulations largely support P this expectation. We
and operational resemblances between A and the KD
compared A (.) with Areg (.) := degeneracies A (.) . The
quasiprobability, accounts for our calling A an extended distributions vary significantly over similar time scales
(K )
quasiprobability. The general K -extended A obeys and have similar shapes. Areg tends to have a smaller
Property 6. imaginary component and, as expected, more degenera-
cies.
The properties of A imply properties of P (W, W 0 ).
Property 7 (Symmetries of A(1/d) ). Let be the We discuss these properties in Appendix A.
infinite-temperature Gibbs state 1/d. The OTOC
quasiprobability A(1/d) has the following symmetries.
B. Bayes-type theorem and retrodiction with A
(A) A(1/d) (.) remains invariant under the si- We reviewed, in Sec. I A 2, the KD quasiprobabilitys
multaneous interchanges of (w2 , w2 ) with (1)
role in retrodiction. The KD quasiprobability A gen-
(w3 , w3 ) and (v1 , v1 ) with (v2 , v2 ): 0 0 0
eralizes the nontrivial part <(hf |aiha| |f i) of a condi-
A(1/d) (v1 , v1 ; w2 , w2 ; v2 , v2 ; w3 , w3 ) =
tional quasiprobability p(a|, f ) used to retrodict about
A(1/d) (v2 , v2 ; w3 , w3 ; v1 , v1 ; w2 , w2 ). (1)
an observable A. Does A play a role similar to A ?
It does. To show so, we generalize Sec. I A 2 to com-
posite observables. Let A, B, . . . , K denote K observ-
(B) Let t = 0, such that {|w` , w` i} = {|v` , v` i} (un- ables. K . . . BA might not be Hermitian but can be sym-
der the assumption that [W, V ] = 0). A(1/d) (.) re- metrized. For example, := K . . . A + A . . . K is an ob-
mains invariant under every cyclic permutation of servable.13 Which value is most reasonably attributable
its arguments.
Equation (29) can be recast as a trace. Property 7 12 The name regulated derives from quantum field theory. F (t)
follows from the traces cyclicality. Subproperty (B) re- contains operators W (t) and W(t) defined at the same space-
lies on the triviality of the t = 0 time-evolution operator: time point (and operators V and V defined at the same space-
U = 1. The symmetries lead to degeneracies visible in time point). Products of such operators encode divergences.
One can regulate divergences by shifting one operator to an-
numerical plots (Sec. III). other space-time point. The inserted 1/4 = 1/4 1
eH/4T shifts
Z
Analogous symmetries characterize a regulated operators along an imaginary-time axis.
13 := i(K . . . A A . . . K). An operator can be symmetrized
So is
quasiprobability. Maldacena et al. regulated F (t) to
29
to retrodictively? A weak value weak given by Eq. (3). A rightward-pointing arrow labels quantities in which
We derive an alternative expression for weak . In our ex- the outer products, |kihk|, . . . , |aiha|, are ordered analo-
pression, eigenvalues are weighted by K -extended KD gously to the first term K . . . A in . A leftward-pointing
quasiprobabilities. Our expression reduces exponentially, arrow labels quantities in which reading the outer
in the systems size, the memory required to calculate products |aiha|, . . . , |kihk| backwardfrom right to left
weak values, under certain conditions. We present gen- parallels reading K . . . A forward.
(K )
eral theorems about A , then specialize to the OTOC
A . Proof. The initial steps come from [14, Sec. II A], which
recapitulates [7375]. For every measurement outcome f ,
Theorem 1 (Retrodiction about composite observ- we assume, some number f is the guess most reasonably
ables). Consider a system S associated with a Hilbert attributable to . We combine P these best guesses into
space H. For concreteness,
P we assume P that H is dis- the effective observable est := f f |f 0 ihf 0 |. We must
crete. Let A = a|aiha| , . . . , K = k k|kihk| denote
a optimize our choice of {f }. We should quantify the
K observables defined on H. Let Ut denote the family of
distance between (1) the operator est we construct and
unitaries that propagates the state of S along time t.
(2) the operator we wish to infer about. We use the
Suppose that S begins in the state at time P t = 0, then weighted trace distance
evolves under Ut00 until t = t00 . Let F = f f |f ihf | de-
note an observable measured at t = t00 . Let f denote
D0 (, est ) = Tr 0 [ est ]2 . (114)
the outcome. Let t0 (0, t00 ) denote an intermediate
time. Define 0 := Ut0 Ut0 and |f 0 i := Ut00 t0 |f i as time- 0 serves as a positive prior bias [14].
evolved states. Let us substitute in for the form of est . Expanding
The value most reasonably attributable retrodictively to the square, then invoking the traces linearity, yields
the time-t0 := K . . . A + A . . . K is the weak value
h Xh
X D0 (, est ) = Tr(0 2 ) + f2 hf 0 |0 |f 0 i
weak (, f ) = (a . . . k) p (a, . . . , k|, f )
f
a,...,k i
i
f (hf | |f i + hf 0 |0 |f 0 i) .
0 0 0
(115)
+ p (k, . . . , a|, f ) . (107)
The weights are joint conditional quasiprobabilities. They factor equals 2<(hf 0 |0 |f 0 i). Adding
The parenthesized P
0 0 0 0 0 0 2
obey analogs of Bayes Theorem: and subtracting f hf | |f i[<(hf | |f i)] to and
from Eq. (115), we complete the square:
p (a, . . . , k, f |)
p (a, . . . , k|, f ) = (108) X
p(f |) D0 (, est ) = Tr(0 2 ) hf 0 |0 |f 0 i[<(hf 0 |0 |f 0 i)]2
<(hf 0 |kihk| . . . |aiha|0 |f 0 i) f
, (109) !2
hf 0 |0 |f 0 i X
0 0 0 <(hf 0 |0 |f 0 i)
+ hf | |f i f .
and hf 0 |0 |f 0 i
f
p (k, . . . , a, f |) (116)
p (k, . . . , a|, f ) = (110)
p(f |)
Our choice of {f } should minimize the distance (116).
<(hf 0 |aiha| . . . |kihk|0 |f 0 i)
. (111) We should set the square to zero:
hf 0 |0 |f 0 i
<(hf 0 |0 |f 0 i)
Complex generalizations of the weights numerators, f = . (117)
hf 0 |0 |f 0 i
A(K ) 0 0 0
, (a, . . . , k, f ) := hf |kihk| . . . |aiha| |f i (112)
Now, we deviate from [14, 7375]. We substitute the
and definition of into Eq. (117). Invoking the linearity of
< yields
A(K ) 0 0 0
, (k, . . . , a, f ) := hf |aiha| . . . |kihk| |f i , (113)
<(hf 0 |K . . . A0 |f 0 i) <(hf 0 |A . . . K0 |f 0 i)
are K -extended KD distributions. f = + .
hf 0 |0 |f 0 i hf 0 |0 |f 0 i
(118)
in multiple ways. Theorem 1 governs . Appendix B contains an We eigendecompose A, . . . , K. The eigenvalues, being
analogous result about . Theorem 1 extends trivially to Her- real, can be factored out of the <s. Defining the eigen-
mitian (already symmetrized) instances of K . . . A. Corollary 1 values coefficients as in Eqs. (109) and (111), we reduce
illustrates this extension. Eq. (118) to the form in Eq. (107).
30
Theorem 1 reduces exponentially, in system size, the (ii) substitute into Eq. (107).
space required to calculate weak , in certain cases.14 For
concreteness, we focus on a multiqubit system and on Let (n) denote the space required to compute weak ,
l-localPoperators A, . . . , K. An operator O is l-local if aside from the space required to store weak , with constant
O = j Oj , wherein each Oj operates nontrivially on, precision, using method (n) = (1), (2), in the asymptotic
at most, l qubits. Practicality motivates this focus: The limit. Method (1) requires a number of bits at least expo-
lesser the l, the more easily l-local operators can be mea- nential in the number K of local observables:
sured.
We use asymptotic notation from computer science: (1) = 2K . (119)
Let f f (N ) and g g(N ) denote any functions of
the system size. If g = O(f ), g grows no more quickly Method (2) requires a number of bits linear in K :
than (is upper-bounded by) a constant multiple of f in
the asymptotic limit, as N . If g = (f ), g grows (2) = O(K ) . (120)
at least as quickly as (is lower-bounded by) a constant
multiple of f in the asymptotic limit. If g = (f ), g Method (2) requires exponentiallyin K and so in N
is upper- and lower-bounded by f : g = O(f ), and g = less memory than Method (1).
(f ). If g = o(f ), g shrinks strictly more quickly than f
in the asymptotic limit. Proof. Using Method (1), one computes S . S is a
2N 2N complex matrix. The matrix has (2K ) nonzero
Theorem 2 (Weak-value space saver). Let S denote a elements: A, . . . , K are traceless, so each of AS , . . . , KS
system of N qubits. Let H denote the Hilbert space as- contains at least two nonzero elements. Each operator
sociated with S. Let |f 0 i H denote a pure state and at least doubles the number of nonzero elements in S .
0 D(H) denote a density operator. Let S denote any Specifying each complex number with constant precision
fixed orthonormal basis for H in which each basis element requires (1) bits. Hence Method (1) requires 2K
equals a tensor product of N factors, each of which oper- bits.
ates nontrivially on exactly one site. S may, for example, Let us turn to Method (2). We can store hf 0 |0 |f 0 i in
consist of tensor products of z eigenstates. a constant number of bits.
Let K denote any polynomial function of N : K Step (B) can be implemented with a counter variable
K (N ) = poly(N ). Let A, . . . , K denote K traceless l- CO for each local operator O, a running-total variable G,
local observables defined on H, for any constant l. Each and a current term variable T . CO is used to iterate
observable may, for example, be a tensor product of l through the nonzero eigenvalues of O (arranged in some
nontrivial Pauli operators and N l identity opera- fiducial order). O has O(2l ) nonzero eigenvalues. Hence
tors. The composite observable := P A...K + K...A CO requires O(l) bits. Hence the set of K counters CO
is
P not necessarily l-local. Let A = a a|aiha| , . . . , K = requires O(lK ) = O(K ) bits.
k k|kihk| denote eigenvalue decompositions of the local The following algorithm implements Step (B):
observables. Let OS denote the matrix that represents an
operator O relative to S. (i) If CK < its maximum possible value, proceed as
Consider being given the matrices AS , . . . , KS , 0S , and follows:
0
|f iS . From this information, the weak value weak can
be computed in two ways: (a) For each O = A, . . . , K, compute the (2CO )th
nonzero eigenvalue (according to the fiducial
(1) Conventional method ordering).
(A) Multiply and sum given matrices to form S = (b) Multiply the eigenvalues to form a . . . k. Store
KS . . . AS + AS . . . KS . the product in T .
(B) Compute hf 0 |0 |f 0 i = hf 0 |S 0S |f 0 iS . (c) For each O = A, . . . , K, calculate the (2CO )th
0
(C) Substitute into weak = <
hf |S S 0S |f 0 iS
. eigenvector column (according to some fidu-
0 0 0
hf | |f i cial ordering).
(2) K -factored method (d) Substitute the eigenvector columns into
Eqs. (109) and (111), to compute p (.) and
(A) Compute hf 0 |0 |f 0 i. p (.).
(B) For each nonzero term in Eq. (107), h
(e) Form (a . . . k) p (a, . . . , k|, f ) +
(i) calculate p (.) and p (.) from Eqs. (109)
and (111). p (k, . . . , a|, f ). Update T to this value.
(f) Add T to G.
(g) Erase T .
14 Space means memory, or number of bits, here. (h) Increment CK .
31
(ii) If CK equals its maximum possible value, increment offers insight into the result:
the counter of the preceding variable, J , in the X
list; reset CK to one; and, if J has not attained its weak (, f ) = (k . . . a)
p (k, . . . , a|, f )
maximum possible value, return to Step (i). Pro- k,...,a
ceed in this mannerincrementing counters; then X
resetting counters, incrementing preceding coun- + (a . . . k)
p (a, . . . , k|, f ) . (121)
a,...,k
ters, and returning to Step (i)until CA reaches
its maximum possible value. Then, halt.
Each sum parallels the sum in Eq. (6). Equation (121)
suggests that we are retrodicting about K . . . A indepen-
The space needed to store G is the space needed to
dently of A . . . K. But neither K . . . A nor A . . . K is Her-
store weak . This space does not contribute to (2) .
mitian. Neither operator seems measurable. Ascribing
How much space is needed to store T ? We must cal- a value to neither appears to have physical significance,
culate weak with constant precision. weak equals a prima facie.
sum of 2lK terms. Let j denote the error in term Yet non-Hermitian products BA have been measured
P2lK
j. The sum j=1 j must be O(1). This require- weakly [3234]. Weak measurements associate a value
ment is satisfied if 2lK (max | |) = o(1), which implies with the supposedly unphysical K . . . A, just as weak
j j
maxj |j | = o 2lK . We can specify each term, with a measurements enable us to infer supposedly unphysical
small-enough roundoff error, using O(lK ) = O(K ) bits. probability amplitudes A . The parallel between K . . . A
Altogether, the variables require O(K ) bits. As the and A can be expanded. K . . . A and A . . . K, being
set of variables does, so does the O-factored method. non-Hermitian, appear to lack physical significance inde-
pendently. Summing the operators forms an observable.
Similarly, probability amplitudes A and A appear to
Performing Method (2) requires slightly more time lack physical significance independently. Multiplying the
than performing Method (1). Yet Theorem 2 can benefit amplitudes forms a probability. But A and K . . . A can
computations about quantum many-body systems. Con- be inferred individually from weak measurements.
sider measuring a weak value of a quantum many-body We have generalized Sec. I A 2. Specializing to k = 3,
system. One might wish to predict the experiments out- and choosing forms for A, . . . K, yields an application of
come and to compare the outcome with the prediction. A to retrodiction.
Alternatively, consider simulating quantum many-body
systems independently of laboratory experiments, as in Corollary 1 (Retrodictive application of A ). Let S,
Sec. III. One must compute weak values numerically, us- H, , W(t), and V be defined as in Sec. I B. Suppose
ing large matrices. The memory required to store these that S is in state at P time t = 0. Suppose that the
matrices can limit computations. Theorem 2 can free up observable F = W = w3 ,w3 w3 |w3 , w3 ihw3 , w3 | of
space. S is measured at time t00 =Pt. Let (w3 , w3 ) denote
Two more aspects of retrodiction deserve exposition: the outcome. Let A = V = v1 ,v v1 |v1 , v1 ihv1 , v1 |,
P 1
related studies and the physical significance of K . . . A. B = W(t) =
w2 ,w2 w2 U |w2 , w2 ihw2 , w2 |U , and
Related studies: Sequential weak measurements have P
C = V = v2 ,v v2 |v2 , v2 ihv2 , v2 | . Let the composite
been proposed [11] and realized recently [3234]. Lun- 2
observable = ABC = V W(t)V . The value most rea-
deen and Bamber proposed a direct measurement of a
sonably attributable to retrodictively is the weak value
density operator [11]. Let denote a density operator de-
fined on a dimension-d Hilbert space H. Let Sa := {|a` i} X
and Sb := {|b` i} denote orthonormal mutually unbiased weak (; w3 , w3 ) = v1 w 2 v2
bases (MUBs) for H. The interbasis inner products have (v1 ,v1 ),(v2 ,v2 ),(w2 ,w2 )
Relationship
n between scrambling and completeness of
o target, state target . Thermal states eH/T /Z are diffi-
|aihf |
S: The hf |ai in Sec. I A 3 forms a basis for D(H). cult to prepare, for example. How accurately was target
prepared? One may answer by comparing target with
But suppose that 0 6= . S fails to form a basis.
the KD quasiprobability A for .
What does this failure imply about W(t) and V ?
Reconstructing the KD quasiprobability requires
The failure is equivalent to the existence of a vanish-
a trivial sum over already-performed measurements
ing := |hw3 , w3 |U |v2 , v2 i|. Some vanishes if some
[Eq. (129)]. One could reconstruct independently via
degenerate eigensubspace H0 of W(t) is a degenerate
conventional quantum-state tomography [131]. The re-
eigensubspace of V : Every eigenspace of every Hermi-
tian operator has an orthogonal basis. H0 therefore has construction inferred from A may have lower precision,
an orthogonal basis. One basis element can be labeled due to the multiplicity of weak measurements and to the
U |w3 , w3 i; and the other, |v2 , v2 i. sum. But independent tomography would likely require
The sharing of an eigensubspace is equivalent to the extra measurements, exponentially many in the system
commutation of some component of W(t) with some com- size. Inferring A requires exponentially many measure-
ponent of V . The operators more likely commute before ments, granted.16 But, from these measurements, one
the scrambling time t than after. Scrambling is therefore can infer A , the OTOC, and . Upon reconstructing
expected to magnify the similarity between the OTOC the KD distribution for , one can recover a matrix rep-
quasiprobability A and the conventional KD distribu- resentation for via an integral transform [11].
tion. The asymmetrically decohered 0 : What does the de-
Let us illustrate with an extreme case. Suppose that composed operator 0 signify? 0 has the following prop-
all the s lie as far from zero as possible: erties: The term subtracted off in Eq. (126) has trace
zero. Hence 0 has trace one, like a density operator.
1 But the subtracted-off term is not Hermitian. Hence 0
= . (133) is not Hermitian, unlike a density operator. Nor is 0
d
anti-Hermitian, necessarily unitarity, or necessarily anti-
Equation (133) implies that W(t) and V eigenbases are unitary.
mutually unbiased biases (MUBs) [129]. MUBs are eigen- 0 plays none of the familiar rolesof state, observ-
bases of operators that maximize the lower bound in an able, or time-evolution operatorin quantum theory.
uncertainty relation [130]. If you prepare any eigen- The physical significance of 0 is not clear. Similar
state of one operator (e.g., U |w` , w` i) and measure quantities appear in weak-measurement theory: First,
the other operator (e.g., V ), all the possible outcomes non-Hermitian products BA of observables have been
have equal likelihoods. You have no information with measured weakly (see Sec. V B and [3234]). Second,
which to predict the outcome; your ignorance is maxi- nonsymmetrized correlation functions characterize quan-
mal. W(t) and V are maximally incompatible, in the tum detectors of photon absorptions and emissions [124].
quantum-information (QI) sense of entropic uncertainty Weak measurements imbue these examples with physi-
relations. Consistency between this QI sense of mutu- cal significance. We might therefore expect 0 to have
ally incompatible and the OTOC sense might be ex- physical significance. Additionally, since 0 is non-
pected: W(t) and V eigenbases might be expected to Hermitian, non-Hermitian quantum mechanics might of-
form MUBs after the scrambling time t . We elaborate fer insights [132].
on this possibility in Sec. VI C. The subtraction in Eq. (126) constitutes a removal of
KD quasiprobabilities are typically evaluated on coherences. But the subtraction is not equivalent to a de-
MUBs, such as position and momentum eigenbases [10, cohering channel [112], which outputs a density operator.
11, 34]. One therefore might expect A to relate more Hence our description of the decoherence as asymmetric.
closely the KD quasiprobability after t than before. The The asymmetry relates to the breaking time-reversal
OTOC motivates a generalization of KD studies beyond invariance. Let U |w3 , w3 i =: |w
3 i be fixed throughout
MUBs. the following argument (be represented, relative to any
Application: Evaluating a state preparations accuracy: given basis, by a fixed list of numbers). Suppose that
Experimentalists wish to measure the OTOC F (t) at = eH/T /Z. The removal of hv2 , v2 ||w 3 i terms from
each of many times t. One may therefore wish to measure is equivalent to the removal of hv2 , v2 |H|w 3 i terms from
A after t . Upon doing so, one may be able to infer not H: 7 0 H 7 H 0 . Imagine, temporarily, that H 0
only F (t), but also the accuracy with which one prepared could represent a Hamiltonian without being Hermitian.
the target initial state.
Suppose that, after t , some S that forms a basis for H.
Consider summing late-time A (.) values over (w2 , w2 ) 16 One could measure, instead of A , the coarse-grained quasiprob-
and (v1 , v1 ). The sum equals a KD quasiprobability ability A =:
P
degeneracies A (Sec. II A). From A , one could
for . The quasiprobability encodes all the information
infer the OTOC. Measuring A would require exponentially fewer
in [10, 11]. One can reconstruct the state that one measurements. But from A , one could not infer the KD distri-
prepared [3234]. bution. One could infer a coarse-grained KD distribution, akin
The prepared state might differ from the desired, or to a block-diagonal matrix representation for .
34
H 0 would generate a time evolution under which |w 3 i to probability values. Summing ATOC
under constraints
could not evolve into |v2 , v2 i. But |v2 , v2 i could evolve yields a complex distribution PTOC (W, W 0 ). The TOC
into |w3 i. The forward process would be allowed; the FTOC (t) equals a moment of PTOC (W, W 0 ), obeying a
reverse would be forbidden. Hence 7 0 relates to a Jarzynski-like equality.
breaking of time-reversal symmetry.
Interpretation of the sum in Eq. (129): Summing
A (.) values, in Eq. (129), yields a decomposition co- 1. Time-ordered correlator FTOC (t)
efficient C of 0 . Imagine introducing that sum into
Eq. (125). The OTOC quasiprobability A (.) would The OTOC equals a term in the expectation value h.i of
become a KD quasiprobability. Consider applying this the squared magnitude |.|2 of a commutator [. , .] [6, 30],
summed Eq. (125) in Eq. (122). We would change from
retrodicting about V W(t)V to retrodicting about the C(t) := [W(t), V ] [W(t), V ] (134)
leftmost V .
= W (t)V V W(t) V W (t)W(t)V
D. Relationship between out-of-time ordering and + 2<(F (t)) . (135)
quasiprobabilities
The second term is a time-ordered correlator (TOC),
The OTOC has been shown to equal a moment of FTOC (t) := V W (t)W(t)V . (136)
the complex distribution P (W, W 0 ) [37]. This equal-
ity echoes Jarzynskis [38]. Jarzynskis equality governs The first term, W (t)V V W(t) , exhibits similar
out-of-equilibrium statistical mechanics. Examples in- physics. Each term evaluates to one if W and V are
clude a quantum oscillator whose potential is dragged unitary. If W and V are nonunitary Hermitian opera-
quickly [133]. With such nonequilibrium systems, one tors, the TOC reaches its equilibrium value by the dissi-
can associate a difficult-to-measure, but useful, free- pation time td < t (Sec. I C). The TOC fails to reflect
energy difference F . Jarzynski
cast F in terms of the scrambling, which generates the OTOCs Lyapunov-type
characteristic function eW of a probability distribu- behavior at t (td , t ).
tion P (W ).17 Similarly, the difficult-to-measure, but use-
ful, OTOC F (t)D has been castE in terms of the character-
0
W 0) 2. TOC probability amplitude ATOC
istic function e(W + of the summed quasiprob-
variable, is , yields eW . Jarzynskis equality reads, measurement, and U . A encodes two time rever-
W U, W
F
e =e . sals. ATOC encodes none, as one might expect.
35
. .. .The PTOC
)average
0 of
F (K ) (t) := hW(t)V . . . W(t)V
0 TOC TOC
:=(WiTOC
hf Tr(TOC f (W TOC , TOC
W(t)V
)i TOC )P (144)
W(t)V
TOC (WTOC ,W
.than
Let
the
than
probabilit
4.proba
WTOC
can4.(1
Com
Tt
| {z } X | 0 {z } OTOC wea
ity. (wqu
OTOC
TOC
:=
W ,W
hf (WTOC , WTOC 0
f (W TOC
)iTOC , WTOC 36
0
TOC
)PTOC (WTOC
ogous
W , Let
W
than
to0the
than
4. TOC
th,
=Com
prob
Let (wtTW
=
2
comeK
W 4. TOC
2K X
5. Jarzynski-like 2K for the TOC
equality
,Heisenberg- TO
Each such correlation function := contains
W ,W
f (WK
TOC
W 0
0
TOC
)P
ogous new
sextu
ogousOTO
to0 t
t
TOC
TOC TOC (144)TOC
(W , W
constraints.
W
bilitytarie
WLet4.W
than
A Let
TO=Co
4. TOC
comewea
TOC TOC
TO
picture operators W(t) interleaved W 5.,Wwith K
Jarzynski-like time-0
equality oper-
0
The TOC obeys a Jarzynski-like equality analogous
for the TOC ogous
come
sextu
to
ogous th
s
0
Each such correlation Heisenberg- TOC TOC isfy the constra
sure cons
4. Complex TOC distribution PTOC (WTOC , WTOC ) .function contains K
Wconstraints
WLet = 4
K) K
Let W A
bility TOC TOC
ators
Each Vsuch F (correlation
(t) encodes Eq.2K
function
(11) in5.[31].
contains
Jarzynski-like
1 obeys
= time
K
equality
Heisenberg-
Ka Jarzynski-like for the TOC
reversals, come
bility
Pogous sextu
A
come
ogous
(W
isfy to
the TOC
TOCT
TO
W(t)operators
picture operatorsillustrated interleaved with The TOC time-0 equality analogou
W sure
constraints
isfy W
tari
the
constr con
K Koper-
TOC
Le =
picture in Fig. 19. W(t) We focus
Theorem
interleaved
Eq. The(11)
TOC
on Hermitian
in45.[31].
(Jarzynski-like
with
Jarzynski-like
obeys a
W
theorem
time-0 and
equality
Jarzynski-like
forVthe
oper-
for
equality
, TOC
the TOC).
bility come
come
P Athe
analogou
PA
bility
sex
ogou
T TOC TOC
TOC
TOC
(K) as in [6, 125], time-ordered correlator (132) obeys the Jarzynski-l isfy to W th
isfy
sure
(W
constr (w
con th
TOC T
illustrated infocus
Fig.(2)19. We focus(11) on Hermitian W ,and VEthe, Jarzynski-
illustrated in Fig. 19. We on Hermitian @(2WK)D and V+obeys
Theorem in4 [31].
(Jarzynski-like theorem for fTOC).
denote P anT
bility
ogous to thermodynamic work. We fix the constraints K = 2: F (t) = F (t). If
Eq.
K
equality <
time-ordered 3, Fcorrelator (t) is
W not
( (132) OTO.
W the
)
P A Func
average
Jarzynski-
0 to (W
of
isfy
0 A (t
ft
TOC
TOC
TOC
K = 2: F (t) = F (2)
TOC , hf=0
K 0 (W
f:=denote
WT
sum the quasiprobabilities that satisfy the constraints: W(t) W(t) W(t) (summed) quasiprobabilities equality Eq. The (11) TOC
The
time-or
thes
in
Proof. The proof is analogous
t U U inE. ...
[31].Higher-order OTOCs as
Dierentiating a characteristic
to the proof ofEq.
Experiment
time
moments of
function of
Theore
Theorem
F longe
Cer
(11)
Eq.
equality
time-ordere
Theorem
again
The
(t) (1
Eq.
Theo
stra
TOC
in
=
(1
a
X 0
E. Higher-order
again yields higher-
(summed)
(summed)
OTOCs as moments
quasiprobabilities
and quasiprobabilities
higher-point correlation
longe
The
equality
time-ordere
Eq.P(11)
equality
F TO
time-o
(K
Theo
fu
equali Th i
TOC
0 wherein time-o
thes ,
PTOC (WTOC , WTOC ) := V V V
versals.
So does dierentiating
Dierentiating
E. . .Higher-order
.
ButDierentiating
each resulting
again yields higher-
Let K
(summed)
0 th
a characteristic
acorrelator
=W(t)V (2K pair
P (W, W 0 ) again and
Experiment
OTOCs as moments
encodes
characteristic
1and quasiprobabilities
higher-point
+ 1), for K
function of
time function
just three
= 1,
Theorem
again
F longe
time-order
again
wherein
2,P
:=
time
equality
correlation
Eq.
(t) (=
equalit
F whereinaga
(t) =
F
The
F fu
time
. (K
.where
TOC
TOC TO
..fu , TO
1 W(t)V pair again yields
V K
tions.
V 0
-fold
SoOTOCdoeshigher-has
2and
dierentiating
been V
higher-point
P (W,
defined [91,
0 correlation
W92]:
0
) againwhereinand equa
ag ,W
tions. .So. .does dierentiating Pencodes
(W, W just ) again F
andtime ag
wherei(t) TOC
ATOC
(v1 , v1 ; w1 , w1 ; v2 , v2 ) W (w1 v2 ) W 0 (w1 v1 ) . 1ststtime
ButDierentiating
But
versals.
again
each
each
(K ) yields
resulting
. . Let
.resulting
(2 K 0
higher-
correlator
a
Experiment 1
characteristic
0 -1)correlator
=th.time (2K
12.and +encodes
1),
higher-point
function
for justK =
three
three1,
correlation
again
A
2, :=
time(KF
.. ..f
T
(K ) 0
1. W(t)V i Tr( W(t)V . . .W(
(14
(K
VFIG. 19:1stK
V-fold out-of-time-ordered
time Fversals.
(K ) (t) :=
F 0 (t) := hW(t)VV
Let hW(t)V
(2
| K 0 -1)th. .time
0
= (2K
correlator + 1), for
.2. W(t)V}i Tr( W(t)V
.{z K | = 1, A
2,
. . W(
.{z .. .
reversal {z
[Eq.reversal
| has2K {z } contains |K 0 Heisenbe
PTOC forms a complex distribution. Let f denote any (OTOC): . . . Each
The conventional KOTOC -fold
such OTOCcorrelation
operators W(t)
(22)],
2K
been
function defined
encodes 0
0
[91,
just 92]:
interleaved with K time-0op 0
2K
2K
0
0
picture T (1
0 three time reversals. The K-fold
1st W(t)V pair th ) OTOC
(KW(t)V (K pair
F V .(t)F := 0hW(t)V
19: K-fold out-of-time-ordered F (K ) (t) encodes
0
. . . W(t)V
2K }i K W(t)V ...W
(1
function of WTOC and WTOC . The PTOC average of f is FIG.
K 1 = KThe
2(OTOC): = conventional
ators
3, 5, . . . timeEach
)
|(t) encodes
such correlation
reversals. correlation
The
correlator
{zfunction
time
0
1=
measured
Tr(
contains |K
time reversa
{z
K 00 Heisenb
Heisenb
...
Each
OTOC such
picturethe [Eq.
operators (22)],
2K function
0
W(t) interleaved
contains
encodes
interleaved just
with K 0 2K(K
G
time-0
K 0 time-0
0
op
by a laboratory clock runs along picture
abscissa.
operators W(t)
(K ) The ordinate
with T(
op
0 1st time (2
three time reversals. The Kators -1)
ators th time(K 00 )
-fold .. OTOC (t) F
encodes(t) 2Kencodes
0
hf (WTOC , WTOC )i (146) reversal
represents
the time parameter t,VVwhich F (K ) (t)
F may encodes 2K
be inverted 0
11 =
=K time revers
inK time revers
2K reversal
Each such correlation
1 = K = 3, 5, . . . time reversals. The time measured function contains K 0
Heisenb
X experiments. The orange, leftmost
by a laboratory clock runs along picturethe dot represents
operators the state
W(t) interleaved
abscissa. The ordinate with K 0 time-0G (K
Po
0 0 preparation . Each green dot represents W(t) or
:= f (WTOC , WTOC )PTOC (WTOC , WTOC ). V . F (K ) (t)a encodes 2Ka0 V1. = K time rever
0
ators
FIG. 19: K-fold represents
Each
the time parameter t,correlator
out-of-time-ordered
purple line represents a unitary
which may be inverted in
time evolution. The
0
WTOC ,WTOC experiments. The orange, leftmost dot represents(Kthe state
(OTOC): The conventional
diagram, scanned OTOC from left[Eq. to (23)], encodes
right, represents Fjust) (t), P
preparation . Each green dot represents a W(t) or a V .
scanned fromleft to right. (K)
three time reversals.
EachThe purple -fold
K line OTOCa F
represents unitary(t) encodes
time evolution. The
2K 1 = K = 3,diagram,
5, . . . time
scannedreversals.
from left The time
to right, measured
represents F (K ) (t),
(K ) The
5. Jarzynski-like equality for the TOC Theruns
scanned
by a laboratory clock greater
from thetothe
left
along K , abscissa.
the longer The
right. the distribution
ordinate P
tem
of which F (K ) (t) equals a moment. We define P (K ) in
represents the time
three
parameter t,recall
which may be inverted in liter
The
Thesteps:
experiments. TheAorange,
Wethe
greater
leftmost Kdot,the
the -extended
Klonger
represents thethe quasiprobability
distribution
state P (K ) feren
The TOC obeys a Jarzynski-like equality analogous to (K )
[Eq.F(91)].
(K ) We introduce measurable random(K tem
of which (t) equals a moment. We define P vari- )
in
preparation . Each
ables green dotWrepresents
0 a W(t) or a V. liter
Eq. (11) in [37]. three W ` andWe
steps: 0. These
`recall the K variables
-extendedparticipate in con-
quasiprobability The
Each purple line represents
(K ) a unitary (Ktime
)
Aintroduce
evolution. The fere
A
straints on sums
[Eq. (91)]. of We (.) values.
measurable
random vari- OTO
diagram, scanned ables
from W left to right,0 represents F (K ) (t),
` and W`0 . These variables participate in con-
Theorem 4 (Jarzynski-like theorem for the TOC). The Th
scanned from left straints
to right. on sums of A
(K )
(.) values. OT
time-ordered correlator (136) obeys the Jarzynski-like
equality
The greater the K , the longer the distribution P (K )
2 D (WTOC + 0 WTOC E of which F (K ) (t) equals a moment. We define P (K ) in
0
)
FTOC (t) = e , (147) three steps: We recall the K -extended quasiprobability
0 0 (K )
, =0 A [Eq. (95)]. We introduce measurable random vari-
ables W` and W`00 . These variables participate in con-
wherein , 0 R. (K )
straints on sums of A (.) values.
Proof. The proof is analogous to the proof of Theorem 1 Let us evaluate Eq. (95) on particular arguments:
in [37].
A(K
)
(v1 , v1 ; w2 , w2 ; . . . ; vK , vK ; wK+1 , wK+1 )
= hwK+1 , wK+1 |U |vK , vK ihvK , vK |U |wK , wK i
E. Higher-order OTOCs as moments of longer
(summed) quasiprobabilities . . . hw2 , w2 |U |v1 , v1 ihv1 , v1 |U |wK+1 , wK+1 i .
(149)
Differentiating a characteristic function again and
(K )
again yields higher- and higher-point correlation func- One can infer A from the interferometry scheme
tions. So does differentiating P (W, W 0 ) again and again. in [37] and from weak measurements. Upon implement-
But each resulting correlator encodes just K = 3 time ing one batch of the interferometry trials, one can infer
reversals. Let K = 21 (K +1) = 2, 3, . . ., for K = 3, 5, . . . (K )
A for all K -values: One has measured all the inner
A K-fold OTOC has been defined [121, 122]: products ha|U|bi. Multiplying together arbitrarily many
inner products yields an arbitrarily high-K quasiproba-
(K )
F (K ) (t) := hW(t)V . . . W(t)V i Tr( W(t)V . . . W(t)V ) . bility. Having inferred some A , one need not perform
| {z } | {z } (K +2) (K )
2K 2K new experiments to infer A . To infer A from
(148) weak measurements, one first prepares . One performs
37
VI. OUTLOOK
(K )
The characteristic function of P is
We have characterized the quasiprobability A that
X X
G (K ) (s2 , . . . , sK+1 , s01 , . . . , s0K ) := lies behind the OTOC F (t). A , we have argued, is
W2 ,...,WK+1 W10 ,...,W 0
an extension of the Kirkwood-Dirac distribution used
K
in quantum optics. We have analyzed and simplified
P (K ) (W2 , W3 , . . . , WK+1 , W10 , W20 , . . . , WK0
) measurement protocols for A , calculated A numeri-
0 0 0 0
cally and on average over Brownian circuits, and inves-
is2 W2 isK+1 WK+1 is1 W1
e ... e e . . . eisk WK . tigated mathematical properties. This work redounds
(153) upon quantum chaos, quasiprobability theory, and weak-
measurement physics. As the OTOC equals a combina-
tion of A (.) values, A provides more-fundamental infor-
The s` and s0`0 variables are regarded as imaginary-
mation about scrambling. The OTOC motivates general-
temperature variables, to parallel the fluctuation-relation
izations of, and fundamental questions about, KD theory.
literature (e.g., [134]): is` ` , and is0`0 `0 0 . Dif-
The OTOC also suggests a new application of sequential
ferentiating G (K ) yields the K-fold OTOC. weak measurements.
At this intersection of fields lie many opportunities.
Theorem 5 (Jarzynski-like equality for the K-fold We classify the opportunities by the tools that inspired
OTOC). The K-fold OTOC obeys the Jarzynski-like them: experiments, calculations, and abstract theory.
equality
A. Experimental opportunities
(K) 2K
F (t) =
2 . . . K+1 10 . . . K 0
+ We expect the weak-measurement scheme for A and
*
KX+1 K
X F (t) to be realizable in the immediate future. Candi-
exp ` W ` + `0 0 W`00 . date platforms include superconducting qubits, trapped
0
ions, ultracold atoms, cavity QED, and perhaps NMR.
`=2 ` =1 ` ,`0 0 =0 `,`0
(154) Experimentalists have developed key tools required to
implement the protocol [1013, 3135, 50, 63].
Achievable control and dissipation must be compared
Proof. The proof proceeds in analogy with the proof of with the conditions needed to infer the OTOC. Errors
Theorem 1 in [37]. For clarity, we emphasize the analog might be mitigated with tools under investigation [114].
38
B. Opportunities motivated by calculations that, in some cases, the distribution over possible val-
ues of |hv2 , v2 |U |w3 , w3 i| peaks at 1d . But the dis-
Numerical simulations and analytical calculations tribution approaches this form before t . Also, the dis-
point to three opportunities. tributions width seems constant in d. Further study is
Physical models OTOC quasiprobabilities may be required. The overlap between OTOC and two QI def-
evaluated. The Sachdev-Ye-Kitaev model, for example, initions of scrambling have been explored already: (1)
scrambles quickly [29, 30]. The quasiprobabilitys func- When the OTOC is small, a tripartite information is
tional form may suggest new insights into chaos. Our negative [28]. (2) An OTOC-like function is propor-
Brownian-circuit calculation (Sec. IV), while a first step, tional to a frame potential that quantifies pseudorandom-
involves averages over unitaries. Summing quasiproba- ness [121]. The relationship between the OTOC and a
bilities can cause interference to dampen nonclassical be- third QI sense of incompatibilityMUBs and entropic
haviors [14]. Additionally, while unitary averages model uncertainty relationsmerits investigation.
chaotic evolution, explicit Hamiltonian evolution might Second, A effectively has four arguments, apart from
provide different insights. Explicit Hamiltonian evolution (Sec. V A). The KD quasiprobability has two. This
would also preclude the need to calculate higher moments doubling of indices parallels the Choi-Jamiolkowski (CJ)
of the quasiprobability. representation of quantum channels [117]. Hosur et al.
In some numerical plots, the real part <(A ) bifur- have, using the CJ representation, linked F (t) to the tri-
cates. These bifurcations resemble classical-chaos pitch- partite information [28]. The extended KD distribution
forks [118]. Classical-chaos plots bifurcate when a differ- might be linked to information-theoretic quantities simi-
ential equations equilibrium point branches into three. larly.
The OTOC quasiprobability A might be recast in terms Third, our P (W, W 0 ) and weak-measurement protocol
of equilibria. Such a recasting would strengthen the par- resemble analogs in [107, 108]. {See [109111] for frame-
allel between classical chaos and the OTOC. works similar to Solinas and Gasparinettis (S&Gs).}
Finally, the Brownian-circuit calculation has untied Yet [107, 108] concern quantum thermodynamics, not
threads. We calculated only the first moment of A . the OTOC. The similarity between the quasiprobabilities
Higher moments may encode physics less visible in F (t). in [107, 108] and those in [37], their weak-measurement
Also, evaluating certain components of A requires new protocol and ours, and the thermodynamic agendas
calculational tools. These tools merit development, then in [107, 108] and [37] suggest a connection between the
application to A . An example opportunity is discussed projects [105, 106]. The connection merits investigation
after Eq. (86). and might yield new insights. For instance, S&G cal-
culate the heat dissipated by an open quantum system
that absorbs work [107, Sec. IV]. OTOC theory focuses
C. Fundamental-theory opportunities on closed systems. Yet experimental systems are open.
Dissipation endangers measurements of F (t). Solinas and
Gasparinettis toolkit might facilitate predictions about,
Seven opportunities concern the mathematical proper-
and expose interesting physics in, open-system OTOCs.
ties and physical interpretations of A .
Fourth, W and W 0 suggest understudies for work in
The KD quasiprobability prompts the question, Is the
quantum thermodynamics. Thermodynamics sprouted
OTOC definition of maximal noncommutation consis-
during the 1800s, alongside steam engines and facto-
tent with the mutually-unbiased-bases definition? Re-
ries. How much work a system could outputhow much
call Sec. V C: n
We decomposed
o an operator 0 in terms
|aihf | orderly energy one could reliably drawheld practi-
of a set S = hf |ai of operators. In the KD- cal importance. Todays experimentalists draw energy
hf |ai6=0
quasiprobability literature, the bases Sa = {|ai} and from power plants. Quantifying work may be less critical
Sf = {|f i} tend to be mutually unbiased (MU): |hf |ai| = than it was 150 years ago. What can replace work in the
1 a, f . Let A and B denote operators that have todays growing incarnation of thermodynamics, quan-
d
MU eigenbases. Substituting A and B into an uncer- tum thermodynamics? Coherence relative to the energy
tainty relation maximizes the lower bound on an uncer- eigenbasis is being quantified [135, 136]. The OTOC sug-
tainty [130]. In this quantum-information (QI) sense, A gests alternatives: W and W 0 are random variables, anal-
and B noncommute maximally. ogous to work, natural to quantum-information scram-
bling. The potential roles of W and W 0 within quantum
In Sec. V C, Sa = {|v2 , v2 i}, and Sf = U |w3 , w3 i .
These Ss are eigenbases of V and W(t). When do thermodynamics merit exploration.
we expect these eigenbases to be MU, as in the KD- Fifth, relationships amongst three ideas were identified
quasiprobability literature? After the scrambling time recently:
t after F (t) decays to zerowhen W(t) and V non- (1) We have linked quasiprobabilities with the OTOC,
commute maximally in the OTOC sense. following [37].
The OTOC provides one definition of maximal non-
commutation. MUBs provide a QI definition. To what (2) Aleiner et al. [137] and Haehl et al. [138, 139] have
extent do these definitions overlap? Initial results show linked the OTOC with Schwinger-Keldysh path in-
39
The final expression is not obviously a probability. Consider inferring A(1/d) or A(1/d) from weak mea-
But suppose that shares its eigenbasis with V or surements. From one trial, we infer about four random
with W(t). Suppose, for example, that has the form in variables: v1 , w2 , v2 and w3 . Each variable equals 1.
Eq. (30). Equation (A2) simplifies: The quadruple (v1 , w2 , v2 , w3 ) therefore assumes one of
X sixteen possible values. These four base variables are
P (W ) = p(v2 , v2 ; w3 , w3 ) W (w3 v2 ) . (A3) multiplied to form the composite variables W and W 0 .
(v2 ,v2 ), The tuple (W, W 0 ) assumes one of four possible values.
(w3 ,w3 ) Every (W, W 0 ) value can be formed from each of four val-
ues of (v1 , w2 , v2 , w3 ). Table II lists the tuple-quadruple
The p(v2 , v2 ; w3 , w3 ) := |hw3 , w3 |U |v2 , v2 i|2 pv2 ,v2 correspondences.
denotes the joint probability that a V measurement of Consider any quadruple associated with (W, W 0 ) =
yields (v2 , v2 ) and, after a subsequent evolution under (1, 1), e.g., (1, 1, 1, 1). Consider swapping w2 with w3
U, a W measurement yields (w3 , w ). and swapping v1 with v2 . The result, e.g., (1, 1, 1, 1),
3
Every factor in Eq. (A3) is nonnegative. Summing over leads to (W, W 0 ) = (1, 1). This double swap amounts
W yields a sum over the arguments to a cyclic permutation of the quadruples elements. This
P of A (.). The latter permutation is equivalent to a cyclic permutation of the
sum equals one, by Property 6: W P (W ) = 1. Hence
P (W ) [0, 1]. Hence P (W ) behaves as a probability. argument of the (A6) trace. This permutation pre-
We can generalize Property 9 to arbitrary Gibbs states serves the traces value while transforming the trace into
= eH/T /Z, using the regulated quasiprobability (106). P (1, 1). The trace originally equaled P (1, 1). Hence
The regulated OTOC (104) equals a moment of the com- P (1, 1) = P (1, 1).
plex distribution
X
Preg (W, W 0 ) := (A4) Appendix B RETRODICTION ABOUT THE
(v1 ,v1 ),(w2 ,w2 ),(v2 ,v2 )(w3 ,w3 ) SYMMETRIZED COMPOSITE OBSERVABLE
:= i(K . . . A A . . . K)
Areg
(v1 , v1 ; w2 , w2 ; v2 , v2 ; w3 , w3 ) W (w3 v2 ) W 0 (w2 v1 ) .
The proof is analogous to the proof of Theorem 1 in [37]. Section V B concerns retrodiction about the sym-
W0 metrized observable := K . . . A + A . . . K. The prod-
PSumming over yields Preg (W ) :=
:= i(K . . . A
0
W 0 Preg (W, W ). We substitute in from Eq. (A4), then
uct K . . . A is symmetrized also in
for Areg from Eq. (106). We perform the sum over W 0 using K -extended
A . . . K). One can retrodict about ,
(K )
explicitly, then the sums over (w2 , w2 ) and (v1 , v1 ): KD quasiprobabilities A , similarly to in Theorem 1.
X The value most reasonably attributable retrodictively
Preg (W ) = |v2 , v i|2 W (w v ) .
|hw3 , w3 |U to the time-t0 value of is given by Eqs. (107), (108),
2 3 2
(v2 ,v2 ) and (110). The conditional quasiprobabilities on the
(w3 ,w3 ) right-hand sides of Eqs. (109) and (111) become
(A5)
=(hf 0 |kihk| . . . |aiha|0 |f 0 i)
This expression is real and nonnegative. Preg (W ) sums p (a, . . . , k, f |) = (B1)
hf 0 |0 |f 0 i
to one, as P (W ) does. Hence Preg (W ) [0, 1] acts as a
probability. and
0
Property 10 (Degeneracy of every P (W, W ) associated
=(hf 0 |aiha| . . . |kihk|0 |f 0 i)
with = 1/d and with eigenvalue-(1) operators W and p (k, . . . , a, f |) = . (B2)
V ). Let the eigenvalues of W and V be 1. For exam- hf 0 |0 |f 0 i
ple, let W and V be Pauli operators. Let = 1/d be the
infinite-temperature Gibbs state. The complex distribu- The extended KD distributions become
tion has the degeneracy P (1, 1) = P (1, 1).
A(K ) 0 0 0
, (, a, . . . , k, f ) = ihf |kihk| . . . |aiha| |f i (B3)
Property 10 follows from (1) Eq. (44) and (2) Prop-
erty 7 of A(1/d) . Item (2) can be replaced with the traces and
cyclicality. We reason as follows: P (W, W 0 ) is defined
in Eq. (32). Performing the sums over the degeneracies A(K ) 0 0
, (, k, . . . , a, f ) = ihf |aiha| . . . |kihk||f i . (B4)
yields A(1/d) . Substituting in from Eq. (44) yields
To prove this claim, we repeat the proof of Theorem 1
1 X requires
P (W, W ) = 0 W(t) V
Tr w v2 W(t) V until reaching Eq. (118). The definition of
w 2 v 1
d v ,w ,v ,w 3
that an i enter the argument of the first < and that a
1 2 2 3
i enter the argument of the second <. The identity
W (w3 v2 ) W 0 (w2 v1 ) . (A6) <(iz) = =(z), for z C, implies Eqs. (B1)(B4).
41
(W, W 0 ) (v1 , w2 , v2 , w3 )
(1, 1) (1, 1, 1, 1), (1, 1, 1, 1), (1, 1, 1, 1), (1, 1, 1, 1)
(1, 1) (1, 1, 1, 1), (1, 1, 1, 1), (1, 1, 1, 1), (1, 1, 1, 1)
(1, 1) (1, 1, 1, 1), (1, 1, 1, 1), (1, 1, 1, 1), (1, 1, 1, 1)
(1, 1) (1, 1, 1, 1), (1, 1, 1, 1), (1, 1, 1, 1), (1, 1, 1, 1)
TABLE II: Correspondence between tuples of composite variables and quadruples of base variables: From
each weak-measurement trial, one learns about a quadruple (v1 , w2 , v2 , w3 ). Suppose that the out-of-time-ordered-correlator
operators W and V have the eigenvalues w` , vm = 1. For example, suppose that W and V are Pauli operators. The
quadruples elements are combined into W := w3 v2 and W 0 := w2 v1 . Each (W, W 0 ) tuple can be formed from each of four
quadruples.
[53] J. Kelly et al., Nature 519, 66 (2015). [93] H. Margenau and R. N. Hill, Prog. Theor. Phys. 26,
[54] R. W. Heeres et al., ArXiv e-prints (2016), 1608.02430. 722 (1961).
[55] D. Rist`e et al., Nature Communications 6, 6983 (2015). [94] L. M. Johansen and A. Luis, Phys. Rev. A 70, 052115
[56] S. A. Gardiner, J. I. Cirac, and P. Zoller, Phys. Rev. (2004).
Lett. 79, 4790 (1997). [95] H. M. Wiseman, Phys. Rev. A 65, 032111 (2002).
[57] S. K. Choudhary, T. Konrad, and H. Uys, Phys. Rev. [96] A. J. Leggett and A. Garg, Phys. Rev. Lett. 54, 857
A 87, 012131 (2013). (1985).
[58] L. G. Lutterbach and L. Davidovich, Phys. Rev. Lett. [97] C. Emary, N. Lambert, and F. Nori, Reports on
78, 2547 (1997). Progress in Physics 77, 016001 (2014).
[59] S. Debnath et al., Nature 536, 63 (2016). [98] H. F. Hofmann, New Journal of Physics 14, 043031
[60] T. Monz et al., Science 351, 1068 (2016). (2012).
[61] N. M. Linke et al., ArXiv e-prints (2016), 1611.06946. [99] J. Dressel and A. N. Jordan, Phys. Rev. A 85, 012107
[62] N. M. Linke et al., ArXiv e-prints (2017), 1702.01852. (2012).
[63] A. Browaeys, D. Barredo, and T. Lahaye, Journal of [100] H. F. Hofmann, Phys. Rev. A 89, 042115 (2014).
Physics B: Atomic, Molecular and Optical Physics 49, [101] H. F. Hofmann, New Journal of Physics 16, 063056
152001 (2016). (2014).
[64] C. Guerlin et al., Nature (London)448, 889 (2007), [102] J. Z. Salvail et al., Nat Photon 7, 316 (2013).
0707.3880. [103] M. Malik et al., Nat Commun 5 (2014), Article.
[65] K. W. Murch, S. J. Weber, C. Macklin, and I. Siddiqi, [104] G. A. Howland, D. J. Lum, and J. C. Howell, Opt.
Nature (London)502, 211 (2013), 1305.7270. Express 22, 18870 (2014).
[66] L. Xiao and J. A. Jones, Physics Letters A 359, 424 [105] J. Cotler, private communication, 2016.
(2006). [106] P. Solinas, private communication, 2016.
[67] D. Lu, A. Brodutch, J. Li, H. Li, and R. Laflamme, New [107] P. Solinas and S. Gasparinetti, Phys. Rev. E 92, 042150
Journal of Physics 16, 053015 (2014). (2015).
[68] W. Brown and O. Fawzi, ArXiv e-prints (2012), [108] P. Solinas and S. Gasparinetti, Phys. Rev. A 94, 052103
1210.6644. (2016).
[69] P. Hayden and J. Preskill, Journal of High Energy [109] J. J. Alonso, E. Lutz, and A. Romito, Phys. Rev. Lett.
Physics 2007, 120 (2007). 116, 080403 (2016).
[70] Y. Sekino and L. Susskind, Journal of High Energy [110] H. J. D. Miller and J. Anders, ArXiv e-prints (2016),
Physics 2008, 065 (2008). 1610.04285.
[71] N. Lashkari, D. Stanford, M. Hastings, T. Osborne, and [111] C. Elouard, D. A. Herrera-Mart, M. Clusel, and
P. Hayden, Journal of High Energy Physics 2013, 22 A. Auff`eves, npj Quantum Information 3, 9 (2017).
(2013). [112] M. A. Nielsen and I. L. Chuang, Quantum Computa-
[72] Y. Aharonov, D. Z. Albert, and L. Vaidman, Phys. Rev. tion and Quantum Information (Cambridge University
Lett. 60, 1351 (1988). Press, 2010).
[73] L. M. Johansen, Phys. Lett. A 329, 184 (2004). [113] J. von Neumann, Mathematische Grundlagen der Quan-
[74] M. J. W. Hall, Phys. Rev. A 64, 052103 (2001). tenmechanik (Springer, Berlin, 1932).
[75] M. J. W. Hall, Phys. Rev. A 69, 052113 (2004). [114] B. Swingle, G. Bentsen, M. Schleier-Smith, N. Yao, and
[76] L. D. Landau and E. M. Lifshitz, Statistical Physics N. Yunger Halpern, Resilience of the scrambling inter-
(Pergamon Press, Oxford, England, 1980). ferometer, in prep.
[77] J. Banerji, Contemporary Physics 48, 157 (2007). [115] IBM, The quantum experience.
[78] A. Rihaczek, IEEE Transactions on Information Theory [116] J. Dressel, T. A. Brun, and A. N. Korotkov, Phys. Rev.
14, 369 (1968). A 90, 032302 (2014).
[79] L. Cohen, Proceedings of the IEEE 77, 941 (1989). [117] J. Preskill, Quantum computation: Ch. 3: Foundations
[80] N. W. M. Ritchie, J. G. Story, and R. G. Hulet, Phys. of quantum theory ii: Measurement and evolution, Lec-
Rev. Lett. 66, 1107 (1991). ture notes, 2015.
[81] G. J. Pryde, J. L. OBrien, A. G. White, T. C. Ralph, [118] S. Strogatz, Non-linear Dynamics and Chaos: With
and H. M. Wiseman, Phys. Rev. Lett. 94, 220405 applications to Physics, Biology, Chemistry and Engi-
(2005). neering (Perseus Books, 2000).
[82] J. Dressel, C. J. Broadbent, J. C. Howell, and A. N. [119] L. DAlessio, Y. Kafri, A. Polkovnikov, and
Jordan, Phys. Rev. Lett. 106, 040402 (2011). M. Rigol, Advances in Physics 65, 239 (2016),
[83] M. F. Pusey, Phys. Rev. Lett. 113, 200401 (2014). http://dx.doi.org/10.1080/00018732.2016.1198134.
[84] M. Waegell et al., ArXiv e-prints (2016), 1609.06046. [120] C. Gogolin and J. Eisert, Reports on Progress in Physics
[85] C. Ferrie and J. Combes, Phys. Rev. Lett. 113, 120404 79, 056001 (2016).
(2014). [121] D. A. Roberts and B. Yoshida, ArXiv e-prints (2016),
[86] L. Vaidman, ArXiv e-prints (2014), 1409.5386. 1610.04903.
[87] E. Cohen, ArXiv e-prints (2014), 1409.8555. [122] F. M. Haehl, R. Loganayagam, P. Narayan, and
[88] Y. Aharonov and D. Rohrlich, ArXiv e-prints (2014), M. Rangamani, ArXiv e-prints (2017), 1701.02820.
1410.0381. [123] V. Manko and R. V. Mendes, Physica D: Nonlinear
[89] D. Sokolovski, ArXiv e-prints (2014), 1410.0570. Phenomena 145, 330 (2000).
[90] A. Brodutch, Phys. Rev. Lett. 114, 118901 (2015). [124] A. Bednorz, C. Bruder, B. Reulet, and W. Belzig, Phys.
[91] C. Ferrie and J. Combes, Phys. Rev. Lett. 114, 118902 Rev. Lett. 110, 250404 (2013).
(2015). [125] D. Oehri, A. V. Lebedev, G. B. Lesovik, and G. Blatter,
[92] Y. P. Terletsky, JETP 7, 1290 (1937). Phys. Rev. B 93, 045308 (2016).
43
[126] P. P. Hofer, ArXiv e-prints (2017), 1702.00998. [134] P. Talkner, E. Lutz, and P. H anggi, Phys. Rev. E 75,
[127] J. Lee and I. Tsutsui, ArXiv e-prints (2017), 1703.06068. 050102 (2007).
[128] J. J. Halliwell, Phys. Rev. A 93, 022123 (2016). [135] M. Lostaglio, D. Jennings, and T. Rudolph, Nature
[129] T. Durt, B.-G. Englert, I. Bengtsson, and Communications 6, 6383 (2015), Article.
K. Zyczkowski, ArXiv e-prints (2010), 1004.3348. [136] V. Narasimhachar and G. Gour, Nature Communica-
[130] P. J. Coles, M. Berta, M. Tomamichel, and S. Wehner, tions 6, 7689 EP (2015).
ArXiv e-prints (2015), 1511.04857. [137] I. L. Aleiner, L. Faoro, and L. B. Ioffe, Annals of Physics
[131] M. Paris and J. Rehacek, editors, Quantum State Es- 375, 378 (2016).
timation, Lecture Notes in Physics Vol. 649 (Springer, [138] F. M. Haehl, R. Loganayagam, and M. Rangamani,
Berlin, Heidelberg, 2004). ArXiv e-prints (2016), 1610.01940.
[132] N. Moiseyev, Non-Hermitian Quantum Mechanics [139] F. M. Haehl, R. Loganayagam, and M. Rangamani,
(Cambridge UP, 2011). ArXiv e-prints (2016), 1610.01941.
[133] S. An et al., Nature Physics 11, 193 (2015).