Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences
Published:https://doi.org/10.1098/rspa.2015.0435

    Abstract

    We address the problem of properly quantifying information in quantum theory. Brukner and Zeilinger proposed the concept of an operationally invariant measure based on measurement statistics. Their measure of information is calculated with probabilities generated in a complete set of mutually complementary observations. This approach was later criticized for several reasons. We show that some critical points can be overcome by means of natural extension or reformulation of the Brukner–Zeilinger approach. In particular, this approach is connected with symmetric informationally complete measurements. The ‘total information’ of Brukner and Zeilinger can further be treated in the context of mutually unbiased measurements as well as general symmetric informationally complete measurements. The Brukner–Zeilinger measure of information is also examined in the case of detection inefficiencies. It is shown to be decreasing under the action of bistochastic maps. The Brukner–Zeilinger total information can be used for estimating the map norm of quantum operations.

    1. Introduction

    Quantum information science has currently made impressive advances in both theory and practice [1]. Feynman emphasized that quantum systems are very hard to be simulated at the classical level [2]. On the other hand, such a negative claim also inspires a positive reason for trying to build quantum computers [3]. Quantum key distribution has provided a long-term technological solution already implemented in a lot of commercial products [4,5]. Quantum algorithms allow to solve efficiently a number of important problems, which are currently intractable [6,7]. Developments in quantum information processing stimulated a renewed interest to foundations of quantum mechanics. This subject is a thriving, lively and controversial field of research [8,9]. Currently, conceptual questions are often reformulated in information-theoretic terms. Actually, results of a quantum measurement are finally recorded in some row of statistical data. Hence, we have come across a problem to quantify an amount of information that could be extracted from such data.

    The problem of determining quantum state quite differs from the classical formulation. There are many possible scenarios to be imagined. Attacking a system of quantum key distribution, Eve is typically bused with discriminating between two or more alternatives known to her a priori. During an individual attack, she captures only a single information carrier. An opposite situation deals with a very large ensemble of identical copies. In practice, a number of copies is never infinite though large. Our experience leads to the following conclusion. The proportion of times that the given outcome occurs settles down to some value as the number of trials becomes larger and larger. The ultimate value of this proportion is meant as the probability of the given outcome. Dealing with quantum systems, the observer can take different experiments, which might even completely exclude each other. For example, the state of a spin-12 system is often considered to be estimated with measurements of the three orthogonal components of spin [10]. In more than two dimensions, such complementary measurements are formulated in terms of the so-called mutually unbiased bases (MUBs). This concept was actually considered by Schwinger [11].

    To approach properly an informational measure, Brukner & Zeilinger considered the following situation [10]. Suppose that we know probabilities of all outcomes and try to guess a number of occurrences of the prescribed outcome among finite experimental trials. Of course, our prediction will allow an amount of uncertainty, which can be estimated with respect to some confidence interval. Taking an uncertainty per single trial and summing it for all outcomes, one naturally leads to a measure of uncertainty in one experiment [10]. It is shown to be 1 minus the sum of squared probabilities. Hence, Brukner and Zeilinger defined a measure of information in one experiment and in a set experiments. For d+1 MUBs, the corresponding total information turned to be operationally invariant in the following sense [10]. The sum of the individual measures of information for mutually complementary observations is invariant with respect to a choice of the particular set of complementary observations. In other words, this sum is invariant under unitary rotations of the measured state. The latter implies that there is no information flow between the system of interest and its environment [10].

    MUBs are an interesting mathematical object as well as an important tool in many physical issues [12]. Such bases can be used in quantum key distribution, state reconstruction, quantum error correction, detection of quantum entanglement and other topics. MUBs are connected with symmetric informationally complete measurements. A positive operator-valued measure (POVM) is said to be informationally complete, if its statistics determine completely the quantum state [13,14]. To increase an efficiency at determining the state, elements of such a measurement should have rank one. An informationally complete POVM is called symmetric, when all pairwise inner products between the POVM elements are equal [15]. In general, the maximal number of MUBs in d dimensions is still an open question [12]. When d is a prime power, the answer d+1 is known [12]. Constructions of d+1 MUBs for such d rely on properties of prime powers and on an underlying finite field [16]. It also seems to be hard to get a unified way for building a symmetric informationally complete POVM (SIC-POVM) in all dimensions.

    The authors of Kalev & Gour [17] introduced the concept of mutually unbiased measurements (MUMs). The core idea is that elements of such a measurement are not rank one. This method does not reach the maximal efficiency but is easy to construct. It turns out that a complete set of d+1 MUMs can be built explicitly for arbitrary finite d [17]. An utility of such measurements in quantum information science deserves further investigations. It is also unknown whether rank-one SIC-POVMs exist in all finite dimensions. The positive answer was obtained with a weaker condition that POVM elements are not rank one. The authors of Gour & Kalev [18] proved the existence of general SIC-POVMs in all finite dimensions. It is not insignificant that general SIC-POVMs can be constructed within a unified approach. Studies of MUMs and general SIC-POVMs were continued in [1922]. We will show that these measurements are interesting in the context of the Brukner–Zeilinger approach [10,2325]. This approach to quantifying an amount of information will be shown to be realizable within three additional types of quantum measurements.

    The paper is organized as follows. In §2, preliminary material is reviewed. In particular, we recall the definitions of MUMs and general SIC-POVMs. Section 3 is devoted to a general discussion of the Brukner–Zeilinger approach to quantification of information in quantum measurements. Its treatment in terms of Tsallis' entropies of degree 2 is mentioned. In §4, we show that an operationally invariant measure of information can be approached within the three measurement schemes. They are, respectively, based on a single SIC-POVM, on a set of d+1 MUMs, and on a general SIC-POVM. These measurement schemes give an alternative to d+1 MUBs known only for prime power dimensions. In §5, the Brukner–Zeilinger approach is examined for the case of detection inefficiencies, when the ‘no-click’ events are allowed. In §6, we show that the Brukner–Zeilinger total information cannot increase under the action of bistochastic maps. Relations between the Brukner–Zeilinger approach and non-unitality are examined in §7. In §8, we conclude the paper with a summary of results.

    2. Preliminaries

    In this section, we review the required material on MUMs and general SIC-POVMs. Let L(Hd) be the space of linear operators on d-dimensional Hilbert space Hd. By L+(Hd), we denote the set of positive semi-definite operators on Hd. By Ls.a.(Hd), we mean the d2-dimensional real space of Hermitian operators on Hd. A state of d-level system is represented by density operator ρL+(Hd) normalized as tr(ρ)=1. For operators X,YL(Hd), their Hilbert–Schmidt inner product is defined by Watrous [26]

    X,YHS:=tr(XY).2.1
    Quantum measurements are commonly dealt in terms of the POVM formalism [27]. We consider a set of elements AjL+(Hd) such that the completeness relation holds, namely
    jAj=1d.2.2
    Here, the 1d denotes the identity operator on Hd. The set A={Aj} is called a POVM. For pre-measurement state ρ, the probability of jth outcome is written as [27]
    pj(A|ρ)=tr(Ajρ).2.3
    It is of key importance that the number of different outcomes can be more than the dimensionality of Hd [27]. Of course, in practice, POVM measurements involve auxiliary systems, so that degrees of freedom are actually added.

    Let B(1)={|bj(1)} and B(2)={|bk(2)} be two orthonormal bases in Hd. They are mutually unbiased if and only if for all j and k,

    |bj(1)|bk(2)|=1d.2.4
    The set B={B(1),,B(L)} is formed by MUBs, when each two bases from this set are mutually unbiased. The measurement in one basis cannot give anything about the state, which was prepared in another basis. This property is essential in some schemes of quantum key distribution.

    Let us recall symmetric informationally complete measurements. In d-dimensional Hilbert space, we consider a set of d2 rank-one operators of the form

    Nj=1d|ϕjϕj|.2.5
    If the normalized vectors |ϕj〉 all satisfy the condition
    |ϕj|ϕk|2=1d+1(jk),2.6
    the set N={Nj} is a symmetric informationally complete POVM (SIC-POVM) [15]. It was conjectured that SIC-POVMs exist in all dimensions [28]. The existence of SIC-POVMs has been shown analytically or numerically for all dimensions up to 67 [29]. For a discussion of connections between MUBs and SIC-POVMs (see [30] and references therein). Weyl–Heisenberg (WH) covariant SIC-sets of states in prime dimensions are examined in [31]. WH SIC-sets, whenever they exist, consist solely of minimum uncertainty states with respect to Rényi's 2-entropy for a complete set of MUBs [31]. The authors of Dall'Arno et al. [32] derived bounds on accessible information and informational power for the case of SIC-sets of quantum states. In general, informationally complete sets of positive matrices are discussed in the book [33]. The authors of Klappenecker et al. [34] discussed approximate versions of a SIC-POVM, when a small deviation from uniformity of the inner products is allowed.

    Basic constructions of MUBs concern the case, when d is a prime power. If d is another composite number, maximal sets of MUBs are an open problem [12]. We can try to approach ‘unbiasedness’ with weaker conditions. The authors of Kalev & Gour [17] proposed the concept of MUMs. They consider two POVM measurements P={Pj} and Q={Qk}. Each of them contains d elements such that

    tr(Pj)=tr(Qk)=12.7
    and
    tr(PjQk)=1d.2.8
    Thus, the POVM elements are all of trace one, but now not of rank one. The formula (2.8) replaces (2.4). The Hilbert–Schmidt product of two elements from the same POVM depends on the so-called efficiency parameter ϰ [17]. It holds that
    tr(PjPk)=δjkϰ+(1δjk)1ϰd1,2.9
    and similarly for the elements of Q. The efficiency parameter obeys [17]
    1d<ϰ1.2.10
    For ϰ=1/d, we have the trivial case, in which Pj=1d/d for all j. The value ϰ=1, when possible, leads to the standard case of MUBs. More precise bounds on ϰ will depend on a construction of measurement operators. The efficiency parameter shows how close the measurement operators are to rank-one projectors [17]. For the given ϰ, we take the set P={P(1),,P(L)} of POVMs satisfying (2.9). When each two POVMs also obey conditions of the forms (2.7) and (2.8), the set P is a set of MUMs. Allowing ϰ1, the authors of Kalev & Gour [17] built d+1 MUMs in d-dimensional Hilbert space for arbitrary d. Their construction is based on the generators of SU(d). For the given d, the parameter ϰ ranges in the interval, which is determined by the smallest or largest eigenvalues of some traceless operators. In this regard, we cannot fix ϰ without specifying d. Of course, the efficiency parameter should approach 1 as close as possible.

    Similar ideas can be used in building general SIC-POVMs. For all finite d, a common construction has been given [18]. Consider a POVM with d2 elements Mj, which satisfy the following two conditions. First, for all j=1,…,d2, one has

    tr(MjMj)=a.2.11
    Second, the pairwise inner products are all symmetrical, namely
    tr(MjMk)=b(jk).2.12
    Then the operators Mj form a general SIC-POVM. Combining the conditions (2.11) and (2.12) with the completeness relation finally gives [18]
    b=1add(d21).2.13
    We also get tr(Mj)=1/d for all j=1,…,d2. Therefore, the value a is the only parameter that characterizes the type of a general SIC-POVM. This parameter is restricted as [18]
    1d3<a1d2.2.14
    The value a=1/d3 corresponds to the case Mj=1d/d2, which does not give an informationally complete POVM. The value a=1/d2 is achieved, when the POVM elements are all rank-one [18]. The latter is actually the case of usual SIC-POVMs, when POVM elements are represented in terms of the corresponding unit vectors as (2.5). Even if SIC-POVMs exist in all dimensions, they are rather hard to construct. Similarly to usual SIC-POVMs, general SIC-POVMs have a specific structure that makes them appropriate in determining an informational content of a quantum state.

    In §5, we will use monotonicity of the relative entropy under the action of trace-preserving completely positive (TPCP) maps. So, we recall some required material. Let us consider a linear map

    Φ:L(Hd)L(Hm).2.15
    To describe physical processes, linear maps have to be completely positive [35,36]. Let idn be the identity map on L(Hn), where the n-dimensional space Hn is assigned to a reference system. The complete positivity implies that the map Φidn is positive for all n. Completely positive maps are often called quantum operations [35]. Each completely positive map can be represented in the form [26,35]
    Φ(X)=iKiXKi.2.16
    Here, the Kraus operators Ki map the input space Hd to the output space Hm. The map preserves the trace, when the Kraus operators satisfy
    iKiKi=1d.2.17
    Trace-preserving quantum operations are usually referred to as quantum channels. Applying to a POVM measurement, the formula (2.17) merely gives the completeness relation [35].

    3. On definition of the Brukner–Zeilinger information

    Quantum theory can shortly be characterized as a formal scheme for representing states together with rules for computing the probabilities of different outcomes of an experiment [27]. In this regard, the notion of quantum state is rather a list of the statistical properties of an ensemble of identically prepared systems. In a series of papers [10,2325], Brukner and Zeilinger considered the question of informational content of an unknown quantum state. To quantify the amount of information, a prospective measure should have some natural properties. These properties are also connected with a proper choice of individual experiments or rather a set of experiments. Choosing experiments, the observer can actually manage different kinds of information that will manifest themselves, although the total amount of information is apparently limited [8].

    Let us consider an experiment, in which a non-degenerate d-dimensional observable is measured. This test is actually connected with the corresponding basis B={|bj}. As a rule, the observer has only a limited number of systems to work with. Keeping the probability distribution pj(B|ρ)=bj|ρ|bj, the observer try to guess how many times a specific outcome will occur. In such situation, the number of occurrences of some outcome in future repetitions cannot be expected precisely [10]. The authors of Brukner & Zeilinger [10] suggested to characterize the experimenter's uncertainty by the quantity

    UBZ(B|ρ):=1j=1dpj(B|ρ)2.3.1
    This approach is motivated with considering mean-square-deviation for uncertainty in the number of occurrences. It will be convenient to introduce the index of coincidence
    C(B|ρ):=j=1dpj(B|ρ)2.3.2
    We then have UBZ(B|ρ)=1C(B|ρ). The case of complete lack of information in an experiment corresponds to the uniform distribution. Hence, Brukner & Zeilinger [10,23] proposed to define the measure of information as
    IBZ(B|ρ):=j=1d(pj(B|ρ)1d)2.3.3
    In principle, the right-hand side of (3.3) could be rescaled by appropriate normalization factor [10]. The latter is chosen with respect to the context. Since the uniform distribution is obtained with the completely mixed state ρ=1d/d, we can rewrite the Brukner–Zeilinger information as
    IBZ(B|ρ)=C(B|ρ)C(B|ρ).3.4
    As we will see, this form is useful in studies of the case with detection inefficiencies. Here, the uniform distribution is not a good reference point for comparison. It is also convenient for generalizing the approach to POVM measurements. Indeed, for a POVM measurement, the number of outcomes typically exceeds dimensionality [27].

    When the observer have many copies of the same quantum state, he will rather tend to measure the state in several mutually complementary bases. For example, the state of spin-12 could be measured along one of three orthogonal axes. The authors of Brukner & Zeilinger [10,23] defined the total information content by summarizing the measures (3.3) for all complementary tests. Suppose that we have the set B of d+1 MUBs in d-dimensional space. For any density matrix ρL+(Hd), one then gives [37,38]

    BBC(B|ρ)=1+tr(ρ2).3.5
    Thus, the sum of indices of coincidence is determined by the quantity tr(ρ2) usually called purity [36]. Then the total information is represented as
    BBIBZ(B|ρ)=tr(ρ2)tr(ρ2)=tr(ρ2)1d.3.6
    It must be stressed here that this quantity is invariant under unitary transformations of ρ. When we have a set BL={B(1),,B(L)} of L MUBs, there holds [39]
    BBLC(B|ρ)L1d+tr(ρ2).3.7
    For the case L<d+1, the sum of L indices of coincidence cannot be determined in terms of purity solely. Hence, we can only write the inequality
    BBLIBZ(B|ρ)tr(ρ2)tr(ρ2).3.8
    The left-hand side of (3.8) is generally changed under unitary transformations of ρ.

    The question about invariance or non-invariance under unitary transformations can be illustrated with the three spin-12 measurements along mutually orthogonal axes [23]. For one and the same spin state, the three coordinate axes may be oriented arbitrarily. Here, the total information (3.6) does not depend on such an orientation. Indeed, any axes rotation can be reformulated as a unitary transformation of the given state. The eigenbases of the three Pauli observables are mutually unbiased, whence the total information (3.6) is invariant under unitary transformations of the state.

    The Shannon entropy is one of the basic notions of information theory. If a measurement is described by the probabilities pj(B|ρ), then the Shannon entropy is written as

    H1(B|ρ):=j=1dpj(B|ρ)lnpj(B|ρ).3.9
    Summing the Shannon measures for all the bases, we obtain some total characteristic. It turned out that such total characteristic is generally not invariant under unitary transformations. The authors of Brukner & Zeilinger [23] clearly exemplified this fact with the three spin-12 measurements along orthogonal axes. As a total measure of informational character, the sum of three Shannon entropies has several counterintuitive properties [23]. First, it can be different for states of the same purity. Second, it changes in time even for a completely isolated system. Third, it depends on particular details of an experimental set-up. Even in two dimensions, therefore, the mentioned approach to quantifying information in quantum measurements seems to be inappropriate.

    Thus, the sum of the Shannon entropies of generated probability distributions is generally not invariant even for the case, when d+1 MUBs exist. In contrast, the total information (3.6) is constant here. Note that the Brukner–Zeilinger information can be interpreted in entropic terms. For 0<α≠1, the Tsallis α-entropy of generated probability distribution pj(B|ρ) is defined by

    Hα(B|ρ):=11α(j=1d{pj(B|ρ)}α1).3.10
    This entropy is widely used in non-extensive statistical mechanics due to Tsallis [40]. For α=2, the corresponding Tsallis entropy is connected with the index of coincidence as
    H2(B|ρ)=1C(B|ρ).3.11
    Hence, we represent the Brukner–Zeilinger information as
    IBZ(B|ρ)=H2(B|ρ)H2(B|ρ).3.12
    Thus, the Brukner–Zeilinger measure shows a reduction in the uncertainty due to a deviation of the density matrix from the completely mixed one. However, the uncertainty is quantified by the Tsallis entropy of degree α=2.

    4. Three schemes with special types of quantum measurements

    In this section, we will discuss use of the Brukner–Zeilinger approach with a SIC-POVM, with a complete set of MUMs, and with a general SIC-POVM. In each of these cases, we finally obtain an information measure operationally invariant in the terminology of Brukner & Zeilinger [10]. To apply the result (3.5), we have to perform d+1 projective measurements, if the required MUBs all exist. So, it is interesting to examine the Brukner–Zeilinger total information with other quantum measurements. For a POVM A={Aj}, we define

    IBZ(A|ρ)=C(A|ρ)C(A|ρ),4.1
    where C(A|ρ) is the sum of all squared probabilities of the form (2.3). The definition (4.1) is a natural generalization of the formula (3.4).

    We first mention that a single POVM measurement is sufficient for our purposes. Suppose that N={Nj} is a symmetric informationally complete POVM in d dimensions. As was shown in [41], the corresponding index of coincidence is equal to

    C(N|ρ)=j=1d2pj(N|ρ)2=tr(ρ2)+1d(d+1).4.2
    That is, for a SIC-POVM, the index of coincidence is expressed in terms of purity of the given density matrix. For the completely mixed state, we have
    C(N|ρ)=tr(ρ2)+1d(d+1)=1d2.4.3
    For a SIC-POVM, the Brukner–Zeilinger information is represented as
    IBZ(N|ρ)=tr(ρ2)tr(ρ2)d(d+1).4.4
    This quantity is merely obtained by dividing the total information (3.6) by d(d+1). In this regard, the quantity (4.4) can also be treated as a measure of total information. It is important since SIC-POVMs could exist for those values of d, for which d+1 MUBs do not exist. Say, for MUBs, we do not know the answer already for d=6, whereas the existence of SIC-POVMs has been shown for d≤67 [29]. Of course, any SIC-POVM is more complicated for implementation than a single projective measurement. However, we need d+1 projective measurements for calculating (3.6). Even if d+1 MUBs exist, the scheme with them may require more costs than the scheme based on a single SIC-POVM. In this respect, the result (4.4) is also significant. At the same time, constructions of SIC-POVMs for sufficiently larger d may rather be complicated. We will further see that the Brukner–Zeilinger concept of total information can be developed with d+1 MUMs and with a general SIC-POVM. These types of measurement are interesting in the sense that each of them allows a unified theoretical description.

    For arbitrary d, we can built a set of d+1 MUMs of some efficiency ϰ<1 [17]. We shall now consider the Brukner–Zeilinger approach with such measurements. Let P be a set d+1 MUMs of the efficiency ϰ in d-dimensional space. As was shown in [21,22], we then have

    PPC(P|ρ)=1+1ϰ+(ϰd1)tr(ρ2)d1.4.5
    For pure states, the right-hand side of (4.5) reads 1+ϰ. The latter was obtained in [17]. With a set PL={P(1),,P(L)} of L MUMs, we can only write the inequality [22]
    PPLC(P|ρ)L1d+1ϰ+(ϰd1)tr(ρ2)d1.4.6
    Due to (4.5), we have arrived at a conclusion. For the complete set P of d+1 MUMs of the efficiency ϰ and any density matrix ρL+(Hd), one gives
    PPIBZ(P|ρ)=ϰd1d1[tr(ρ2)tr(ρ2)].4.7
    The right-hand side of (4.7) increases proportionally to the efficiency parameter ϰ. At the prescribed efficiency, the sum of the Brukner–Zeilinger information measures is determined by purity solely. For ϰ=1, the result (4.7) is reduced to (3.6). The latter, however, depends on the existence of a complete set of mutually complementary observables. Among other critical points, this fact was mentioned in [42]. On the other hand, a set of d+1 MUMs with some ϰ<1 has been constructed for arbitrary d [17]. Except for ϰ=1, MUMs are not projective. Together, a set of d+1 MUMs involves d(d+1) POVM elements. The scheme with a general SIC-POVM seems to be more effective, since it involves only d2 POVM elements.

    Let us proceed to the case of general SIC-POVMs. It is interesting, since general SIC-POVMs can be built within a scheme common for all d [18]. On the contrary, a unified approach to constructing SIC-POVMs with rank-one elements hardly exists. Moreover, the existence of usual SIC-POVMs for all d is plausible but still not proved. For a general SIC-POVM M, we have [20]

    C(M|ρ)=j=1d2pj(M|ρ)2=(ad31)tr(ρ2)+d(1ad)d(d21).4.8
    where the parameter a characterizes this POVM. Due to (4.8), for any density matrix ρL+(Hd), we then get
    IBZ(M|ρ)=ad31d(d21)[tr(ρ2)tr(ρ2)].4.9
    This quantity expresses the total information associated with the general SIC-POVM M. For a=1/d2, the result (4.9) is naturally reduced to (4.4). Thus, the Brukner–Zeilinger approach to quantifying total information of the given quantum state can be realized, at least in principle, with MUMs as well as with a general SIC-POVM.

    In this section, we have shown that the Brukner–Zeilinger concept of total information can be realized within the three measurement schemes. They are, respectively, based on a single SIC-POVM, on a set of d+1 MUMs, and on a general SIC-POVM. We are sure of the existence of the complete set of MUBs only for specific values of the dimensionality. We can also recall that even the case d=6 is still not understood. For this reason, an alternative realization of the Brukner–Zeilinger approach is certainly interesting. On the other hand, implementation of such experimental schemes may not be easy due to very special structure of measurement operators. So, the developed approach should take into account a role of detection inefficiencies. In this regard, the authors of Shafiee et al. [43] criticized the Brukner–Zeilinger approach. In the next section, we examine the question in more details.

    5. Formulation for measurements with detection inefficiencies

    In practice, measurement devices inevitably suffer from losses. The authors of Shafiee et al. [43] considered the Brukner–Zeilinger approach in the case of non-zero probability of the no-click event. For definiteness, we first describe this case for complementary measurements in MUBs. Let the parameter η∈[0;1] characterize a detector efficiency. The no-click event is presented by additional outcome . Assume that for any basis B, the inefficiency-free distribution {pj(B|ρ)} is altered as

    pj(η)(B|ρ)=ηpj(B|ρ)andp(η)(B|ρ)=1η.5.1
    In other words, we mean detectors of the same efficiency for all of the used MUBs. This assumption seems to be physically natural and has been adopted in [43]. In essence, the above formulation coincides with the first model of detection inefficiencies applied in [44]. On the other hand, the authors of Chaves & Fritz [44] focus on measurements in cycle scenarios of the Bell type.

    It was noted that the Brukner–Zeilinger approach may have some doubts in application to more realistic models of the experiment. In principle, we could expect that the total information should vanish with negligible η. At a glance, however, one comes across an opposite situation. The authors of Shafiee et al. [43] illustrated this conclusion with the three spin-12 measurements along orthogonal axes. They calculated the sum of three quantities of the form (3.3) for different η∈[0;1] and found the following. First, the minimum of the sum is reached at some intermediate value of η>0. Second, for η→0+, the sum becomes even larger than for the inefficiency-free case η=1. Such results gave a ground for criticizing the Brukner–Zeilinger approach [43].

    In our opinion, these doubts may be overcome with a proper modification of the form (3.3). Here, we compare obtained probability distributions with the uniform one. However, such a comparison is meaningful only in the inefficiency-free case η=1. In the distribution (5.1), one of probabilities depends on detectors solely. As its value is 1−η, the uniform distribution does not have actual bearing for the case η<1. Instead, we propose to compare the actual probability distribution with the distribution obtained with the completely mixed input. It can be reached by replacing (3.3) with (3.4). More precisely, for the case of detection inefficiencies, we use the quantity

    IBZ(η)(B|ρ)=C(η)(B|ρ)C(η)(B|ρ)=H2(η)(B|ρ)H2(η)(B|ρ).5.2
    The superscripts emphasize here that the information measures are all calculated with actual ‘distorted’ probabilities. Apparently, preparing the completely mixed state is not difficult. For the existing experimental set-up, therefore, statistics with the completely mixed input can be observed and stored. Stored data will be used in future for applications of the definition (5.2). Thus, we shall consider a more realistic case of detection inefficiencies on the base of (5.2). It was shown in [45] that for all α>0, we have
    Hα(η)(B|ρ)=ηαHα(B|ρ)+hα(η),5.3
    where Hα(η)(B|ρ) is the α-entropy of ‘distorted’ distribution (5.1). Of course, the binary entropy hα(η) is written as
    hα(η)=11α(ηα+(1η)α1).5.4
    For α=1, results of the form (5.3) were applied in studying entropic Bell inequalities with detector inefficiencies [44]. We will also assume that for a POVM A={Aj}, the inefficiency-free probabilities pj(A|ρ) are actually altered similarly to (5.1). For α=2, we then have
    H2(η)(A|ρ)=η2H2(A|ρ)+h2(η).5.5
    The left-hand side of (5.5) is the entropy calculated with actual measurement statistics.

    We can now reformulate the results (3.6), (4.4), (4.7) and (4.9) in the case of detection inefficiencies. It is for this reason that we modified definition of the Brukner–Zeilinger information according to (5.2). That is, the terms with ρ* also take into account an influence of no-click events. Combining (3.6) with (5.3) for α=2, we have arrived at a conclusion. When d+1 MUBs exist and form the set B, the total information with actually observed statistics is equal to

    BBIBZ(η)(B|ρ)=η2[tr(ρ2)tr(ρ2)].5.6
    When the parameter η decreases, the total information also decreases proportionally to the square of η. With a negligible efficiency of detection, no information about the system could be obtained. This very natural picture motivates the proposed definition (5.2).

    Using the described model of inefficiencies, we further obtain the following relations. If a POVM N is symmetric informationally complete then

    IBZ(η)(N|ρ)=η2tr(ρ2)tr(ρ2)d(d+1).5.7
    This result is obtained by combining (4.4) with (5.5). For the complete set P of d+1 MUMs of the efficiency ϰ, we also rewrite (4.7) as
    PPIBZ(η)(P|ρ)=η2ϰd1d1[tr(ρ2)tr(ρ2)].5.8
    Due to (4.9), for a general SIC-POVM M, we have
    IBZ(η)(M|ρ)=η2ad31d(d21)[tr(ρ2)tr(ρ2)].5.9
    The right-hand side of any of the formulae (5.6)–(5.9) monotonically increases with the detection efficiency η. Thus, criticism related to detection inefficiencies is truly overcome by a proper modification of the definition of the Brukner–Zeilinger measure. The idea is that the probability distribution used for referencing should take into account the parameter η. In principle, the results (5.6)–(5.9) could be adopted for verification of concrete experimental set-ups with respect to their efficiency. Of course, the inefficiency model used is very simple in character. Probably, more sophisticated models of detection inefficiencies could be developed. Nevertheless, our discussion has shown that the Brukner–Zeilinger approach can well be placed in the context of real experiments with a limited efficiency.

    6. Monotonicity under the action of bistochastic maps

    We have seen that, for some special measurements, the Brukner–Zeilinger total information can exactly be expressed in terms of purity of the quantum state of interest. In effect, the four information measures (3.6), (4.4), (4.7) and (4.9) are all proportional to the quantity

    tr(ρ2)tr(ρ2)=tr(ρ2)1d.6.1
    So, we can treat it as a quantum measure of informational content of the given quantum state. Luo [46] showed the following fact. The quantity (6.1) is directly connected with usual quantum-mechanical variance averaged over every orthonormal basis in Ls.a.(Hd). Hence, the Brukner–Zeilinger concept of invariant information is supported within a more traditional point of view. We will provide another interesting interpretation for (6.1). This interpretation allows the study of the monotonicity of the Brukner–Zeilinger information under the action of quantum stochastic maps.

    The relative entropy is a very important measure of statistical distinguishability [35]. In the classical regime, the relative entropy is also known as the Kullback–Leibler divergence [47]. Its extension to entropic functions of the Tsallis type was discussed in [48,49]. Let supp(ρ)Hd be the subspace spanned by those eigenvectors that correspond to strictly positive eigenvalues of ρ. This subspace is typically called the support of ρ [35]. For density operators ρ and σ, the quantum relative entropy is expressed as [35]

    D1(ρσ):={tr(ρlnρρlnσ),ifsupp(ρ)supp(σ),+,otherwise.6.2
    Many fundamental results of quantum information theory are closely related to properties of the relative entropy [35,50].

    The divergence (6.2) was generalized in several ways. To connect the Brukner–Zeilinger approach, we will use quantum divergences of the Tsallis type. For α(1;+), the Tsallis α-divergence is defined as

    Dα(ρσ):={1α1[tr(ρασ1α)1],if supp(ρ)supp(σ),+,otherwise..6.3
    For α∈(0;1), we merely use the first entry without conditions. Up to a factor, this relative entropy is a particular case of quasi-entropies introduced by Petz [51]. Quasi-entropies are a quantum counterpart of Csiszár's f-divergences [52]. For more details, see [53,54] and references therein. It is easy to see that the quantity (6.1) can be represented as
    tr(ρ2)tr(ρ2)=1dD2(ρρ).6.4
    This formula gives a connection of the Brukner–Zeilinger total information with the Tsallis relative entropy.

    One of the basic properties of the quantum relative entropy is its monotonicity under the action of TPCP maps [35]. As has been shown, the four information measures (3.6), (4.4), (4.7) and (4.9) are invariant with respect to unitary transformations. Keeping the measurement set-up, we now aim to compare the Brukner–Zeilinger measure before and after the action of TPCP maps. For this reason, we will focus on the case of the same input and output space. Then Kraus operators of the operator-sum representation (2.16) are expressed by square matrices.

    In classical regime, the relative α-entropy of Tsallis' type is monotone for all α≥0 [49]. Due to non-commutativity, the quantum case is more complicated in character. The quantum α-divergence (6.3) is monotone under the action of TPCP maps for α∈(0;2]. That is, for α∈(0;2] and arbitrary TPCP map Φ, we have

    Dα(Φ(ρ)Φ(σ))Dα(ρσ).6.5
    This claim is based on the general approach of Hiai et al. [54] and the following results of matrix analysis. The function ξξα is matrix concave on [0;+) for 0≤α≤1 and matrix convex on [0;+) for 1≤α≤2 (see, respectively, theorems 4.2.3 and 1.5.8 in [55]).

    Bistochastic maps form an important class of TPCP maps. Recall that we consider the case of the same input and output space. Taking arbitrary operators X,YL(Hd), the adjoint map is defined by Watrous [26]

    Φ(X),YHS=X,Φ(Y)HS.6.6
    For the completely positive map (2.16), its adjoint is represented as
    Φ(X)=iKiXKi.6.7
    If this adjoint is trace preserving, then Kraus operators of Φ also obey
    iKiKi=1d.6.8
    If a quantum map is completely positive and its Kraus operators satisfy both (2.17) and (6.8), the map is called bistochastic [36]. Bistochastic maps can be treated as a quantum counterpart of bistochastic matrix, which act in the space of probability vectors. The principal fact is that the completely mixed state is a fixed point of any bistochastic map, namely
    Φ(ρ)=ρ.6.9
    This property is referred to as the unitality of the map [35,55]. Combining (6.5) with (6.9), we have arrived at a conclusion. For α∈(0;2] and all bistochastic maps Φ:L(Hd)L(Hd), one gets
    Dα(Φ(ρ)ρ)Dα(ρρ).6.10
    We will use (6.10) for α=2. Thus, the quantity (6.4) cannot increase under the action of bistochastic maps. In other words, for bistochastic maps, we write
    tr(Φ(ρ)2)tr(ρ2)tr(ρ2)tr(ρ2).6.11
    Due to (6.11), we see that the quantities (3.6), (4.4), (4.7) and (4.9) can only decrease under the action of bistochastic maps. As was recently shown in [56], bistochastic quantum operation can only increase quantum entropies of very general class.

    Since the four measures (3.6), (4.4), (4.7) and (4.9) depend on purity of the state, they are all invariant with respect to unitary transformation. In the terminology of Brukner & Zeilinger [10], they are all operationally invariant measures of information. The unitary invariance has been treated as one of basic reasons for using just this approach to quantification of information in quantum measurements. Further, the above information measures cannot increase under the action of bistochastic maps. For a bistiochastic map, its adjoint is a TPCP map as well. Here, the property (6.9) plays a key role. Quantum fluctuation theorems form another direction, in which unitality seems to be very important. As was claimed in [57], unitality replaces microreversibility as the restriction for the physicality of reverse processes. Significance of unitality or non-unitality of quantum stochastic maps deserves further investigations. In the next section, we will discuss some relations between this question and the Brukner–Zeilinger total information.

    7. Non-unital maps and the Brukner–Zeilinger approach

    We have seen that the quantity (6.1) can only decrease under the action of bistochastic maps. It is natural to expect that (6.1) may be increased for non-unital quantum operations. In this section, we will study connections of the Brukner–Zeilinger total information with characterization of such maps. The latter seems to be closely related with quantum fluctuation theorems. Recent advances in dealing with small quantum systems have led to growing interest in their thermodynamics [58]. A certain progress has been connected with studies of the Jarzynski equality [59] and related fluctuation theorems [60,61]. Recent studies are mainly concentrated on formulations for open quantum systems [6266]. Some of such results have been shown to be valid in the case of bistochastic maps [57,67]. Jarzynski equality for quantum stochastic maps can naturally be formulated in terms of the non-unitality operator [68]. It turns out that norms of this operator can be evaluated within the Brukner–Zeilinger approach.

    Operators of interest are often characterized by means of norms. Some of them are especially important. To each XL(Hd), we assign |X|L+(Hd) as the unique positive square root of XX. The eigenvalues of |X| counted with multiplicities are the singular values of X denoted by sj(X). For q[1;], the Schatten q-norm is defined as [26]

    Xq:=(j=1dsj(X)q)1/q.7.1
    This family includes the trace norm ∥X1=tr|X| for q=1, the Hilbert–Schmidt norm X2=X,X HS1/2 for q=2, and the spectral norm
    X=max{sj(X):1jd}7.2
    for q=. These norms are widely used in quantum information theory. They also give a tool for characterizing linear maps. For a linear map Φ, its norm is defined as
    Φ:=sup{Φ(X):X=1}.7.3
    We will use the following fact proved, e.g. in item 2.3.8 of [55]. If a map Φ:L(Hd)L(Hd) is positive, then
    Φ=Φ(1d).7.4
    In terms of the completely mixed state, we write Φ=dΦ(ρ).

    For a linear map Φ:L(Hd)L(Hd), the non-unitality operator is written as [68]

    GΦ:=Φ(ρ)ρ.7.5
    This operator is zero for all bistochastic maps. For TPCP maps, the Hilbert–Schmidt norm of (7.5) is immediately expressed in terms of the Brukner–Zeilinger measure of information. Indeed, the squared Hilbert–Schmidt norm of GΦ is written as
    Φ(1d)1d,Φ(1d)1d HS=tr(Φ(1d)2)2tr(Φ(1d))+d=tr(Φ(1d)2)d.7.6
    Here, we recall that Φ(1d)L+(Hd) is Hermitian and tr(Φ(1d))=d due to preservation of the trace. Dividing (7.6) by d2 and taking the square root, for a TPCP map we have
    GΦ2=tr(Φ(ρ)2)1d=tr(Φ(ρ)2)tr(ρ2).7.7
    Thus, obtaining the Brukner–Zeilinger total information also allows calculation of the Hilbert–Schmidt norm of the non-unitality operator.

    The difference tr(Φ(ρ)2)tr(ρ2) can be evaluated by means of measurements schemes described in §§3 and 4. When an unknown quantum channel is given as some black box, we prepare the completely mixed state with putting it into the black box. The output Φ(ρ*) is further subjected to one of measurement schemes available for the given d. This run is repeated as many times as required for collecting measurement statistics. Statistical data should be sufficient for evaluation of the left-hand side of one of the relations (3.6), (4.4), (4.7) and (4.9). Thus, we obtain the quantity (6.1) for ρ=Φ(ρ*) and apply (7.7).

    Using the result (7.7), for quantum operations, we can estimate from above the map norm (7.4). We will use a relation between vector norms proved in [41]. It was later applied for deriving fine-grained uncertainty relations for a set of MUBs and a set of MUMs [69]. As follows from the results of appendix A of [41], for any operator XL(Hd) we have

    X1d(X1+d1dX22X12).7.8
    For a TPCP map, we have ∥Φ(ρ*)∥1=tr(Φ(ρ*))=1. Combining this with (7.8) gives
    Φ(ρ)1d(1+d1dΦ(ρ)221).7.9
    Due to (7.7) and Φ(ρ)22=tr(Φ(ρ)2), multiplying (7.9) by d leads to
    Φ1+d(d1)GΦ2.7.10
    Thus, for quantum operations, the map norm (7.3) is bounded from above in terms of the Hilbert–Schmidt norm of the corresponding non-unitality operator. For bistochastic maps, we have Φ=1d=1 and GΦ=0, so that the inequality (7.10) is saturated here.

    The above findings can further be illustrated with the following example. Let {|i} i=1d be an orthonormal basis in Hd. We consider the quantum operation Ψ:L(Hd)L(Hd) with Kraus operators

    Ki=|i0i|,7.11
    where |i0〉 is some prescribed state of the basis. This map represents the complete contraction to a pure state. Taking |i0〉 as a ground state, one can describe the process of spontaneous emission in atomic physics. In a certain sense, quantum operations of the form (7.11) enjoy extreme non-unitality. The condition (2.17) is clearly satisfied, whereas
    Ψ(1d)=i=1dKiKi=d|i0i0|.7.12
    In this example, we have Ψ(ρ*)=|i0〉〈i0| and tr(Ψ(ρ)2)tr(ρ2)=11/d. Hence, the Brukner–Zeilinger information reaches its maximal value. We also note that the inequality (7.10) is saturated here. Indeed, substituting the term GΨ2=11/d into the right-hand side of (7.10) results in the value d that is exactly Ψ=Ψ(1d). Thus, quantum operations of the form (7.11) show a behaviour quite opposite to bistochastic maps.

    8. Conclusion

    We have considered the Brukner–Zeilinger approach to quantifying information in quantum measurements on a finite-level system. This problem is essential due to recent advances in quantum information processing. The original formulation of Brukner and Zeilinger was based on projective measurements in the complete set of MUBs. This formulation is therefore restricted, since even the case of MUBs in dimensionality 6 is still not resolved [12]. We have shown that the idea of operationally invariant measure of information can truly be realized within the three schemes based on special types of quantum measurements. Namely, these schemes, respectively, use a single SIC-POVM, a complete set of MUMs, and a single general SIC-POVM. Such measurements are easy to construct. In addition, costs on the schemes with a single SIC-POVM may be less. The Brukner–Zeilinger measure of information was also criticized on the following ground. In real experiments, the ‘no-click’ events inevitably occur. Some doubts in the case of detection inefficiencies were discussed in [43]. Such criticism is overcome by means of natural reformulation of the approach considered. Namely, the uniform distribution is a good reference only for the inefficiency-free case. Otherwise, we should use for comparison some probability distribution that takes into account a real efficiency of detectors. The desired probability distribution is naturally obtained by putting the completely mixed state into real experiments. The corresponding data can be stored and further used for calculating required quantities. Information measures of the Brukner–Zeilinger type are not only unitarily invariant, they cannot also increase under the action of bistochastic maps. Using this approach for characterization of non-unital TPCP maps is considered. If a quantum channel is given as black box, the measurement schemes described can be used for determining the Hilbert–Schmidt norm of the non-unitality operator. Potential applications of information measures of the Brukner–Zeilinger type in quantum information science deserve further investigations. The authors of Deutsch & Marletto [70] recently proposed the constructor theory of information, which is aimed to derive the properties of information entirely from the laws of physics. It would be interesting to study measures of information in quantum theory within the constructor theory.

    Competing interests

    I declare I have no competing interests.

    Funding

    I received no funding for this study.

    Acknowledgements

    I am grateful to the anonymous reviewers for useful comments.

    Footnotes

    Published by the Royal Society. All rights reserved.