On the Brukner–Zeilinger approach to information in quantum measurements
Abstract
We address the problem of properly quantifying information in quantum theory. Brukner and Zeilinger proposed the concept of an operationally invariant measure based on measurement statistics. Their measure of information is calculated with probabilities generated in a complete set of mutually complementary observations. This approach was later criticized for several reasons. We show that some critical points can be overcome by means of natural extension or reformulation of the Brukner–Zeilinger approach. In particular, this approach is connected with symmetric informationally complete measurements. The ‘total information’ of Brukner and Zeilinger can further be treated in the context of mutually unbiased measurements as well as general symmetric informationally complete measurements. The Brukner–Zeilinger measure of information is also examined in the case of detection inefficiencies. It is shown to be decreasing under the action of bistochastic maps. The Brukner–Zeilinger total information can be used for estimating the map norm of quantum operations.
1. Introduction
Quantum information science has currently made impressive advances in both theory and practice [1]. Feynman emphasized that quantum systems are very hard to be simulated at the classical level [2]. On the other hand, such a negative claim also inspires a positive reason for trying to build quantum computers [3]. Quantum key distribution has provided a long-term technological solution already implemented in a lot of commercial products [4,5]. Quantum algorithms allow to solve efficiently a number of important problems, which are currently intractable [6,7]. Developments in quantum information processing stimulated a renewed interest to foundations of quantum mechanics. This subject is a thriving, lively and controversial field of research [8,9]. Currently, conceptual questions are often reformulated in information-theoretic terms. Actually, results of a quantum measurement are finally recorded in some row of statistical data. Hence, we have come across a problem to quantify an amount of information that could be extracted from such data.
The problem of determining quantum state quite differs from the classical formulation. There are many possible scenarios to be imagined. Attacking a system of quantum key distribution, Eve is typically bused with discriminating between two or more alternatives known to her a priori. During an individual attack, she captures only a single information carrier. An opposite situation deals with a very large ensemble of identical copies. In practice, a number of copies is never infinite though large. Our experience leads to the following conclusion. The proportion of times that the given outcome occurs settles down to some value as the number of trials becomes larger and larger. The ultimate value of this proportion is meant as the probability of the given outcome. Dealing with quantum systems, the observer can take different experiments, which might even completely exclude each other. For example, the state of a spin- system is often considered to be estimated with measurements of the three orthogonal components of spin [10]. In more than two dimensions, such complementary measurements are formulated in terms of the so-called mutually unbiased bases (MUBs). This concept was actually considered by Schwinger [11].
To approach properly an informational measure, Brukner & Zeilinger considered the following situation [10]. Suppose that we know probabilities of all outcomes and try to guess a number of occurrences of the prescribed outcome among finite experimental trials. Of course, our prediction will allow an amount of uncertainty, which can be estimated with respect to some confidence interval. Taking an uncertainty per single trial and summing it for all outcomes, one naturally leads to a measure of uncertainty in one experiment [10]. It is shown to be 1 minus the sum of squared probabilities. Hence, Brukner and Zeilinger defined a measure of information in one experiment and in a set experiments. For d+1 MUBs, the corresponding total information turned to be operationally invariant in the following sense [10]. The sum of the individual measures of information for mutually complementary observations is invariant with respect to a choice of the particular set of complementary observations. In other words, this sum is invariant under unitary rotations of the measured state. The latter implies that there is no information flow between the system of interest and its environment [10].
MUBs are an interesting mathematical object as well as an important tool in many physical issues [12]. Such bases can be used in quantum key distribution, state reconstruction, quantum error correction, detection of quantum entanglement and other topics. MUBs are connected with symmetric informationally complete measurements. A positive operator-valued measure (POVM) is said to be informationally complete, if its statistics determine completely the quantum state [13,14]. To increase an efficiency at determining the state, elements of such a measurement should have rank one. An informationally complete POVM is called symmetric, when all pairwise inner products between the POVM elements are equal [15]. In general, the maximal number of MUBs in d dimensions is still an open question [12]. When d is a prime power, the answer d+1 is known [12]. Constructions of d+1 MUBs for such d rely on properties of prime powers and on an underlying finite field [16]. It also seems to be hard to get a unified way for building a symmetric informationally complete POVM (SIC-POVM) in all dimensions.
The authors of Kalev & Gour [17] introduced the concept of mutually unbiased measurements (MUMs). The core idea is that elements of such a measurement are not rank one. This method does not reach the maximal efficiency but is easy to construct. It turns out that a complete set of d+1 MUMs can be built explicitly for arbitrary finite d [17]. An utility of such measurements in quantum information science deserves further investigations. It is also unknown whether rank-one SIC-POVMs exist in all finite dimensions. The positive answer was obtained with a weaker condition that POVM elements are not rank one. The authors of Gour & Kalev [18] proved the existence of general SIC-POVMs in all finite dimensions. It is not insignificant that general SIC-POVMs can be constructed within a unified approach. Studies of MUMs and general SIC-POVMs were continued in [19–22]. We will show that these measurements are interesting in the context of the Brukner–Zeilinger approach [10,23–25]. This approach to quantifying an amount of information will be shown to be realizable within three additional types of quantum measurements.
The paper is organized as follows. In §2, preliminary material is reviewed. In particular, we recall the definitions of MUMs and general SIC-POVMs. Section 3 is devoted to a general discussion of the Brukner–Zeilinger approach to quantification of information in quantum measurements. Its treatment in terms of Tsallis' entropies of degree 2 is mentioned. In §4, we show that an operationally invariant measure of information can be approached within the three measurement schemes. They are, respectively, based on a single SIC-POVM, on a set of d+1 MUMs, and on a general SIC-POVM. These measurement schemes give an alternative to d+1 MUBs known only for prime power dimensions. In §5, the Brukner–Zeilinger approach is examined for the case of detection inefficiencies, when the ‘no-click’ events are allowed. In §6, we show that the Brukner–Zeilinger total information cannot increase under the action of bistochastic maps. Relations between the Brukner–Zeilinger approach and non-unitality are examined in §7. In §8, we conclude the paper with a summary of results.
2. Preliminaries
In this section, we review the required material on MUMs and general SIC-POVMs. Let be the space of linear operators on d-dimensional Hilbert space . By , we denote the set of positive semi-definite operators on . By , we mean the d2-dimensional real space of Hermitian operators on . A state of d-level system is represented by density operator normalized as tr(ρ)=1. For operators , their Hilbert–Schmidt inner product is defined by Watrous [26]
Let and be two orthonormal bases in . They are mutually unbiased if and only if for all j and k,
Let us recall symmetric informationally complete measurements. In d-dimensional Hilbert space, we consider a set of d2 rank-one operators of the form
Basic constructions of MUBs concern the case, when d is a prime power. If d is another composite number, maximal sets of MUBs are an open problem [12]. We can try to approach ‘unbiasedness’ with weaker conditions. The authors of Kalev & Gour [17] proposed the concept of MUMs. They consider two POVM measurements and . Each of them contains d elements such that
Similar ideas can be used in building general SIC-POVMs. For all finite d, a common construction has been given [18]. Consider a POVM with d2 elements
In §5, we will use monotonicity of the relative entropy under the action of trace-preserving completely positive (TPCP) maps. So, we recall some required material. Let us consider a linear map
3. On definition of the Brukner–Zeilinger information
Quantum theory can shortly be characterized as a formal scheme for representing states together with rules for computing the probabilities of different outcomes of an experiment [27]. In this regard, the notion of quantum state is rather a list of the statistical properties of an ensemble of identically prepared systems. In a series of papers [10,23–25], Brukner and Zeilinger considered the question of informational content of an unknown quantum state. To quantify the amount of information, a prospective measure should have some natural properties. These properties are also connected with a proper choice of individual experiments or rather a set of experiments. Choosing experiments, the observer can actually manage different kinds of information that will manifest themselves, although the total amount of information is apparently limited [8].
Let us consider an experiment, in which a non-degenerate d-dimensional observable is measured. This test is actually connected with the corresponding basis . As a rule, the observer has only a limited number of systems to work with. Keeping the probability distribution , the observer try to guess how many times a specific outcome will occur. In such situation, the number of occurrences of some outcome in future repetitions cannot be expected precisely [10]. The authors of Brukner & Zeilinger [10] suggested to characterize the experimenter's uncertainty by the quantity
When the observer have many copies of the same quantum state, he will rather tend to measure the state in several mutually complementary bases. For example, the state of spin- could be measured along one of three orthogonal axes. The authors of Brukner & Zeilinger [10,23] defined the total information content by summarizing the measures (3.3) for all complementary tests. Suppose that we have the set of d+1 MUBs in d-dimensional space. For any density matrix , one then gives [37,38]
The question about invariance or non-invariance under unitary transformations can be illustrated with the three spin- measurements along mutually orthogonal axes [23]. For one and the same spin state, the three coordinate axes may be oriented arbitrarily. Here, the total information (3.6) does not depend on such an orientation. Indeed, any axes rotation can be reformulated as a unitary transformation of the given state. The eigenbases of the three Pauli observables are mutually unbiased, whence the total information (3.6) is invariant under unitary transformations of the state.
The Shannon entropy is one of the basic notions of information theory. If a measurement is described by the probabilities , then the Shannon entropy is written as
Thus, the sum of the Shannon entropies of generated probability distributions is generally not invariant even for the case, when d+1 MUBs exist. In contrast, the total information (3.6) is constant here. Note that the Brukner–Zeilinger information can be interpreted in entropic terms. For 0<α≠1, the Tsallis α-entropy of generated probability distribution is defined by
4. Three schemes with special types of quantum measurements
In this section, we will discuss use of the Brukner–Zeilinger approach with a SIC-POVM, with a complete set of MUMs, and with a general SIC-POVM. In each of these cases, we finally obtain an information measure operationally invariant in the terminology of Brukner & Zeilinger [10]. To apply the result (3.5), we have to perform d+1 projective measurements, if the required MUBs all exist. So, it is interesting to examine the Brukner–Zeilinger total information with other quantum measurements. For a POVM , we define
We first mention that a single POVM measurement is sufficient for our purposes. Suppose that is a symmetric informationally complete POVM in d dimensions. As was shown in [41], the corresponding index of coincidence is equal to
For arbitrary d, we can built a set of d+1 MUMs of some efficiency [17]. We shall now consider the Brukner–Zeilinger approach with such measurements. Let be a set d+1 MUMs of the efficiency in d-dimensional space. As was shown in [21,22], we then have
Let us proceed to the case of general SIC-POVMs. It is interesting, since general SIC-POVMs can be built within a scheme common for all d [18]. On the contrary, a unified approach to constructing SIC-POVMs with rank-one elements hardly exists. Moreover, the existence of usual SIC-POVMs for all d is plausible but still not proved. For a general SIC-POVM , we have [20]
In this section, we have shown that the Brukner–Zeilinger concept of total information can be realized within the three measurement schemes. They are, respectively, based on a single SIC-POVM, on a set of d+1 MUMs, and on a general SIC-POVM. We are sure of the existence of the complete set of MUBs only for specific values of the dimensionality. We can also recall that even the case d=6 is still not understood. For this reason, an alternative realization of the Brukner–Zeilinger approach is certainly interesting. On the other hand, implementation of such experimental schemes may not be easy due to very special structure of measurement operators. So, the developed approach should take into account a role of detection inefficiencies. In this regard, the authors of Shafiee et al. [43] criticized the Brukner–Zeilinger approach. In the next section, we examine the question in more details.
5. Formulation for measurements with detection inefficiencies
In practice, measurement devices inevitably suffer from losses. The authors of Shafiee et al. [43] considered the Brukner–Zeilinger approach in the case of non-zero probability of the no-click event. For definiteness, we first describe this case for complementary measurements in MUBs. Let the parameter η∈[0;1] characterize a detector efficiency. The no-click event is presented by additional outcome . Assume that for any basis , the inefficiency-free distribution is altered as
It was noted that the Brukner–Zeilinger approach may have some doubts in application to more realistic models of the experiment. In principle, we could expect that the total information should vanish with negligible η. At a glance, however, one comes across an opposite situation. The authors of Shafiee et al. [43] illustrated this conclusion with the three spin- measurements along orthogonal axes. They calculated the sum of three quantities of the form (3.3) for different η∈[0;1] and found the following. First, the minimum of the sum is reached at some intermediate value of η>0. Second, for η→0+, the sum becomes even larger than for the inefficiency-free case η=1. Such results gave a ground for criticizing the Brukner–Zeilinger approach [43].
In our opinion, these doubts may be overcome with a proper modification of the form (3.3). Here, we compare obtained probability distributions with the uniform one. However, such a comparison is meaningful only in the inefficiency-free case η=1. In the distribution (5.1), one of probabilities depends on detectors solely. As its value is 1−η, the uniform distribution does not have actual bearing for the case η<1. Instead, we propose to compare the actual probability distribution with the distribution obtained with the completely mixed input. It can be reached by replacing (3.3) with (3.4). More precisely, for the case of detection inefficiencies, we use the quantity
We can now reformulate the results (3.6), (4.4), (4.7) and (4.9) in the case of detection inefficiencies. It is for this reason that we modified definition of the Brukner–Zeilinger information according to (5.2). That is, the terms with ρ* also take into account an influence of no-click events. Combining (3.6) with (5.3) for α=2, we have arrived at a conclusion. When d+1 MUBs exist and form the set , the total information with actually observed statistics is equal to
Using the described model of inefficiencies, we further obtain the following relations. If a POVM is symmetric informationally complete then
6. Monotonicity under the action of bistochastic maps
We have seen that, for some special measurements, the Brukner–Zeilinger total information can exactly be expressed in terms of purity of the quantum state of interest. In effect, the four information measures (3.6), (4.4), (4.7) and (4.9) are all proportional to the quantity
The relative entropy is a very important measure of statistical distinguishability [35]. In the classical regime, the relative entropy is also known as the Kullback–Leibler divergence [47]. Its extension to entropic functions of the Tsallis type was discussed in [48,49]. Let be the subspace spanned by those eigenvectors that correspond to strictly positive eigenvalues of ρ. This subspace is typically called the support of ρ [35]. For density operators ρ and σ, the quantum relative entropy is expressed as [35]
The divergence (6.2) was generalized in several ways. To connect the Brukner–Zeilinger approach, we will use quantum divergences of the Tsallis type. For , the Tsallis α-divergence is defined as
One of the basic properties of the quantum relative entropy is its monotonicity under the action of TPCP maps [35]. As has been shown, the four information measures (3.6), (4.4), (4.7) and (4.9) are invariant with respect to unitary transformations. Keeping the measurement set-up, we now aim to compare the Brukner–Zeilinger measure before and after the action of TPCP maps. For this reason, we will focus on the case of the same input and output space. Then Kraus operators of the operator-sum representation (2.16) are expressed by square matrices.
In classical regime, the relative α-entropy of Tsallis' type is monotone for all α≥0 [49]. Due to non-commutativity, the quantum case is more complicated in character. The quantum α-divergence (6.3) is monotone under the action of TPCP maps for α∈(0;2]. That is, for α∈(0;2] and arbitrary TPCP map Φ, we have
Bistochastic maps form an important class of TPCP maps. Recall that we consider the case of the same input and output space. Taking arbitrary operators , the adjoint map is defined by Watrous [26]
Since the four measures (3.6), (4.4), (4.7) and (4.9) depend on purity of the state, they are all invariant with respect to unitary transformation. In the terminology of Brukner & Zeilinger [10], they are all operationally invariant measures of information. The unitary invariance has been treated as one of basic reasons for using just this approach to quantification of information in quantum measurements. Further, the above information measures cannot increase under the action of bistochastic maps. For a bistiochastic map, its adjoint is a TPCP map as well. Here, the property (6.9) plays a key role. Quantum fluctuation theorems form another direction, in which unitality seems to be very important. As was claimed in [57], unitality replaces microreversibility as the restriction for the physicality of reverse processes. Significance of unitality or non-unitality of quantum stochastic maps deserves further investigations. In the next section, we will discuss some relations between this question and the Brukner–Zeilinger total information.
7. Non-unital maps and the Brukner–Zeilinger approach
We have seen that the quantity (6.1) can only decrease under the action of bistochastic maps. It is natural to expect that (6.1) may be increased for non-unital quantum operations. In this section, we will study connections of the Brukner–Zeilinger total information with characterization of such maps. The latter seems to be closely related with quantum fluctuation theorems. Recent advances in dealing with small quantum systems have led to growing interest in their thermodynamics [58]. A certain progress has been connected with studies of the Jarzynski equality [59] and related fluctuation theorems [60,61]. Recent studies are mainly concentrated on formulations for open quantum systems [62–66]. Some of such results have been shown to be valid in the case of bistochastic maps [57,67]. Jarzynski equality for quantum stochastic maps can naturally be formulated in terms of the non-unitality operator [68]. It turns out that norms of this operator can be evaluated within the Brukner–Zeilinger approach.
Operators of interest are often characterized by means of norms. Some of them are especially important. To each , we assign as the unique positive square root of
For a linear map , the non-unitality operator is written as [68]
The difference can be evaluated by means of measurements schemes described in §§3 and 4. When an unknown quantum channel is given as some black box, we prepare the completely mixed state with putting it into the black box. The output Φ(ρ*) is further subjected to one of measurement schemes available for the given d. This run is repeated as many times as required for collecting measurement statistics. Statistical data should be sufficient for evaluation of the left-hand side of one of the relations (3.6), (4.4), (4.7) and (4.9). Thus, we obtain the quantity (6.1) for ρ=Φ(ρ*) and apply (7.7).
Using the result (7.7), for quantum operations, we can estimate from above the map norm (7.4). We will use a relation between vector norms proved in [41]. It was later applied for deriving fine-grained uncertainty relations for a set of MUBs and a set of MUMs [69]. As follows from the results of appendix A of [41], for any operator we have
The above findings can further be illustrated with the following example. Let be an orthonormal basis in . We consider the quantum operation with Kraus operators
8. Conclusion
We have considered the Brukner–Zeilinger approach to quantifying information in quantum measurements on a finite-level system. This problem is essential due to recent advances in quantum information processing. The original formulation of Brukner and Zeilinger was based on projective measurements in the complete set of MUBs. This formulation is therefore restricted, since even the case of MUBs in dimensionality 6 is still not resolved [12]. We have shown that the idea of operationally invariant measure of information can truly be realized within the three schemes based on special types of quantum measurements. Namely, these schemes, respectively, use a single SIC-POVM, a complete set of MUMs, and a single general SIC-POVM. Such measurements are easy to construct. In addition, costs on the schemes with a single SIC-POVM may be less. The Brukner–Zeilinger measure of information was also criticized on the following ground. In real experiments, the ‘no-click’ events inevitably occur. Some doubts in the case of detection inefficiencies were discussed in [43]. Such criticism is overcome by means of natural reformulation of the approach considered. Namely, the uniform distribution is a good reference only for the inefficiency-free case. Otherwise, we should use for comparison some probability distribution that takes into account a real efficiency of detectors. The desired probability distribution is naturally obtained by putting the completely mixed state into real experiments. The corresponding data can be stored and further used for calculating required quantities. Information measures of the Brukner–Zeilinger type are not only unitarily invariant, they cannot also increase under the action of bistochastic maps. Using this approach for characterization of non-unital TPCP maps is considered. If a quantum channel is given as black box, the measurement schemes described can be used for determining the Hilbert–Schmidt norm of the non-unitality operator. Potential applications of information measures of the Brukner–Zeilinger type in quantum information science deserve further investigations. The authors of Deutsch & Marletto [70] recently proposed the constructor theory of information, which is aimed to derive the properties of information entirely from the laws of physics. It would be interesting to study measures of information in quantum theory within the constructor theory.
Competing interests
I declare I have no competing interests.
Funding
I received no funding for this study.
Acknowledgements
I am grateful to the anonymous reviewers for useful comments.