Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences
You have accessResearch article

A model of adaptive decision-making from representation of information environment by quantum fields

    Abstract

    We present the mathematical model of decision-making (DM) of agents acting in a complex and uncertain environment (combining huge variety of economical, financial, behavioural and geopolitical factors). To describe interaction of agents with it, we apply the formalism of quantum field theory (QTF). Quantum fields are a purely informational nature. The QFT model can be treated as a far relative of the expected utility theory, where the role of utility is played by adaptivity to an environment (bath). However, this sort of utility–adaptivity cannot be represented simply as a numerical function. The operator representation in Hilbert space is used and adaptivity is described as in quantum dynamics. We are especially interested in stabilization of solutions for sufficiently large time. The outputs of this stabilization process, probabilities for possible choices, are treated in the framework of classical DM. To connect classical and quantum DM, we appeal to Quantum Bayesianism. We demonstrate the quantum-like interference effect in DM, which is exhibited as a violation of the formula of total probability, and hence the classical Bayesian inference scheme.

    This article is part of the themed issue ‘Second quantum revolution: foundational questions’.

    1. Introduction

    This paper is devoted to applications of the mathematical formalism of quantum theory to the modelling of decision-making (henceforth denoted as ‘DM’) in a context where there exists deep uncertainty1 about both the possible actions of other decision-makers and the surrounding complex information environment. This information environment can include belief states of decision-makers, memory recollections and external economic (or also financial, social and geopolitical) contexts.2 In economics, the information environment is typically formalized via the introduction of the notions of (i) public (non-scarce) and private (scarce) information (flows); and the notions of (ii) produced information (a dataset which needs buying for instance) and emergent information (time which reveals information for instance). The economics and finance literature has published research on how one can measure the economic value of information. The notion of ‘private information price of risk’ for instance measures how private information triggers changes in the price of risk within an asset pricing context [2,3]. The idea of ‘degrees of information’ is sometimes also considered such as ‘first-order’ information (which ties the information to an event) and ‘higher-order’ information (which refers to the information itself (for instance what is the source of the information)). The notion of ‘common knowledge’ which we briefly mention below is an example of such higher-order information. For a detailed discussion of many of the basic concepts (see [4], p. 171 and p. 224).

    The idea of ‘information environment’ is also used in physics and very often the terms ‘bath’ or ‘reservoir’ are used. In a social science adapted context (starting from physics), we can then speak of ‘mental bath’ or ‘mental reservoir’.

    In the model presented here in this paper, agents are not aiming to maximize their utility, but they instead adapt their behaviour to information gained from other agents and the environment. To describe this process of adaptivity, we adopt the mathematical apparatus of quantum field theory (QFT). In particular, the information environment is represented mathematically by quantum fields. We call this approach QFT-inspired. It has to be sharply distinguished from a variety of applications where genuine quantum physics is applied to cognition and DM.

    The adaptive approach which we will introduce is more realistic compared to the utility-maximizing stance used in basic economics. However, the ensuing formalism is much more demanding. As is well known, from a basic economics perspective, the usual assumption made is that in the absence of any arbitrage3 opportunities, there should exist a so-called budget feasible plan which cannot be more preferred by any other budget feasible plan [6].

    From the brief discussion so far, it is intuitive that our DM model will differ crucially from the classical expected utility theory introduced by von Neumann & Morgenstern [7]. Nevertheless, the QFT-inspired adaptive model can be treated as a far relative of utility-based models of DM, including expected utility. The role of utility is played by the degree of adaptivity to the surrounding (informational) environment, i.e. the mental bath. However, it is impossible to encode ‘environment-adaptivity’ by a numerical utility-function. As we will show, it is encoded by the dynamics of quantum decision operators.

    Within the mainstream literature, in cognitive psychology and behavioural economics, several models have been developed to measure certain effects such as ‘order’ effects and ‘conjunction’ and ‘disjunction’ effects. Those are effects which have been repeatedly observed in experimental laboratory settings and some of those effects have relevance to some of the important paradoxes which have affected the integrity of the axiomatic structure of key expected utility models (such as the von Neumann–Morgenstern expected utility model (Allais’ paradox; see [8]) and the Savage expected utility model (Ellsberg paradox, see [9]). See also Machina [10]). A very rich literature has emerged and we must mention the work of Tversky & Kahneman [11] and Tversky & Shafir [12]. However, we need to note that those effects are now also modelled with a so-called ‘quantum-like’ approach. In essence, this approach uses formalisms from quantum mechanics to augment the modelling of non-physics problems, such as cognition. For representative work in this area, see [1344] and references therein.

    Finally, the quantum-like approach has recently been successfully applied to the theory of common knowledge and the modelling of violations of Aumann's theorem about the impossibility to agree on disagreeing [36,37].

    As was already pointed out, we are interested in modelling the DM process as being adaptive to the information environment. An agent Inline Formula is not driven to maximize his utility, but he rather tries to adapt his behaviour to information which he is gaining (during the DM process) from his environment Inline Formula

    There are two well-known main approaches to the modelling of quantum adaptive dynamics. One approach is based on the theory of open quantum systems. Here, the state of an agent is extracted from the general state of the compound system ‘agent+environment’ and the dynamics are reduced (by using the trace operation) to the state space of this agent (the environment is encoded in the operator coefficients of the quantum master equation).4 Another approach is based on the study of the general dynamics of the state of the compound system, ‘agent+environment’, and then averages and probabilities corresponding to the agents’ decisions are extracted from the complete dynamical state [45]. In the latter approach, an environment is represented as a quantum field. In this paper, we proceed with this QFT-inspired approach by using the mathematical methods developed in recent papers by Bagarello [1921] and in the monograph [18].

    The main advantage of the open quantum system approach is the reduction of the state dimension to the dimension of the agent's state. The main problem is that an environment appears in a very formal encoding and it is not easy to extract its features from the coefficients of the quantum master equation. Thus, the model is phenomenological.5

    The QFT-inspired approach is more difficult analytically, because the scenarios of evolution are presented by taking into account all degrees of freedom of the environment.6 It can be difficult to construct analytical solutions of the dynamical equations for the state evolution, even if in many relevant cases this can be done (see §§3 and 4). Sometimes, however, numerical methods are really what is needed. This is the case, for instance, of nonlinear models, which will not play any role in this paper. Among the advantages of the QFT-inspired approach, we can point to the possibility of describing dynamics in terms of solely pure states.

    At the beginning of this paper, we mentioned the idea of ‘deep uncertainty’. From the quantum information point of view [46] a pure state represents the maximally available information about the context (in our case the context of DM). However, the availability of maximal information does not mean the resolution of uncertainty. We say that the uncertainty can be very deep. The pure state representation means that maximal information (about the context) is encoded in it. The most tricky point of quantum information theory (and, in fact, quantum theory in general) is that there exist states of compound systems, e.g. ‘agent+environment’, which are pure. This means they encode maximal information, but the states of subsystems, e.g. the state of the agent, are not pure (i.e. they do not encode maximally available information about the situation). Such states of compound systems are called entangled. Entanglement is nowadays considered as the main distinguishing feature of quantum theory.

    Let us now come back to the objective of this paper. We will study a very general DM model represented in the form of a game played by two players Inline Formula and Inline Formula interacting with two environments, Inline Formula and Inline Formula Such players can be agents of a market, e.g. corporations, traders of the financial market, political parties. A decision is taken only after the interactions Inline Formula and Inline Formula are considered. The reflections of the agents generated by the interactions modify the agents’ mental states. Thus, as was already emphasized, the key notion of such a DM model is not utility, but rather the interactions between agents and their environments and the adaptivity to their impact.7

    The QFT-inspired model for this sort of games involving DM under uncertainty (including the uncertainty generated by a complex surrounding environment (be it mental, economic, social, etc.)) was considered in papers by Bagarello [19,20]. In these papers, one of the main characteristics of the analysis was that, at the beginning of the process of DM t=0, each agent was in a sharp state. By this we mean that each agent knew, at t=0, exactly which was her own choice. Thus, initially, an agent was able to resolve her internal mental uncertainty. Then, in the process of DM, her certainty was destroyed as a result of reflections generated by interactions with another agent and the environment.

    Of course, this initial certainty of the agent's state is a strongly simplifying assumption which does not match reality (e.g. when we consider traders in the financial market).

    In this paper, we present a more realistic model by considering the possibility that, initially, an agent is also being in a state of uncertainty which is mathematically represented as a superposition of quantum(-like) pure states corresponding to the concrete choices. Such a superposition encodes a very deep uncertainty which cannot be modelled in the framework of classical probability (CP) [27,35]. The superposition of decisions induces a kind of interference between possible decisions of agents (see §4). In probabilistic terms, this interference is visible as a violation of the classical formula of total probability: it is perturbed by an additional term. We will see that, in comparison with [1921], the dynamical behaviour of the decision functions (DFs; see below) changes drastically. In particular, a sort of overall noise appears because of the presence of some interference effects in the system.

    2. Why quantum(-like)?

    The original general idea of Bagarello [18] was to formally use creation and annihilation operators, the basic building blocks of QFT, in macroscopic systems so one can possibly consider dynamical systems. These operators can be used to represent very complex dynamical processes composed of elementary acts of creation and annihilation of new states of systems. These states can be of any nature: physical, mental, biological. This is the minimal interpretation of the QFT-inspired approach. From this formal viewpoint, the application of the mathematical formalism of quantum theory is just a matter of convenience, and in fact it proved to be useful in several applications in different contexts [1821].

    In our concrete model, the states under consideration are possible strategies of players Inline Formula and Inline Formula. The creation and annihilation operators are explored to model the process of how players reflect upon their strategies. These reflections are composed of elementary acts of transition of the player's mental state from one strategy to another (‘annihilation’ of the previously chosen strategy and ‘creation’ of the opposite strategy). We remark that this model, as well as the quantum physical model in general, is epistemic, i.e. it does not represent the internal (and extremely complex) processes in the neural system leading to such transitions. It provides only the formal operator representation of these transitions. However, this formal model is rather rich mathematically; see §§3 and 4. We can make the argument that without such an operational reduction of complexity, it would be really impossible to proceed to concrete solutions of dynamical equations. This operational reduction in complexity is one the most important features motivating the exploration of the quantum formalism.

    However, the application of this formalism has also deeper foundational consequences. The creation and annihilation operators act on some complex Hilbert space. Therefore, the states of this space can form superpositions. In the quantum formalism, the presence of superpositions of states lead to non-classical probabilistic effects. One of them is the interference of probabilities. In quantum physics, this interference effect can be written as a violation of the formula of total probability. The latter is one of the basic laws of CP.

    As was mentioned in the introduction, the consideration of superpositions of states is very natural for the modelling of mental phenomena. They represent uncertainty at the level of an individual decision-maker. Moreover, for superpositions, our model exhibits the aforementioned deviation from CP theory, in the form of the violation of the formula of total probability; see §5c.

    We also remark that the use of superpositions as initial states generates oscillations (figures 13, §4) which are absent for dynamics starting with states representing ‘definite strategies’, [19].

    Figure 1.

    Figure 1. The DFs n1(t) (a,b) and n2(t) (c,d) for parameters Inline Formula, N1=0, N2=1, μex=500, μcoop=0 and for Inline Formula (a,c) and Inline Formula (b,d). (Online version in colour.)

    In short, the use of the quantum formalism for DM can be justified as follows:

    • — It provides a powerful operational tool for the representation of reflections of decision-makers.

    • — It provides the unique possibility to represent uncertainty in the mental representation of the decision problem of an individual decision-maker.8

    3. Quantum field theory-inspired model of decision-making in two players game and its dynamics

    In this section, we will discuss the details of our (QFT-inspired) model, constructing first the vectors of the players, the Hamiltonian of the system and deducing out of it, the differential equations of motion and their solution, and we will be particularly interested in its asymptotic (in time t) behaviour.

    In our game, we have two players, Inline Formula and Inline Formula. Each player could operate, at t=0, two possible choices, 0 and 1. Hence, we have four different possibilities, to which, following Bagarello [19], we associate four different and mutually orthogonal vectors in a four-dimensional Hilbert space Inline Formula. These vectors are φ0,0, φ1,0, φ0,1 and φ1,1. The first vector, φ0,0, describes the fact that, at t=0, the two players have both chosen 0 (0102). Of course, such a choice can change during the time evolution of the system. Analogously, φ0,1 describes the fact that, at t=0, the first player has chosen 0, while the second has chosen 1 (0112). And so on. We see that Inline Formula is an orthonormal basis for Inline Formula. The general mental state vector of the system Inline Formula (i.e. of the two players), for t=0, is a linear combination

    Display Formula
    3.1
    where it is natural to assume that Inline Formula in order to normalize the total probability. Indeed, for instance, we interpret |α0,0|2 as the probability that Inline Formula is, at t=0, in a state φ0,0, i.e. that both Inline Formula and Inline Formula have chosen 0. Analogous interpretations can be given to the other coefficients.

    We construct the states φk,l by using two fermionic operators, i.e. two operators b1 and b2, satisfying the following canonical anti-commutation rules (CAR):

    Display Formula
    3.2
    where k,l=0,1. Here, Inline Formula is the identity operator and {x,y}=xy+yx is the anticommutator between x and y. Then, we take φ0,0 as the vacuum of b1 and b2: b1φ0,0=b2φ0,0=0, and build up the other vectors out of it:
    Display Formula

    Remarks 3.1

    (1) Note that Inline Formula could be equivalently constructed starting from the vector Inline Formula which is annihilated by Inline Formula and Inline Formula, and then constructing the other vectors of Inline Formula by acting on φ1,1 with b1 and b2.

    (2) The reason why we introduce the operators in (3.2) is because they are used to write down a Hamiltonian-like operator for the system Inline Formula, from which we deduce the dynamics of the observables of Inline Formula, i.e. the variables needed to describe Inline Formula, and Inline Formula in particular, via the Heisenberg rule.

    An explicit representation of these vectors and operators can be found in many textbooks in quantum mechanics [48]. For instance, Inline Formula, where Inline Formula and Inline Formula. Then,

    Display Formula
    and so on. The matrix form of the operators bj and Inline Formula are also quite simple. For instance,
    Display Formula
    and so on.

    Let now Inline Formula be the number operator of the jth player: the CAR above implies that Inline Formula and Inline Formula, k,l=0,1. Then, because of what we discussed before, the eigenvalues of these operators correspond to the choice operated by the two players at t=0. For instance, φ1,0 corresponds to the choice 1102, just because ‘one’ is the eigenvalue of Inline Formula and ‘zero’ is the eigenvalue of Inline Formula. It is natural, therefore, to call Inline Formula and Inline Formula the strategy operators (at t=0). Moreover, as bj and Inline Formula modify the attitude of Inline Formula, they can be called the reflection operators. We repeat that fermionic operators are in use, because the eigenvalues of Inline Formula are exactly 0 and 1, which are the only possible choices of the players of our game.

    Our main effort now consists in giving a dynamics to the number operators Inline Formula, following the scheme described in [18]. Therefore, what we first need is to introduce a Hamiltonian H for the system. Then, we will use this Hamiltonian to deduce the dynamics of the number operators as Inline Formula, and finally we will compute the mean values of these operators on some suitable state which describes (see below) the status of the system at t=0. We refer the reader to [18,19] for the details of our construction. Here, we just recall that H is the Hamiltonian of an open system, because the two players Inline Formula and Inline Formula, in order to take their decision, need also to interact with their environments Inline Formula and Inline Formula. Contrary to what happens for the players, whose situation can be described in a simple four-dimensional Hilbert space, these environments are naturally defined in an infinite-dimensional Hilbert space. For this reason, they can be thought to describe some subsystem with infinite (or very many) degrees of freedom, as the neurons in the brain, for instance. If we adopt this interpretation, Inline Formula can be seen as the neural system of Inline Formula, j=1,2.

    The full Hamiltonian H [19], is the following:

    Display Formula
    3.3

    Here, ωj, λj, μex and μcoop are real quantities, and Ωj(k) are real functions. In analogy with the bj's, we use fermionic operators Bj(k) and Inline Formula to describe the environment:

    Display Formula
    3.4
    which have to be added to those in (3.2). Moreover, each Inline Formula anti-commutes with each Inline Formula: Inline Formula for all j, l and k. Here, X stands for X or X. The various terms of H can be understood as follows: (i) H0 is the free Hamiltonian, which produces no time evolution for the strategy operators Inline Formula. This is because Inline Formula, and because H reduces to H0 in the absence of interactions (i.e. when λj=μex=μcoop=0). This is in agreement with our idea that the strategies of Inline Formula and Inline Formula can be modified only in the presence of interactions. (ii) HI describes the interaction between the players and their neural systems. Of course, the one discussed here is a special kind of interaction, which is useful because it produces an analytical solution for the time evolution of (the mean values of) the strategy operators. Other choices could be considered, but these would, quite likely, break down this nice aspect of the model. (iii) Hint describes two different interactions between Inline Formula and Inline Formula. When μcoop=0, the two players act differently, while they behave in the same way when μex=0. Of course, when both μcoop and μex are not zero, the dynamics are even richer. We refer the reader to [19] for more details on (3.3).

    The Heisenberg equations of motion Inline Formula can now be deduced by using the CAR (3.2) and (3.4) and using H given in (3.3):

    Display Formula
    3.5
    j=1,2. The solution of this system of equations has been found similarly to [19], and it looks like
    Display Formula
    3.6
    where we have introduced the following quantities:
    Display Formula
    and where Ωj(k)=Ωjk, Ωj>0, Inline Formula and Inline Formula, j=1,2.

    As already stated, the next step consists in taking the average of the time evolution of the strategy operators, Inline Formula, on a state over the full system Inline Formula, where Inline Formula has already been introduced, Inline Formula, and Inline Formula. These states are assumed to be tensor products of vector states for Inline Formula and states on the environment in the following way: for each operator of the form Inline Formula, Inline Formula being an operator of Inline Formula and Inline Formula an operator of the environment, we have

    Display Formula
    Here, Ψ0 is the vector introduced in (3.1), while Inline Formula is a state satisfying the following standard properties [49]:
    Display Formula
    3.7
    for some constant Nj. Also, Inline Formula for all j and l.

    Remark 3.2

    At first sight, this expression for the state on Inline Formula introduces a sort of asymmetry between Inline Formula and Inline Formula, because their states look to be of a different nature: the one over Inline Formula is a vector state, while the one over Inline Formula is a linear positive functional over the algebra of the fermionic operators Bj(k) and Inline Formula. This is not really surprising, because Inline Formula involves just two degrees of freedom, while Inline Formula involves an infinite number of degrees of freedom. Nevertheless, using the so-called Gelfand–Naimark–Segal construction [45], it is possible to present Inline Formula as a vector state.

    After a few computations, noting V (t)=eiUt and calling Vk,l(t) its (k,l)-matrix element, we deduce the following general formulae for the DFs of Inline Formula and Inline Formula, which extend those found in [19] in the absence of interference:

    Display Formula
    3.8
    Here, we have introduced
    Display Formula
    3.9
    Display Formula
    3.10
    Display Formula
    3.11
    In formula (3.8), we have clearly divided contributions of three different natures: Inline Formula contains contributions only due to the players Inline Formula and Inline Formula. Their analytic expressions become particularly simple if the initial state Ψ0 is just one of the vectors φj,k, i.e. if all the coefficients αk,l in (3.1) are zero, except one. In this particular situation, all the contributions in Inline Formula, which again only refer to Inline Formula and Inline Formula, are zero. For this reason, we call Inline Formula and Inline Formulainterference terms: they are only present if Ψ0 is some superposition of eigenvectors of the (time zero) number operators. Otherwise, they simply disappear. Finally, Inline Formula and Inline Formula arise because of the interaction of the players with the environments: as we see from (3.11), they are both zero if λ1 and λ2 in the Hamiltonian are both equal to zero, and they do not depend on the explicit form of Ψ0.

    A detailed analysis of what happens when there are no interference terms can be found in [19], where the focus was mainly on the asymptotic behaviour of the DFs in the absence of interferences. Here, on the other hand, we want to analyse what happens when the interference does exist already at t=0 (i.e. when more than just one coefficient αj,k is non-zero), and we are also interested in the behaviour of the DFs for finite time.

    4. Analysis of the results

    In the figures plotted in this section, we will call Inline Formula the following choice of parameters of H: ω1=1, ω2=2, Ω1=Ω2=0.1, λ1=λ2=0.5, and Inline Formula the second choice: ω1=0.1, ω2=0.2, Ω1=Ω2=1, λ1=1 and λ2=0.7. Also, we call Inline Formula and Inline Formula the following choices of the parameters αk,l in (3.1): Inline Formula, while Inline Formula.

    Of course, several other choices could also be considered. However, the ones we are fixing here cover already different situations. In particular, while in Inline Formula the interaction parameters λ1 and λ2 are equal, they are different in Inline Formula. Also, while in Inline Formula each ωj is bigger than each Ωk, the opposite holds for Inline Formula. This is important, because it is known that the ωj's are related to the inertia of the player Inline Formula [18,19]. Moreover, as for the choices Inline Formula and Inline Formula, the difference is clear: in Inline Formula all the coefficients in (3.1) are equal, and in particular there is no relative phase between the various φk,l's in Ψ0, whilst this is not so when adopting the choice Inline Formula. And, as we will see, this makes indeed a big difference: we clearly see that, even if (as expected from what is deduced in [19]) the asymptotic values of the two DFs appear to be independent of the choice of Inline Formula and Inline Formula, there exists a certain time window in which the choice of the αk,l's really change the behaviours of the functions. More concretely, if we add a phase in the coefficients defining the original vector Ψ0, we may observe quite large oscillations. Then, interference terms in Ψ0 make it, in general, quite difficult to get a decision. In particular, this is the effect of the relative phases in the interference coefficients (figure 1b,d), while if these coefficients have all the same phases, a decision can be reached quite soon (figure 1a,c). However, if we wait for a sufficiently long time, in both cases we reach the same final values of the DFs: the asymptotic values of the DFs only depend on the state of the environment, and not on the particular choice of Ψ0.

    These conclusions are confirmed by other choices of the state on the environment. For instance, we plot again the DFs for the choices Inline Formula (figure 2a,c) and Inline Formula (figure 2b,d) and for the same choice of the parameters as in figure 1, while the state of the environment is chosen different, because we take now N1=N2=1. We see that there is no particular difference between the two DFs n1(t) and n2(t) (this is possibly due to the fact that N1=N2). However, adding the phases to the αk,l creates, again, a lot of noise in the DM process, noise which disappears, but only after a sufficiently long time.

    Figure 2.

    Figure 2. The DFs n1(t) (a,b) and n2(t) (c,d) for parameters Inline Formula, N1=1, N2=1, μex=500, μcoop=0 and for Inline Formula (a,c) and Inline Formula (b,d). (Online version in colour.)

    It is useful to stress that, changing further the parameters of the Hamiltonian, does not really affect our conclusions. Indeed, figure 3 shows that even with different choices of the parameters in H, the choice Inline Formula with μex=100 (rather than μex=500, as in the previous figures), phases create noise, and again, this noise becomes smaller and smaller after some time.

    Figure 3.

    Figure 3. The DFs n1(t) (a,b) and n2(t) (c,d) for parameters Inline Formula, N1=0, N2=1, μex=100, μcoop=0 and for Inline Formula (a,c) and Inline Formula (b,d). (Online version in colour.)

    In this case, the existence of an asymptotic limit for the DFs is less evident, but this is only due to the small time interval considered. It is not hard to imagine that we could still recover a clear asymptotic value for nj(t) if we consider a time interval larger than just [0,0.5]. However, we will not do this here in order to keep the range of the figures uniform.

    So far, we have taken μcoop=0. The same behaviour is observed if we take μcoop≠0: we could check (but we do not include the plots here) that the choice Inline Formula is again much more noisy than Inline Formula.

    Also, no particular difference arises if we consider both μex and μcoop different from zero, especially when they are different enough (again, we do not include the plots here). However, interestingly enough, this noise is not so evident if μex and μcoop are not so different (figure 4). It seems that when the two terms in Hint (see (3.3)) act in cooperation, they become capable to filter the noise, making the effect of the phases in αk,l not so strong. This is an interesting feature, which is surely worthy of a deeper analysis.

    Figure 4.

    Figure 4. The DFs n1(t) (a,b) and n2(t) (c,d) for parameters Inline Formula, N1=0, N2=1, μex=10, μcoop=10 and for Inline Formula (a,c) and Inline Formula (b,d). (Online version in colour.)

    5. Discussion

    In this section, we shall discuss the output of our model in detail.

    (a) State's interpretation

    As is well known, the present situation in quantum foundations is characterized by a huge diversity of interpretations of the quantum state. The two main classes of interpretations correspond to the statistical and individual viewpoints on a quantum state [50] for details. By the former a quantum state encodes probabilities for the results of measurements for an ensemble of identically prepared quantum systems.

    We investigated the behaviour of the averages of the strategy operators of the two players Inline Formula and Inline Formula interacting with the corresponding environments (mental baths) Inline Formula and Inline Formula. One of the main outputs of our quantum-like model is that these averages stabilize for Inline Formula (for natural Hamiltonians describing the interaction between players and their interactions with mental baths). To interpret this result, we have to fix one of the interpretations of a quantum state.

    The straightforward interpretation of this stabilization result can be presented on the basis of the statistical interpretation of quantum states. Here, we consider a very large ensemble of pairs of players, and the averages of the strategy operators are treated as the ensemble averages. Our stabilization result implies that, on average (with respect to this ensemble), the strategies of players stabilize to fixed values in the very long run of the DM reflections (this is encoded in creation and annihilation operators forming the Hamiltonian of the game).

    Now, we switch to the individual interpretation of a quantum state. Here, the question of the interpretation of probabilities encoded in a quantum state is more complicated than in the case of the statistical interpretation. It seems that, for our applications of the mathematical formalism of quantum mechanics to the operational modelling of DM, the most natural interpretation of probability is the subjective interpretation. Recently, this interpretation became popular in quantum information theory and it is known as Quantum Bayesianism (QBism) [51,52].9

    In the QBist framework, the average stabilization output of our model can be interpreted in the following way:

    At the initial instant of time t=0, each pair of players assigns their individual subjective probabilities to selections of their strategies,

    Display Formula
    These probabilities depend on the initial state Ψ0 of the pair of players. By using the representation of Ψ0 as superposition of the ‘certainty states’ φk,l (see (3.1)) and Born's rule, these probabilities can be represented as
    Display Formula
    for j=0,1. In fact, for instance, we have p1(0)=|〈φ00,Ψ0〉|2+|〈φ01,Ψ0〉|2 and so on. If Ψ0 is non-factorizable (entangled), then p1(j) cannot be expressed solely in terms of the state of Inline Formula and vice versa. We emphasize that, in the case of entanglement, for a pure state Ψ0, the states of players are mixed states and they are represented by density operators.

    Starting with the initial state Ψ0, and hence with the probabilities pi(j), each decision maker updates continuously her state by taking into account the impact of the environment and the feedback from the update of the state of the co-player. This state dynamics studied in previous sections generates the corresponding dynamics of the subjective probabilities, Inline Formula. The essence of exploring the quantum-like model indicates that the genuine dynamics are not a dynamics of probabilities, but rather a dynamics of the state. Metaphorically, we can say that the dynamics of probabilities are a shadow of the mental state dynamics. The state dynamics can be expressed (see §3) in terms of the strategy operators.

    A direct computation shows that the strategy operators Inline Formula (at t=0) can be represented in the form (spectral decomposition)

    Display Formula
    5.1
    and
    Display Formula
    5.2
    where (|f〉〈g|)h=〈g,hf for all Inline Formula. The ones in (5.1)–(5.2) are orthogonal projectors onto the subspaces L1 with the basis (φ1,0,φ1,1) and L2 with the basis (φ0,1,φ11). Those are the specifics of the fermionic representation of the reflection and strategy operators.

    Now, with simple manipulations, we observe that

    Display Formula
    moving from the Schrödinger to the Heisenberg representation. Therefore, the averages of these operators, i=1,2, are equal to the probabilities of selecting the decision s=1:
    Display Formula
    5.3
    These are nothing else than DFs considered in §3 (see (3.8)). This clarifies the relation between the subjective probabilities and the DFs.

    (b) The process of decision-making: the subjective probability viewpoint

    Each decision-maker starts with assignment of the subjective probabilities Inline Formula where Inline Formula. Then, she begins her reflections on the possible selections of the strategies (adapted to the environment and interactions with the other decision-maker) and her subjective probabilities fluctuate; see §§3 and 4.

    We have seen that the structure of fluctuations depends crucially on the initial state Ψ0. In the process of DM, the magnitude of these fluctuations decreases and subjective probabilities stabilize to the fixed values. These are the probabilities which are used by the decision-maker to make her choice. After the determination of her subjective probabilities, she proceeds as a classical decision-maker. For the instant of time t, the odds in favour of the decision labelled by 1 are given by the proportion

    Display Formula
    5.4
    We consider the instance of time τ when the decision is made. This τ corresponds to a diminishing of the fluctuations to the minimal level. In the theoretical model, Inline Formula. However, in reality a decision-maker cannot wait for an infinite time to make the decision. The decision-maker uses some threshold for fluctuations ϵ>0 and the decision instant τ is determined by ϵ.

    If Oi(1;τ)>1, then Inline Formula makes the decision labelled by 1. If Oi(1;τ)<1, then she makes the opposite decision (if Oi(1;τ)=1, she either makes the decision randomly with probability Inline Formula or she repeats the process of DM with a modified initial state Ψ0). We remark that, in the absence of the mental baths, the subjective probabilities assigned by a decision-maker to the possible strategies would fluctuate forever, as deduced, for instance, in [20] in a different context.

    (c) Violation of the law of total probability: ‘interference of probabilities’

    As was emphasized by Feynman & Hibbs [54], the probabilistic data from interference experiments with quantum systems, e.g. the two-slit experiment, can be treated as violating the laws of CP. They pointed to a violation of the additivity of probability. Feynman's argument was reformulated by one of the coauthors of this paper in terms of conditional probabilities—as a violation of the law of total probability (LTP) [35]. In this form, this argument is nicely applicable to the process of DM. The classical Bayesian scheme of DM is fundamentally based on the LTP. Its violation leads to non-Bayesian schemes of DM and the quantum formalism provides one of such schemes.10

    We recall the LTP. Consider two random variables ξ and η taking discrete values, ξ=α1,…,αN and η=β1,…,βM. Then, in the classical probabilistic framework (the measure-theoretic model of probability [55] it can be proved that

    Display Formula
    5.5
    It is important to remark that the conditional probability is defined by the Bayes formula: p(η=β | ξ=α)=p(η=β,ξ=α)/p(ξ=α), where p(η=β,ξ=α) is the joint probability distribution of the pair of random variables. As is well known, in general, for quantum observables, the joint probability is not well defined. Therefore, the classical Bayes formula for conditional probabilities is inapplicable and one can expect that, for quantum conditional probabilities which are defined in the Hilbert space formalism, the formula (5.5) can be violated [35].

    The classical LTP in our setting can be formulated as follows. We remind that the probabilities pi(s;t),s=0,1, depend on the initial state Ψ0, i.e. in fact, we could even use the notation pi(s;t|Ψ0). Among the possible initial mental states of the pairs of players Inline Formula the states φk,m,k,m=0,1, play a very special role. Here, initially the players were completely sure in their strategies, Inline Formula with the strategy k and Inline Formula with the strategy m. In (5.5), the probabilities pi(s;t|k,m)≡pi(s;t|φk,m),s=0,1, can be identified with the conditional probabilities p(η=β|ξ=α) with α=(k,m) and β=0,1. The probabilities

    Display Formula
    5.6
    encode uncertainty in the determination of the pairs of strategies (k,m) for the initial mental state Ψ0. In (5.5), they are identified with the probabilities p(ξ=α).

    The interference-like representation of the DFs (3.8) can be rewritten as a quantum generalization of the LTP:

    Display Formula
    5.7
    Here, the phase θi(s;t | (k,m,j,n) encodes a sort of interference between the pairs of strategies (k,m) and (j,n). In our quantum model, this phase can be easily calculated by using the phases of the states Ψ0 and φk,m,φj,n.

    As was shown in [35], in the purely probabilistic framework (i.e. without direct appealing to the Hilbert space formalism) a violation of the LTP means the impossibility to embed all probabilities in (5.7), into a single CP space. Thus, the appearance of the additional interference-like term disturbing the classical LTP (see (5.7)) shows that CP has the restricted domain of applications. For our applications to DM, it is more useful to interpret this result in terms of the interrelation between classical and non-classical logics. CP theory is based on classical Boolean logic. Therefore, the impossibility of using CP implies the impossibility of using the Boolean logic. Thus, in our quantum-like model the actions of the players are not ruled solely by the laws of Boolean logics. They can make decisions with reasoning based on non-classical logic.

    6. Conclusion and perspectives

    The DM under uncertainty plays a crucial role in economics, game theory and many areas of social science. Until now, the most intensive and successful modelling of the DM process was performed on the basis of Bernoulli's idea to explore utility functions (which originated in his attempting to solve the St Petersburg paradox). Several theories have ensued since then: expected utility theory, prospect theory, cumulative prospect theory and other approaches. In spite of the aforementioned success of these utility-based models, it is impossible to ignore the growing dissatisfaction by the present state of the art in DM. In particular, the number of paradoxes have increased over the years: Allais, Ellsberg, Machina, and one recently counted 39 paradoxes [56]. Already, in subjective uncertainty utility models, the representation of the agent's utility by a function is a fuzzy problem. Individuals may use a huge variety of possible utility functions and the determination of the class of possible functions is a complicated problem [5759].

    One can treat DM as the interaction of an agent with a complex information environment which includes a variety of behavioural, economic, social and geopolitical factors as well as beliefs about states of other agents (e.g. the financial market). Therefore, it is natural to model DM as an (environment-)adaptive dynamical process. In principle, one may try to encode such a complex information environment by a single function encoding the utility of a context. However, this would definitely be a high simplification of the mathematical representation of the DM context.

    The most advanced mathematical model of interaction of a system with an environment is presented in quantum theory. The modern quantum information viewpoint on quantum theory [40,46,47] justifies the possibility to apply this formalism outside of physics. Here, the keyword is adaptivity to the environment. In this paper, we have applied the quantum model of adaptive dynamics to the modelling of the DM process. Our model is QFT-inspired: the information environment is modelled with the aid of quantum fields which are treated as operational quantities carrying information. In such a DM model, an agent does not maximize (expected) utility, but she searches for decisions matching ‘demands’ of the environment and (agent's representation) of the belief states of other agents involved in the DM process. This dynamical adaptive process is based on the representation of the DM context by a quantum pure state carrying maximal available information about the situation.

    We have found analytical and numerical solutions for operator-valued functions representing the DM process and their expectations, and we considered the problem of stabilization for Inline Formula. The output of stabilization is considered as the classical (mixed) decision strategy. Of course, the real DM process cannot take an infinite time and we are interested in approximate stabilization. We have carefully analysed the interpretational issues of our model (see §5). In particular, we have looked at the coupling between subjective probability and the private agent interpretation of quantum mechanics (QBism) [51,52].

    This paper is of a conceptual nature. Its aim is to present a quite general model of DM under uncertainty as an adaptive dynamical process of evolution of the belief state of an agent interacting with a complex information environment (e.g. the financial market). This paper also wishes to demonstrate the mathematical power of this general model. Our proposed model can be explored for a variety of problems in economics, sociology and politics. See Bagarello [1921] and Khrennikova [38]11 for concrete applications of special variations of the presented model. Others are in progress.

    Data accessibility

    This article has no additional data.

    Authors' contributions

    F.B. proposed the model and its solution. A.K. and E.H. worked on the interpretation of the model in the context of DM.

    Competing interests

    We declare we have no competing interests.

    Funding

    This work received partial financial support by the grant ‘Mathematical Modeling of Complex Hierarchic Systems’ of Linnaeus University and by the EU project ‘Quantum Information Access and Retrieval Theory’ (QUARTZ), grant no. 721321.

    Acknowledgements

    One of the authors (F.B.) acknowledges partial support from the University of Palermo and from G.N.F.M., and the discussion on the paper was started during his visit to Linnaeus University (supported by the grant ‘Mathematical Modelling of Complex Hierarchic Systems’). F.B. also wishes to thank Prof. A. Busacca for his strong support for this project.

    Footnotes

    1 We will discuss below what we mean with ‘deep uncertainty’.

    2 The notion ‘state of the world’ used in economics (and DM) and financial asset pricing can be considered as the counterpart of these physical notions [1].

    3 An arbitrage opportunity can be formally defined [5]. Essentially, an arbitrage opportunity arises when no investment needs to be made to obtain a positive cash flow. This is akin to a situation where a positive return on an investment exists without the investment itself having any risk of its own. From a risk–reward perspective, there is a positive reward for no risk.

    4 This approach was used for a wide class of problems in DM, psychology, politics and biology [1417,32,38].

    5 Another complication is that it is impossible to represent the quantum adaptive dynamics as dynamics of pure states. The impact of the environment can transfer a pure initial state into a mixed quantum state (represented by a density operator).

    6 Such an environment contains both the complex internal mental representations of agents as well as, e.g. the market situation or the current state of the political arena.

    7 It is important to stress that in quantum theory ‘interactions’ cannot be imagined as actions of forces, as we know them in classical physics. For our applications, the most useful is the information interpretation of quantum theory [46,47]. See Plotnitsky [40] for applications of the information interpretation of quantum theory to DM.

    8 We remark that CP theory also can be used to represent uncertainty. But a probability measure represents uncertainty in an ensemble, i.e. a statistical uncertainty (and not in individual setting). It seems that the quantum representation of uncertainty matches the modelling of uncertainty ‘in the head of an individual’; see §5a for further discussion.

    9 We do not claim that QBism is the proper interpretation of quantum physics. See Khrennikov [50] for reflections from one of the authors of this paper on the applicability of QBism in quantum physics. But it is very natural to use it for our purpose [33,50] for a motivation. We also remark that originally classical game theory and the theory of DM was formulated by von Neumann & Morgenstern [7] in the statistical probabilistic framework: probabilities were treated from the frequentist viewpoint. However, Savage [53] reformulated it by using the subjectivist viewpoint on probability. Nowadays, the latter is dominating the foundations of DM and the axiomatic foundation of economics, but with the strong emphasize of the role of Bayesian inference.

    10 Of course, a non-Bayesian scheme need not be precisely quantum. It is not clear whether the quantum formalism can cover all probabilistic phenomena arising in DM [22,23].

    11 We remark that the QFT-inspired model generalizes the quantum dynamical DM-model of Pothos & Busemeyer [42]. It also has close connections with the dynamical DM-model based on the quantum master equation [1417].

    One contribution of 15 to a theme issue ‘Second quantum revolution: foundational questions’.

    Published by the Royal Society. All rights reserved.

    References