Constructor theory of probability

Unitary quantum theory, having no Born Rule, is non-probabilistic. Hence the notorious problem of reconciling it with the unpredictability and appearance of stochasticity in quantum measurements. Generalizing and improving upon the so-called ‘decision-theoretic approach’, I shall recast that problem in the recently proposed constructor theory of information—where quantum theory is represented as one of a class of superinformation theories, which are local, non-probabilistic theories conforming to certain constructor-theoretic conditions. I prove that the unpredictability of measurement outcomes (to which constructor theory gives an exact meaning) necessarily arises in superinformation theories. Then I explain how the appearance of stochasticity in (finitely many) repeated measurements can arise under superinformation theories. And I establish sufficient conditions for a superinformation theory to inform decisions (made under it) as if it were probabilistic, via a Deutsch–Wallace-type argument—thus defining a class of decision-supporting superinformation theories. This broadens the domain of applicability of that argument to cover constructor-theory compliant theories. In addition, in this version some of the argument's assumptions, previously construed as merely decision-theoretic, follow from physical properties expressed by constructor-theoretic principles.


Introduction
Quantum theory without the Born Rule (hereinafter: unitary quantum theory) is deterministic [1]. Its viability as a universal physical theory has long been debated [1][2][3][4]. A contentious issue is how to reconcile its determinism with the unpredictability and appearance of stochasticity  in quantum measurements [2,4,5]. Two problems emerge: (i) how unpredictability can occur in unitary quantum theory, as, absent the Born Rule and 'collapse' processes, single measurements do not deliver single observed outcomes ( §1b), and (ii) how unitary quantum theory, being non-probabilistic, can adequately account for the appearance of stochasticity in repeated measurements ( §1c).
These problems also arise in the constructor theory of information [6] ( § §2 and 3). In that context unitary quantum theory is one of a class of superinformation theories, most of them yet to be discovered, which are elegantly characterized by a simple, exact, constructortheoretic condition. Specifically, certain physical systems permitted under such theories-called superinformation media-exhibit all the most distinctive properties of quantum systems. Like all theories conforming to the principles [7] of constructor theory, superinformation theories are expressed solely via statements about possible and impossible tasks, and are necessarily nonprobabilistic. So a task being 'possible' in this sense means that it could be performed with arbitrarily high accuracy-not that it will happen with non-zero probability. Just as for unitary quantum theory, therefore, an explanation is required for how superinformation theories could account for unpredictable measurement outcomes and apparently stochastic processes.
To provide this, I shall first provide an exact criterion for unpredictability in constructor theory ( §4); then I shall show that unpredictability necessarily arises in superinformation theories (including quantum theory) as a result of the impossibility of cloning certain states-thereby addressing problem (i). Then, I shall generalize and improve upon an existing class of proposed solutions to problem (ii) in quantum theory-known as the decision-theoretic approach [1,[8][9][10], by recasting them in constructor theory. This will entail expressing a number of physical conditions on superinformation theories for them to support the decision-theoretic approach-thus defining a class of decision-supporting superinformation theories, which include unitary quantum theory ( § §5-7). As I shall outline ( §1d), switching to constructor theory widens the domain of applicability of such approaches, to cover certain generalizations of quantum theory; it also clarifies the assumptions on which Deutsch-Wallace-type approaches are based, by revealing that most of the assumptions are not decision-theoretic, as previously thought [1,8], but physical.

(a) Constructor theory
Constructor theory is a proposed fundamental theory of physics [7], consisting of principles that underlie other physical theories (such as laws of motion of elementary particles, etc.), called subsidiary theories in this context. Its mode of explanation requires physical laws to be expressed exclusively via statements about which physical transformations (more precisely, tasks- §2) are possible, which are impossible, and why. This is a radical departure from the prevailing conception of fundamental physics, which instead expresses laws as predictions about what happens, given dynamical equations and boundary conditions in space-time.
As I shall explain ( §2), constructor theory is not just a framework (e.g. [11], or category theory [12]) for reformulating existing theories: its principles are proposed physical laws, which supplement the content of existing theories. They express regularities among subsidiary theories, including new ones that the prevailing conception cannot adequately capture. They thereby address some of those theories' unsolved problems and illuminate the theories' underlying meaning, informing the development of successors-just as, for instance, the principle of energy conservation explains invariance properties of the dynamical laws, supplementing their explanatory content without changing their predictive content, providing criteria to guide the search for future theories.
In this work, I appeal to the principles of the constructor theory of information [6]. They express the regularities in physical laws that are implicitly required by theories of information (e.g. Shannon's) as exact statements about possible and impossible tasks, thus giving a physical meaning to the hitherto fuzzily defined notion of information. Notions such as measurement and distinguishability, which are problematic to express in quantum theory, yet are essential to the decision-theoretic approach, can be exactly expressed in constructor theory.

(b) Unpredictability
The distinction between unpredictability and randomness is difficult to pin down in quantum theory, especially unitary quantum theory, but it can be naturally expressed in the more general context of constructor theory. Unpredictability occurs in quantum systems even given perfect knowledge of dynamical laws and initial conditions. 1 When a perfect measurer of a quantum observablê X-say the x-component of the spin of a spin-1/2 particle-is presented with the particle in a superposition (or mixture) of eigenvectors ofX, say |0 and |1 , it is impossible to predict reliably which outcome (0 or 1) will be observed. But in unitary quantum theory, a perfect measurement ofX is merely a unitary transformation on the source S a (the system to be measured) and the target S b (the 'tape' of the measurer): which implies, by the linearity of quantum theory, for arbitrary complex amplitudes α, β. As no wavefunction collapse occurs, there is no single 'observed outcome'. All possible outcomes occur simultaneously: in what sense, then, are they unpredictable? Additional explanations are needed-e.g. Everett's [13] is that the observer differentiates into multiple instances, each observing a different outcome, whence the impossibility of predicting which one [1,5]. Such accounts, however, can only ever be approximate in quantum theory, as they rely on emergent notions such as observed outcomes and 'universes'. Also, unpredictability is a counterfactual property: it is not about what will happen, but what cannot be made to happen. So, while the prevailing conception struggles to accommodate it, constructor theory does so naturally. Just as the impossibility of cloning a set of non-orthogonal quantum states is an exact property [14], I shall express unpredictability exactly as a consequence of the impossibility of cloning certain sets of states under superinformation theories ( §4). This distinguishes it from (apparent) randomness, which, as I shall explain, requires a quantitative explanation.

(c) The appearance of stochasticity
Another key finding of this paper is a sufficient set of conditions for superinformation theories to support a generalization of the decision-theory approach to probability, thereby explaining the appearance of stochasticity, namely that repeated identical measurements not only have different unpredictable outcomes but are also, to all appearances, random. Specifically, consider the frequencies of each observed outcome 2 x in multiple measurements of a quantum observableX on N systems each prepared in a superposition or mixture ρ ofX-eigenstates |x . The appearance of stochasticity is that, for sufficiently large N, the frequencies do not differ significantly (according to some a priori fixed statistical test) from the numbers Tr{ρ|x x|} (equality occurring in the limiting case of an ensemble ( §6)).
To account for this, the Born Rule states that the probability that x is the outcome of any individualX-measurement is Tr{ρ|x x|}, thus linking, by fiat, Tr{ρ|x x|} with the frequencies in finite sequences of experiments. In unitary quantum theory no such link can, prima facie, exist, as all possible outcomes occur in reality. How can that theory inform an expectation about finite sequences of experiments, as its Born-Rule-endowed counterpart can?
The decision-theoretic approach claims to explain how [1,[8][9][10][15][16][17]. It models measurements as deterministic games of chance:X is measured on a superposition or mixture ρ ofX-eigenstates; the reward is equal (in some currency) to the observed outcome. Thus, the problem is recast as that of how unitary quantum theory can inform decisions of a hypothetical rational player of that game, satisfying only non-probabilistic axioms of rationality. The decision-theory argument shows that the player, knowing unitary quantum theory (with no Born Rule) and the state ρ, reaches the same decision, in situations where the Born Rule would apply, as if they were informed by a stochastic theory with Born-Rule probabilities Tr{ρ|x x|}. This explains how Tr{ρ|x x|} can inform expectations in single measurements under unitary quantum theory. One must additionally prove, from this, that unitary quantum theory is as testable as its Born-Rule-endowed counterpart [1,15,18].
Thus, the decision-theoretic approach claims to explain the appearance of stochasticity in unitary quantum theory without invoking stochastic laws (or axioms), rather as Darwin's theory of evolution explains the appearance of design in biological adaptations without invoking a designer. It has been challenged, especially in regard to testability [4,19], and defended in, for example, [1,15,18]. Note that this work is not a defence of that approach; rather, it aims at clarifying and illuminating its assumptions (showing that most of them are physical) and at broadening its domain of applicability to more general theories than quantum theory.
In my generalized version of the decision-theoretic approach, I shall define a game of chance under superinformation theories ( §7) and then identify a sufficient set of conditions for them to support decisions (under that approach) in the presence of unpredictability ( §6). These conditions define the class of decision-supporting superinformation theories (including unitary quantum theory). Specifically, they include conditions for superinformation theories to support the generalization f x of the numbers Tr{ρ|x x|} ( §5) corresponding to Born-Rule probabilities. That is to say, my version of the decision-theory argument explains how the numbers f x can inform decisions of a player satisfying non-probabilistic rationality axioms under certain superinformation theories ( §7). Those theories would account for the appearance of stochasticity at least as adequately as unitary quantum theory.

(d) Summary of the main results
Switching to constructor theory yields three interrelated results: (1) The unpredictability of measurements in superinformation theories is exactly distinguished from the appearance of stochasticity, and proved to follow from the constructor-theory generalization of the quantum no-cloning theorem ( §4). (2) A sufficient set of conditions for superinformation theories to support the decision-theoretic argument ( § §5-7) is provided, defining a class of decision-supporting superinformation theories, including unitary quantum theory. Constructor theory emancipates the argument from formalisms and concepts specific to (Everettian) quantum theory-such as 'observed outcomes' or 'relative states'. (3) Most premises of the decision-theory argument are no longer controversial decisiontheoretic axioms, as in existing formulations, but follow from physical properties implied by exact principles of constructor theory.
In § §2 and 3, I summarize as much of constructor theory as is needed; in §4, I present the criterion for unpredictability; in § §5 and 6, I give the condition for superinformation theories to permit the constructor-theoretic generalization of the numbers f x = Tr{ρ|x x|}; in §7, I present the decision-theory argument in constructor theory.

Constructor theory
In constructor-theoretic physics the primitive notion of a 'physical system' is replaced by the slightly different notion of a substrate-a physical system some of whose properties can be changed by a physical transformation. Constructor theory's primitive elements are tasks (as defined below), which intuitively can be thought of as the specifications of physical transformations affecting substrates. Its laws take the form of conditions on possible/impossible tasks on substrates allowed by subsidiary theories. As tasks involving individual states are rarely fundamental, more general descriptors for substrates are convenient.

(a) Attributes and variables
The subsidiary theory must provide a collection of states, attributes and variables, for any given substrate. These are physical properties of the substrate and can be represented in several interrelated ways. For example, a traffic light is a substrate, each of whose eight states (of three lamps, each of which can be on or off) is labelled by a binary string (σ r , σ a , σ g ): σ i ∈ {0, 1}, ∀i ∈ {r, a, g}, where, say, σ r = 0 indicates that the red lamp is off, and σ r = 1 that it is on. Similarly, for i = a (amber) and i = g (green). Thus, for instance, the state where the red lamp is on and the others off is (1, 0, 0).
An attribute is any property of a substrate that can be formally defined as a set of all the states in which the substrate has that property. So, for example, the attribute red of the traffic light, denoted by r, is the set of all states in which the red lamp is on: An intrinsic attribute is one that can be specified without referring to any other specific system. For example, 'having the same colour lamp on' is an intrinsic attribute of a pair of traffic lights, but 'having the same colour lamp on as the other one in the pair' is not an intrinsic attribute of either of them. In quantum theory, 'being entangled with each other' is, likewise, an intrinsic attribute of a qubit pair; 'having a particular density operator' is an intrinsic attribute of a qubit that has, for instance, undergone an entangling interaction. The rest of the quantum state, in the Heisenberg picture, describes entanglement with the other systems that the qubit has interacted with, and so is not an intrinsic attribute of the qubit.
A physical variable is defined in a slightly unfamiliar way as any set of disjoint attributes of the same substrate. In quantum theory, this includes not only all observables (which are representable as Hermitian operators), but many other constructs, such as any set {x, y} where x and y are the attributes of being, respectively, in distinct non-orthogonal states |x and |y of a quantum system-i.e. the eigenvalues of two non-commuting observables. Whenever a substrate is in a state in an attribute x ∈ X, where X is a variable, we say that X is sharp (on that system), with the value x-where the x are members of the set X of labels 3 of the attributes in X. As a shorthand, 'X is sharp in a' shall mean that the attribute a is a subset of some attribute in X. In the case of the traffic light, 'whether some lamp is on' is the variable P = {off , on}, where I have introduced the attributes off = {(0, 0, 0)} and on, which contains all the states where at least one lamp is on. So, when the traffic light is, say, in the state (1, 0, 0) where only the red lamp is on, we say that 'P is sharp with value on'. Also, we say that P is sharp in the attribute r (red, defined above), with value on-which means that r ⊆ on. In quantum theory, a substrate can be a quantum spin-1/2 particle-e.g. an electron. The z-component of the spin is a variable, represented as the set of two intrinsic attributes: that of the z-component of the spin being 1/2 and −1/2. That variable is sharp when the qubit is in a pure eigenstate of the observable corresponding to the z-component of the spin and is non-sharp otherwise.

(b) Tasks
A task is the abstract specification of a physical transformation on a substrate, which is transformed from having some physical attribute to having another. It is expressed as a set of ordered pairs of input/output attributes x i → y i of the substrates. I shall represent it as: 4 A = {x 1 → y 1 , x 2 → y 2 , . . .}. 3 I shall always define symbols explicitly in their contexts, but for added clarity I use the convention: small Greek letters (γράμματα) denote states; small italic boldface denotes attributes; CAPITAL ITALIC BOLDFACE denotes variables; small italic denotes labels; CAPITAL ITALIC denotes sets of labels; CAPITAL BOLDFACE denotes physical systems; and capital letters with arrow above (e.g. C) denote constructors.
The {x i } are the legitimate input attributes; the {y i } are the output attributes. A constructor for the task A is defined as a physical system that would cause A to occur on the substrates and would remain unchanged in its ability to cause that again. Schematically, Input attribute of substrates Constructor − −−−−−− → Output attribute of substrates, where constructor and substrates jointly are isolated. This scheme draws upon two primitive notions that must be given physical meanings by the subsidiary theories, namely: the substrates with the input attribute are presented to the constructor, which delivers the substrates with the output attribute. A constructor is capable of performing A if, whenever presented with the substrates with a legitimate input attribute of A (i.e. in any state in that attribute), it delivers them in some state in one of the corresponding output attributes, regardless of how it acts on the substrate with any other attribute. A task on the traffic light substrate is {on → off }; and a constructor for it is a device that must switch off all its lamps whenever presented when any of the states in on. In the case of the task {off → on}, it is enough that, when the traffic light as a whole is switched off (in the state (0, 0, 0)), it delivers some state in the attribute on-say by switching on the red lamp only, delivering the state (1, 0, 0)-not necessarily all of them. In quantum information, for instance, a unitary quantum gate can be thought of as implementing a one-to-one possible task on the qubits that it acts on-its substrates. The physical system implementing the gate and the substrates constitute an isolated system: the gate is the same after the task as before. (Impossible tasks do not have constructors and thus cannot be thought of as corresponding to gates obeying quantum theory.)

(c) The fundamental principle
A task T is impossible if there is a law of physics that forbids it being carried out with arbitrary accuracy and reliability by a constructor. Otherwise, T is possible, which I shall denote by T . This means that a constructor capable of performing T can be physically realized with arbitrary accuracy and reliability (short of perfection). Catalysts and computers are examples of approximations to constructors. So, 'T is possible' means that T can be brought about with arbitrary accuracy, but it does not imply that it will happen, as it does not imply that a constructor for it will ever be built and presented with the right substrate. Conversely, a prediction that T will happen with some probability would not imply T's possibility: that 'rolling a seven' sometimes happens when shooting dice does not imply that the task 'roll a seven under the rules of that game' can be performed with arbitrarily high accuracy.
Non-probabilistic, counterfactual properties-i.e. about what does not happen, but could-are central to constructor theory's mode of explanation, as expressed by its fundamental principle: I. All (other) laws of physics are expressible solely in terms of statements about which tasks are possible, which are impossible, and why.
The radically different mode of explanation employed by this principle permits the formulation of new laws of physics (e.g. constructor information theory's ones). Thus, constructor theory differs in motivation and content from existing operational frameworks, such as resource theory [11], which aims at proving theorems following from subsidiary theories, allowing their formal properties to be expressed in the resource-theoretic formalism. Constructor theory, by contrast, proposes new principles, not derivable from subsidiary theories, to supplement them, elucidate their physical meaning and impose severe restrictions ruling out some of them. In addition, while resource theory focuses on allowed/forbidden processes under certain dynamical laws, constructor theory's main objects are impossible and possible tasks; the latter are not just allowed processes: they require a constructor to be possible. However, resource theory might be applicable to express certain constructor-theoretic concepts. For instance, as remarked, the notion of a chemical catalyst, as recently formalized in resource theory [20], is related to that of a constructor. A constructor is distinguished among general catalyst-type objects in that it is required to be capable of performing the task reliably, repeatedly and with perfect accuracy. Hence an ideal constructor is not itself a physical object, but the limiting case of an infinite sequence of possible physical objects. Each element of the sequence is an approximation to a constructor, performing the task to some finite accuracy. For example, the task of copying a string of letters is a possible task; a perfect copier is never realized in reality; but one can in principle build increasingly accurate approximations to it: a template copier is inaccurate; for higher accuracies, one would need to include some error-correction mechanism. Thus, that a task is possible is a manner of speaking about the realizability of each element of such a sequence, except for the limiting, perfect constructor that is never physically realizable, because of the inevitability of errors and deterioration under our physical laws.
Hence principle I requires subsidiary theories to have two crucial properties (holding in unitary quantum theory): (i) they must support a topology on the set of physical processes they apply to, which gives a meaning to a sequence of approximate constructions, converging to an exact performance of T; (ii) they must be non-probabilistic-as they must be expressed exclusively as statements about possible/impossible tasks. For instance, the Born-Rule-endowed versions of quantum theory, being probabilistic, do not obey the principle.
(d) Principle of locality S 1 ⊕ S 2 is the substrate consisting of substrates S 1 and S 2 . Constructor theory requires subsidiary theories to provide the following support for such a combination. First, if subsidiary theories designate any task as possible which has S 1 ⊕ S 2 as the input substrate, they must provide a meaning for presenting S 1 and S 2 to the relevant constructor as the substrate S 1 ⊕ S 2 . Second, they must conform to Einstein's [21] principle of locality in the form: II. There exists a mode of description such that the state of S 1 ⊕ S 2 is the pair (ξ , ζ ) of the states 5 ξ of S 1 and ζ of S 2 , and any construction undergone by S 1 and not S 2 can change only ξ and not ζ .
Unitary quantum theory satisfies principle II, as is explicit in the Heisenberg picture [22,23]. In that picture, the state of a quantum system is, at any one time, a minimal set of generators for the algebra of observables of that system, plus the Heisenberg state [24,25]. As the latter never changes, it can be abstracted away when specifying tasks: any residual 'non-locality' in that state [26] does not prevent quantum theory from satisfying principle II.
The parallel composition A ⊗ B of two tasks A and B is the task whose net effect on a substrate M ⊕ N is that of performing A on M and B on N. When A ⊗ T is possible for some task T on some generic, naturally occurring substrate (as defined in [6]), A is possible with side effects, which is written A . (T represents the side effect.) Note that in quantum theory the constructor performing a possible task A ⊗ B on M ⊕ N may generate entanglement between the two substrates. For example, consider two tasks A and B including only one pair of input/output attributes on some qubit, and let a be the output attribute of A, b that of B. LetP a be the quantum projector for a qubit to have the attribute a andP b for attribute b. The constructor performing A ⊗ B is required to deliver the two qubits in a quantum state in the +1-eigenspace of the projectorP a ⊗P b (where the small ⊗ denotes here the tensor product between quantum operators). Such eigenspaces in general include states describing entangled qubits.

Constructor theory of information
I shall now summarize the principles of the constructor theory of information [6]. These express exactly the properties required of physical laws by theories of (classical) information, computation and communication-such as the possibility of copying-as well as the exact relation between what has been called informally 'quantum information' and 'classical information'. 6 First, one defines computation media. 7 A computation medium with computation variable V (at least two of whose attributes have labels in a set V) is a substrate on which the task Π (V) of performing a permutation Π defined via the labels V is possible (with or without side effects), for all Π . Π (V) is a reversible computation. 8 Information media are computation media on which additional tasks are possible. Specifically, a variable X is clonable if for some attribute namely cloning X is possible (with or without side effects). 9 An information medium is a substrate with at least one clonable computation variable, called an information variable (whose attributes are called information attributes). For instance, a qubit is a computation medium with any set of two pure states, even if they are not orthogonal [6]; with orthogonal states it is an information medium. Information media must also obey the principles of constructor information theory, which I shall now recall.

(a) Interoperability
Let X 1 and X 2 be variables of substrates S 1 and S 2 , respectively, and X 1 × X 2 be the variable of S 1 ⊕ S 2 whose attributes are labelled by the ordered pair (x, x ) ∈ X 1 × X 2 , where X 1 and X 2 are the sets of labels of X 1 and X 2 , respectively, and × denotes the Cartesian product of sets. The interoperability principle is elegantly expressed as a constraint on the composite system of information media (and on their information variables): III. The combination of two information media with information variables X 1 and X 2 is an information medium with information variable X 1 × X 2 .

(b) Distinguishing and measuring
These are expressed exactly in constructor theory as tasks involving information variableswithout reference to any a priori notion of information. A variable X is measurable if a special case of the distinguishing task (3.2) is possible (with or without side effects)-namely, when the original source substrate continues to exist 11 in some attribute y x and the result is stored in a target substrate: where x 0 is a generic, 'receptive' attribute and 'X' = {'x': x ∈ X} is an information variable of the target substrate, called the output variable (which may, but need not, contain x 0 ). When X is sharp on the source with any value x, the target is changed to having the information attribute 'x', meaning S had attribute x .
A measurer of X is any constructor capable of performing the task (3.3) for some choice of its output variable, labelling and receptive state. 12 Thus, it also is a measurer of other variables: for example, it measures any subset of X, or any coarsening of X (a variable whose members are unions of attributes in X). Two notable coarsenings of X 1 × X 2 are: X 1 + X 2 , where the attributes (x 1 , x 2 ) are re-labelled with numbers x 1 + x 2 (and combined accordingly), and X 1 X 2 , where the attributes (x 1 , x 2 ) are re-labelled with numbers x 1 x 2 (and likewise combined). I shall consider only non-perturbing measurements, i.e. y x ⊆ x in (3.3). Whenever the output variable is guaranteed to be sharp with a value 'x', I shall say, with a slight abuse of terminology, that the measurer of X "delivers a sharp output 'x'".

(c) The 'bar' operation
Given an information attribute x, define the attributex (x-bar) as the union a : a⊥x a of all attributes that are distinguishable from x. With this useful tool one can construct a Boolean information variable, defined as {x,x} (which, as explained below, is a generalization of quantum projectors). Also, for any variable X, define the attribute u X .
= x∈X x. The attributeū X is the constructor-theoretic generalization of the subspace spanned by a set of quantum states. For example, consider an information variable X = {0, 1} where 0 and 1 are the attributes of being in particular eigenstates of a non-degenerate quantum observableX (which also has other eigenstates). Then,ū X is the attribute of being in any of the possible superpositions and mixtures (prepared by any possible preparation 13 ) of those two eigenstates ofX.

(d) Consistency of measurement
In quantum theory repeated measurements of physical properties are consistent in the following sense. Consider the variable X = {0, 1} defined above. Let 2 be the attribute of being in a particular eigenstate ofX orthogonal to both 0 and 1. All measurers of the variable Z = {u X , 2} are then also measurers of the variable Z = {ū X , 2}, so that all measurers of the former, when given any attribute a ⊆ū X , will give the same sharp output 'u X '. The principle of consistency of measurement requires all subsidiary theories to have this property: IV. Whenever a measurer of a variable Z would deliver a sharp output when presented with an attribute a ⊆ū Z , all other measurers of Z would too.
It follows [6] that they would all deliver the same sharp output.

(e) Observables
As (from the definition of 'bar')x ≡x, attributes x with x =x have a useful property, whence the following constructor-theoretic generalization of quantum information observables: an information observable X is an information variable such that whenever a measurer of X delivers a sharp output 'x' the input substrate really has the attribute x. 14 A necessary and sufficient condition for a variable to be an observable is that x =x for all its attributes x. For example, the above-defined variable Z = {u X , 2} is not an observable (a Z-measurer delivers a sharp output 'u X ' even when presented with a state ξ ∈ū X \u X , where '\' denotes set exclusion), but {ū X , 2} is.

(f) Superinformation media
A superinformation medium S is an information medium with at least two information observables, X and Y, that contain only mutually disjoint attributes and whose union is not an information observable. Y and X are called complementary observables. For example, any pair of orthogonal states of a qubit constitutes an information observable, but no union of two or more such pairs does: its members are not all distinguishable. Superinformation theories are subsidiary theories obeying constructor theory and permitting superinformation media. From that simple property it follows that superinformation media exhibit all the most distinctive properties of quantum systems [6]. In particular, the attributes y in Y are the constructor-theoretic generalizations of what in quantum theory is called 'being in a superposition or mixture' of states in the complementary observable X.

(g) Generalized mixtures
Consider an attribute y ∈ Y and define the observable X y . = {x ∈ X : x ⊥y}. (In quantum theory X could be the photon number observable in some cavity, |1 1| + 2|2 2| + · · · , and y the attribute of being in some superposition or mixture of some of its eigenstates, e.g. (1/ √ 2)(|0 + |1 ). In that case X y would contain two attributes, namely those of being in the states |0 0| and |1 1|, respectively.) One proves [6] that: (1) X is non-sharp in y as x ∩ y = o, ∀x ∈ X (where ' o' denotes the empty set), and X y contains at least two attributes. (2) Some coarsenings of X are sharp in y, just as in quantum theory-where the state (1/ √ 2)(|0 + |1 ) is in the +1-eigenspace of the projector |0 0| + |1 1|. The observable {ū X y ,ū X y }, u X y = x∈X y x, is the constructor-theoretic generalization of such a projector, and it is sharp in y, with valueū X y . As in quantum theory, any measurer of X presented with y, followed by a computation whose output is «whether the outcome was one of the 'x' with x ⊥y», will provide a sharp output 'ū X y ', corresponding to «yes». (I adopt the convention that a 'quoted' attribute is the one that would be delivered by a measurement of the un-quoted one, with suitable labelling. Likewise for variables.) In summary:

Quantum Theory
Constructor Theory |y is an eigenstate of an observablê   (1) and (2) as y does, as follows: Let 'X' y . = {'x' ∈ 'X': x ∈ X y }. (i) 'X' y is not sharp in b y . (If it were, with value 'x', that would imply, via the property of observables, that y ⊆ x, contrary to the defining property that , then y could be distinguished from x, contrary to assumption). (iii) b y ⊆ū 'X' y . For a measurer of {ū 'X' y ,ū 'X' y } applied to the target substrate of an X-measurer is also a measurer of {ū X y ,ū X y }; hence, when presented with y, it must deliver a sharp output 'ū 'X' y '. By the property of observables, {ū 'X' y ,ū 'X' y } must be sharp in b y , with valueū 'X' y . By the same argument, X y is not sharp in a y , i.e. a y ∩ x = o; also a y ⊥x, ∀x ∈ X y ; and {ū X y ,ū X y } is sharp in a y with valueū X y .

(h) Intrinsic parts of attributes
The attributes a y and b y are not intrinsic, for each depends on the history of interactions with other systems. (In quantum theory, S a and S b are entangled.) However, because of the principle of locality, given an information observable X, one can define the X-intrinsic part [a y ] X of the attribute a y as follows. Consider the attribute (a y , b y ) prepared by measuring X on system S a using some particular substrate S b as the target substrate. In each such preparation, S a will have the same intrinsic attribute [a y ] X , which I shall call the X-intrinsic part of a y , which is therefore the union of all the attributes preparable in that way. The same construction defines the 'X'-intrinsic part It follows, from the corresponding property of a y : that {ū X y ,ū X y } is sharp in [a y ] X with valuē u X y ; that [a y ] x ∩ x=φ; that [a y ] X ⊥x, ∀x ∈ X y . Similarly for the 'quoted' variables and attributes. In quantum theory, [a y ] X and [b y ] 'X' are attributes of having the reduced density matrices on S a and S b . Unlike in [30], they are not given any probabilistic interpretation. They are merely local descriptors of locally accessible information (defined deterministically in constructor theory [6]).

(i) Successive measurements
In unitary quantum theory, the consistency of measurement (see above) is the feature that when successive measurers ofX are applied to the same source initially in the state (α|0 + β|1 ), with the projector for «whether the two target substrates hold the same value» is sharp with value 1. In constructor theory, the generalization of that property is required to hold. Define a useful device, the X-comparer 16 C X . It is a constructor for the task of comparing two instances of a substrate in regard to an observable X defined on each: where The fact that a quantum C X would deliver a sharp «yes» if presented with the target substrates of successive measurements of an observable on the same source is what makes 'relative states' and 'universes' meaningful in Everettian quantum theory, because it makes the notion of 'observed outcome in a universe' meaningful even when the input variable X of the measurer is not sharp. The same holds in superinformation theories (figure 2).

Unpredictability in superinformation media
I can now define unpredictability exactly in constructor theory, and show how it arises in superinformation media.

(a) X-predictor
An X-predictor for the output of an X-measurer whose input attribute z is drawn from some variable Z (in short: 'X-predictor for Z') is a constructor for the task: where P = {p z } is an information observable whose attributes p z -each representing the prediction «the outcome of the X-measurer will be 'x' given the attribute z as input»-are required to satisfy the network of constructions in figure 3. B first prepares S a with the information attribute z ∈ Z specified by some information attribute s z ; then the X-measurer M X is applied to S a ; and then its target S b and the output of the predictor, p z , are presented to an 'X'-comparer C 'X' . If that delivers a sharp «yes», the prediction p z is confirmed. If it would be confirmed for all z ∈ Z, then P X is an X-predictor for Z. The exact definition of unpredictability is then: A substrate exhibits unpredictability if, for some observable X, there is a variable Z such that an X-predictor for Z is impossible.
Hence unpredictability is the impossibility of an X-predictor for a variable Z. Note the similarity to 'no-cloning', i.e. the impossibility of a constructor for the cloning task (3.2) on the variable Z.

(b) No-cloning implies unpredictability
Indeed, I shall now show that superinformation theories (and thus unitary quantum theory) exhibit unpredictability as a consequence of the impossibility of cloning certain sets of attributes.
Consider two complementary observables X and Y of a superinformation medium and define the variable Z = X y ∪ {y}. I show that there cannot be an X-predictor for Z. For suppose there were. The predictor's output information observable P would have to include the observable 'X' y . For, if z = x for some x ∈ X y , 'X' y has to be sharp on the target S b of the measurer with value 'x'; so the 'X'-comparer yielding a sharp «yes» would require p x = 'x'. (See §3: C 'X' is a measurer of {'x', 'x'} when 'X' is sharp on one of its sources with value 'x'.) When z = y, the X-predictor's output attribute p y must still cause the X-comparer to output the sharp outcome «yes»; also, P = 'X' y ∪ {p y } is required to be an information variable: hence either p y = 'x' for some 'x' ∈ 'X' y ; or 'x' ∈ū 'X' y . In the former case, again by considering C 'X' as a measurer of {'x', 'x'}, 'X' y would have to be sharp on the target S b of the X-measurer, with the value 'x'; whence y ⊆ x, contrary to the definition of superinformation. In the latter, as y ⊂ū X y , S b would have the attributeū 'X' y ( §3) so that the 'X'-comparer would have to output a sharp «no», again contradicting the assumptions. So, there cannot exist an X-predictor for Z, just as there cannot be a cloner for Z, because Z is not an information variable.
Thus, unpredictability is predicted by the superinformation theory's deterministic 17 laws. Its physical explanation is given by the subsidiary theory. In Everettian quantum theory, it is that there are different 'observed outcomes' across the multiverse. But constructor theory has emancipated unpredictability from 'observers', 'relative states' and 'universes', stating it as a qualitative information-theoretic property, just as no-cloning is.

X-indistinguishability equivalence classes
Quantum systems exhibit the appearance of stochasticity, which is more than mere unpredictability. Consider a quantum observableX of a d-dimensional system S, with eigenstates |x and eigenvalues x. Successive measurements ofX on N instances of S, each identically prepared in a superposition or mixture ρ ofX-eigenstates, display the following convergence property: (i) for large N, the fraction of replicas delivering the observed outcome 18 'x' whenX is measured can be expected not to differ significantly (according to some a priori fixed statistical test) from the number Tr{ρ|x x|}; (ii) in an ensemble (infinite collection) of such replicas, each prepared in state ρ (a 'ρ-ensemble', for brevity), the fraction of instances that would give rise to an observed outcome 'x' equals Tr{ρ|x x|} [31].
But what justifies the expectation in (i)? A frequentist approach to probability would simply postulate that the number Tr{ρ|x x|} from (ii) is the 'probability' of the outcome x whenX is measured on ρ-which would imply (via ad hoc methodological rules; e.g. [18,32]) that Tr{ρ|x x|} could inform decisions about finitely many measurements. By contrast, in unitary quantum theory, it is the decision-theoretic approach that establishes the same conclusionwithout probabilistic assumptions. Absent that argument, the numbers Tr{ρ|x x|} are just labels of equivalence classes within the set of superpositions and mixtures of the states {|x }. I shall now give sufficient conditions on superinformation theories for a generalization of those equivalence classes, which I shall call X-indistinguishability classes, to exist on the set of all generalized mixtures of attributes of a given observable X. One of the conditions for a superinformation theory to support the decision-theoretic argument will be that they allow such classes ( §6).
In quantum theory, the equivalence classes are labelled by the d-tuple [f x ] x∈X , where f x = Tr{ρ|x x|}. As the 'trace' operator need not be available in superinformation theories, to construct such equivalence classes I shall deploy fictitious ensembles. This is a novel mathematical construction; properties of such ensembles will be used to define properties of single systems without, of course, any probabilistic or frequentist interpretation.
At this stage, the f x are only labels of equivalence classes. Additional conditions will therefore be needed for the f x to inform decisions in the way that probabilities are assumed to do in stochastic theories (including traditional quantum theory via the Born Rule). I shall give these in §7 via (a) X-indistinguishability classes I denote by S (N) a substrate N instances S ⊕ S ⊕ · · · S, consisting of N replicas of a substrate S. Let us fix an observable X of S, whose attributes I suppose with no relevant loss of generality to be labelled by integers: . , x N ): x i ∈ X} be the set of strings of length N whose digits can take values in X, each denoted by s . = (s 1 , s 2 , s 3 , . . . , s N ): s i ∈ X. X (N) = {s : s ∈ X (N) } is an observable of S (N) . In quantum theory, supposing thatX is an observable of a d-dimensional system S, X (N) might beX (N) =X 1 + dX 2 + d 2X 3 + · · · + d N−1X N , whose non-degenerate eigenstates are the strings of length N: |s . = |s 1 |s 2 · · · |s N |s i ∈ X.
Fix an N. For any attribute x in X, I define a constructor D x to define attributes of the substrate S (N) , whose limit for N → ∞ will be used to define the X-indistinguishability classes.
Consider the observable X It follows from the consistency of measurement ( §3) that: Therefore, crucially, for a given x, F(x) (N) can be sharp even if the observable X (N) is not. In the example above, for N = 3 and x = 0, 0 2/3 ∈ F(0) (3) is the set of all superpositions and mixtures of the eigenstates ofX (3) contained in X 0,2/3 , {|001 , |010 , |100 }:X (3) is not sharp in most such mixtures and superpositions. The observable F(x) (N) is key to generalizing quantum theory's convergence property, for the latter is due to the fact that there exists the limit of the sequence of attributes x f i (N) for N → ∞. Let me now recall the formal expression of the convergence property in quantum theory [31]. 19 Consider a state |z = x∈X c x |x with the property that |z ⊗N = c s 1 c s 2 · · · c s N |s is a superposition of states |s = |s 1 |s 2 · · · |s N |s i ∈ X each having a different f (x; s) = f i (N) . The convergence property is that for any positive, arbitrarily small ε: it answers «yes». Thus, the proportion of instances delivering the observed outcome 'x' when X is measured on the ensemble state |z ∞ = lim N→∞ |z ⊗N is equal to Tr{ρ z |x x|} = |c x | 2 (where ρ z . = |z z|). Therefore, all states |z ⊗N with the property Tr{ρ z |x x|} = |c x | 2 will be grouped by x in the same set, as N → ∞, which can thus be labelled by Tr{ρ z |x x|}. This set is an attribute of a single system, containing all quantum states with the property that Tr{ρ z |x x|} = |c x | 2 = f x . The set of all superpositions and mixtures of eigenstates of X is thus partitioned equivalence classes, labelled by the d-tuple A sufficient condition on a superinformation theory for a generalization of these X-indistinguishability classes to exist under it, on the set of all generalized mixtures of attributes of a given observable X, is that it satisfies the following requirements: (N) , there exists the attribute of the ensemble of replicas of S defined as Φ is the limiting set of Φ (N) -which must exist, and its elements, which are real numbers, must have the property f ∞ ∈Φ f ∞ = 1, 0 ≤ f ∞ ≤ 1.
The existence of the limit implies that those attributes do not intersect, i.e. the set F(x) (∞) = {x f ∞ : f ∞ ∈ Φ} is a (formal) variable of the ensemble-a limiting case of F(x) (N) , generalizing its quantum analogue.
Given an attribute z of S, define z (N) . = N terms (z, z, . . . z) (in quantum theory, this is the attribute of being in the quantum state |z ⊗N ); introduce the auxiliary variable x∈X the X-partition of unity for the attribute z. 20 If z has such a partition of unity, it must be unique because of (E1). An X-indistinguishability equivalence class is defined as the set of all attributes with the same X-partition of unity: any two attributes within that class cannot be distinguished by measuring only the observable X on each individual substrate, even in the limit of an infinite ensemble. In quantum theory, x f contains all states ρ with Tr{ρ|x x|} = f . A superinformation theory admits X-partitions of unity (on the set of generalized mixtures of attributes in X) if conditions (E1) and (E2) are satisfied.
A key innovation of this paper is showing how the mathematical construction of an abstract infinite ensemble (culminating in the property (E1)) can define structure on individual systems (via property (E2))-the attributes x f and the X-partition of unity-without recourse to the frequency interpretation of probability or any other probabilistic assumption.

(b) X-partition of unity of the X-intrinsic part
Consider now the attribute of being in the quantum state |z = c 0 |0 + e iφ c 1 |1 whose X-partition of unity is [|c 0 | 2 , |c 1 | 2 ]. In quantum theory, the reduced density matrices of the source and target substrate as delivered by an X-measurer acting on |z still have the same partition of unity. The same holds in constructor theory. Consider the X-intrinsic part [a z ] X ( §4) of the attribute a z 20 The existence of x f ∞ does not require there to be any corresponding attribute of the single system for finite N: x , prepending a measurer of X to each of the input substrates of D (N) x will still give a D (N) x , with the same labellings. Thus, the construction that would classify z as being in a certain X-partition of unity can be reinterpreted as providing a classification of [a z ] X , under the same labellings: the two classifications must coincide. If y has a given X-partition of unity, the X-intrinsic part [a z ] X of a z must have the same one. Likewise for the intrinsic part [b z ] X of b z (obtained as the output attribute on the target of the X-measurement applied to y): the 'X'-partition of unity of [b z ] X is numerically the same as the X-partition of unity of z.

Conditions for decision-supporting superinformation theories
For a given observable X and generalized mixture z of attributes in X, the labels [f (z) x ] x∈X of the X-partition of unity defined in §5 are not probabilities. Even though they are numbers between 0 and 1, summing to unity, they need not satisfy other axioms of the probability calculus: for instance, in quantum interference experiments they do not obey the axiom of additivity of probabilities of mutually exclusive events [33]. It is the decision-theory argument ( §7) that explains under what circumstances the numbers [f (z) x ] x∈X can inform decisions in experiments on finitely many instances as if they were probabilities. I shall now establish sufficient conditions on superinformation theories to support the decision-theory argument, thus characterizing decision-supporting superinformation theories.
I shall introduce one of the conditions via the special case of quantum theory. Consider the x-and y-components of a qubit spin,X andŶ. There exist eigenstates ofX, i.e. |x 1 , |x 2 , and of Y, i.e. |y ± = (1/ √ 2)[|x 1 ± |x 2 ] that are 'equally weighted', respectively, in the x-and y-basisin other words, |x 1 , |x 2 are invariant under the action of a unitary that swaps |y + with |y − ; and |y ± are invariant under a unitary that swaps |x 1 with |x 2 . Moreover, there exist quantum states on the composite system of two qubits, which are likewise 'equally weighted' and have the special property that [|y + |y + ± |y − |y − ]. (6.1) I shall now require that the analogous property holds in superinformation theories. While in quantum theory it is straightforward to express this via the powerful tools of linear superpositions, in constructor theory expressing the same conditions will require careful definition in terms of 'generalized mixtures'.
The conditions for decision-supporting information theories is that there exist two complementary information observables X and Y such that: (T1) The theory admits X-partitions of unity (on the information attributes of S that are generalized mixtures of attributes of X) and X a + X b -partitions of unity (on the information attributes of the substrate S a ⊕ S b that are generalized mixtures of the attributes in the observable X a + X b ).    (2) There exist attributes x 1 , x 2 ∈ X that are generalized mixtures of the attributes {y + , y − }; and attributes y + , y − ∈ Y that are generalized mixtures of attributes {x 1 , x 2 } with the property that: There exists an attribute q that is a generalized mixture of attributes in S x = {(x 1 , x 1 ), (x 2 , x 2 )} and a generalized mixture of attributes in S y = {(y + , y + ), (y − , y − )}, such that S x 1 x 1 ,x 2 x 2 (q) ⊆ q, S y + y + ,y − y − (q) ⊆ q. In quantum theory, y ± are two distinguishable equally-weighted quantum superpositions or mixtures of the attributes in X y , such as |y ± . Similarly for |x 1 , |x 2 . The principle of consistency of measurement ( §3) implies that if an attribute y has an X-partition of unity with element f (y) x , for any permutation Π on X, Π (y) has X-partition of unity such that f (Π (y)) x = f (y) Π(x) . This is because presenting Π (y) to D (N) x (defined in §5) is equivalent to presenting y to the constructor (measurer) obtained prepending the computation Π to D (N) Let the attributes [a y± ] X ⊇ a y ± be the X-intrinsic parts of the attributes a y ± , where (a y ± , b y ± ) is the attribute of S a ⊕ S b prepared by measuring X on S a holding the attribute y ± , with S b as a target.

R3. [a y+
This requirement is satisfied in quantum theory: a measurer of the observableX acting on a substrate S a in the state |y ± generates the states: whose reduced density operators on S a are the same, (1/2)[|x 1 x 1 | + |x 2 x 2 |], and are still 'equally weighted', thus invariant under swapping |x 1 and |x 2 .
Consider now the observables of S ⊕ S: R4. There exists an attribute q that is both a generalized mixture of attributes in S x and a generalized mixture of attributes in S y : with the property that: S x 1 x 1 ,x 2 x 2 (q) ⊆ q and S y + y + ,y − y − (q) ⊆ q, (6.2) where the swap S on a pair of substrates acts in parallel in each separately. In quantum theory, q is the attribute  See table 1 for a summary, and footnote 26 in appendix A.) They are satisfied by quantum theory, as I said, via the existence of states such as those for which (6.1) holds.

Games and decisions with superinformation media
The key step in the decision-theoretic approach in quantum theory is to model the physical processes displaying the appearance of stochasticity as games of chance, played with equipment obeying unitary (hence non-probabilistic) quantum theory. In the language of the partition of unity, applied to quantum theory, the game is informally characterized as follows. An X-measurer is applied with an input attribute y whose X-partition of unity is [f (y) x 1 , . . . f (y) x d ]. The gamedevice is such that, if the observable X is sharp in y with value x (so, under quantum theory, y is an eigenvalue-x eigenstate ofX), the reward is 'x' (in some currency); otherwise unpredictability arises ( §4). The player may pay for the privilege of playing the game, knowing y, the rules of the game and unitary quantum theory (i.e. a full description of the physical situation). One proves that a player satisfying the same non-probabilistic axioms of rationality as in classical decision theory [35], but without assuming the Born Rule or any other rule referring to probability, would place bets as if he had assumed both the identification between the f x and the Born-Rule probabilities, and the ad hoc methodological rule connecting probabilities with decisions [18,32].
I shall now recast the decision-theoretic approach in constructor theory. The key differences from previous versions are as follows. (i) They involved notions of observed outcomes and relative states, which are powerful tools in Everettian quantum theory. In constructor theory, instead, none of those will be relied upon. (ii) Some axioms that were previously considered decision-theoretic follow from properties of information media and measurements, as expressed in the constructor theory of information.
(a) Games of chance From now on, I shall assume that we are dealing with a decision-supporting superinformation theory-i.e. a superinformation theory satisfying the conditions given in §6, with complementary observables X and Y. For simplicity, I shall assume X and Y to be real-valued, whereby X and Y are the set of real numbers. I shall require a slightly more detailed model of a 'game of chance', whose centrepiece is an X-adder. It is defined as the constructor Σ X performing the following computation on an information medium S ⊕ S p ⊕ S p : x∈X,p∈X p {(x, p, 0) → (x, p, x + p)}. (7.1) An X-adder is a measurer of the observable X + X p of S ⊕ S p labelled so that, when X is sharp in input with value x and X p is sharp in input with value 0, X p is sharp on the output pay-off substrate with value x. For any fixed p, this is also a measurer (as defined in (3.3)) of the observable X on S. 21 In quantum theory, it is realized by a unitary U : |x |p |0 → |x |p |p + x , ∀x, p.
An X-game 22 of chance G X (z) is a construction defined as follows: (G1) The game substrate S (e.g. a die), with game observable X = {x : x ∈ X}, is prepared with some legitimate game attribute z, defined as any information attribute that admits an X-partition of unity (where X may be non-sharp in it) . (G2) S is presented to an adder together with two other pay-off substrates S p , representing the player's records of the winnings, with pay-off observable X p = {x : x ∈ X}. The first instance of S p (the input pay-off substrate) contains a record of the initial (pre-game) assets, in some x' Figure 5. The composition of two games G X (z) G X (z ) (top) is the game G X a + X b ((z, z )) (bottom).
units; the other instance (the output pay-off substrate), initially at 0, contains the record of the winnings at the end of the game and it is set during the game to its pay-off under the action of the adder. Keeping track of both of those records is an artefact of my model, superfluous in real life, but making it easier to analyse composite games. (G3) The composition of two games of chance G X (z) G X (z ), with game substrates S a and S b , is defined as the construction where the output pay-off substrate of G X (z) is the input payoff substrate of G X (z ) (figure 5, top). In other words, the composite game G X (z) G X (z ) is an (X a + X b )-adder realized by measuring the observable X a + X p and the observable X b + X p separately. Thus, the composition of two games G X (z) G X (z ) is the game G X a +X b ((z, z )) with game substrate S a ⊕ S b (figure 5, bottom).

(b) The player
I model the player for G X (z) as a programmable constructor (or automaton) Γ whose legitimate inputs are: the specification of the game attribute z, the (deterministic) rules of the game (with the game observable X) and the subsidiary theory. Its program must also satisfy the following axioms: A1. Ordering. Given z and z', the automaton orders any two games G X (z) and G X (z )-the ordering is transitive and total.
In this constructor-theoretic version of the decision-theory argument, this is the only classical decision-theoretic axiom required of the automaton. It corresponds to the transitivity of preferences in [1,8]. Its effect is to require Γ to be a constructor for the task of providing a real number V{G X (z)} ∈ X-the value of the game. Specifically, I define the value of the game G X (z), V{G X (z)}, as the unique v z ∈ X with the property that Γ is indifferent between playing the game G X (z) and the game G X (v z ). As the reader may guess, the key will turn out to be that attributes with the same partition of unity have the same value.
A2. Game of chance. The only observables allowed to condition 23 the automaton's output are: (i) the observable for whether the rules of the game are followed; (ii) the observable D X p , defined as the difference between the observable X p of the first reward substrate before the game and the observable X p of the second reward substrate after the game, as predicted by the automaton (given the specification of the input attribute and the subsidiary theory Thus, whether other observables than the ones of axiom 2 may be sharp is irrelevant to the automaton's output. Otherwise, those observables would have to be mentioned in the program to condition the automaton's output (thus specifying a player for a different game). This fact shall be repeatedly used in §7c.
In classical decision theory, any monotonic rescaling of the utility function causes no change in choices; so, without losing generality, I shall assume that, whenever the observable D X p is predicted by the automaton to be sharp on the pay-off substrate with value x, the automaton outputs a substrate holding a sharp v x = x.
(c) Properties of the value function As I promised, axiom 1 and 2 imply crucial properties, which were construed as independent decision-theoretic axioms in earlier treatments, but here follow from the other axioms and the principles of constructor theory, under decision-supporting superinformation theories. Namely: (P1) Γ 's preferences must be constant in time. Otherwise, the observable 'elapsed time' would have to condition the program, violating axiom A2. 25 (P2) Substitutability of games [8,35]. The value of a game G X (z) in isolation must be the same as when composed with another game G X (z ). Otherwise, again, the automaton's program would have to be conditioned on the observable 'what games G X (z) is composed with', violating axiom A2. (P3) Additivity of composition. Setting V{G X (z)} = v z and V{G X (z )} = v z , P2 implies: 2) (P4) Measurement neutrality [10]. Games where the X-adder (under the given labellings) has physically different implementations have the same value-otherwise the observable 'which physical implementation' would have to be included in the automaton's program, violating axiom A2. Measurement neutrality, in turn, implies the following additional properties: a. The game G X (Π (z))-where the computation Π (for any permutation Π over X) acts on S immediately before the adder (figure 6)-is a Π (X)-adder, where Π (X) denotes the re-labelling of X given by Π . Hence, it can be regarded as a particular physical implementation of the game G Π(X) (z). Therefore, measurement neutrality implies: ((z a , z b )) (figure 5), by measurement neutrality the values of games on composite substrates must equal that of composite games: (The last step follows from (7.2).) c. Let [a y ] X and [b y ] X be the X-intrinsic parts [a y ] X ⊇ a y and [b y ] X ⊇ b y of the attributes a y and b y ( §4), where (a y , b y ) is the attribute of S a ⊕ S b prepared by measuring X on S a in the attribute y, with S b as a target. By prepending a measurer of X to the Xadder in the game G X (y), so that the measurer's source is the input of the X-adder (figure 7, bottom right), one still has the same X-adder. Thus, the X-game G X ([a y ] X ) (figure 7, top) has the same physical implementation as the X-game G X (y) (figure 7,  bottom left). Similarly, by concatenating a measurer of X to an 'X'-adder so that the former's target is the game substrate of the latter, one obtains an X-adder overall, whereby the 'X'-game G 'X' ([b y ] X ) has the same physical implementation as the Xgame G X (y). The same applies if one considers G Π(X) (y), for any permutation Π over X. Measurement neutrality implies: d. Shift rule [8]. Define the uniform shift as the permutation T k = x∈X {x → x + k}, for real k. The game G X a +X b (z, k) (figure 8, right) is a particular implementation of the game G T k (X) (z) ( figure 8, left), where the T k (X)-adder is realized by measuring X a + X b on S a ⊕ S b in the attribute (z, k), for a fixed k ∈ X. By measurement neutrality and (7.4): The 'equal value' property. In quantum theory, a superposition or mixture of orthogonal quantum states, such that X-games played with them have the same value v, has value v [8]. A generalization of this property holds under decision-supporting superinformation theories. A crucial difference from the argument in [8] is that here there is no need to invoke observed outcomes or relative states to prove this property. having an X-partition of unity, with the property that q is a generalized mixture of Hwhereby q ⊆ū H . I shall now prove that the X-game with game attribute q also has value v: V{G X (q)} = v. In the trivial case that q = h 1 or q = h 2 , V{G X (q)} = v. Consider the case where, instead, q is a non-trivial generalized mixture of H = {h 1 , h 2 }: q ⊆ū H , q ∩ h i = o, q ⊥ h i , ∀i = 1, 2. In quantum theory, q is the attribute of being in a superposition or mixture of distinguishable quantum states |h 1 , |h 1 : h 1 |h 2 = 0-e.g. the state |q = (α|h 1 + β|h 2 ) for complex α, β. I shall show that G X (q) is equivalent to another game with value v, obtained by applying the following procedure on the game substrate S a . First, measure H on S a holding the attribute q, with an ancillary system S b as a target of the H-measurer. This delivers the substrate S a ⊕ S b in the attribute (a q , b q ). By the properties of the intrinsic part ( §3), [a q ] X still is a generalized mixture of the attributes in H: for any Π over X. Hence, by the additivity of composition (property (P3)), it also follows that: 7) for any permutations Π , Π over X. Consider the game G R(X) a +X b (q) where q satisfies conditions R4 ( §6) and the reflection R is the permutation over X defined by R = x∈X {x → −x}. The adder in the game is a measurer of R(X) a + X b , which, in turn, is an X-comparer (as defined in (3.4)) on the substrates S a , S b : it measures the observable «whether the two substrates hold the same value x», where the output 0 corresponds to «yes». By the consistency of successive measurements of X on the same substrate with attribute y ( §3), the output variable of that measurer is sharp with value 0 when presented with an attribute which, like q, has the property that q ⊆ū S x -where S x . = {(x 1 , x 1 ), (x 2 , x 2 )}. In quantum theory, as I said, the attribute q is that of being in the quantum state (1/ √ 2)[|x 1 |x 1 + |x 2 |x 2 ], which is a 0-eigenstate of the observable −X a +X b . Thus, by definition of value, the value of G R(X) a +X b (q) must be 0. On the other hand, as q ⊆ū S y , where S y . = {(y + , y + ), (y − , y − )}, and, by (7.7), both G R(X) a +X b ((y − , y − )) and G R(X) a +X b ((y + , y + )) have the same value, by the 'equal-value' property (e), one has: V{G R(X) a +X b (q)} = V{G R(X) a +X b ((y + , y + ))}. Hence: 0 = V{G R(X) a +X b (q)} = V{G R(X) a +X b ((y + , y + ))} = V{G R(X) (y + ) G X (y + )}, where the last step follows by (7.4). Finally, one obtains, as promised: V{G R(X) (y + )} = −V{G X (y + )}. thereby showing that a program satisfying the axioms in §7.1 is possible, under decisionsupporting superinformation theories. The automaton with that program must value the game using the f (y) x as if they were the probabilities of outcomes, without assuming (or concluding!) that they are. It places bets in the same way as a corresponding automaton would, if programmed with the same axioms of classical decision theory plus additional, ad hoc probabilistic axioms connecting probabilities with decisions (e.g. that the 'prudentially best option' maximizes the expected value of the gain [32]) and some stochastic theory with the f (y) x as probabilities of outcomes x (e.g. quantum theory with the Born Rule). My argument broadly follows the logic of Deutsch's or Wallace's formulation; however, individual steps rest on different constructor-theoretic conditions and do not use concepts specific to quantum theory or the Everett interpretation-e.g. subspaces, observed outcomes, relative states, universes and instances of the player in 'universes' or 'branches of the multiverse'.
By comparison with (7.11): V{G X (y + )} = (x 1 + x 2 ) 2 , (7.12) which is a special case of (7.10) when the game attribute y + has an X-partition of unity whose non-null elements are f (y + ) x 1 = f (y + ) x 2 = 1/2. This, the central result proved by Deutsch and Wallace, is now proved from the physical, constructor-theoretic properties of measurements. The general, 'unequal weights', case follows by an argument analogous to [8], with some conceptual differences (see appendix A). What elements of reality the elements of a partition of unity in (7.10) represent depends on the subsidiary theory in question: it is not up to constructor theory (nor the decision-theory argument) to explain that. The decision-theoretic argument only shows that decisions can be made, when informed by a decision-supporting superinformation theory, under a non-probabilistic subset of the classical axioms of rationality, as if it were a stochastic theory with probabilities given by the elements in the partition of unity. In particular, in unitary quantum theory that argument is not (as it is often described) a 'derivation of the Born Rule', but an explanation of why, without the Born Rule, in the situations where it would apply, one must use the moduli squared of the amplitudes (and only those) to inform decisions. I have shown that the equivalent holds for any constructortheory-compliant subsidiary theory. Thus, such theories can, like unitary quantum theory, account for the appearance of stochastic behaviour without appealing to any stochastic law.

Conclusion
I have reformulated the problem of reconciling unitary quantum theory, unpredictability and the appearance of stochasticity in quantum systems, within the constructor theory of information, where unitary quantum theory is a particular superinformation theory-a non-probabilistic theory obeying constructor-theoretic principles. I have provided an exact criterion for unpredictability, and have shown that superinformation theories (including unitary quantum theory) satisfy it, and that unpredictability follows from the impossibility of cloning certain sets of states, and is compatible with deterministic laws. This distinguishes it from randomness.
I have exhibited conditions under which superinformation theories can inform decisions in games of chance, as if they were stochastic theories-by giving conditions for decision-supporting superinformation theories. To this end, I have generalized to constructor theory the Deutsch-Wallace decision-theoretic approach, which shows how unitary quantum theory can inform decisions in those games as if the Born Rule were assumed.
My approach improves upon that one in that (i) its axioms, formulated in constructorinformation-theoretic terms only, make no use of concepts specific to Everettian quantum theory, thus broadening the domain of applicability of the approach to decision-supporting superinformation theories; (ii) it shows that some assumptions that were previously considered as purely decision-theoretic, and thus criticized for being 'subjective' (namely, measurement neutrality, diachronic consistency, the zero-sum rule), follow from physical properties of superinformation media, measurers and adders, as required by the principles of constructor theory.
So the axioms of the decision-theory approach turn out not to be particular, ad hoc axioms necessary for quantum theory only, as it was previously thought; instead, they are either physical, information-theoretic requirements (such as the principle of consistency of measurement, interoperability and the conditions for decision-supporting superinformation theories) or general methodological rules of scientific methodology (such as transitivity of preferences) required by general theory testing. Deutsch [18] has shown how all decision-supporting superinformation theories (as defined in this paper) are testable in regard to their statements about repeated unpredictable measurements.
This paper and Deutsch's, taken together, imply that it is possible to regard the set of decisionsupporting superinformation theories as a set of theoretical possibilities for a local, non-probabilistic generalization of quantum theory (alternative to, for example, 'generalized probabilistic theories' [34]), thus providing a new framework where the successor of quantum theory may be sought.