Black-boxing and cause-effect power

Reductionism assumes that causation in the physical world occurs at the micro level, excluding the emergence of macro-level causation. We challenge this reductionist assumption by employing a principled, well-defined measure of intrinsic cause-effect power–integrated information (Φ), and showing that, according to this measure, it is possible for a macro level to “beat” the micro level. Simple systems were evaluated for Φ across different spatial and temporal scales by systematically considering all possible black boxes. These are macro elements that consist of one or more micro elements over one or more micro updates. Cause-effect power was evaluated based on the inputs and outputs of the black boxes, ignoring the internal micro elements that support their input-output function. We show how black-box elements can have more common inputs and outputs than the corresponding micro elements, revealing the emergence of high-order mechanisms and joint constraints that are not apparent at the micro level. As a consequence, a macro, black-box system can have higher Φ than its micro constituents by having more mechanisms (higher composition) that are more interconnected (higher integration). We also show that, for a given micro system, one can identify local maxima of Φ across several spatiotemporal scales. The framework is demonstrated on a simple biological system, the Boolean network model of the fission-yeast cell-cycle, for which we identify stable local maxima during the course of its simulated biological function. These local maxima correspond to macro levels of organization at which emergent cause-effect properties of physical systems come into focus, and provide a natural vantage point for scientific inquiries.


Author summary
We challenge the reductionist assumption by studying causal properties of physical systems across different spatiotemporal scales. The result is that-contrary to reductionist views-causal power can emerge at macro scales. Rather than relying on the traditional notion of coarse-grains (averages), we introduce the notion of functional black boxes that are defined based on their input-output relationship. Using a sequence of examples, our work demonstrates that black boxes are particularly well suited to capture the heterogeneous and specialized nature of components in biological systems. While the emergence of coarse-grained systems relies on increased specificity, black-boxing reveals the importance of structure and integration. Our framework is mathematically rigorous and fully a1111111111 a1111111111 a1111111111 a1111111111 a1111111111

Introduction
Reductionist approaches in science usually assume that the optimal causal model of a physical system is at the finest possible scale. Coarser causal models are seen as convenient approximations due to limitations in measurement accuracy or computational power [1,2]. The reductionist view is based on the conjecture that the micro level of causal interaction is causally complete, leaving no room for additional causation at a macro level [3]. The reductionist assumption is most obvious in fields such as particle physics [4], neuroscience [5], and nanotechnology [6], but it can also be found in the social sciences [7], where researchers endeavor to 'look inside the black box'.
A case has been made for the occurrence of genuine emergence at various macro levels [8,9], such as the emergence of mind above and beyond the individual neurons (or atoms) that constitute the brain [10], and for the autonomy of the special sciences such as chemistry [11], and biology [12,13], above and beyond the underlying physics. However, arguments in favor of emergence have often been vague, or they have focused on the possibility that macro variables may have greater descriptive power than micro variables, rather than greater causal power [14, 15,16].
Inspired by statistical physics, macro-level descriptions of a system are typically taken to be coarse-grainings, i.e. averages over micro elements and micro time steps. The reductionist assumption has been challenged by the introduction of explicit measures of cause-effect power, which were used to show that such coarse-grainings can indeed have greater causeeffect power at the macro level [17,18]. In simulated examples of simple logic gate systems, we coarse-grained (nearly) identical elements ('neurons') into groups ('neuronal groups') and averaged over their states. We demonstrated that, under certain conditions involving degeneracy and/or indeterminism, a macro-level system of coarse-grained elements can "beat" the micro-level system in terms of cause-effect power [17,18].
However, moving beyond statistical physics to biology, the macro elements of interest cannot be obtained by coarse-graining, because they are constituted of heterogeneous micro elements that are often compartmentalized and have highly specific functions, which would be muddled by averaging (see Box 1). For example, take the neuron, considered as the Box 1. Black-boxing and coarse-graining A discrete, finite physical system can be considered at various spatiotemporal levels. At the most fine-grained scale, it is constituted of a set S m of micro elements, each having at least two states. Supervening, physical macro-level systems S M can be obtained by a mapping M: S m ! S M that groups disjoint subsets of S m into non-overlapping macro elements. A physical macro element is thus constituted of one or more micro elements, operating over one or more micro time steps and can be manipulated, observed, and partitioned. For each macro element, M defines how the states of its constituting micro elements are mapped onto the possible states of the macro element. In previous work [17,18], we demonstrated the emergence of cause-effect power in 'coarse-grained' macro-level systems with average-based state mappings. Here, we extend these results to 'black-box' macro elements with an output-based state mapping (Fig 1).

Coarse-graining:
Coarse-graining corresponds to the notion of a macro state in statistical physics. In coarse-graining, the state mapping is a function that depends only on the average of the micro states of the micro elements constituting the macro element, without reference to the identity of individual micro elements [17,18]. This means that all micro states with the same average have to be mapped onto the same macro state, e.g., Black-boxing: Black boxes correspond to the typical notion of macro elements in the special sciences, such as cells or organisms in biology. In black-boxing, the state of a macro element is determined by the state of its output (micro) elements at a specific (micro) time step, without reference to the states of its internal micro elements. A possible mapping for the schematic system shown in Fig 1 (left) in which 5 micro elements form a black box is, e.g., s m (t 3 ) = {XXXX0} ! s M = 'OFF' and s m (t 3 ) = {XXXX1} ! s M = 'ON'. This means that, given an input at time t 0 , the macro state of the black box corresponds to the micro state of the output element at time t 3 , while the states of the hidden elements are ignored.
Increasing intrinsic cause-effect power: In recent work, we showed that coarse-grained physical systems can, under certain conditions, 'beat' the corresponding micro-level system in terms of measures of effectiveness [17] and intrinsic cause-effect power (F) [18]. As done in this study, we simulated simple physical systems constituted, at the micro level, of collections of logic gates. The main factor enabling higher intrinsic cause-effect power through coarse-graining is a reduction in indeterminism and degeneracy at the macro level [17,18]. Determinism and degeneracy affect the selectivity of a system in its current state. In a non-degenerate and deterministic system, the current system state constrains with maximum selectivity both the cause repertoire (only one past state is possible-no degeneracy) and the effect repertoire (only one future state is possible-no indeterminism). In a degenerate system, multiple past states could lead to the current state of the system. In a non-deterministic system, multiple future states could follow the current state. Grouping noisy or degenerate micro elements into less degenerate and more deterministic macro elements may lead to a gain in the selectivity of the system's mechanisms. Everything else being equal, more selective mechanisms have higher intrinsic cause-effect power φ (see Methods), which translates to higher F at the system level and thus may lead to emergence of macro-level cause-effect power in coarsegrained systems [18,25].
In general, coarse-graining micro systems, in the sense of averaging over subsets of them, may increase intrinsic cause-effect power when the constituting micro elements are all roughly of the same kind and all their inputs and outputs can be treated as equivalent. However, in system architectures constituted of heterogeneous micro elements, with highly specific functions, which are typical for biological and electronic systems, averaging across micro states may blur rather than enhance cause-effect power. It is these types of modular system architectures for which black-boxing is particularly suited to bring about emergent cause-effect properties at the macro level: in the results section, we demonstrate that black-boxing may reveal high-order macro mechanisms that are not present at a micro scale. In turn, these support a more integrated cause-effect structure and higher F values at the macro level.
fundamental unit in much of neuroscience. Clearly, a neuron cannot be represented by a coarse-grained macro element, because it is constituted of a great diversity of specific molecules, organized in highly specific and hierarchical ways, performing highly specific functions. Indeed, it is the very specificity of the internal micro elements that makes the reductionist assumption seem inevitable in these cases: while we can treat a neuron as a black box for ease of understanding and for convenience, it would seem that its full causal power can only be captured by considering all the molecules that constitute the black box, in exquisite and specific detail [19].
Here we further challenge the reductionist assumption by generalizing the causal analysis employed for coarse-graining to black-boxing [20]: we first analyze a system of heterogeneous, specific micro elements at the micro level; then we repeat the analysis at the macro level by grouping subsets of those micro elements inside black boxes (macro elements). Black boxes are characterized exclusively by their overall input-output function [21,22]. The heterogeneous micro elements inside the black box are hidden inside a macro element, rather than averaged as with coarse-graining (Fig 1). As an example of a black box, Fig 2 shows, on the left, a simple, schematic neuron constituted of a number of specific micro elements (synapses S, cell body C, and axon hillock A) that interact internally in specific ways. On the right, the neuron is treated as a single macro element, a black box, that receives inputs (spike or no spike for each input), produces a single output (spike or no spike), and conceals its micro elements inside.
Taken together, mapping a finer-grained system into a coarser, macro-level system may increase intrinsic cause-effect power both through coarse-graining (possible increase in selectivity) or black-boxing (possible increase in integration). Which mapping is more suited to bring about emergent cause-effect properties depends on the type of system architecture. Ultimately, we can consider a continuum of possible macro elements combining the two complementary approaches as the general case, where black boxes with one output for all micro elements of a box at a particular micro time-step and coarse grains with an output for each micro element are the extremes. In what follows, we assume that the causal power of a system is quantified by its intrinsic cause-effect power as previously defined [23,24]. While reductionism assumes implicitly that causal power resides exclusively with micro elements, we assess causal power explicitly-as intrinsic cause-effect power-and determine the spatiotemporal levels at which new causeeffect properties emerge. Such emergent cause-effect properties may include an increase in the overall intrinsic cause-effect power of the system, but also specific relationships between elements within the system ("mechanisms") that only become apparent at the macro level. To quantify intrinsic cause-effect power and system mechanisms at the micro level and all possible black-boxed macro levels, we use the interventional and counterfactual causal framework of integrated information theory [23,24]. As a measure of intrinsic cause-effect power, integrated information (F) captures several aspects that are often overlooked in causal accounts [23]: the dependence of cause-effect power on the specific state the system is in (state-dependency); how cause-effect power of the system is structured (composition); whether the whole system is causally irreducible to its parts (integration); and what defines the system's borders and grain (exclusion). These features make F particularly suited for assessing the cause-effect power intrinsic to a system, independent of external observers. As demonstrated through several examples, including the Boolean network model of the fission yeast cell cycle, the F value of systems of black-box macro elements can increase when going from finer to coarser spatiotemporal grains and lead to emergent cause-effect properties at macro scales.

Methods
Integrated information (F) measures the intrinsic cause-effect power of a physical system [23,25] by evaluating five requirements: the system's capacity to make a difference to itself (intrinsicality), composition, information, integration, and exclusion. Loosely defined, F quantifies to what extent a system's cause-effect structure, which specifies how all the system's parts constrain each other's past and future states, is integrated, that is, irreducible to subsystems (more below). The measure F, which was developed as part of integrated information theory (IIT) A schematic neuron considered as a number of 'micro' elements (left), or as a black box (right). At the micro scale, the neuron receives inputs at its synapses (S), which are passed on to the cell body (C) and then to the axon hillock (A), which outputs to other neurons. Cause-effect power is assessed by perturbing each element (small hands) and observing the effects, while irreducibility is assessed by partitioning the elements (dashed red line). At the macro scale, there is only the black-box element (neuron) which receives three inputs and generates an output. Cause-effect power is assessed by perturbing the output of the black box (big hand) and observing its effects without constraining the constituent micro elements, however its irreducibility is still assessed by partitioning between micro elements (dashed red line).
We formally define a physical system as a set of elements, for example neurons in the brain or logic gates in a computer, such that each element has at least two states, inputs that can influence these states, and outputs that in turn are influenced by these states. Furthermore, it must be possible to manipulate, observe, and partition among elements, in order to evaluate their cause-effect power. To fully characterize the cause-effect properties of a physical system, we first randomly perturb its elements into all possible states according to a maximum entropy distribution and observe their subsequent state transitions. Through this process, one obtains the transition probability matrix (TPM) for the physical system. During the perturbations, elements outside the physical system under consideration are held fixed; the states of these elements are considered "background conditions" [23]. By fixing the background conditions we control external influences and use the system's TPM to calculate its intrinsic cause-effect properties, including F (see S1 Text).
Given the TPM of a system, the next step is to identify all its mechanisms-the subsets of the system which, in their current state, have irreducible cause-effect power within the system itself (intrinsicality). To this end, we test the entire power-set of system elements as candidate mechanisms (composition). To have irreducible cause-effect power, a set of elements in its current state must selectively constrain the potential past and future states of the system (information). This is evaluated using the conditional probability distribution of past or future states given the current state of the set of elements. A mechanism can be composed of one or more elements, as long as it constrains the past and future states of the system above and beyond its parts (integration). The degree to which a mechanism in its current state is irreducible is measured by φ, which quantifies the irreducible cause-effect power of the mechanism within the system [23,24,25,28,29]. In the following, we distinguish between mechanisms consisting of a single element (first-order mechanisms) and those composed of multiple elements (highorder mechanisms), which play an essential role in integrating the whole system. Note that a set of elements that fails to irreducibly constrain the system's past state does not have any potential causes within the system, and a set of elements that fails to constrain the system's future state irreducibly does not have any potential effects within the system; in both cases φ = 0 and neither is an intrinsic mechanism of the system.
The set of all mechanisms within a system defines its cause-effect structure. If a candidate mechanism in its current state has a value of φ = 0, then it is reducible, and does not contribute to the cause-effect structure of the system. The intrinsic cause-effect power of the system is quantified by its integrated information F [23,24,25,28,29], which captures the irreducibility of the cause-effect structure: the degree to which the system's cause-effect structure is changed by partitioning the system (eliminating constraints among parts). For F to be high, every possible partition must affect many mechanisms that constrain the system in a highly selective, irreducible manner (having high φ). If F = 0, then there is at least one part of the system that remains unconstrained by the mechanisms of the rest: from the intrinsic perspective, there is no unified system, even though an external observer can treat it as one.
Finally, from the intrinsic perspective, the set of elements that form a system must be definite. In other words, it must have a self-defined causal border with its environment that identifies the elements within the border as part of the system, while elements outside the border belong to the system's environment. Even though many subsets and supersets of elements may have F > 0, only sets of elements that specify a local maximum of F have well defined borders from the intrinsic perspective (exclusion). A system's border is thus defined by the intrinsic cause-effect structure of its elements, such that adding or removing a single element will result in a decrease of cause-effect power.
This exclusion principle also applies across spatiotemporal scales: from the intrinsic perspective, the set of elements that form a system must have a definite spatiotemporal grain. As with the system's borders, it is the intrinsic cause-effect structure that self-defines its spatiotemporal scale, which is one that is a local maximum of F. Local maxima of F identify those scales at which cause-effect properties emerge-any finer or coarser grains necessarily result in a reduction of cause-effect power and a blurring of intrinsic cause-effect properties. To evaluate intrinsic cause-effect power at macro scales and identify the definite scales at which new cause-effect properties emerge, micro elements can be grouped either by coarse-graining as in [17,18] or, more generally, by black-boxing, as will be demonstrated here.

Black-boxing
In typical usage, a black box is an object into which inputs impinge and from which outputs emerge, but its internal workings are not available for inspection [21,22]. For our purposes, a 'black-box element' is a physical macro element that can be manipulated, observed, and partitioned, which is constituted of several micro elements (spatial), operating over several micro time steps (temporal). To qualify as a black box, it must satisfy the following conditions: (i) It must have at least one input, one output, and two or more (macro) states that can be read from its output (element) (ii) The micro elements and micro updates within the black box are hidden (black box condition) (iii) The micro elements contribute causally to the black box's output (integration) (iv) There cannot be any overlap between the micro elements of multiple black boxes (exclusion) Specifically: (i) The inputs and outputs of a black box are defined in terms of the internal micro elements that receive direct input from other elements/black boxes (e.g., synapses S in Fig  2) and directly output to other elements/black boxes (e.g., the axon hillock A in Fig 2). For this work we allow for inputs to arrive at multiple micro elements, but restrict outputs to leave from only a single micro element within the black box. Furthermore, the inputs are taken to arrive at the beginning of the macro time step, while the outputs are taken to depart at the end of the macro time step. In principle, this framework could be extended to multiple output elements and to a more general treatment of time steps by allowing macro elements with different temporal grains.
(ii) The state of a black-box element is taken to be the state of its (micro) output element at its (micro) output time step. The transition probabilities associated with a black-box element are determined as usual by causal analysis, perturbing the inputs of the black box into all possible states according to a maximum entropy distribution. At the end of the macro update, the state of the black box is observed from its output element (see Fig 3). In this way, one can determine the cause-effect power that the inputs (i.e., outputs from other black-box elements) have on the state of the black-box element over the respective macro update. In line with the notion of black boxes, the micro elements within the black box are "hidden" from other black boxes within the system, meaning they do not directly contribute to the intrinsic cause-effect power of the system, but only indirectly through their black box's output. Any other direct micro interactions are not considered intrinsic to the macro-level system and therefore do not contribute to its cause-effect power at all (see S3 Text). Crucially, for the duration of the macro update, the internal elements are allowed to evolve unperturbed; however, to discount the cause-effect power of micro elements when evaluating F, the initial states of micro elements and any micro connections leaving the black box, other than its designated output element at the designated output time step, are noised during the perturbation analysis. A consequence of this perturbation procedure is that potential causes and effects must be direct (i.e. between two black boxes), and that potential causes and effects that are mediated by a third black box are 'screened off' and do not contribute to cause-effect power (see Figure A in S3 Text).
(iii) The requirement that every constituent micro element must causally contribute to the output of its black box is mandated by the integration principle that cause-effect power must be irreducible. Even at the macro level, a system can only be integrated if its micro level is integrated. Moreover, it is not meaningful to consider a black-box element as a single physical element if it is reducible to two or more unrelated elements. The requirement of micro integration is satisfied implicitly when assessing models using integrated information; any physical system that violates it will be found to be reducible and thus have F = 0, as even for macro systems, F is evaluated by partitioning between micro elements. This implies that it is not possible to take a non-integrated system of micro elements and to black-box it in such a way as to create an integrated system of macro elements (see Figure B in S3 Text).
(iv) The requirement for no overlap among the constituents of different black boxes (or equivalently that a micro element cannot be a constituent of more than one blackbox element) is a consequence of causal exclusion. A physical (macro) element must be definite, meaning that it has a well-defined border which separates it from other macro elements. The importance of the exclusion condition has been independently recognized in the theory of computation: it is only meaningful to say that a physical system implements a computation if the system is constituted of distinct, non-overlapping elements [30]. If black-box elements were permitted to overlap, then every open physical system could be said to implement any computation [30,31].
Together, the above requirements allow to specify inputs and outputs of each blackbox element, to define its macro state, to include within each black box only micro elements that are integrated and contribute to its input-output function, and to draw 'borders' around each black-box element that exclude any overlap with other black boxes (Figure C in S3 Text).

Local maxima of cause-effect power
Only systems that support local maxima of F, both in terms of constitution and spatiotemporal grain, are definite and have intrinsic cause-effect power. A system of elements is a local maximum if there are no 'neighboring' systems with a higher value of F. When only micro elements are considered, such as in [32], it is natural to define a neighbor as any system that differs in constitution by only a single micro element, that is, any system that can be made by either adding or removing a single element. However, to determine whether two systems at different spatiotemporal grains are neighbors, several distance measures have to be taken into account. For the present purposes, we consider three different distances between systems to establish whether two systems are neighbors in this general context. The first is the constitutional distance between two systems, which is the number of micro elements that must be added / removed from one system to transform it into the other. Next is the temporal distance between two systems, which is the difference in the number of micro updates that make up the corresponding macro updates. Finally, the spatial distance between two systems is the distance between the partitions that group micro elements into macro elements. In the current work we use the maximum matching distance between partitions [33], which is essentially the number of micro elements that must be moved from one grouping to another. If the sum of the constitutional, temporal and spatial distances between two systems is equal to 1 then those systems are neighbors, i.e., two systems are neighbors if they differ by a single step in exactly one of the three distances.
Given a set of micro elements, we evaluate all possible systems (sets of black-box elements) to determine which systems have intrinsic cause-effect power, at which spatiotemporal grain (the set of black-box elements that define the system), and what their borders are (the set of micro elements that constitute the system). Evaluating all possible sets of black-box elements includes all possible groupings of micro elements into macro elements. Then, for every grouping all possible elements of each black box are considered as its output element. Finally, causeeffect power is evaluated over all possible macro time steps of each black-box system. Note that not all micro elements must be grouped into black boxes when searching for maxima of intrinsic cause-effect power. It may be that adding a specific micro element to any blackbox element within the system would in fact reduce cause-effect power. In this case, such micro elements are held fixed as background conditions of the macro system (see S3 Text).

Results
In the following, we demonstrate black-boxing and its importance for revealing macro-level cause-effect properties based on a set of simple proof-of-principle examples before we apply the framework to a biological model of the fission-yeast cell-cycle. Crucially, we demonstrate that systems of black-box macro elements can have higher intrinsic cause-effect power than their corresponding micro systems, and support local maxima of F that reveal emergent functional properties. For the purposes of this work, we shall consider collections of elements that are binary micro elements which cannot be further reduced or split, and the time scale of state transitions to be a micro time step. Time is implicit in the TPM, as micro elements are synchronously updated at discrete micro time steps. In principle, integrated information is defined for any discrete system of elements. The full mathematical details of the F calculation are described elsewhere, we recommend [23] but details are also available in [18,24,29]; full example analyses are presented in S1 Text. All calculations in this work were performed using the PyPhi software package in Python [34], which includes a documented example for a blackbox analysis.

How macro beats micro: Composition and integration
An intuitive example in which black-boxing may be appropriate is propagation delay-the amount of time between the output of one element and its effect on another element. Such delays are largely ignored in functional analyses and are taken to be an implicit aspect of the element of interest, i.e., they are black-boxed. In the context of logic gates, for example, NOR logic is commonly described as a "universal" in the sense that any other logic can be built strictly from NOR gates. However, building, say, an XOR gate from NOR gates requires in fact a propagation delay as an implicit part of the circuit.
In the following example, we explicitly model such propagation delays as (one or more) COPY elements that take a single input and then output the same value. Fig 3 shows the micro structure of an XOR element with a one-step propagation delay, along with the corresponding macro element, a black box with XOR logic.
Consider a system of three interconnected XOR elements with a one-step propagation delay. At the micro level, this system is constituted of nine micro elements-six COPY and three XOR, which can be black-boxed over two time steps into a macro system of three interconnected XOR elements (see Fig 4). The current state of all elements is OFF.
Assessing the cause-effect structure of the micro system, we find that there are only three first-order mechanisms and no high-order mechanisms. The three XOR elements each specify a mechanism with φ = 0.5: by being in the OFF state, each XOR specifies that its two inputs must have been either (OFF, OFF) or (ON, ON) and that its outputs, the COPY elements, must be OFF in the future (Fig 4, top-right). All other sets of elements do not have cause-effect power, or are reducible, so φ = 0 (see Fig 5). Recall that from the intrinsic perspective, a set of elements must constrain both the system's past and future irreducibly to be a mechanism for the system (see Methods). The six COPY elements, taken individually, lack any potential effect within the system: by being in the OFF state, a COPY by itself does not constrain the future state of its XOR output, which is still equally likely to be ON or OFF depending on the state of its other input (Fig 5, top). On the other hand, two COPY elements in the state (OFF, OFF) that input to the same XOR element do irreducibly constrain the system's future states, since together they specify that the XOR element they output to will be OFF. Nonetheless, these pairs of COPY elements do not form a second-order mechanism in the system since their constraint on the system's past state is reducible: in the OFF state, the two COPY elements taken individually already specify that their inputs must have been OFF, leaving no room for additional second-order constraints (Fig 5, bottom). The lack of either irreducible past or future constraints thus prevents the COPY elements from specifying first-or high-order mechanisms in the system. The integrated information of the micro physical system is F = 0.25 (see S1 Text).
The macro-level physical system with black-box elements also has three mechanisms with φ = 0.5, but they are second-order mechanisms specified by pairs of XOR elements. By being in the state (OFF, OFF), each pair of XOR elements specifies that the past state of the entire model must have been either (OFF, OFF, OFF) or (ON, ON, ON), and that the future state of their common output must be OFF (Fig 4, bottom-right). Neither of the XOR elements in this high-order mechanism can specify these constraints on its own. Individual XOR elements lack potential effects in the system for the same reason as the individual micro COPY gates above. At the macro level, the collection of mechanisms (cause-effect structure) is more integrated than that of the micro level, with a value of F = 1.875. Although the system has the same number of mechanisms and the same φ values at both the micro and the macro level, the black-boxed system has higher F because a system partition impacts the macro level causeeffect structure more than the micro level cause-effect structure. The black-box system "wins" by having more overlap in its mechanisms, both in terms of the elements they are composed of and the constraints they impose. The high-order mechanisms of the black-box system have overlapping constraints, with each mechanism constraining all elements within the system, whereas the first-order mechanisms of the micro system only constrain their respective COPY inputs and outputs, without overlap. A system partition at the micro level thus only affects a single micro mechanism, whereas a system partition at the black-box level affects all of the mechanisms in the system, resulting in higher integration (see S1 Text). Consequently, there is irreducible cause-effect power that emerges at this macro level of the physical system. Concealing the COPY elements inside the black boxes reveals the high-order interactions between the XOR gates over two time steps. Note also, that, while the causal analysis is state-dependent, in this example the irreducibility of micro and black-box cause-effect structures (their F values) and thus the relationship between levels, is equivalent for all possible system states.

Finding local maxima of intrinsic cause-effect power
In a second example, we consider a larger micro system constituted of 55 elements that all implement NOR logic. By testing all possible black-boxings, we establish three local maxima of cause-effect power which reveal the organizational hierarchy of the system. Fig 6, demonstrates how a group of 11 elements implementing NOR logic can be connected in such a way to produce AND/OR logic, or MAJORITY logic at coarser spatiotemporal scales.
The 55-element system is arranged into five interconnected groups of 11 elements, with each group organized according to Fig 6 so that the system exhibits different functions at different spatiotemporal scales. Each group of 11 elements receives inputs from three other groups and has a single element that outputs to three other groups (Fig 7, top left). We consider the system state in which each of the 55 NOR micro elements is ON. In the following, we focus on the cause-effect structures of the system levels shown in Fig 6: the micro physical system of NOR elements, a black-boxed system of AND/OR elements, and a black-boxed system of MAJORITY elements. These systems are shown in Fig 7 (top row) ordered according to the average spatial grain of their elements. Many other possible black-boxing schemes were also On top is a COPY element that does not specify a mechanism. By being OFF in the current state, the COPY element constrains its input to be OFF in the previous state, but it does not constrain the future state of its output element, because the state of the XOR element still completely depends on the unknown state of its other input (shown here in grey). The bottom panel is a set of COPY elements which do not specify a high-order mechanism because they do not have an irreducible cause (the red line partitions the cause in two with no loss of information). Taking each COPY element independently fully constrains the past state of its input to be OFF. At the micro level, the system's cause-effect structure consists of 55 first order mechanisms, one for each micro element with φ = 0.239 on average, and no high-order mechanisms. The integrated information of this micro physical system is F = 0.453 (see S1 Text).
The macro-level AND/OR black-boxed system with an average spatial grain of 2.75 (Fig 7,  top, middle) has 20 macro elements, 15 implementing AND logic and 5 implementing OR logic, operating over two time steps. Similar to the micro level, its cause-effect structure is composed of 20 first order mechanisms (one for each black-box element) but no high-order mechanisms, with φ = 0.112 on average. This black-boxing reduces the number of first-order mechanisms, but does not reveal high-order mechanisms or overlapping constraints, thus the macro system is no more integrated than the micro system. Moreover, this black-boxing in fact reduces the integrated information of the first-order mechanisms in the system compared to the micro level (φ values are 0.127 lower on average), leading to lower integrated information for the system (F = 0.080).
The macro-level black-boxed system with an average spatial grain of 11 (Fig 7, top right) is defined by considering black-box elements implementing MAJORITY logic over four time steps. Compared to the macro level with an average spatial grain of 2.75, this additional black-boxing step further reduces the number of elements, but increases the average φ to 0.216 (φ values are still 0.023 lower than the micro level on average). However, this macro system is endowed not only with first-order mechanisms, but with all possible second, third and fourth-order mechanisms. In total, its cause-effect structure includes 30 of 31 possible mechanisms from the power set of black-box elements, resulting in high integration, with F = 2.333, more than the micro level. Fig 7 also shows additional black-box systems with F = 0. One of these black-box systems with an average spatial grain of 1.57 has 20 black-box OR elements over two time steps and 15 micro NOR elements. A second black-box system with average spatial grain of 3.66 has 10 black-box AND elements over two time steps and 5 black-box AND elements over four time steps. For both of these systems (and many others not shown), the integrated information is F = 0, because there is no common temporal scale over which all the elements in the system have effects on other elements within the system. For any specific temporal scale, there will be elements that do not causally contribute, thus the system is not integrated. In summary, this example demonstrates how evaluating cause-effect power over many different spatial and temporal scales of black boxes identifies local maxima of cause-effect power and reveals emergent cause-effect properties. For this example, the analysis reveals functional relationships between elements; local maxima of cause-effect power occur specifically at the micro level of NOR elements (average spatial grain size of 1, F = 0.453), at an intermediate macro level of AND/OR elements (average spatial grain size of 2.75, F = 0.080) and at a coarser macro level of MAJORITY elements (average spatial grain of 11, F = 2.333). While these spatial grains reveal emergent levels of organization at which the system exhibits intrinsic causeeffect power, which shed light on its cause-effect properties, the vast majority of systems of black-box elements, on the other hand, yield F = 0.

Boolean network model of the fission yeast cell cycle
As a demonstration of black-boxing in biological systems, we apply the framework to the Boolean network model of the fission-yeast cell-cycle [35]. The model consists of nine Boolean ("micro") elements representing the state of crucial proteins expressed during cell division. Each element implements linear threshold logic, and the connections between elements are weighted, with each connection being either excitatory (+1) or inhibitory (-1) in nature (see Fig 8A). One element, "SK" only inputs to the system, receiving no feedback. This element acts as a catalyst for cell division: when it is activated while the network is in its biological attractor state, the remaining eight elements cycle through a sequence of 9 states, eventually returning to the initial attractor state (see Fig 8B). This cycle of states is called the 'biological sequence' of the model, and captures the specific sequence of protein expressions that occur during the cell-division cycle.
Since the element SK receives no feedback from the rest of the cell-cycle network, any system that includes SK will necessarily be reducible (F = 0). Only when SK is fixed as a background condition can we potentially identify systems with F > 0. Furthermore, if we consider the remaining eight elements (excluding SK) as a system, one of the states of the biological sequence (t 2 , see Fig 8B) has no cause (potential past state) within the system (it is caused by the catalyst element SK which initializes cell division from outside the system). For this reason, the cause-effect structure of this system is undefined in state t 2 . In what follows, we refer to the cell-cycle network as the eight strongly connected elements that contain both inputs and outputs (not including SK) and its biological sequence as the eight states (t 1 , t 3 -t 9 ) with welldefined cause-effect structures.
Previous work analyzing the cause-effect structure of the cell-cycle model demonstrated that the cell-cycle network constitutes a stable local maximum of integrated information across all states of the biological sequence [32]. However, this previous work only analyzed the cellcycle model at the micro level, considering all possible subsets of micro elements. In the current work, we extend this analysis by considering the cell-cycle network at macro spatiotemporal scales. Specifically, we consider all possible groupings of the cell-cycle network into blackbox macro elements, at time scales of 2, 3 and 4 micro updates (greater time scales may reveal additional local maxima and emergent cause-effect properties).
There are 4140 ways to group the eight micro elements in the cell-cycle network into any number of black-box elements, and for each grouping there are on average 10 different ways to define the output elements of the black boxes. Considering three different time scales for each set of black-box elements, results in a total of 124,176 macro systems to analyze. Across all states of the biological sequence, there are 2224 macro systems with F > 0, an average of 278 per state, or roughly 0.22% of all possible systems.
Among the 2224 macro system with F > 0, we identify 33 unique local maxima (some others are duplicates due to symmetries in the network). The majority of these local maxima are transient, occurring in an average of 2.5 out of 8 states in the biological sequence. However, 5 of the local maxima are stable over all states of the biological sequence. The micro system is one example of a stable local maximum, confirming that the results of [32] hold even when considering macro systems. The remaining four local maxima occur at macro spatiotemporal scales, one at a time scale of 3 micro updates, and the others at a time scale of four micro updates (see Fig 9). Note that the intrinsic cause-effect power of a system is state dependent, and stability across subsequent time steps is not assumed at any point in the analysis. That the cell-cycle supports stable local maxima of macro cause-effect power is a feature of this biological system that is revealed by the causal analysis, rather than a requirement imposed by the framework.
Our analysis moreover reveals that one element in particular (Slp1) serves as a black box's output in every stable local maximum. This indicates that Slp1 may play a crucial role in stabilizing and integrating the network over longer time scales during the process of cell divisiona property that could not be identified from its micro level interactions [32,36].

Discussion
In this work we expand the framework for evaluating the cause-effect power of physical systems at multiple spatiotemporal scales, to include biologically motivated black-box macro elements defined by their input-output function. We then use this framework to explore the cause-effect power of simple systems of elements considered both at the micro level and after black-boxing, at a macro level. The cause-effect power of these systems was assessed using integrated information (F), a measure of the cause-effect power that is intrinsic to a physical system. To properly capture cause-effect power from the intrinsic perspective of the system itself, F considers composition, specificity, irreducibility, and exclusion [23,25]. We show how macro systems based on black boxes can have higher intrinsic cause-effect power than any neighboring systems (including in some cases their micro element counterparts). This result complements and extends previous work that showed how intrinsic cause-effect power can increase when macro elements are defined by coarse-graining micro elements [18]. While coarse-graining may reduce degeneracy and/or indeterminism in a system, black-boxing may increase a system's intrinsic cause-effect power by increasing its integration.
Reductionist accounts of causation assume that all causal power resides with micro elements and time steps, excluding all macro levels [2]. We argue that reductionist accounts of causation conflate the necessity of micro elements as constituents with their cause-effect power within the system. As shown in Fig 5, a single micro element within a system may completely lack the power to constrain the system's future states-taken individually, it does not make any difference to the system. Yet, the high-order mechanism with irreducible causeeffect power shown in Fig 4 would not exist without the individual micro elements to support All stable local maxima of macro cause-effect power for the cell-cycle network over the course of its biological sequence. Stable local maxima are identified at two different time scales (over 3 or 4 micro updates) and with groupings of the eight micro elements into either two or three macro elements. The output element for each black box is marked by a green outline; one common feature among all of the stable maxima is that element Slp1 acts as an output element of one black box. Note that connections between black boxes that do not originate from output elements are not shown in the figure because they do not contribute to the cause-effect structure (see S3 Text). it. Thus micro elements may play a role as a constituent of a high-order mechanism or a macro element with cause-effect power. The current work reveals the possibility that causal power may emerge at macro spatiotemporal scales, requiring only that a system is definite, with self-defined borders and spatiotemporal grain (by being a local maximum of F). In such a case, the micro elements support the macro level as constituents, the macro level still supervenes upon the micro level, yet there are cause-effect properties that are only revealed at this particular macro level.

Limitations and future work
In the current work, we use intrinsic cause-effect power as a quantification of causal power, and demonstrate several examples of systems of black-box macro elements with higher intrinsic cause-effect power than the corresponding micro systems. To the extent that the notion of causal power is appropriately captured and quantified by intrinsic cause-effect power, our results refute the reductionist assumption that causal power resides exclusively at the micro level. The value of our characterization of cause-effect power had been previously demonstrated in a number of contexts [25,28,32], and will continue to be evaluated in the future.
A limitation on the practical application of this framework is the computational demands for exhaustively evaluating intrinsic cause-effect power. Currently, cause-effect properties can only be fully explored for very small systems (< 10 micro elements; propagation delay example, cell-cycle example) or by exploiting symmetries in the system (local maxima example). Future work will extend the PyPhi software for evaluating intrinsic cause-effect power [34] by including, for example, approximations based on the connectivity matrix. However, practical applications inevitably will have to use a targeted approach and only assess the intrinsic causeeffect power of a predetermined set of macro-level systems instead of evaluating all possible black-box systems. Theoretical investigations like the current work (see below) as well as previous exploration of coarse-grained macro elements [18,37]will be crucial to define the criteria that will guide such a targeted approach.

Black-boxing reveals high-order mechanisms and joint constraints
The two main requirements for high F are that a physical system is differentiated (many specific mechanisms) and integrated (mechanisms with overlapping constraints). Typically, whenever a lower level system is mapped into a higher macro system, there is reduced state differentiation, i.e., the macro system has fewer elements and a smaller state space. This decrease in differentiation means fewer potential mechanisms and thus less potential integrated information [29]. In order for a macro level system to have higher cause-effect power (F) than a finer grained system over the same elements, the macro system must increase cause-effect power either by having more specific mechanisms, or a more integrated set of mechanisms.
Degeneracy and indeterminism are two factors that influence the specificity of a mechanism. Everything else being equal, decreasing degeneracy and indeterminism leads to an increase in the cause-effect power of mechanisms within the system. In [17,18] we demonstrated that coarse-graining (averaging) micro elements into macro elements can lead to an increase in intrinsic cause-effect power that can overcome the inherent loss of differentiation in macro systems. An increase in intrinsic cause-effect power through reduction of degeneracy is also possible through black-boxing, as shown in S2 Text.
The particular asset of black-boxing is that it may reveal high-order mechanisms and joint constraints between mechanisms at macro spatiotemporal scales. As demonstrated by the propagation delay example, the macro can even beat the micro level through increased integration. This may occur when elements with few potential effects are concealed within black-box elements, and micro elements with many potential effects serve as the outputs of blackbox elements, resulting in a more densely interconnected set of macro elements, where groups of macro elements share common inputs and common outputs. If creating common inputs and common outputs among elements leads to additional, joint constraints on the possible past and future system states, elements may form high-order mechanisms, resulting in a more integrated cause-effect structure and higher F. Being a part of high-order mechanisms, or being constrained by multiple mechanisms, gives an element additional ways to contribute to the cause-effect structure; when an element contributes in multiple ways, cutting that element has a greater effect on the cause-effect structure, making the system more irreducible. Being more irreducible means having higher intrinsic cause-effect power (F) and may thus lead to a causally emerging macro level. This suggests that black-boxing is most beneficial when there are "causal bottlenecks" in the micro system, that is, when a micro element with a single or few outputs connects to a micro element with a single or few inputs. In such cases, it is impossible for these micro elements to contribute to high-order mechanisms, and such elements represent a "weak link" in the integration of the system. More generally, black-boxing should be particularly appropriate in systems with local modular interactions whose results are distributed across the system, such as molecular interactions within neurons in the brain, or electrical interactions within computer networks.

Local maxima of intrinsic cause-effect power
Evaluating cause-effect power of black-box systems across many spatiotemporal scales shows that, in general, there can be several local maxima of macro cause-effect power, between which integrated information decreases or falls to zero. In Fig 7, the local maxima capture emergent functional roles of black-box macro elements, corresponding to the different descriptions of the system as sets of NOR, OR/AND, or MAJORITY elements. Importantly, even within a given spatiotemporal grain, there will generally be several local maxima corresponding to overlapping subsets of elements, such that adding or subtracting an element reduces integrated information [18,23]. These local maxima of intrinsic cause-effect power across and within levels correspond to organizational macro levels and systems having emergent cause-effect properties. These are natural levels and systems for the special sciences to investigate.
A prime example is biological systems, since they contain many highly specialized components which are required to perform their function. In biology we can study the molecules within an individual cell, the interactions between networks of cells (nervous system), individual organs (liver, kidneys), whole organism (animals, humans), and communities of organisms (swarms, societies). The Boolean network model of the fission yeast cell cycle is one example of a simulated biological system which contains many heterogeneous micro elements that perform specific functions in order to accomplish cell division. Applying the black-boxing framework reveals several macro local maxima that are stable throughout the biological sequence of the network model, and highlights the role of element Slp1 in stabilizing the cycle. Note that the typical approach of studying biological systems at a particular (macro) spatiotemporal scale is precisely to treat its next-lower level components as black boxes. Here we have proposed a theoretical framework to evaluate cause-effect power and the cause-effect properties of such a black-box system. If an organizational level corresponds to a local maximum of integrated information, then there will be cause-effect properties that emerge at that level, and there is knowledge to be gained by studying the system accordingly.
Finally, while local maxima reveal cause-effect properties to an investigator studying the system, the global maximum specifies the set of elements and spatiotemporal grain at which the system has most cause-effect power upon itself-from its own intrinsic perspective.
According to integrated information theory, a set of elements at the spatial-temporal grain that defines the global maximum of intrinsic cause-effect power corresponds to a physical substrate of consciousness [23,24].