A key component of the flexibility and complexity of the brain is its ability to dynamically adapt its functional network structure between integrated and segregated brain states depending on the demands of different cognitive tasks. Integrated states are prevalent when performing tasks of high complexity, such as maintaining items in working memory, consistent with models of a global workspace architecture. Recent work has suggested that the balance between integration and segregation is under the control of ascending neuromodulatory systems, such as the noradrenergic system, via changes in neural gain (in terms of the amplification and non-linearity in stimulus-response transfer function of brain regions). In a previous large-scale nonlinear oscillator model of neuronal network dynamics, we showed that manipulating neural gain parameters led to a ‘critical’ transition in phase synchrony that was associated with a shift from segregated to integrated topology, thus confirming our original prediction. In this study, we advance these results by demonstrating that the gain-mediated phase transition is characterized by a shift in the underlying dynamics of neural information processing. Specifically, the dynamics of the subcritical (segregated) regime are dominated by information storage, whereas the supercritical (integrated) regime is associated with increased information transfer (measured via transfer entropy). Operating near to the critical regime with respect to modulating neural gain parameters would thus appear to provide computational advantages, offering flexibility in the information processing that can be performed with only subtle changes in gain control. Our results thus link studies of whole-brain network topology and the ascending arousal system with information processing dynamics, and suggest that the constraints imposed by the ascending arousal system constrain low-dimensional modes of information processing within the brain.
Higher brain function relies on a dynamic balance between functional integration and segregation. Previous work has shown that this balance is mediated in part by alterations in neural gain, which are thought to relate to projections from ascending neuromodulatory nuclei, such as the locus coeruleus. Here, we extend this work by demonstrating that the modulation of neural gain parameters alters the information processing dynamics of the brain regions of a biophysical neural model. Specifically, we find that subcritical dynamics in the phase space of neural gain parameters are characterized by high Active Information Storage, whereas supercritical dynamics in this phase space are associated with an increase in inter-regional Transfer Entropy. Our results suggest that the modulation of neural gain via the ascending arousal system may fundamentally alter the information processing mode of the brain, which in turn has important implications for understanding the biophysical basis of cognition.
Citation: Li M, Han Y, Aburn MJ, Breakspear M, Poldrack RA, Shine JM, et al. (2019) Transitions in information processing dynamics at the whole-brain network level are driven by alterations in neural gain. PLoS Comput Biol 15(10): e1006957. https://doi.org/10.1371/journal.pcbi.1006957
Editor: Hermann Cuntz, Ernst-Strungmann-Institut, GERMANY
Received: March 11, 2019; Accepted: September 2, 2019; Published: October 15, 2019
Copyright: © 2019 Li et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: Data underlying the results is produced via simulations as described in the manuscript; code to reproduce the simulations is freely available at https://github.com/macshine/gain_topology as linked from the manuscript.
Funding: MJA was supported through a Queensland Government Advance Queensland Innovation Partnership grant AQIP12316-17RD2 - https://advance.qld.gov.au/investors-universities-and-researchers/innovation-partnerships. JL was supported through the Australian Research Council DECRA grant DE160100630 - https://www.arc.gov.au/grants/discovery-program/discovery-early-career-researcher-award-decra. JMS was supported through a University of Sydney Robinson Fellowship and NHMRC Project Grant 1156536 - https://nhmrc.gov.au/funding/find-funding/project-grants. JMS and JL were supported through The University of Sydney Research Accelerator (SOAR) Fellowship program - https://sydney.edu.au/research/our-researchers/sydney-research-accelerator-fellows.html. High performance computing facilities provided by QIMR Berghofer Medical Research Institute and The University of Sydney (artemis) have contributed to the research results reported within this paper. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Although there is a long history relating individual brain regions to specific and specialized functions, regions in isolation cannot perform meaningful physiological or cognitive processes . Instead, starting at a lower scale, a few prominent features of the brain’s basic mechanisms stand out. Firstly, neurons exist in vast numbers, each acting as an individual element with a similar set of rules. Secondly, the response of individual neurons to stimuli are far from linear—small changes in the surrounding milieu can lead to abrupt changes in neural dynamics . Thirdly, all neurons interact with other neurons through synapses, and hence form a network that spans the central nervous system . Furthermore, this structural backbone supports coherence of physiological activity at larger scales, giving rise to distributed functional networks . Therefore, in every regard, the brain is a complex system whose computational power stems from the emergent properties of coordinated interactions between its components [5, 6]. Understanding how the topology and dynamics of these networks give rise to its function is one of the most central questions that computational neuroscience aims to address.
From comparing a range of physical and mathematical systems, it is known that complex systems can exist in multiple distinct phases. For instance, groups of water molecules can exist as a solid, liquid or gas, depending on the surrounding temperature and pressure. By altering one or more tuning parameters (e.g. temperature in the water example), the system can cross clearly defined critical boundaries in the parameter space. These critical transitions are typically abrupt and often associated with qualitative shifts in the function of a system (e.g. consider the stark differences between ice and liquid water). They are often of great interest due to their ubiquity and the implications for systemic flexibility .
Empirical observations in neural cultures, EEG and fMRI recordings provide evidence that the brain operates near criticality [8–16]—one form of which is a transition between two distinct states in the functional network topology  (see Fig 1c). At one extreme, different regions of the brain are highly segregated, and each region prioritizes communication within its local topological neighbourhood. At the other pole, the whole brain becomes highly integrated, and cross-regional communication becomes far more prominent. Experimentally, the resting brain is found to trace a trajectory between the two states, and can transition abruptly into the highly integrated state when the subject is presented with a cognitively challenging task .
(a) The effect of neural gain (σ) and excitability (γ), the two tuning parameters being varied in our neural mass model (see Methods), on the response of individual neurons to stimuli are shown schematically. Each input stimulus to a target region in the model contributes an effect to the rate of change of the target via a sigmoid function. Arrows in the figures indicate how the sigmoid function defining these effects changes with increases in these gain parameters (with σ increasing nonlinearity of response and γ increase amplification). (b) Previous results from  (adapted under Creative Commons Attribution License CC BY 4.0) showing that varying neural gain and excitability may cause abrupt changes in the mean phase synchrony of the brain from modelled fMRI BOLD recordings, implying the existence of a critical boundary between a segregated phase (“S”, low phase synchrony) and an integrated phase (“I”, high phase synchrony) in the brain. (c) Schematic diagram of functional brain networks in the segregated and integrated phases, and how changing neural gain and excitability may lead to transitions between the two. (d) Schematic diagram of the concept of active information storage and transfer entropy, and how they may be affected by phase transitions. Qualitatively, active information storage (green arrow) describes information on the next instance Xn+1 (blue sample) of a time series X provided by its own history (, green samples), whereas transfer entropy (orange arrow) describes that provided by the past (, orange samples) of another time series Y in the context of the target’s history. See further details on these measures in Methods.
As described in Methods regarding , this transition across a critical boundary can be achieved in a neural mass model by tuning two neural gain parameters: the neural gain σ and excitability γ. (From this point onwards, if “neural gain” refers only to the σ parameter rather than the two parameters collectively, then this is indicated by including σ in such text). Fig 1a shows how the gain parameter σ increases each region’s signal-to-noise ratio by altering the shape (or non-linearity) of the input-output curve, while the excitability γ scales the magnitude or amplification of the response. Biologically, the control parameter could plausibly be implemented through subtle alterations in the concentration of ionotropic and metabotropic neuromodulatory neurotransmitters at the level of neural circuits [19, 20]. Of these neurochemicals, dynamic changes in noradrenaline, mediated by ascending noradrenergic projections from the pontine locus coeruleus, have been shown to play a particularly important role in the modulation of the precision and responsivity (i.e., the ‘neural gain’) of targeted neurons [7, 21, 22]. The result of tuning neural gain parameters in the neural mass model can be characterized via temporal measures of the activity in each region, such as the average phase synchrony order parameter. As shown in Fig 1b, the model displays two distinct dynamic states in modelled fMRI BOLD recordings, one exhibiting high phase synchrony between the dynamics of all brain regions and the other exhibiting low phase synchrony in these dynamics. Fig 1b also shows that the sharp transition in mean synchrony requires only a small change in gain parameters, which is an illustration of the critical behaviour of the network in that region of parameter space. From another perspective, the result of tuning neural gain parameters can be seen in significant alterations to the functional network topology of the brain, as characterized by graph theoretical parameters such as the mean participation coefficient. These functional network measures provide the interpretation of integrated (high phase synchrony) versus segregated (low phase synchrony) states alluded to above (Fig 1c).
A question then arises: why might it be favourable for the brain to be in a near or quasi-critical state between these regimes in the first place? Specifically, are there computational advantages accompanying this structure? Many have proposed that this signature may reflect an evolutionary optimisation, allowing for both an effective balance between long- and short-range interactions between neural regions, as well as rapid transitions between segregated and integrated states [23, 24]. For instance, a highly segregated brain cannot communicate effectively to share information across different sub-networks, while a highly integrated state results in homogeneity in the flow of signals and a reduction in meaningful interactions, as in the case of epilepsy . Hence, an optimal state for the brain is likely a flexible balance between the two extremes. Indeed, systems poised near criticality are well-known to exhibit other distinct characteristics which could be usefully exploited in the brain, such as increased autocorrelation times and variance [25–27], increased coupling across the system [28, 29] and maximal sensitivity to tuning parameters .
Prompted by early conjecture , there is evidence from neural recordings and studies of other complex systems that phase transitions are often related to changes in the information processing structure of a system. Shew et al.  demonstrated maximal information capacity (via entropy) and sharing of information (via mutual information) near critical transitions in dynamics of neural cultures, with the transitions investigated by manipulating excitation-inhibition ratios. Although the study referred to the latter measure as “information transmission”, the mutual information remains a measure of statically shared information, and information transmission and processing in general are more appropriately modelled with measures of dynamic state updates . These measures of “information dynamics” model the interacting entities in the system as computational units, translating the intrinsic dynamics of their state updates into statistical models of how information is stored within or transferred between these entities as they update their state in time [32, 33]. Such measures have provided more direct evidence of changes in information processing structure associated with phase transitions in the brain, in preliminary results of Priesemann et al. , and in other complex systems. In artificial recurrent neural networks for example, both information transfer and storage were observed to be maximized close to a critical phase transition (with respect to perturbation propagation in reservoir dynamics) , suggesting that these intrinsic information processing advantages underpinned the known  higher performance of similar networks near the critical point on various computational tasks. These changes in information processing structure can also explain some of the aforementioned characteristics near the critical point, such as increased autocorrelation times (as a result of elevated information storage) and coupling (as a result of increased information transfer). Similar results are seen in the well-known phase transition with respect to temperature in the Ising model, with information transfer maximized near the critical regime . Furthermore, the dynamics of Boolean networks (models for gene regulatory networks ) exhibiting order-chaos phase transitions are dominated by information storage in the ordered low-activity phase, and information transfer in the high-activity chaotic phase [39, 40]. At the critical regime, networks exhibit a balance by combining relatively strong capabilities of both information storage and transfer. This transition in Boolean networks can be triggered either by directly altering the level of activity in the dynamical rules of the nodes, or by sweeping the randomness in network structure starting with a regular lattice network (low-activity) through small-world  and onto random structure (high-activity). The dynamics of the brain, being a highly analogous system to these exhibiting phase transitions between functional segregation and integration, may exhibit similar patterns in information storage and transfer capabilities near the critical regime, and on both sides of the critical boundary. Hence, we aim to examine the quantitative changes in the information processing properties of the brain under the framework of information theory.
From a Shannon information-theoretic perspective, we measure information as the reduction in uncertainty about an event with an unknown outcome . For a given time series process within a larger system, such as the blood oxygen level dependent (BOLD) data for a single voxel in the brain (i.e. the smallest identifiable region in an fMRI scan), the sources of information regarding the next event in the process include the history of the time series of the process itself, and the history of other processes in the system as inputs, such as the time series of other voxels. Here, within the framework of information dynamics  we model the amount of information storage as that provided from within a time series process using the active information storage (AIS) . We model information transfer as that provided by another source to a target process, in the context of the target past, using transfer entropy (TE) . Fig 1d provides a simple illustration of this concept.
By using computational modelling to examine the behaviour of these two information-theoretic measures across a parameter space of varying values of neural gain and excitability, we aim to address two main questions: firstly, are there differences in the information processing structure as a function of alteration of neural gain parameters? And secondly, does a quasi-critical state provide computational information processing benefits? Given the properties of the previously determined topological measures (Fig 1b) and how they relate to previous results on information processing around critical regimes, we predicted differential information processing structures across the parameter space of gain and excitability, and hypothesized that: i. the active information storage across the system should be maximized in the subcritical region before the critical boundary, whereas ii. the transfer entropy would be maximized after the boundary in the supercritical region, and iii. that storage and transfer should be relatively balanced at the critical transition. A change in information processing near criticality may allow for rapid alterations in the balance between states dominated by information storage in the subcritical phase and information transfer in the supercritical phase, hence providing flexibility for the dynamical structure of the brain to quickly adapt to and complete a wide range of tasks.
Regional time series of neuronal dynamics were generated by a 2-dimensional neural oscillator model with stochastic noise  built on top of a weighted, directed white matter connectome , simulated with the Virtual Brain toolbox . The properties of inter-regional coupling were systematically adjusted using the parameters for gain (σ) and excitability (γ) (see Methods for more details).
In contrast to the previous study which used the same underlying generative model , we did not transform the raw data into a simulated BOLD signal. Instead, information-theoretic measures were calculated directly on each region’s average membrane voltage V (which is monotonically related to the neural firing rate, see Methods)—sampled at a rate of 2 kHz. This allowed us to construct a model of information processing that was more closely linked with the underlying dynamics of the neural system. For each point in the two dimensional σ-γ parameter space, information storage was calculated for each region from its own generated time series process. Similarly, information transfer was calculated for each directed pair of regions from their own generated time series processes at each point in the σ-γ space. The fast sampling rate obliged us to apply the information-theoretic measures treating the data as arising from a continuous-time process (see Methods).
Information storage peaks in the subcritical region at intermediate γ
The active information storage of a process measures the extent to which one can model the next sample of a time series as being computed from (a time-delay phase space embedding of) its past history  (see Methods). High active information storage implies that the past states of a process are strongly predictive of the next observation. For this experiment, we measured the active memory utilization rate (AM rate), which is a formulation of active information storage suitable for continuous time processes  (see Methods).
Fig 2a plots the active memory rate (averaged over all regions) with respect to the σ-γ space. The time-delay embedding parameters for the past history were set to an embedding dimension of k = 25 with embedding delay of τ = 12 (see details in Methods). Active memory rate peaks at what was previously identified as the subcritical or segregated regime  (compare to regime identification in Fig 1b). We also see two types of transitions that divide the space up into four qualitative regions. One transition occurs over variations in γ: the highest information storage occurs at intermediate values of γ, with a sharp dropoff on either side. Within this band of intermediate γ there is also a transition in σ, with the highest information storage occurring at small values of σ, again with a sharp dropoff across the critical transition.
(a) Active memory utilization rate. (b) and (c) Mean active memory rate across σ and γ phase boundaries. (d) Network motifs supporting information storage in the dynamics of node a. (e) Correlation of AM rate to local network support (weighted motif counts). (f) Correlation of AM rate to normalized within-region synaptic connection weight. Matching colour scale is used for (e) and (f). By convention we use blue-white-red color scale for correlation plots to emphasise the positive-negative distinction, and default blue-yellow scale for other plots.
This correspondence with the previously identified regions can be observed more clearly from Fig 2b and 2c, which plots the average values across the σ and γ boundaries, respectively, based on the synchronization order parameter of the model from . A qualitative change is observed at these boundaries, where the active memory rate is highest (and peaking) in the subcritical regime with respect to σ, whilst still exhibiting sharp transitions to higher values in the supercritical regime with respect to γ.
Correlation of information storage to motif counts suggests distributed memory in supercritical regime
Information storage in time-series dynamics of network-embedded processes is known to be supported both by mechanisms internal to a node (i.e. self-loops) as well as by distributed network effects [49, 50]. The distributed network effects supporting information storage include recurrent or loop motif structures within directed structural networks, such as low order feed-forward and feedback loops, and under simple coupled Gaussian dynamics an exact relationship can be derived . For the dynamics used with this model, we do not have an analytic derivation of the exact relationship, but use a heuristic to approximate the relative “local network support” provided for information to be stored in the time-series process at a particular region. The local network support for a given region is computed, taking inspiration from , as a linear weighted path sum of specific types of motifs (of length larger than 1) involving that region. Full details are provided in Methods; example motifs supporting information storage (i.e. the low-order terms used in the local network support heuristic are shown in Fig 2d. We compute a (Pearson) correlation of this local network support for each region with its active memory rate, at each point in the parameter space, in order to infer where in the parameter space the distributed network effects are a strong factor in the variation of information storage across the regions. As a contrast we also compute a correlation of AM rate to normalized within-region synaptic connection weight (self loop weights Aii in (3) in Methods), in order to infer where in the parameter space internal (non-network) effects within each region are a strong factor in the variation of information storage (across the regions).
Fig 2e shows a high correlation between the active memory rate and local network support in the supercritical phase. This fits with established results of  that assume a noise-driven system. This can be contrasted with the correlation of AM rate to normalized within-region synaptic connection weight in Fig 2f, which peaks in the high γ subcritical regime. This suggests that primarily synaptic connections internal to a region are the strongest information storage mechanism in the segregated dynamics of the subcritical regime, whilst during the supercritical regime we observe the engagement of network effects via longer motifs to support memory largely via more distributed mechanisms in the integrated dynamics. Interestingly, we note that the subcritical regime with strongest information storage (intermediate γ, low σ) appears to have support from both within-region synaptic connections and longer motifs.
Information transfer is maximized in the supercritical region
Information transfer from one process to another is modelled by the transfer entropy  as the amount of information which a source provides about a target’s next state in the context of the target’s past (see Methods).
For this experiment on continuous-time processes, we measure the transfer entropy rate . Unless otherwise stated, we constrain the information sources considering only those which are causal information contributors to the target over their source-target time delay (following [39, 53, 54]); these are known from the directed structural connectome and delays used in the simulation (see Methods).
For each point in the parameter space of σ and γ, we calculate all the pairwise transfer entropy rates to targets from each of their causal parents. (There are an average of 19.7 causal parents per source, with a standard deviation of 7.65). These values are averaged across all such directed pairs to give the mean (pairwise) transfer entropy rate across the network at each σ and γ pair. Fig 3a shows a clear separation of the parameter space into the subcritical and supercritical regions (as defined in the previous work , see Fig 3d and 3e), with information transfer occurring almost exclusively in the supercritical or integrated regime. An additional trend within the supercritical region can be seen, with TE rate rising as σ and γ increases.
(a) Average transfer entropy rate over causal edges (those connected source → target by the directed connectome). (b) Conditional transfer entropy rate over causal edges. (c) Collective transfer entropy rate of causal edges. (d) and (e) Mean TE rate across σ boundary and γ boundaries.
Higher order terms for information transfer can also be calculated in addition to the pairwise components. Conditional transfer entropy [55, 56] (see Methods) adds the history of a third process or a collection of processes to be conditioned on in addition to the history of the target itself. For this experiment, we calculate the conditional transfer entropy of the causal source-target relationships, conditioned on all the other causal parents of the target (from the directed structural connectome), and average this across all directed causal pairs. In comparison to the pairwise TE, by conditioning on all the other causal parents the conditional TE captures only the unique information component which this source is able to provide that the others do not, and adds in synergistic or multivariate information about the target that it provides only in conjunction with the set of other sources. Furthermore, any information it holds about the target which is redundant with the other sources is removed.
The collective transfer entropy  (see Methods) captures the total transfer of information from a group of sources to a target. We examine the total information from the full set of causal parents to a particular target, and average this across all target processes. The collective TE captures all of the information provided by the sources about the target, whether that information is provided uniquely by any single source, or redundantly or synergistically by some set of them. Importantly, it does not “double count” information held redundantly across multiple sources.
Overall, the mean conditional TE rate in Fig 3b and the mean collective TE rate in Fig 3c show qualitatively similar trends to the pairwise TE rate in Fig 3a, with the same strong mean transfer in the supercritical region and trend towards high γ and σ within that region. From comparing the peak values of the different figures, it can be seen that substantial redundancies exist in the information held about the target between the different sources. That is, the conditional transfer entropy rate is an order of magnitude lower than the simple pairwise measure—indeed, at these low levels, the conditional TE is far less distinguished in the supercritical region from that in the subcritical region. The comparison to pairwise TE suggests that each causal parent is not able to provide much additional information beyond that already apparent from the other parents. The collective transfer entropy rate, on the other hand is an order of magnitude higher than the simple pairwise measure. This may, however, simply be due to the effect of adding up the transfer entropy rate from the different sources, and the collective transfer entropy divided by the number of sources is not actually higher than the pairwise measure. It is difficult to conclude from this whether there are network level effects that give rise to synergies beyond looking at each region in pairwise fashion (see Methods).
The behaviour of information transfer (Fig 3) was observed to be complementary to the patterns of information storage (Fig 2). This paints a picture of a distinct mode in the dynamics of information processing that switches abruptly as the system moves between the supercritical and subcritical phases. The subcritical or segregated phase effectively contains only information storage dynamics, whereas the supercritical or integrated phase contains a significant level of information transfer. (Subtleties within these phases are described in the Discussion). However, it should be noted that the values for transfer entropy rate are still smaller than the active memory rate by one or two orders of magnitude, even in the collective case. This is partially due to the relative simplicity of our neural mass model, and particularly to the regularity of the oscillations which they produce. The relative changes in storage and transfer separately across the phases though are far more important in understanding the dynamics than the difference in scale when the two are compared.
Correlation of information transfer with in-degree shows change in behaviour at the phase boundaries
Fig 4a examines the correlation of the pairwise transfer entropy rate, averaged across all outgoing directed connections in the network for each source, with the in-degree of the source. This correlation is expected in general (having been observed in [57, 58] and somewhat related to [59–62]) because sources with higher in-degree have greater diversity of inputs, and so potentially more available information to transfer. The expected effect is observed in the supercritical phase, suggesting integration of the information from the different source inputs. Fig 4b shows the correlation of conditional transfer entropy rate to source in-degree (again with the conditional TE rate measure averaged for all outgoing connections for each source); the trend is far less pronounced there, only being observed around the critical boundary, due largely to the much smaller values of conditional TE than pairwise in Fig 3.
(a) Correlation between TE rate and source in-degree. (b) Correlation between conditional TE rate and source in-degree. (c) Correlation between collective TE rate and target in-degree. Matching scale is used across all subfigures.
Fig 4c shows the correlation of the collective transfer entropy rate for each target to target in-degree. The target in-degree is used instead of the source because one value of collective transfer entropy is produced for each target, while the contribution over all sources is combined. Because the collective TE rate captures the combined effect from all sources, it can be expected that this will increase with the in-degree of the target, and so the correlation should be quite strong. This is observed across the phase space, though is weaker at the critical boundary (which on inspection appears to be due in part to larger non-linearities in the TE-degree relationship there).
Inter-hemisphere information transfer is high in the supercritical region
The transfer entropy can be calculated solely for the causal edges which link regions between the two hemispheres. Only 38 links are inter-hemispheric (out of the total of 1494, not counting self loops). The weights of these connections are also relatively low, with an average weight of 1.07 (standard deviation of 0.86) compared to an average weight of 1.91 (standard deviation of 0.63) over all links, not counting self loops. Despite this, however, the average transfer entropy rate of these inter-hemisphere links in Fig 5a follows the same pattern as the standard pairwise transfer entropy rate seen in Fig 3a and the peak values are only 50% lower. This suggests that information transfer across hemispheres is significant, especially since Fig 5a favours the high γ, high σ part of the supercritical region, which may help explain why this trend is seen in Fig 3a.
(a) TE rate over interhemisphere causal edges. (b) Proportion of significant TE rate measurements occuring between interhemisphere source and target.
Fig 5b shows the outcome of a second test which again highlights the importance of inter-hemisphere information transfer in the supercritical phase. This figure looks beyond causal links to compare the proportion of total statistically significant pairwise transfer entropy which occurs between hemispheres. The transfer entropy rate is first calculated for all pairwise combinations of source and target (whether they are linked in the directed connectome or not), at each point in the parameter space. However, pairs which do not give a level of transfer entropy statistically different to zero are ignored (see Methods). The remaining pairs are used to calculate the proportion of total pairwise transfer entropies that are accounted for by inter-hemisphere transfer, for each point in the parameter space. (Note that at points in σ, γ space where only a single pair had significant TE, the proportion is set to 0 in order to avoid noise in the plot where the proportion cannot be well determined). This proportion is close to half in the supercritical phase, showing that there is a large indirect effect of the information transferred between hemispheres. Even though there are only a few causal links between hemispheres, the information transferred by these “long” links is novel and becomes redistributed within the hemisphere, underpinning the higher levels of integration observed in the supercritical phase.
Using an information-theoretic decomposition, we extend previous work  by demonstrating that a gain-mediated phase transition in functional network topology is associated with a fundamental alteration in the information processing capacity of the whole brain network. Importantly, during this transition the underlying coupling strength and connectivity matrix are kept constant: the local dynamics are altered due to changes in the neural gain and excitability parameters, which then leads to changes in the effective connectivity (being a result of both local dynamics and large-scale structural connectivity). By modelling the distributed computation of the neural system in terms of information storage and information transfer, our results suggest that the shift from segregated to integrated states confers a computational alteration in the brain, which may be advantageous for certain cognitive tasks . We thus reinterpret the gain-mediated transition in the functional configuration of the network in terms of the effective influence that neural regions can have over one another within a complex, adaptive, dynamical system . Namely, subtle alterations in the neural gain control parameters lead to large transitions within the state space of functional topology, even within the constraints imposed by a hard-wired structural scaffold, with the resulting modulation of information processing capacity of the brain represented in different patterns of neural effective connectivity .
In previous work , we identified a distinct boundary that was mediated by alterations in neural gain parameters, which have long been linked to the functioning of the ascending (noradrenergic) arousal system . Although we focus here on the noradrenergic system, recent work  has suggested that the effects of gain-mediated arousal in the brain may encompass a more distributed system of neuromodulatory nuclei. As such, the results of this study should be examined through this more distributed lens in future work. Extending this previous work (as currently modelled) into the domain of information processing, we here observed a qualitative shift in regional computational capacity on either side of the gain-mediated phase transition. Our information processing perspective models the interacting entities in the system as computational units, creating statistical models of how information is stored within or transferred between these entities during their intrinsic state updates [32, 33]. Here, we apply this perspective to study the information storage and transfer in the time-series activity of the membrane voltages Vi of each region, which interact as shown in (1)–(4). The qualitative shift in information processing we observed was that information storage (Fig 2a) was maximal in the subcritical region (at intermediate γ), whereas information transfer (Fig 3) was effectively non-existent in the subcritical region before it peaked in the supercritical region. This result is strongly aligned with the observed transition in phase synchrony observed in previous related studies. In a system of Kuramoto oscillators, Ceguerra et al.  showed that the synchronization process can be modelled as a distributed computation, with larger and increasing transfer entropy associated with more strongly synchronized or integrated network states. In the case of our model, we expect that maintaining synchronisation in the face of noise requires strong ongoing transfer between the relevant regions.
Despite the strong qualitative effects observed at intermediate γ, the relationship between gain parameters and information processing was distinctly non-monotonic. By tracking information-theoretic measures across the parameter space, we were able to distinguish six unique zones with qualitative differences in information processing dynamics (Fig 6). For example, Zone 4 contained globally-synchronized oscillations which were relatively large, and also (compared to Zone 3) exhibited the strongest information transfer values. Note that this is not directly because the absolute range of the variables is larger (information measures on continuous-valued variables are scale independent ) but specifically due to variations in the relationships between dynamics of the regions. The differences between Zone 5 and Zone 6—both of which occur at high γ but have distinct between-hemispheric TE (TE6 ≫ TE5, Fig 5b) and AM rate (Fig 2a, including different dependencies on local network structure and self-loops in Fig 2e and 2f)—are also of interest, as they suggest that there may be distinct information processing signatures related to increasing multiplicative and response gain  to maximal levels, as in the case of epileptic seizures . Future work should attempt to determine whether these categories are consistent across generative models, or perhaps relate to individual differences in topological recruitment across diverse cognitive tasks .
A transparent figure of the TE rate from Fig 3a is shown behind for comparison. Dotted lines represent a looser boundary, which are not observed in all measures.
As a general framework for understanding distributed computation within complex systems, the translation of the previous results into the language of information storage and information transfer allows their comparison to other systems whose information dynamics have been shown to undergo phase transitions, including artificial neural networks , random Boolean networks [39, 40] as models for gene regulatory networks, the Ising model  and indeed Kuramoto oscillators  as mentioned above. There appears to be substantial universality among the results from these systems, with similar patterns of information storage and transfer often observed around critical phase transitions—and crucially these patterns are echoed here in transitions driven by alterations to neural gain parameters in our neural mass model. Across all of these systems, we consistently observe that dynamics of subcritical states are dominated by information storage operations underpinning higher segregation, whilst information transfer amongst the components of the networks plays a much more significant role in the dynamics of supercritical states leading to higher integration. In contrast to both, the critical state exhibits intermediate or strong values of both operations of information storage and transfer, achieving something of a balance in dynamics—a result which we emphasize was specifically observed again for the neural gain driven transitions examined here.
These insights allow us to address the question posed in our Introduction: are there computational advantages for the brain to operate in a near-critical state? In alignment with these results from other systems, and hypothesised as discussed earlier [13, 15, 16], the balance reached with both of these operations strongly exhibited near the critical state could be expected to support a wide range of general purpose cognitive tasks (requiring both types of operations), as well as in allowing flexibility for rapid transitions to either sub- or supercritical behaviour in order to alter the computational structure and dynamics as required. Indeed it is straightforward to identify situations that would require rapid transitions away from criticality toward more segregated or integrated operation. A relatively segregated, modular architecture is comprised of regions with high information storage, suggesting that situations in which a more segregated architecture is beneficial to cognitive performance—such as a motor-learning task  or visual vigilance —may retain their capacity for improved performance by promoting heightened information storage. In contrast, cognitive states associated with integration—such as working memory  or attention [68, 69]—may reflect heightened inter-regional influence, and hence, information transfer between the diverse specialist regions housed within distinct locations in the cortex and subcortex . The flexibility inherent in operating near a critical state would be expected to be crucial in supporting rapid transitions to support either broad type of task, and indeed the timescales that such transitions could be achieved in (near the critical state or otherwise) is an important area for future investigations.
The above interpretations align with a broader conjecture regarding utility of critical dynamics, such as in the “edge of chaos” hypothesis [31, 38] as well as more specific considerations regarding the utility of operating near criticality (but not directly at the “edge of chaos”) for the brain . This convergence of results across the aforementioned systems suggests that the rules governing the organisation of whole brain dynamics may share crucial homology with other complex systems, in both biology and physics. However, inferring direct algorithmic correspondence will require more focused, direct comparison between the different systems. Furthermore, work remains to explain conditions leading to subtle differences in the patterns exhibited across the systems, for example the additional maximization of information storage and transfer capabilities near criticality in some transitions (e.g. Ising model ) but only a crossover without maximization in others (such as Kuramoto model  and the neural dynamics here).
The approach of information dynamics also provides a computational description of the dynamics of the system as they unfold at a local or point-wise level through time and across space . Such descriptions provide quantitative insights regarding Marr’s “algorithmic” level  of how entities are represented within and operated on by a neural system . In this study, we have not focused on the temporal dynamics of any particular task, but instead have examined the distribution of information processing signatures across the network. In particular, we have identified how the informational signature of brain dynamics relates to network structure as we transition across the neural gain parameter space. While the underlying network structure does not change, we seek to understand how its impact on the dynamics varies across the parameter space. Our approach allowed us to tease apart the relative importance of local network-supported versus internal mechanisms for information storage (Fig 2), where the local network support explained much of the storage (as suggested for different dynamics ) except for within the strongly segregated regime. We also found that source regions with large in-degrees (i.e. hubs) tended to be stronger information transfer sources, again for much of the parameter space except the strongly segregated regime. This aligned with our hypotheses and findings in other systems [57, 58], as well as related results such as that the degree of a node is correlated to the ratio of (average) outgoing to incoming information transfer from/to it in various dynamical models (including Ising network dynamics on the human connectome) [59, 60]. Finally, we compared the information transfer between hemispheres with the information transfer within hemisphere. Large proportions of information transfer could be apparently observed between non-directly linked regions across hemispheres in the supercritical regime, suggesting that the relatively large information transfer on the small number of inter-hemispheric causal edges supports significant global integration in this regime. These local views of network structure were thus linked to whole-brain macroscopic topology in important ways. The extent to which the patterns triggered by changes in neural gain parameters are targeted or global is a crucial question for future research, particularly given the recent appreciation of the heterogeneity of firing patterns within the locus coeruleus .
We note that the measures of information processing were estimated here using a Gaussian model, assuming linear interactions between the variables. This estimator was selected for efficient performance on the large data set and parameter space. Although such estimators may not directly capture strongly non-linear components of the interactions, they nevertheless provide a useful descriptive statistic even when the linear-Gaussian model is violated. We note that the linear component often dominates (e.g. ), and indeed the larger embeddings such estimators support provide additional terms to indirectly model non-linear components in AIS and TE.
In the present study, we simulated changes in population gain through two manipulations to a standard, symmetric sigmoid curve—modifying its height and changing its nonlinearity. The sigmoid-response curve is a first order approximation to the heterogenous and occasionally non-monotonic response curves of individual neurons and small-scale populations that rests upon the “diffusion approximation”—namely that more complex response curves at small scales merge into a homogenous sigmoid response curve at large-scale under the central-limit-theorem which holds as long as their states are only weakly correlated at large-scales [74–76]. In scenarios where this does not hold true, more complex effects—such as post-inhibitory rebound firing—can be accommodated by introducing asymmetries into the firing rate response curve, or adding additional dynamical variable, such as slow, voltage-dependent calcium states [77, 78]. These important effects are the subject of ongoing work in many groups, including our own.
The motivation for the previous study  was an attempt to explain the mechanistic basis of fluctuations in functional network topology that were observed in empirical BOLD data , which were hypothesized to be functionally related to ongoing dynamics in the ascending arousal system [17, 79]. However, the sluggish temporal nature of the haemodynamic response typically clouds the interpretation of causal or indeed effective connectivity between brain signals [80, 81]. In particular, variable delays between neural activity and peak haemodynamic response around the brain means that temporal precedence in the BOLD response does not necessarily imply neuronal causality. While approaches have been suggested to address this issue [82, 83], we instead investigated information-theoretic signatures on simulated neural data, which has a much higher effective sampling frequency than BOLD, and is also relatively unaffected by the temporal convolution that masks neural activity in the BOLD response. In doing so, we highlight important multi-level organisation within the simulated neural time series, in which whole-brain topological signatures (measured using BOLD) overlap with specific signatures of regional (neural) effective connectivity. It remains an open question whether this relationship holds in empirical data; increasing availability of intracranial human sEEG data will allow this to be tested more directly than with BOLD, however it bears mention that the direct comparison of empirical data with the results of simulations may in turn require the implementation of more biologically realistic models in order to capture the nuances present in biological data . In any case, our approach certainly holds promise for advancing our interpretation of fluctuations in global network topology across cognitive states [18, 85, 86].
In conclusion, we have shown that modulating neural gain parameters in a biophysical model of brain dynamics leads to a shift in the computational signature of regional brain activity, in which the system shifts from a state dominated by self-referential information storage to one distinguished by significant inter-regional effective connectivity. These results provide a crucial algorithmic foundation for understanding the computational advantage of whole-brain network topological states, while simultaneously providing a plausible biological mechanism through which these changes could be instantiated in the brain—namely, alterations in the influence of the ascending arousal system over inter-regional connectivity.
Materials and methods
Simulation of neural activity
Neural activity was modelled (as per ) as a directed network of brain regions, with each region represented by an oscillating 2-dimensional neural mass model  derived by mode decomposition from the Fitzhugh-Nagumo single neuron model . Directed coupling between 76 regions was derived from the CoCoMac connectome  with axonal time delays between regions computed from the length of fiber tracts estimated by diffusion spectrum imaging . The model was simulated by stochastic Heun integration  using the open source framework The Virtual Brain .
The neural mass model at each region is given by the Langevin Eqs 1 and 2, which express the dynamics of local mean membrane potential (V) and the slow recovery variable (W) at each regional node i: (1) (2)
Here, all ξi and ηi are independent standard Wiener noises, and I is the synaptic current, given by (3) where Aij is the connection weight from j to i in the directed connectivity matrix and τij is the corresponding time delay from j to i (estimated as described above). Note that the network contains 1560 directed connections (66 being self-links), with τij on non-self links having an average of 19.8 ms (standard deviation 8.32 ms). A sigmoid activation function was used to convert membrane potentials to normalized firing rates Si, where m = 1.5 was chosen to align the sigmoid with its typical input: (4)
Using this model, we modulate the inter-regional coupling by varying the parameters for gain (σ in (4)) and excitability (γ in (1)) over a range of values between 0 and 1. At each parameter combination, membrane voltage (Vi(t)) over time for each region was recorded as the time series input for the analysis of information dynamics. A sample length of 100,000 values per time series was used in the following analysis, corresponding to 50 seconds of one sample per 500 microseconds. The 2 kHz sampling rate was selected as described regarding the transfer entropy measure below. Each iteration was started from a different random initial condition.
Measures of information dynamics
The framework of information dynamics uses information-theoretic measures built on Shannon entropy to model the storage, transfer and modification of information within complex systems. It considers how the information in a variable Xn+1 at time n + 1 can be modelled as being computed from samples of this and other processes at previous times. Information modelled as being contributed from the past of process X is labelled as information storage, while information modelled as contributed from other source processes Y is interpreted as information transfer.
The Java Information Dynamics Toolkit (JIDT)  was used to calculate these measures empirically using the time series of neuronal membrane voltage from the 76 regions. For each combination of σ and γ parameter values, the active memory rate was calculated for each region, and the transfer entropy rate was calculated for each combination of two regions. Collective and conditional transfer entropy rates were also calculated. Each of these measures is explained in the following sections. The linear-Gaussian estimator in JIDT was utilized in these calculations (which models the underlying processes as multivariate Gaussians with linear coupling). As per our Discussion, this remains a useful descriptive statistic even when the assumed model is violated.
Active information storage.
Active Information Storage (AIS)  models the contribution of information storage to the dynamic state updates of a process X by measuring how much information from the past of X is observed in its next observation Xn+1. It is defined as the expected mutual information between realizations of the past state at time n and the corresponding realizations xn+1 of the next value Xn+1 of process X : (5)
Formally, the states are Takens’ embedding vectors  with embedding dimension k and embedding delay τ, which capture the underlying state of the process X for Markov processes of order k. In general, an embedding delay of τ ≥ 1 can be used, which may help to better empirically capture the state from a finite sample size. (Note that non-uniform embeddings can be used ).
The determination of these embedding parameters followed the method of Garland et al.  finding the values which maximize the AIS, with the important additional inclusion of bias correction (because increasing k generally serves to increase bias of the estimate) . For several sample σ, γ pairs in both the sub- and supercritical regimes we examined these parameter choices across all regions (up to k, τ ≤ 30), and found the optimal choices to be consistently close to k = 25 and τ = 12 for all variables, for the sampling interval Δt = 0.5 ms with 10000 samples. For example, for σ = 0.3, γ = 0.5 (subcritical), the mean of the optimal k across regions was 23.8, with standard deviation 6.7, and median 25; whilst the optimal τ across regions was 13.6 with standard deviation 4.8 and median 12. Similarly for σ = 0.6, γ = 0.5 (supercritical), the mean of the optimal k across regions was 24.1, with standard deviation 3.2, and median 25; whilst the optimal τ across regions was 13.1 with standard deviation 5.3 and median 13. As such, k = 25 and τ = 12 were used for all investigations. The total period of history covered by this embedding (300 time points, or 150 ms) corresponds to approximately 1.5 periods of the underlying oscillations in the subcritical regime and slightly under 1 period in the supercritical regime.
Note that while a larger AIS is likely to give rise to a larger auto-correlation time, there are significant differences between the two measures which make AIS much more powerful, and directly relevant for modelling the utilisation of information storage (unlike autocorrelation). Primarily these differences stem from AIS examining the relationship between multiple past values (as the embedded past) to the current value of the time series, taking into consideration whether those past values are providing the same information redundantly or unique information, or indeed are synergistically providing more when they are considered together. Auto-correlation values in contrast only ever examine relationships from one past value to the current, and are unable to resolve such complexities in the process. (As an information-theoretic measure, AIS can also capture non-linear interactions, although in this study we only use a linear estimator). This leads to the AIS providing very different values to auto-correlation, and indeed much richer insights. For example, significant reductions were observed in AIS in multiple regions of Autism Spectrum Disorder (ASD) subjects versus controls , indicating significantly reduced use or precision of priors in dynamic state updates of ASD subjects. In contrast, no such differences were observable using auto-correlation times or signal power.
Because of the fast sample rate (Δt = 0.5 ms) of the neuronal time series, proper analysis requires using a formulation of the information-theoretic measures suitable for continuous time processes. In general, this means that information storage and information transfer are conceptualized as measures that accumulate over some finite time interval at an associated rate . Both the accumulated and rate measures, however, diverge in the limit as the time step approaches zero. Intuitively, for continuous processes such as those here, this is because all information about the next time step can be captured by the previous time step in the limit as the two samples become essentially identical. These divergent properties can be circumvented by decomposing active information storage into components, , comprising :
- the instantaneous predictive capacity which measures the information storage from the immediately previous time step, IX = I(Xn; Xn+1), and
- the active memory utilisation rate (AM rate) which measures the additional accumulation rate of information storage from time steps before that, .
The instantaneous predictive capacity inherits the divergent nature of the active information storage, while the active memory utilization rate takes on the intuitive representation of memory as a rate . Crucially, converges to a limiting value as Δt → 0 for well-behaved continuous processes such as those considered here unlike AX and IX (see full details in ), and thus only is used in our investigations here. As a rate, the units of measurement of are in bits per second. As recommended by Spinney et al. [48, 52], Δt = 0.5 ms was selected on confirming that the transfer entropy rate (see next subsection) and are stable to Δt in this regime and appear to have converged to a limiting value as Δt → 0.
Transfer entropy [44, 97] models the contribution of information transfer from a source process Y to the dynamic state updates of a destination (or target) process X by measuring the amount of information that Y provides about the next state of process X in the context of the destination’s past. This perspective of modelling of information transfer after first considering storage from the past contrasts the two operations, and ensures that no information storage is attributed as having been transferred . The transfer entropy has been used to model information flow from neural time-series recordings, using various recording types and experiments, e.g. [98–103].
Quantitatively, the transfer entropy is the expected mutual information from realizations of the state of a source process Y over a delay u to the corresponding realizations xn+1 of the next value Xn+1 of the destination process X, conditioned on realizations of its previous state : (6)
The target embedding parameters k and τ are set to the same values as determined for the previous information storage calculations, as is standard when the two operations are being considered. In general, an embedding of the source state with l > 1 could be considered as this would allow Y to be a Markovian process where multiple past values of Y in addition to yn−u+1 are information sources to xn+1. However for this analysis only l = 1 previous time step of the source process is used (denoted TY→X(k, τ, u)), in line with the known contribution of a single value of the source time series to the target in the neural model in (1)–(4) (as is standard in this situation ).
In order to best model information transfer, the source processes for TE measurements are constrained to the known causal information contributors . In this case these are the upstream parents of the target in the structural connectivity matrix. Further, the source-target delays u are set to the number of discrete time steps aligning with (or rather, being the smallest integer of discrete steps giving a time delay larger than) the known source-target delays used in the model (3), as is best practice [33, 54]. Where, the TE is estimated for pairs that are not directly causally linked in the model, the time delay is estimated in the same way from corresponding fiber tract lengths.
Finally, note that transfer entropy estimations can be non-zero even where the source and destination processes have no directional relationship, due to estimator variance and bias (see summary in [97, Sec. 4.5.1]). As such, one can make a statistical test of whether a transfer entropy estimate is statistically different from the null distribution of values that would be observed for source and destination processes with similar properties but no directed relationship. As described in [91, App. A.5], the null transfer entropies are created from surrogate time series generated by permutation resampling of the source embeddings : these retain the memory in the target and the source distribution p() but not the source-target transition relationship . We perform a test of statistical significance for each directed pair of processes in producing Fig 5b, retaining there only the transfer entropies for pairs that were determined to be statistically significant against the null distribution against a p-value threshold of α = 0.05. This test is carried out analytically for the Gaussian estimator, as per  (summarised in [91, App. A.5]), with a Bonferroni correction for all directed pairs that are tested.
Conditional and collective transfer entropy.
Higher order terms of information transfer can capture the multivariate effects from multiple sources to a single target. Two higher order terms which were calculated are the conditional and collective transfer entropies.
Conditional transfer entropy [55, 56, 106] extends the basic form of transfer entropy by conditioning on the history of another source process, Z. This captures the mutual information between the past of source Y and the next value of target X, conditioned on both the history of X and the history of conditional source Z: (7)
Of course, the above may in general incorporate embeddings for both Y and Z, and a delay Z to X, and can be extended to condition on several other sources Z (excluding Y) at once. It should be noted that a conditioned transfer entropy can be either larger or smaller than the unconditioned measure, in the same way that a conditional mutual information can both increase due to the addition of synergistic information that can only be decoded with knowledge of both the source and conditional, as well as decrease due to a removal of redundant information provided by both the source and conditional. The conditional transfer entropy thus includes unique information from the source but not the conditional, and synergistic information provided by the source and conditional together, in the context of the past of the target [55, 107]. These components cannot be pulled apart using the tools of traditional information theory, but efforts are being made by approaches of Partial Information Decomposition (PID) [108–110].
At the same time, collective transfer entropy [56, 100] models the total information transfer from a set of sources to a target, capturing unique information from each source, avoiding double-counting redundant information across the sources, and capturing multivariate synergistic effects. Given a multivariate set of sources Y, this refers to the measure TY→X(k, τ, u) (again ignoring possible embeddings on the Y processes, or different delays for each Y in Y).
For these experiments, we compute conditional transfer entropy rate and collective transfer entropy rate, similar to the pairwise transfer entropy rate. Only the highest order conditional transfer entropy is calculated (known specifically as complete transfer entropy [55, 56]). This means that for each causal source, the conditional transfer entropy to a particular target involved conditioning on all the other causal sources to that target, as identified from the directed connectome. Also, we calculated the collective transfer entropy to a given target from all causal sources to that target region, as identified from the directed connectome. The delays from each source to the target, as well as for each conditional source to the target, are determined so as to match those for the model in (3) as per the pairwise transfer entropy above.
The 76 regions of this model are connected by a directed, weighted network Aij (see (3)) derived from the CoCoMac connectome . The information storage of each region is expected to be related to both the strength of self-loops internal to that region as well as distributed network effects, since both intuitively provide mechanisms for the past activity of a region to influence its future dynamics [49, 50]. The support for storage provided by network effects will depend on the number (and weight) of certain network motifs which provide feedforward and feedback loops involving that region. For linearly-coupled Gaussian processes the active information storage can be calculated as a function of weighted counts of these motifs . However, for the more complex dynamics of this system involving non-linearities in (1) and (4), we cannot derive an exact relationship. Instead, we generate heuristics to approximate the relative weights of self-loops and (relevant parts of the) distributed network structure in supporting storage at a particular node or region a, in comparison to all other regions. Next, we correlate (via Pearson correlation) these heuristics for each region to their active memory, at each point in the parameter space (across all nodes for a given σ, γ), in order to infer which mechanism (local or network effects) appears to be a more relevant factor in supporting information storage across the network at each point in the parameter space. Details of these heuristics are as follows.
We approximate the relative capacity of network structure for information storage for a given region a (in comparison to other regions) in the local network support heuristic, which is a weighted linear combination of certain motif counts involving a. As above, we cannot derive the precise capacity for information storage provided by these motifs under these dynamics, so our heuristic simply counts the (weighted) number of such motifs that are known to be relevant in general for information storage at node a. The motifs which were considered (based on those identified in ) incorporated feedback loops including the target node a and feedforward loops terminating at node a (see Fig 2d). The weighting given to each motif depends on the number of incoming links to a and their edge weights (which are derived from the coupling strengths Aij). First the edge weight of each link is normalized by the total incoming edge weight for each target (excluding self loops) to generate C = D−1(A − diag(A1,1, A2,2, …, A76,76)), where D = diag(d1, d2…, d76) with di = ∑j≠i Aij. This normalization takes inspiration from generation of normalized Laplacians , and is intended to weight the contribution along each edge on a path as relative to other contributions into the same target node (i.e. the more incoming edges one edge competes with, the smaller its relative contribution to the target’s dynamics will be). The local network support Ψa at node a is then computed as a linear sum of the relevant motifs at node a using the normalized weights C: (8)
Note that the four weighted motif counts in the equation for Ψa correspond respectively to the four motifs shown in Fig 2d to contribute to storage in the dynamics of node a. The contribution from longer motifs diminishes with length due to the normalisation, and so we limit (8) to the shortest two contributing feedback and feedfoward motifs (except for any self-loop at a). We emphasise again that the heuristic Ψa does not precisely capture the extent to which information storage is supported by the local network structure at node a; by simply counting the (weighted) relevant motifs for storage at a it is intended to approximately indicate relative capacity provided for network-supported information storage at different parts of the system.
As described above, the self-loops were ignored in the local network support measure in order to consider relative support for memory from distributed network effects only in Ψa. The relative contribution of self-loops Aii (i.e. synaptic connections between neurons within the same brain region) to memory was instead analysed separately so as to compare the two effects. A similar weighting was applied to self loops in order to normalize their contribution with respect to total incoming edge weights (this time including the self loop). Here, we first computed F = G−1A, where G = diag(g1, g2…, g76), gi = ∑j Aij. Then, in order to evaluate the relative strength of contribution of self-loops to memory across the parameter space, we correlated the Fii with the observed active memory rate across all nodes for each given σ, γ pair. Note that we focus here on the synaptic connections between neurons within the same brain region modelled by Aii, which is mediated by the neural gain parameters. This analysis does not include the feedback terms in V and W in (1) and (2) which correspond to self-coupling in the oscillatory dynamics of individual neurons that make up the population. We do not include those terms because they are constant across regions and are not moderated by the neural gain parameters.
- 1. Friston KJ. Functional and effective connectivity: a review. Brain connectivity. 2011;1(1):13–36. pmid:22432952
- 2. Breakspear M. Dynamic models of large-scale brain activity. Nature Neuroscience. 2017;20(3):340. pmid:28230845
- 3. Swanson LW. Brain architecture: understanding the basic plan. Oxford University Press; 2012.
- 4. Bullmore E, Sporns O. Complex brain networks: graph theoretical analysis of structural and functional systems. Nature Reviews Neuroscience. 2009;10:186. pmid:19190637
- 5. Varela F, Lachaux JP, Rodriguez E, Martinerie J. The brainweb: Phase synchronization and large-scale integration. Nature Reviews Neuroscience. 2001;2:229. pmid:11283746
- 6. Deisboeck TS, Kresh JY. Complex Systems Science in Biomedicine. Boston, MA: Springer Inc; 2006.
- 7. Shine JM, Aburn MJ, Breakspear M, Poldrack RA. The modulation of neural gain facilitates a transition between functional segregation and integration in the brain. eLife. 2018;7:e31130. pmid:29376825
- 8. Cocchi L, Gollo LL, Zalesky A, Breakspear M. Criticality in the brain: A synthesis of neurobiology, models and cognition. Progress in Neurobiology. 2017;158:132–152. pmid:28734836
- 9. Beggs JM, Plenz D. Neuronal avalanches in neocortical circuits. Journal of Neuroscience. 2003;23(35):11167–11177. pmid:14657176
- 10. Shew WL, Yang H, Yu S, Roy R, Plenz D. Information capacity and transmission are maximized in balanced cortical networks with neuronal avalanches. Journal of Neuroscience. 2011;31(1):55–63. pmid:21209189
- 11. Priesemann V, Wibral M, Valderrama M, Pröpper R, Le Van Quyen M, Geisel T, et al. Spike avalanches in vivo suggest a driven, slightly subcritical brain state. Frontiers in Systems Neuroscience. 2014;8(108). pmid:25009473
- 12. Priesemann V, Valderrama M, Wibral M, Le Van Quyen M. Neuronal Avalanches Differ from Wakefulness to Deep Sleep—Evidence from Intracranial Depth Recordings in Humans. PLOS Computational Biology. 2013;9(3):e1002985. pmid:23555220
- 13. Wilting J, Dehning J, Neto JP, Rudelt L, Wibral M, Zierenberg J, et al. Dynamic Adaptive Computation: Tuning network states to task requirements. arXiv preprint arXiv:180907550. 2018;.
- 14. Wilting J, Priesemann V. Inferring collective dynamical states from widely unobserved systems. Nature Communications. 2018;9(1):2325. pmid:29899335
- 15. Deco G, Jirsa VK. Ongoing Cortical Activity at Rest: Criticality, Multistability, and Ghost Attractors. Journal of Neuroscience. 2012;32(10):3366–3375. pmid:22399758
- 16. Hahn G, Ponce-Alvarez A, Monier C, Benvenuti G, Kumar A, Chavane F, et al. Spontaneous cortical activity is transiently poised close to criticality. PLOS Computational Biology. 2017;13(5):1–29.
- 17. Shine JM, Bissett PG, Bell PT, Koyejo O, Balsters JH, Gorgolewski KJ, et al. The Dynamics of Functional Brain Networks: Integrated Network States during Cognitive Task Performance. Neuron. 2016;92(2):544–554. pmid:27693256
- 18. Shine JM, Poldrack RA. Principles of dynamic network reconfiguration across diverse brain states. NeuroImage. 2017;180:396–405. pmid:28782684
- 19. Lee SH, Dan Y. Neuromodulation of Brain States. Neuron. 2012;76(1):209–222. pmid:23040816
- 20. Marder E. Neuromodulation of Neuronal Circuits: Back to the Future. Neuron. 2012;76(1):1–11. pmid:23040802
- 21. Servan-Schreiber D, Printz H, Cohen J. A network model of catecholamine effects: gain, signal-to-noise ratio, and behavior. Science. 1990;249(4971):892–895.
- 22. Fazlali Z, Ranjbar-Slamloo Y, Adibi M, Arabzadeh E. Correlation between Cortical State and Locus Coeruleus Activity: Implications for Sensory Coding in Rat Barrel Cortex. Frontiers in Neural Circuits. 2016;10:14. pmid:27047339
- 23. Beggs J, Timme N. Being Critical of Criticality in the Brain. Frontiers in Physiology. 2012;3(163). pmid:22701101
- 24. Chialvo DR. Emergent complex neural dynamics. Nature Physics. 2010;6:744.
- 25. Scheffer M, Bascompte J, Brock WA, Brovkin V, Carpenter SR, Dakos V, et al. Early-warning signals for critical transitions. Nature. 2009;461(7260):53–59. pmid:19727193
- 26. Kuehn C. A mathematical framework for critical transitions: Bifurcations, fast–slow systems and stochastic dynamics. Physica D: Nonlinear Phenomena. 2011;240(12):1020–1035.
- 27. Aburn MJ, Holmes CA, Roberts JA, Boonstra TW, Breakspear M. Critical Fluctuations in Cortical Models Near Instability. Frontiers in Physiology. 2012;3:331. pmid:22952464
- 28. Matsuda H, Kudo K, Nakamura R, Yamakawa O, Murata T. Mutual information of ising systems. International Journal of Theoretical Physics. 1996;35(4):839–845.
- 29. Ribeiro AS, Kauffman SA, Lloyd-Price J, Samuelsson B, Socolar JES. Mutual information in random Boolean models of regulatory networks. Physical Review E. 2008;77(1):011901.
- 30. Prokopenko M, Lizier JT, Obst O, Wang XR. Relating Fisher information to order parameters. Physical Review E. 2011;84:041116.
- 31. Langton CG. Computation at the edge of chaos: Phase transitions and emergent computation. Physica D: Nonlinear Phenomena. 1990;42(1):12–37.
- 32. Lizier JT. The Local Information Dynamics of Distributed Computation in Complex Systems. Berlin/Heidelberg: Springer; 2013.
- 33. Lizier JT. In: Wibral M, Vicente R, Lizier JT, editors. Measuring the Dynamics of Information Processing on a Local Scale in Time and Space. Berlin/Heidelberg: Springer; 2014. p. 161–193.
- 34. Priesemann V, Lizier J, Wibral M, Bullmore ET, Paulsen O, Charlesworth P, et al. Self-organization of information processing in developing neuronal networks. BMC Neuroscience. 2015;16(Suppl 1):P221+.
- 35. Boedecker J, Obst O, Lizier JT, Mayer NM, Asada M. Information processing in echo state networks at the edge of chaos. Theory in Biosciences. 2012;131(3):205–213. pmid:22147532
- 36. Bertschinger N, Natschläger T. Real-Time Computation at the Edge of Chaos in Recurrent Neural Networks. Neural Computation. 2004;16(7):1413–1436. pmid:15165396
- 37. Barnett L, Lizier JT, Harré M, Seth AK, Bossomaier T. Information Flow in a Kinetic Ising Model Peaks in the Disordered Phase. Physical Review Letters. 2013;111(17):177203. pmid:24206517
- 38. Kauffman SA. The Origins of Order: Self-Organization and Selection in Evolution. New York: Oxford University Press; 1993.
- 39. Lizier JT, Pritam S, Prokopenko M. Information dynamics in small-world boolean networks. Artificial Life. 2011;17(4):293–314. pmid:21762020
- 40. Lizier JT, Prokopenko M, Zomaya AY. The information dynamics of phase transitions in random Boolean networks. In: Bullock S, Noble J, Watson R, Bedau MA, editors. Proceedings of the Eleventh International Conference on the Simulation and Synthesis of Living Systems (ALife XI), Winchester, UK. Cambridge, MA: MIT Press; 2008. p. 374–381.
- 41. Watts DJ, Strogatz SH. Collective dynamics of ‘small-world’ networks. Nature. 1998;393(6684):440. pmid:9623998
- 42. Cover TM, Thomas JA. Elements Of Information Theory 2nd Ed. Wiley; 2006.
- 43. Lizier JT, Prokopenko M, Zomaya AY. Local measures of information storage in complex distributed computation. Information Sciences. 2012;208:39–54.
- 44. Schreiber T. Measuring information transfer. Physical Review Letters. 2000;85(2):461. pmid:10991308
- 45. Stefanescu RA, Jirsa VK. Reduced representations of heterogeneous mixed neural networks with synaptic coupling. Physical Review E. 2011;83(2):026204.
- 46. Bakker R, Wachtler T, Diesmann M. CoCoMac 2.0 and the future of tract-tracing databases. Frontiers in Neuroinformatics. 2012;6:30. pmid:23293600
- 47. Sanz Leon P, Knock S, Woodman M, Domide L, Mersmann J, McIntosh A, et al. The Virtual Brain: a simulator of primate brain network dynamics. Frontiers in Neuroinformatics. 2013;7(10). pmid:23781198
- 48. Spinney RE, Lizier JT. Characterizing information-theoretic storage and transfer in continuous time processes. Physical Review E. 2018;98(1):012314. pmid:30110808
- 49. Wibral M, Lizier JT, Vögler S, Priesemann V, Galuske R. Local active information storage as a tool to understand distributed neural information processing. Frontiers in Neuroinformatics. 2014;8:1. pmid:24501593
- 50. Zipser D, Kehoe B, Littlewort G, Fuster J. A spiking network model of short-term active memory. The Journal of Neuroscience. 1993;13(8):3406–3420. pmid:8340815
- 51. Lizier JT, Atay FM, Jost J. Information storage, loop motifs, and clustered structure in complex networks. Physical Review E. 2012;86(2):026110.
- 52. Spinney RE, Prokopenko M, Lizier JT. Transfer entropy in continuous time, with applications to jump and neural spiking processes. Physical Review E. 2017;95(3):032319. pmid:28415203
- 53. Lizier JT, Prokopenko M. Differentiating information transfer and causal effect. The European Physical Journal B. 2010;73(4):605–615.
- 54. Wibral M, Pampu N, Priesemann V, Siebenhühner F, Seiwert H, Lindner M, et al. Measuring Information-Transfer Delays. PLoS ONE. 2013;8(2):e55809. pmid:23468850
- 55. Lizier JT, Prokopenko M, Zomaya AY. Local information transfer as a spatiotemporal filter for complex systems. Phys Rev E. 2008;77:026110.
- 56. Lizier JT, Prokopenko M, Zomaya AY. Information modification and particle collisions in distributed computation. Chaos: An Interdisciplinary Journal of Nonlinear Science. 2010;20(3):037109.
- 57. Ceguerra RV, Lizier JT, Zomaya AY. Information storage and transfer in the synchronization process in locally-connected networks. In: 2011 IEEE Symposium on Artificial Life (ALIFE); 2011. p. 54–61.
- 58. Lizier JT, Prokopenko M, Cornforth DJ. The information dynamics of cascading failures in energy networks. In: Proceedings of the European Conference on Complex Systems (ECCS); 2009. p. 54+.
- 59. Marinazzo D, Pellicoro M, Wu G, Angelini L, Cortés JM, Stramaglia S. Information Transfer and Criticality in the Ising Model on the Human Connectome. PLoS ONE. 2014;9(4):e93616. pmid:24705627
- 60. Marinazzo D, Wu G, Pellicoro M, Angelini L, Stramaglia S. Information flow in networks and the law of diminishing marginal returns: evidence from modeling and human electroencephalographic recordings. PLoS ONE. 2012;7(9):e45026. pmid:23028745
- 61. Timme NM, Ito S, Myroshnychenko M, Nigam S, Shimono M, Yeh FC, et al. High-Degree Neurons Feed Cortical Computations. PLOS Computational Biology. 2016;12(5):e1004858. pmid:27159884
- 62. Faber SP, Timme NM, Beggs JM, Newman EL. Computation is concentrated in rich clubs of local cortical networks. Network Neuroscience. 2019;3(2):384–404. pmid:30793088
- 63. Lovett-Barron M, Andalman AS, Allen WE, Vesuna S, Kauvar I, Burns VM, et al. Ancestral circuits for the coordinated modulation of brain state. Cell. 2017;171(6):1411–1423. pmid:29103613
- 64. Thiele A, Bellgrove MA. Neuromodulation of Attention. Neuron. 2018;97(4):769–785. pmid:29470969
- 65. Pietersen ANJ, Cheong SK, Munn B, Gong P, Martin PR, Solomon SG. Relationship between cortical state and spiking activity in the lateral geniculate nucleus of marmosets. The Journal of Physiology. 2017;595(13):4475–4492. pmid:28116750
- 66. Bassett DS, Wymbs NF, Porter MA, Mucha PJ, Carlson JM, Grafton ST. Dynamic reconfiguration of human brain networks during learning. Proceedings of the National Academy of Sciences of the United States of America. 2011;108(18):7641–7646. pmid:21502525
- 67. Sadaghiani S, Poline JB, Kleinschmidt A, D’Esposito M. Ongoing dynamics in large-scale functional connectivity predict perception. Proceedings of the National Academy of Sciences. 2015;112(27):8463–8468.
- 68. Shine JM, Koyejo O, Poldrack RA. Temporal metastates are associated with differential patterns of time-resolved connectivity, network topology, and attention. Proceedings of the National Academy of Sciences. 2016;113(35):9888–9891.
- 69. Ekman M, Derrfuss J, Tittgemeyer M, Fiebach CJ. Predicting errors from reconfiguration patterns in human brain networks. Proceedings of the National Academy of Sciences. 2012;109(41):16714–16719.
- 70. Marr D. Vision: A Computational Investigation into the Human Representation and Processing of Visual Information. New York, NY, USA: Henry Holt and Co., Inc.; 1982.
- 71. Wibral M, Lizier JT, Priesemann V. Bits from brains for biologically inspired computing. Frontiers in Robotics and AI. 2015;2:5.
- 72. Totah NK, Neves RM, Panzeri S, Logothetis NK, Eschenko O. The Locus Coeruleus Is a Complex and Differentiated Neuromodulatory System. Neuron. 2018;99(5):1055–1068.e6. pmid:30122373
- 73. Hlinka J, Paluš M, Vejmelka M, Mantini D, Corbetta M. Functional connectivity in resting-state fMRI: Is linear correlation sufficient? NeuroImage. 2011;54(3):2218–2225. pmid:20800096
- 74. Marreiros AC, Daunizeau J, Kiebel SJ, Friston KJ. Population dynamics: Variance and the sigmoid activation function. NeuroImage. 2008;42(1):147–157. pmid:18547818
- 75. Deco G, Jirsa VK, Robinson PA, Breakspear M, Friston K. The Dynamic Brain: From Spiking Neurons to Neural Masses and Cortical Fields. PLOS Computational Biology. 2008;4(8):e1000092. pmid:18769680
- 76. Breakspear M, Terry JR, Friston KJ. Modulation of excitatory synaptic coupling facilitates synchronization and complex dynamics in a biophysical model of neuronal dynamics. Network: Computation in Neural Systems. 2003;14(4):703–732.
- 77. Langdon AJ, Breakspear M, Coombes S. Phase-locked cluster oscillations in periodically forced integrate-and-fire-or-burst neuronal populations. Physical Review E. 2012;86(6):061903.
- 78. Coombes S. Dynamics of synaptically coupled integrate-and-fire-or-burst neurons. Physical Review E. 2003;67(4):041910.
- 79. Shine JM, van den Brink RL, Hernaus D, Nieuwenhuis S, Poldrack RA. Catecholaminergic manipulation alters dynamic network topology across cognitive states. Network Neuroscience. 2018;2(3):381–396. pmid:30294705
- 80. Smith SM, Miller KL, Salimi-Khorshidi G, Webster M, Beckmann CF, Nichols TE, et al. Network modelling methods for FMRI. NeuroImage. 2011;54(2):875–891. pmid:20817103
- 81. Ramsey JD, Hanson SJ, Hanson C, Halchenko YO, Poldrack RA, Glymour C. Six problems for causal inference from fMRI. NeuroImage. 2010;49(2):1545–1558. pmid:19747552
- 82. Wu GR, Liao W, Stramaglia S, Ding JR, Chen H, Marinazzo D. A blind deconvolution approach to recover effective connectivity brain networks from resting state fMRI data. Medical Image Analysis. 2013;17(3):365–374. pmid:23422254
- 83. Rangaprakash D, Wu GR, Marinazzo D, Hu X, Deshpande G. Hemodynamic response function (HRF) variability confounds resting-state fMRI functional connectivity. Magnetic Resonance in Medicine. 2018;80(4):1697–1713. pmid:29656446
- 84. Stefanescu RA, Jirsa VK. A Low Dimensional Description of Globally Coupled Heterogeneous Neural Networks of Excitatory and Inhibitory Neurons. PLOS Computational Biology. 2008;4(11):e1000219. pmid:19008942
- 85. Cohen JR, D’Esposito M. The Segregation and Integration of Distinct Brain Networks and Their Relationship to Cognition. The Journal of Neuroscience. 2016;36(48):12083–12094. pmid:27903719
- 86. Shine JM, Breakspear M, Bell PT, Ehgoetz Martens KA, Shine R, Koyejo O, et al. Human cognition involves the dynamic integration of neural activity and neuromodulatory systems. Nature Neuroscience. 2019;22(2):289–296. pmid:30664771
- 87. FitzHugh R. Impulses and physiological states in theoretical models of nerve membrane. Biophysical Journal. 1961;1(6):445–466. pmid:19431309
- 88. Kotter R. Online Retrieval, Processing, and Visualization of Primate Connectivity Data From the CoCoMac Database. Neuroinformatics. 2004;2(2):127–44. pmid:15319511
- 89. Rüemelin W. Numerical treatment of stochastic differential equations. SIAM Journal on Numerical Analysis. 1982;19(3):604–613.
- 90. Shine JM. Gain_topology; 2018. https://github.com/macshine/gain_topology.
- 91. Lizier JT. JIDT: An Information-Theoretic Toolkit for Studying the Dynamics of Complex Systems. Frontiers in Robotics and AI. 2014;1:11.
- 92. Takens F. Detecting strange attractors in turbulence. In: Rand D, Young LS, editors. Dynamical Systems and Turbulence, Warwick 1980. vol. 898 of Lecture Notes in Mathematics. Berlin / Heidelberg: Springer; 1981. p. 366–381. Available from: http://dx.doi.org/10.1007/bfb0091924.
- 93. Faes L, Nollo G, Porta A. Information-based detection of nonlinear Granger causality in multivariate processes via a nonuniform embedding technique. Physical Review E. 2011;83:051112.
- 94. Garland J, James RG, Bradley E. Leveraging information storage to select forecast-optimal parameters for delay-coordinate reconstructions. Physical Review E. 2016;93:022221. pmid:26986345
- 95. Erten EY, Lizier JT, Piraveenan M, Prokopenko M. Criticality and Information Dynamics in Epidemiological Models. Entropy. 2017;19(5):194.
- 96. Brodski-Guerniero A, Naumer MJ, Moliadze V, Chan J, Althen H, Ferreira-Santos F, et al. Predictable information in neural signals during resting state is reduced in autism spectrum disorder. Human Brain Mapping. 2018;39(8):3227–3240. pmid:29617056
- 97. Bossomaier T, Barnett L, Harré M, Lizier JT. An Introduction to Transfer Entropy: Information Flow in Complex Systems. Cham, Switzerland: Springer International Publishing; 2016. Available from: http://dx.doi.org/10.1007/978-3-319-43222-9.
- 98. Vicente R, Wibral M, Lindner M, Pipa G. Transfer Entropy–a Model-free Measure of Effective Connectivity for the Neurosciences. Journal of Computational Neuroscience. 2011;30(1):45–67. pmid:20706781
- 99. Wibral M, Rahm B, Rieder M, Lindner M, Vicente R, Kaiser J. Transfer entropy in magnetoencephalographic data: quantifying information flow in cortical and cerebellar networks. Progress in Biophysics and Molecular Biology. 2011;105(1-2):80–97. pmid:21115029
- 100. Lizier JT, Heinzle J, Horstmann A, Haynes JD, Prokopenko M. Multivariate information-theoretic measures reveal directed information structure and task relevant changes in fMRI connectivity. Journal of Computational Neuroscience. 2011;30(1):85–107. pmid:20799057
- 101. Ito S, Hansen ME, Heiland R, Lumsdaine A, Litke AM, Beggs JM. Extending Transfer Entropy Improves Identification of Effective Connectivity in a Spiking Cortical Network Model. PLoS ONE. 2011;6(11):e27431. pmid:22102894
- 102. Stramaglia S, Wu GR, Pellicoro M, Marinazzo D. Expanding the transfer entropy to identify information subgraphs in complex systems. In: 2012 Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE; 2012. p. 3668–3671.
- 103. Nigam S, Shimono M, Ito S, Yeh FC, Timme N, Myroshnychenko M, et al. Rich-Club Organization in Effective Connectivity among Cortical Neurons. Journal of Neuroscience. 2016;36(3):670–684. pmid:26791200
- 104. Barnett L, Barrett AB, Seth AK. Granger causality and transfer entropy are equivalent for Gaussian variables. Physical Review Letters. 2009;103(23):238701. pmid:20366183
- 105. Barnett L, Seth AK. Detectability of Granger causality for subsampled continuous-time neurophysiological processes. Journal of Neuroscience Methods. 2017;275:93–121. pmid:27826091
- 106. Vakorin VA, Krakovska OA, McIntosh AR. Confounding effects of indirect connections on causality estimation. Journal of Neuroscience Methods. 2009;184(1):152–160. pmid:19628006
- 107. Williams PL, Beer RD. Generalized Measures of Information Transfer. arXiv preprint arXiv:11021507. 2011;.
- 108. Williams P, Beer R. Decomposing multivariate information. arXiv preprint arXiv:10042515. 2010;.
- 109. Lizier JT, Bertschinger N, Jost J, Wibral M. Information Decomposition of Target Effects from Multi-Source Interactions: Perspectives on Previous, Current and Future Work. Entropy. 2018;20(4):307.
- 110. Finn C, Lizier JT. Pointwise partial information decomposition using the specificity and ambiguity lattices. Entropy. 2018;20(4):297.
- 111. Atay FM, Karabacak O. Stability of Coupled Map Networks with Delays. SIAM Journal on Applied Dynamical Systems. 2006;5(3):508–527.