Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Molecular insights into heart field-specific cardiomyocyte differentiation - A computational study

  • Ricco Zeegelaar ,

    Contributed equally to this work with: Ricco Zeegelaar, Georgios Argyris

    Roles Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Resources, Software, Validation, Visualization, Writing – original draft, Writing – review & editing

    ☯ These authors contributed equally to this work.

    Affiliation Quantitative Biology Lab, Developmental BioEngineering, Faculty of Science & Technology, University of Twente, Enschede, The Netherlands

  • Georgios Argyris ,

    Contributed equally to this work with: Ricco Zeegelaar, Georgios Argyris

    Roles Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Project administration, Resources, Software, Validation, Visualization, Writing – original draft, Writing – review & editing

    ☯ These authors contributed equally to this work.

    Affiliation Quantitative Biology Lab, Developmental BioEngineering, Faculty of Science & Technology, University of Twente, Enschede, The Netherlands

  • Janine N. Post

    Roles Conceptualization, Funding acquisition, Project administration, Resources, Software, Supervision, Writing – review & editing

    j.n.post@utwente.nl

    Affiliation Quantitative Biology Lab, Developmental BioEngineering, Faculty of Science & Technology, University of Twente, Enschede, The Netherlands

Abstract

Understanding the mechanisms underlying cardiomyocyte (CM) differentiation is essential for the accurate generation of the different types of heart cells in vitro. This study advances current models of CM differentiation by introducing a gene regulatory network (GRN) model that integrates early heart field formation with downstream differentiation of committed cardiomyocytes into atrial and ventricular subtypes. The model is implemented using Boolean logic, enabling qualitative simulation of cardiac regulatory dynamics. Attractor analysis identifies steady states corresponding to first and second heart field derived atrial and ventricular cardiomyocytes. The model reveals the mechanism of WNT and BMP signaling in heart field determination and shows how RA regulation of NR2F2 decisively determines atrial versus ventricular cardiomyocyte cell fate. The model reproduced published knockout and overexpression experiments, and probabilistic simulations estimate differentiation efficiencies under varying signaling inputs. The unified Boolean model provides a foundation for generating heart-field-specific cardiomyocytes with precise atrial or ventricular identities, supporting efforts in directed differentiation and targeted heart cell therapies.

Introduction

The heart is composed of several different cell types, including endothelial cells, fibroblasts, and cardiomyocytes (CMs), which all arise from mesodermal progenitors during embryogenesis [1]. Early cardiac development initiates with the formation of two distinct heart fields: the first heart field (FHF) and the second heart field (SHF). These heart fields guide early heart morphogenesis and structural compartmentalization of the heart [2,3]. Both heart fields contribute to the formation of cardiomyocytes that populate the atrial and ventricular chambers [4,5] (Fig 1). Atrial and ventricular cardiomyocytes (aCMs and vCMs, respectively) are not only functionally distinct but also exhibit differences in gene expression profiles, including differential activation of key transcriptional regulators [5,6].

thumbnail
Fig 1. Developmental origins of cardiomyocyte subtypes from cardiac mesoderm.

Early cardiac mesoderm is induced by transcription factor MESP1 Early cardiac progenitors initially express ISL1 and give rise to both the first heart field (FHF) and the second heart field (SHF). These lineages are distinguished by differential marker gene expression: FHF progenitors downregulate ISL1 and express TBX5, while SHF progenitors maintain ISL1 expression and express TBX1. The FHF contributes primarily to left ventricular CMs and a subset of atrial cardiomyocytes, while the SHF gives rise to right ventricular CMs, the outflow tract and additional atrial CMs MESP1 [7,8]. In this article, atrial and ventricular cardiomyocyte identities are defined based on model outputs: atrial cardiomyocytes by expression of COUP-TFII and MYL7, and ventricular cardiomyocytes by expression of IRX4 and MYL2.

https://doi.org/10.1371/journal.pone.0340054.g001

Many congenital and acquired heart diseases affect specific regions of the heart [1]. For instance, myocardial infarction commonly results in irreversible loss of cardiomyocytes primarily localized to the left ventricle (LV), a region predominantly populated by cells originating from the FHF [9]. For cell replacement therapies to be effective, transplanted cardiomyocytes must integrate with host tissue. FHF-derived ventricular cardiomyocytes are optimal candidates for treating LV cell loss, as their shared developmental origins enhance electrical coupling and reduce the risk of post-remodeling arrhythmias [10,11]. This lineage-dependent therapeutic potential highlights the need to distinguish not only between atrial and ventricular subtypes, but also between cardiomyocytes of distinct heart field origin.

Gene regulatory networks (GRNs) control cell fate decisions during differentiation by coordinating the expression of key genes in response to internal and external signals. In the context of cardiomyocyte differentiation, numerous regulators have been identified, including transcription factors NKX2.5, TBX5, HAND1, HAND2, ISL1, and TBX1, as well as signaling pathways FGF, BMP, retinoic acid (RA) and WNT [12,13]. The critical roles that these factors individually play in establishing atrial and ventricular identities are well established. There however is limited understanding of the emergent lineage decision mechanisms that these factors within GRNs give rise to, particularly governing heart field specification and cardiomyocyte subtype differentiation. Especially how early lineage decisions influence the final phenotype of the cell, remains incompletely characterized. This gap in understanding limits the ability to generate CMs with defined heart field and chamber-specific identities in vitro, thereby constraining the development of targeted regenerative therapies.

Computational modeling offers a means to investigate the emergent behavior of GRNs. A variety of modeling formalisms is available for modeling biological systems, such as Boolean networks, Bayesian networks, agent-based models and rule-based models to name a few (as reviewed in [14]). The modeling strategy to use is dependent on the system to model and the type of data available and can be classified between mechanistic and empirical approaches. Mechanistic models can simulate the dynamic behavior of regulatory systems but typically depend on detailed kinetic parameters and large-scale quantitative datasets, which are often unavailable in developmental systems. Empirical models infer network structure from experimental data but mainly capture associations between network components without their direct regulatory relationships [15,16]. Boolean network (BN) models offer a practical alternative by representing gene regulatory interactions with binary states and logical rules [15,16]. We can use these Boolean networks to make predictions of cardiac differentiation trajectories and to predict the molecular conditions required for generating heart-field-specific cardiomyocytes, without requiring extensive quantitative data.

A previous Boolean network model by Hermann et al. [4] simulated early cardiac development by modeling regulatory interactions governing FHF and SHF specification. This model identified two steady states corresponding to distinct heart field identities, but did not incorporate the downstream differentiation of committed cardiomyocytes into atrial and ventricular subtypes. Extending this framework to include later stages of differentiation is essential for capturing the full progression of cardiomyocyte development and for enabling predictive simulations of cardiomyocyte subtype generation in the context of targeted regenerative therapy.

This study presents an extended Boolean network modeling framework that integrates early heart field specification with the subsequent differentiation of committed cardiomyocytes into atrial and ventricular subtypes. By combining the heart field identity model from Herrmann et al. [4] with a novel cardiomyocyte subtype differentiation network, the model captures how key signaling pathways govern sequential stages of development. The unified network identifies four steady states corresponding to heart-field-specific cardiomyocyte subtypes. Simulations demonstrate the model’s ability to predict differentiation outcomes and efficiency based on combinations of signaling inputs. This framework provides a mechanistic basis for the coupling between early and late-stage molecular regulation that determines cardiomyocyte subtype identity and enables prediction of signaling conditions required to generate heart field-specific cardiomyocyte subtypes in vitro.

Materials and methods

To investigate the regulatory mechanisms of heart-field-specific cardiomyocyte differentiation, a new Boolean network (BN) model was constructed to describe the differentiation of committed cardiomyocytes into atrial and ventricular subtypes. This model was integrated with a previously published BN representing heart field specification [4], resulting in a unified gene regulatory network (GRN) capable of simulating heart-field-specific cardiomyocyte subtype formation. The following sections describe the construction of the new model, the integration strategy, and simulation approaches used to analyze and validate network dynamics.

Network construction

The cardiomyocyte subtype Boolean network was constructed through literature-based identification of key regulatory factors involved in atrial and ventricular cardiomyocyte differentiation. Transcription factors and signaling molecules identified from mouse and human studies were represented as nodes, and experimentally supported regulatory interactions were implemented as directed edges, categorized as either activating or inhibitory. Only interactions with direct experimental support in cardiomyocytes – derived from either mouse or human studies – were included in the model. The network was gradually expanded, and interactions were refined until simulations reproduced known gene expression dynamics during cardiomyocyte specification, and then reduced to a minimal structure that preserved this behavior. A complete list of included interactions and their literature sources is provided in S1 Table.

The cardiomyocyte subtype network was integrated with the Boolean model of heart field identity from Herrmann et al. [4], which exhibits two steady states corresponding to the first heart field (TBX5+) and the second heart field (ISL1 + TBX1+). The integration was achieved by linking output nodes of the heart field model – specifically GATA4/6 and NKX2.5 – to corresponding input nodes in the cardiomyocyte subtype differentiation model. These connections were incorporated as direct inputs in the update functions of the downstream nodes, including IRX4 and NR2F2, reflecting experimentally supported regulatory dependencies. The logical rules were adjusted to include the upstream inputs, while preserving the behavior of the original network. The update functions of the cardiomyocyte subtype model, the adapted heart field network, and the unified model network with added interactions are provided in S2AS2C Table.

Model dynamics

The nodes of the model are equipped with Boolean dynamics, each taking a value of 0 or 1 to indicate whether the node is active or inactive. The state of the network at a given time point is represented by a vector containing the activity of all nodes. Node states are updated according to logical update functions, which determine the value of each node based on the current values of its regulators. For each consecutive time step, the network will therefore transition into a new state.

Repeated application of the update functions generates a sequence of states and transitions, referred to as a simulation (in literature, such sequences are also termed solutions, orbits, and trajectories, particularly in continuous-time dynamical systems). These simulations form the basis to analyzing network dynamics.

All simulations were encoded into a State Transition Graph (STG) which contains all possible states and all transitions between the states. For a BN with n nodes, the STG contains 2n states. The updates of the STG can be performed synchronous – where all variables are updated at the same time – or asynchronous, in which only one variable is non-deterministically (randomly) selected and updated as exemplified in Fig 2. In this study, the asynchronous update mode was used, as it is one of the most biologically plausible update modes due to the different timescales of biological processes [17].

thumbnail
Fig 2. (Left) Schematic example of a Boolean network (BN) with update functions of three nodes (x,y and z). (Right) The corresponding state transition graph (STG) generated by the asynchronous update mode wherein only one variable is non-deterministically selected and updated at each time step. Self-transitions are not displayed for clarity.

https://doi.org/10.1371/journal.pone.0340054.g002

When no specific time delays are defined, as in this study, asynchronous updating generates all possible transitions between states. Over time, the network will transition into a state where there is no successor state distinct from itself. These are referred to as steady or attractor states. In BNs involving cell differentiation processes, the steady states generally correspond to distinct cell types.

Model validation

Boolean networks of gene regulatory networks can be validated internally and externally. When a GRN is generated using data, internal validation is performed by determining the quality of the data, and the performance of the inference algorithm [18]. The networks presented in this paper are constructed from published node interactions (see S1 Table and Herrmann et al. [4]), causing included interactions to already be internally validated. The model can also be validated externally by comparing the model topology to a data-generated BN model, or by comparing model predictions with biological data or observations [18]. A data-generated BN can for example be generated using the R package BoolNet [19]. However, this requires a dataset with inputs and readouts that match the model, which does not exist for the network presented in this paper. Additionally, a data-generated BN is not expected to exhibit biologically plausible steady states that capture interactions across two separate time steps, which is necessary for comparison to the presented BN. The best method for externally validating the presented BN is therefore to compare model predictions to external information not used during the modeling process [18]. The model will be qualitatively validated to see if it can predict expression outcomes of published knockout and overexpression experiments, with the methodology exemplified in the section “Knockouts and overexpression for validation”.

Model predictions are made by computing the attractors of the asynchronous BN using the tool Boolsim [20], incorporated in the CoLoMoTo Notebook [21]. All code and simulations are available on:

  1. github.com/GeorgiosArg/Beyond-Committed-cardiomyocytes.

Knockouts and overexpression for validation.

Because no dataset directly matches the model’s inputs with information on node activity, the model topology was qualitatively validated. Literature on knockout and overexpression experiments that influence cardiac development was gathered. These perturbations were then replicated in the model to see if the model predictions still match experimental outcomes.

Gene knockouts were simulated by perturbing the corresponding node value to zero (off-state), while constitutive expression of a gene was modeled by fixing the node to one (on-state). For each perturbation, the model was simulated under the same signaling conditions as in the wild-type simulations (WNT, RA; ON or OFF). It is then identified if the model can still reach the attractor states (which corresponds to a cell type) with the perturbation. These results are compared to the published experimental outcomes. A perturbation is considered to be successfully reproduced by the model if the presence or absence of an attractor state matches the (wet lab) experimentally observed presence or absence of a cardiomyocyte cell type.

Probabilistic simulations for differentiation efficiency.

To investigate how many cells in a heterogeneous cell population will reach a specific cell state, a probabilistic version of the Boolean network was employed. In this implementation, each transition from a predecessor state to the immediate successor state occurs through the random selection of a single node for updating. For instance, in a network with three variables (as illustrated in Fig 2), each variable has one-third probability to be updated at each time step, resulting in a Markov chain as shown in Fig 3. Repeated simulations from randomized initial conditions will allow estimation of the probability of cells reaching a specific steady state.

thumbnail
Fig 3. Markov chain of the BN shown in Fig 2.

At each time step, one variable is selected at random for updating. Each state has exactly 3 outgoing transitions (for clarity, the self-transitions of the states are not displayed). In contrast to the non-deterministic STG of Fig 2, where the BN can occupy several states at a particular time point, here the BN is in exactly one state but with a specified probability.

https://doi.org/10.1371/journal.pone.0340054.g003

The same probabilistic simulation approach was applied for the knockout and overexpression perturbations. The resulting steady-state distributions were compared to reported experimental observations of the corresponding genetic perturbations. By comparing the simulated proportion of cells reaching the steady state associated with specific heart-field-specific cardiomyocyte subtypes to experimentally observed cell-type yields under the same conditions, the model’s predictive capacity was further evaluated.

Transformation from Boolean to real dynamics for robustness testing.

To assess the robustness of differentiation outcomes under continuous input variations, the BN model was extended to a Boolean Network Extension (BNE), following the methodology of Grieb et al. [22]. In this framework, node activity is represented by continuous values in the interval [0, 1], replacing the original binary (0 or 1) states. Boolean logic update functions from the original BN were transformed into real-valued update functions using the transformation rules shown in Equation 1.

(1)

Equation 1: The transformation of the Boolean rules into real-valued functions. The equivalences provide identical results to their Boolean counterparts when variables are binary. When variables are initialized in the continuous interval [0, 1], the resulting real valued functions exhibit additional behaviors not captured by the original Boolean rules. Application of these transformations yields a discrete-time dynamical system known as a Boolean Network Extension (BNE), as described by [22].

The extension was used to assess the stability of the atrial and ventricular cardiomyocyte states under varying levels of Retinoic Acid (RA) signaling. The RA pathway was selected due to its decisive role in directing atrial versus ventricular specification during cardiac development [13,23]. To assess robustness, RA input levels were systematically varied across the full range [0, 1], while maintaining other signals relevant for early heart development at constant activation levels, consistent with biological conditions. This analysis enabled the characterization of signaling thresholds required for stable differentiation into atrial or ventricular cardiomyocyte states and provided additional validation of the network’s predictive capability under continuous input variations.

Results

A gene regulatory network of cardiomyocyte specification

The cardiomyocyte subtype Boolean model captures the regulatory pathways and transcription factors involved in the differentiation of committed cardiac precursors into ventricular and atrial cardiomyocytes (Fig 4). The key nodes in the network play crucial roles in guiding CM differentiation. GATA4/6 and the NOTCH signaling pathway are modeled as constitutively active in precursor cells, as they are essential for early-stage development of both atrial and ventricular CMs [4,2426]. COUP-TFII (encoded by NR2F2) is regulated by retinoic acid (RA) and has been identified as a molecular switch that determines whether the committed cell becomes an atrial or ventricular CM. Multiple biomarkers for these cell fates have been identified, which are mostly coregulated and co-expressed [6]. We therefore model these clusters of biomarkers with expression of two representative genes; MYL7 for atrial identities and MYL2 for ventricular identities. When RA is present, NR2F2 expression is activated and represses ventricular markers like MYL2 while activating expression of atrial markers such as MYL7, which steers the cell toward atrial differentiation. Conversely, when RA is absent, NR2F2 is not expressed, which allows the differentiation towards ventricular CMs [13,23].

thumbnail
Fig 4. Key gene regulatory network underlying cardiomyocyte subtype specification.

The schematic shows key signaling interactions involved in atrial and ventricular cardiomyocyte differentiation. Nodes represent signaling inputs, transcription factors and cell identity markers, with colors for easier interpretation. Edges indicate experimentally supported regulatory interactions, with green arrows showing activation and red T-bars marking inhibition. MYL7 (blue) is an atrial biomarker and MYL2 (yellow) is a ventricular biomarker. GATA4/6 and NOTCH (green) are modeled as constitutively active to reflect early developmental conditions. Sources for all interactions are provided in S1 Table, and the Boolean update functions are listed in S2B Table. The steady states of the cardiomyocyte subtype GRN are provided in Table 1.

https://doi.org/10.1371/journal.pone.0340054.g004

thumbnail
Table 1. Simulations of the cardiomyocyte subtype BN result in two steady states.

https://doi.org/10.1371/journal.pone.0340054.t001

The steady states obtained from attractor analysis of the cardiomyocyte specification model are presented in Table 1. The states align with established marker gene profiles for ventricular and atrial CMs [13,27]. The BN exhibits two steady states: one steady state with activity of ventricular cell fate indicators IRX4, HEY2, and MYL2, and a second steady state where atrial cell fate indicator COUP-TFII and MYL7 are active. The outcome of the simulation depends on the initial conditions of the network, particularly the activity of retinoic acid (RA), which drives the system toward either atrial or ventricular specification.

Cardiomyocyte subtype differentiation under varying RA signaling

The Boolean formalism only allows for binary states. In cells however, signaling strength can vary and is dose-dependent. To investigate if the cardiomyocyte specification model can predict what happens under varying signaling strengths, the Boolean network was extended into a Boolean network Extension (BNE), following the methodology of [22]. In this framework, binary states (0 or 1) are converted to continuous values in the interval [0, 1], and logical update functions are transformed into real-valued expressions. The signaling inputs GATA4/6 (γ), RA (ρ), and NOTCH (ν) were modeled as continuous parameters to assess their effect on downstream gene activity and differentiation outcomes. The activity of nodes is then expressed as functions of ρ, to see how varying levels of RA signaling influences cardiomyocyte identity.

Following transformation to the continuous model, some nodes stabilized to expressions dependent solely on the input parameters. The node of NR2F2 was directly stabilized according to RA with the update function . The update function of xHEY2 takes the form , indicating dependency on NOTCH, GATA4/6, and inversely RA signaling. The atrial marker MYL7 stabilized to the function , which showcases the switch-like behavior of atrial specification dependent on RA signaling intensity.

In contrast, the nodes HAND2, IRX4 and MYL2 form a coupled linear system of update functions (Equations 2). These nodes do not stabilize independently but rather depend on each other’s steady state values.

(2)

Solving this system at steady state (where Xi (t + 1) = Xi (t)), yields expressions at steady states for each variable as a function of the input parameters (Equations 3). These solutions provide insight into how RA, GATA4/6 and NOTCH signaling regulate the balance between atrial and ventricular specification via the identified gene regulatory network.

(3)

Below a RA signaling threshold, MYL2 becomes more active, indicating ventricular specification. The relationship between RA signaling intensity and marker activity is shown in Fig 5. These results highlight the robustness of cardiomyocyte subtype predictions and support the role of RA signaling as a switch-like regulator in controlling the differentiation towards atrial and ventricular cardiomyocyte identities.

thumbnail
Fig 5. Steady state activity levels of key network nodes as a function of retinoic acid (RA) signaling intensity.

The X-axis represents the input value of RA (ρ). The Y-axis indicates the steady state activity of nodes, derived from the Boolean Network Extension (BNE). The plot shows that RA orchestrates steady state node activity and therefore cardiomyocyte subtype identity in a signaling intensity-dependent manner. This is consistent with in vivo and in vitro observations, where cardiac subtype fates are dependent on RA concentration [23].

https://doi.org/10.1371/journal.pone.0340054.g005

Unified Boolean network reveals the path to heart field-specific cardiomyocytes

To construct the unified Boolean network, the previously published model by Herrmann et al. [4], which describes gene interactions during early murine cardiac development, was adapted. This model distinguishes between the FHF and SHF based on specific gene expression profiles. The adapted model, shown in Fig 6, includes key signaling pathways implicated in early heart field identity formation such as BMP, FGF and WNT, as well as key transcription factors determining heart field identity like ILS1, TBX5, and NKX2.5. In the original model, multiple exogenous nodes were used to account for the timed activation of BMP2 (exBMP2) and WNT (exWNT) signaling by neighboring tissues. In the adapted version, the structure is adjusted by using a single exogen to represent the effects of these signals without considering timing or origin.

thumbnail
Fig 6. Gene regulatory network underlying heart field specification, adapted from [4].

The network models differentiation of cardiac precursor cells into FHF (with central regulatory node TBX5 shown in blue) and SHF (with central regulatory nodes ISL1 and TBX1 shown in pink) fates guided by BMP and WNT signaling. Nodes are regulatory components including signaling molecules and transcription factors, with activating interactions shown as green arrows, and inhibitory interactions as red T-bars. The BN corresponding to this network is shown in S2A Table. The steady states of the heart field specification GRN are provided in Table 2.

https://doi.org/10.1371/journal.pone.0340054.g006

thumbnail
Table 2. Steady states of the heart field specification Boolean network shown in Fig 6.

https://doi.org/10.1371/journal.pone.0340054.t002

The unification of the two BNs was achieved by merging the GATA nodes from both the heart field and the cardiomyocyte subtype model and introducing an additional regulatory link between NKX2.5 and IRX4, as supported by experimental evidence [6]. The unified network, shown in Fig 7, incorporates both networks with the additional regulatory interactions, enabling the simulation of cardiomyocyte subtype differentiation downstream of heart field specification. The corresponding update functions are provided in S2C Table.

thumbnail
Fig 7. The unified Gene regulatory network for heart field identity and cardiomyocyte subtype.

The nodes in the blue shaded area (left part of Fig 7) constitute the heart field GRN adapted from [4], while the nodes in the magenta shaded area are the nodes of the Cardiomyocyte GRN. The nodes in the overlapping purple area (GATA4/6 and NKX2-5) connect the models with each other. In this model, expression of TBX5 promotes FHF identity; ISL1 and TBX1 promote SHF identity, MYL7 expression indicates aCM differentiation and MYL2 expression indicates vCM differentiation. The steady states of the unified GRN are provided in Table 3.

https://doi.org/10.1371/journal.pone.0340054.g007

The steady states of the unified model are derived through attractor analysis, as shown in Table 3. Importantly, since the Boolean network integrates regulatory interactions from multiple stages of differentiation, node activity in steady states should be interpreted in the context of the differentiation stage in which the node is involved, rather than as a direct indicator of transcriptional activity in the final stage of differentiation. For instance, the activity of nodes associated with heart field specification reflects their activity at the heart field specification stage of differentiation and does not necessarily correspond to their activity in the final cardiomyocyte subtype. Based on this interpretation, the steady states reveal which pathways and transcription factors were active at key regulatory checkpoints during differentiation, influencing the final cell fate. To support interpretation, nodes have been annotated with their associated cell type, clarifying their stage specific roles in cardiomyocyte subtype specification.

The steady states shown in Table 3 (FHF aCM, FHF vCM, SHF aCM, SHF vCM) are consistent with biological evidence, suggesting that both atrial and ventricular CMs can originate from either heart field [4,5]. Cells of the first heart field primarily populate the left ventricle, while cells of the second heart field populate the right ventricle with both fields contributing to the atria [5]. In the model, the steady state representing FHF-derived ventricular CMs (column FHF vCM) exhibits TBX5 activity, consistent with left ventricular identity [12,22]. Conversely, SHF-derived ventricular CMs (column SHF vCM) exhibit TBX1 expression, corresponding to right ventricular identity [12,22].

Additionally, the model highlights the differential activity of key signaling pathways (WNT, BMP, RA, and NOTCH) across the different steady states. In vivo and in vitro, correct temporal activation of these pathways is crucial for guiding cardiomyocyte differentiation. For instance, the simulation results suggest that SHF-derived right ventricular CMs can be generated from precursor CMs, by initial activation of WNT while inhibiting BMP, followed by NOTCH activation and RA inhibition during maturation. These findings are consistent with experimental observations [8].

The presence of the “null” steady state in which GATA4/6 is inactive reflects its essential role in cardiomyocyte specification [26]. The model suggests that there is a state possible where GATA4/6 is not activated, and no cardiac markers get upregulated. These cells with a “null” state identity are therefore speculated to either be non-differentiated precursor mesoderm cells, or cells that may adopt an alternative non-cardiac cell fate not accounted for in the current network structure.

Probabilistic simulations to predict differentiation efficiency

Probabilistic simulations of the unified model were conducted to evaluate differentiation efficiency in a heterogeneous cell population of cardiac progenitor cells. The unified BN has 21 variables, and each simulation step involved the asynchronous selection of one variable for updating, resulting in transition probabilities of 1/21 across the corresponding Markov chain. A total of 400,000 independent simulations were performed, with 100,000 runs for each of the 4 different activity combinations of the input variables WNT and RA (as shown in the first column of Table 4). The initial values of all the other nodes were assigned with a random binary value in {0, 1}.

thumbnail
Table 4. Simulations of cardiomyocyte differentiation efficiency under varying WNT and RA input combinations.

https://doi.org/10.1371/journal.pone.0340054.t004

The results of the conducted simulations are shown in Table 4. A total of 363,695 out of 400,000 simulated cells (90.92%) follow the expected differentiation lineage based on the combination of the input variables (WNT, RA) that each cell is treated with. When WNT signaling was inactive (WNT = 0; 1st and 3rd row of Table 4), the network primarily produced FHF cells, as well as cells reaching the null state, suggesting a non-cardiac lateral plate mesoderm derived cell type like endothelial or smooth muscle cells [28]. Among the FHF cells, the cells exposed to RA (RA = 1) gave rise FHF aCMs. In contrast, when WNT was active (WNT = 1; 2nd and 4th row of Table 4), the model yielded SHF cells which, in absence of RA signaling (RA = 0) differentiated to SHF vCMs, in line with predicted lineage trajectories.

Qualitative validation of the unified Boolean network

Known genetic perturbations from experimental studies were simulated using the Boolean network to evaluate the model’s predictive capabilities. Knockouts were adapted to the BN by fixing the value of that node to 0 (inactive). Similarly, constitutive expression (overexpression) was modeled by setting the value of the associated node to 1 (active). The effects of each perturbation were evaluated by analyzing changes in the model’s steady states and checking if resulting cell types matched experimentally observed outcomes. A summary of perturbations and their simulation results is provided in Tables 5 and 6.

thumbnail
Table 5. Overview of cardiomyocyte differentiation perturbations in literature.

https://doi.org/10.1371/journal.pone.0340054.t005

thumbnail
Table 6. Simulation results for in silico perturbations compared to experimentally observed results.

https://doi.org/10.1371/journal.pone.0340054.t006

In the simulations, the knockout of COUP-TFII (ii) correctly prevented atrial cardiomyocyte formation; the double knockout of NKX2.5 and HAND2 (v) successfully eliminated all ventricular cardiomyocytes; and HEY2 overexpression (vii) accurately suppressed atrial gene expression. These results align with experimental findings and demonstrate the ability of the model to mimic key gene regulatory interactions.

HAND2 knockout (i) and NKX2.5 knockout (iv), which are expected to result in the loss of right and left ventricular cardiomyocytes respectively, were only partly reproduced by the model. This can be attributed to model simplifications, particularly the exclusion of HAND1, a transcription factor strongly enriched in the left ventricle [29]. HAND1 and HAND2 exhibit functional redundancy during cardiomyocyte differentiation, with HAND1 compensating for HAND2 loss in the FHF to ensure left ventricular development in vivo [30]. Including HAND1 would therefore be redundant for modeling the general mechanisms behind the differentiation, but as a result, the model can not distinguish between right- and left ventricle-specific effects. Similarly, the effects of NKX2.5 knockout, disrupting left ventricle formation through upstream regulation of HAND1 [29], were also only partially reproduced. Despite the model’s inability to fully capture the effects of these perturbations, the overall impact on ventricular cardiomyocyte loss was successfully reproduced.

Together, these results demonstrate that the unified Boolean network captures the regulatory logic of heart-field-specific cardiomyocyte differentiation. In addition to reproducing perturbation outcomes, the model also highlights how signaling combinations determine the cardiac lineage trajectories. In summary, the simulations suggest that cardiomyocyte cell fate is orchestrated by a small but highly influential set of factors, primarily WNT and BMP during heart field specification followed by RA during the cardiomyocyte subtype commitment, consistent with in vivo and in vitro observations. These findings show the robust logic behind the cardiac developmental system.

Discussion

This paper presents a Boolean GRN that integrates heart field specification with the differentiation of committed cardiomyocytes (CMs) to atrial and ventricular subtypes. The unified model incorporates key signaling pathways and transcription factors implicated in both cardiomyocyte differentiation stages. Attractor analysis identifies four steady states corresponding to heart-field-specific cardiomyocyte identities: FHF-derived atrial CMs, FHF-derived ventricular CMs, SHF-derived atrial CMs, and SHF-derived ventricular CMs. This work expands upon earlier models focused exclusively on heart field formation [4] by simulating downstream CM subtype commitment.

Regulatory robustness and switch-like behavior revealed through Boolean network extension

The attractors generated by the model reproduce known gene expression profiles for atrial and ventricular fates, including MYL7 and COUPTFII for atrial identity and MYL2 and HEY2 for ventricular identity. Retinoic acid (RA) signaling acts as the key regulatory input driving the switch between these identities, consistent with in vivo and in vitro differentiation studies [13,23].

To investigate the concentration dynamics of this regulatory switch, the cardiomyocyte subtype Boolean network was converted into a Boolean network extension. In this framework, node activity takes values across the continuous interval [0, 1], and update functions are reformulated as real-valued expressions. Analysis of the Boolean network Extension revealed that the decision between atrial and ventricular specification depends on the RA activity crossing a threshold. This supports the idea that varying RA concentration can instruct distinct cardiac subtype fates, consistent with its proposed morphogenic function in cardiomyocyte differentiation [23]. There are several co-factors and enzymes that influence RA signaling and RA-dependent gene expression, including COUPTFII [23]. Therefore, the RA concentration to cross the signaling threshold in vitro should be calibrated based on the experimental context.

Model validation using knockout and overexpression perturbations

To validate the predictions of the network, the model was tested against seven perturbation experiments described in the literature. Three out of seven perturbations were fully reproduced, such as the suppression of atrial fate upon COUP-TFII knockout [34] and repression of atrial gene expression under HEY2 overexpression [6,37,38]. Two perturbations were partly reproduced, and two perturbations were not captured. These results support the model’s capacity to capture core regulatory dependencies driving CM subtype identity.

The discrepancies in some of the model predictions reflect either missing regulatory links or simplifications inherent in a Boolean logic framework. For instance, in the case of TBX5 knockout [35], the model predicted no change in steady states except the deactivation of TBX5 itself. This result is consistent with experimental observations showing that MYL7 expression persists in TBX5-deficient mice, but it fails to capture the reduction in atrial cardiomyocyte output mediated by loss of TBX5-regulated genes such as ANF and CX40 [35]. In the context of this knockout, MYL7 can therefore not represent atrial genes as it does for the wild type.

The HEY2 knockout simulation also deviated from biological expectations. HEY2 loss has been shown to induce ectopic MYL7 expression in ventricular cells [6], but this behavior was not reproduced. The discrepancy arises from the Boolean logic constraints, which prevent a steady state in which MYL2 and MYL7 are both active. In contrast, the BNE analysis, which allowed continuous activation values, did permit intermediate MYL7 activity in ventricular states, and may therefore have represented the effects of this knockout more accurately.

Other mismatches likely result from the simplified network topology. In these cases, the mechanisms underlying the effects of a perturbation are not captured because the relevant regulatory factors are not included in the model. Incorporating such factors in future versions could improve the model’s predictive powers, especially under more complex or combined perturbations. However, expanding the network could cause redundancy of nodes and feedback loops that enhance the robustness of the system. This would make it harder to learn from the model, as simulations may be less responsive to perturbations like knockdown or overexpression. In effect, expanding the model may paradoxically lead to a reduction of predictive power and model sensitivity [16].

Justification for Boolean modelling approach

Differential equation-based models can describe time-dependent processes and concentration-dependent interactions, but they require detailed kinetic parameters and quantitative data. Such information is rarely available for large gene regulatory networks involved in embryonic development [4,16]. Boolean models provide a simplified yet effective alternative for modeling gene regulatory networks when mainly qualitative information is available [15,16]. In this study, a Boolean framework was applied to simulate the differentiation of heart-field-specific cardiomyocyte subtypes. All regulatory interactions in the model were derived from published experimental studies, without requiring parameter estimation or assumptions about reactions kinetics. This approach enables qualitative predictions of regulatory behavior using a binary representation of activity.

The Boolean logic approach did however have limitations for modelling heart-field-specific cardiomyocyte differentiation. Two networks that biologically are consecutive were united and steady states were computed asynchronously. Because of this, nodes determining cardiomyocyte identity are already interacting while the heart field decision is still being made. As a result, cardiomyocyte regulators that are heart field specific (e.g., HAND1) cannot be included in the model. Using Boolean logic also caused effects of some knockouts to not be captured correctly. Some knockouts biologically cause ectopic expression of marker genes in other tissues (e.g., MYL7 expression in MYL2-expressing ventricular cardiomyocytes after HEY2 knockout). The Boolean logic does not allow for a steady state to exist where two separate markers are both expressed in the same cell, and these knockouts will therefore not be successfully represented. Although the Boolean modelling approach allows for insights in the system without detailed data on parameters, it lacks implementation of timing and concentrations which are important properties during differentiation.

Boolean networks have previously been used for understanding the regulatory mechanisms behind differentiation of multiple different cell types. This is done either by inferring GRN structure of a system from a single cell or bulk dataset (RNA-seq or qRT-PCR) with an algorithm (e.g., [3941]), or by manually assigning Boolean rules to literature-reported observations (e.g., [4,15,42,43]). The approach in this manuscript is similar to the second category, but extends it in two ways. Instead of modelling a single lineage decision, we couple two consecutive differentiation stages in one Boolean Network, to see how heart-field-specific cardiomyocytes can be generated. We also couple our GRN to external signaling inputs to gain additional insights on how the GRN activities can be influenced to steer differentiation towards a specific cell type. Other modelling modalities can also be used to predict signaling combinations with additional information on the effects of signaling strength (such as timed automata, e.g., [44]; and ODE-based modelling, e.g., [45,46]), something Boolean network predictions lack. Still, these methods require a tremendous amount of data for parameter optimization, whereas here it’s shown that Boolean Networks can already provide applicable insights from a simple dynamic approach.

Broader implications and future directions

The model presented provides an integrated Boolean Network that connects early heart field specification with differentiation towards atrial and ventricular subtypes. The dual-stage integration shows how early signals influence the subtype identities, providing a foundation as a hypothesis-generating tool for understanding the emergence of heart-field-specific cardiomyocytes. Insights from the model can aid the development of in vitro differentiation protocols aimed at producing specific CM types.

The model reproduces known expression patterns and perturbation phenotypes but could be further enhanced through additional validation steps. Quantitative comparison of the model predictions against a transcriptomic dataset requires an optimized experiment that uses the same set of signaling factors as in the model, applied to the corresponding cell types. Setting up and optimizing such an experiment might be timely and costly but would significantly strengthen conclusions from the model. New knockout experiments can be essential in extending the model to enhance its predictive accuracy. Such validations would support the refinement of the network into a better hypothesis-generating tool.

Ultimately, the model offers a promising foundation for understanding the control of cardiac development. The ability to reproduce key regulatory behaviors with minimal assumptions is especially valuable in a data-limited context. With integration of new datasets and modelling formalisms, the model can be progressively refined to capture more dynamic, spatial and mechanical aspects of heart development, to bring it closer to applications in disease modeling and cell therapy.

Supporting information

S1 Table. Extended evidence of the connections constituting the cardiomyocyte subtype GRN.

The listed sources form the basis of the interactions that make up Fig 4.

https://doi.org/10.1371/journal.pone.0340054.s001

(DOCX)

S2 Table. Boolean network description of the models.

The Table includes the description of the S1A) Heart Field specification S1B) Cardiomyocyte specification and S1C) Unified differentiation network.

https://doi.org/10.1371/journal.pone.0340054.s002

(DOCX)

Acknowledgments

This work is part of work conducted within the Heart Engine consortium involving the groups of Prof. Dr. J. Leijten, Prof Dr. R. Passier, and Dr. Ing. J. N. Post (University of Twente, The Netherlands), and Prof. Dr. G.J.C. Veenstra (Radboud University, The Netherlands). The study has benefited from discussions with the following: Stefano Schivo from the Open Universiteit, Heerlen, the Netherlands, Lucas F. Jansen Klomp, Xinqi Yan from the University of Twente, the Netherlands, and Rebecca R. Snabel, Gert Jan C. Veenstra from Radboud University, the Netherlands. This manuscript is a revised and expanded version of the preprint “Molecular mechanisms of heart field specific cardiomyocyte differentiation- a computational modeling approach” previously uploaded on bioRxiv (https://doi.org/10.1101/2024.12.19.629328). The description of the materials, methods, and core results is based on that original draft.

References

  1. 1. Doyle MJ, Lohr JL, Chapman CS, Koyano-Nakagawa N, Garry MG, Garry DJ. Human induced pluripotent stem cell-derived cardiomyocytes as a model for heart development and congenital heart disease. Stem Cell Rev Rep. 2015;11(5):710–27. pmid:26085192
  2. 2. Brade T, Pane LS, Moretti A, Chien KR, Laugwitz K-L. Embryonic heart progenitors and cardiogenesis. Cold Spring Harb Perspect Med. 2013;3(10):a013847–a013847.
  3. 3. Srivastava D. Making or breaking the heart: from lineage determination to morphogenesis. Cell. 2006;126(6):1037–48.
  4. 4. Herrmann F, Groß A, Zhou D, Kestler HA, Kühl M. A boolean model of the cardiac gene regulatory network determining first and second heart field identity. PLoS One. 2012;7(10):e46798. pmid:23056457
  5. 5. Litviňuková M, Talavera-López C, Maatz H, Reichart D, Worth CL, Lindberg EL, et al. Cells of the adult human heart. Nature. 2020;588(7838):466–72. pmid:32971526
  6. 6. Ng SY, Wong CK, Tsang SY. Differential gene expressions in atrial and ventricular myocytes: insights into the road of applying embryonic stem cell-derived cardiomyocytes for future therapies. Am J Physiol Cell Physiol. 2010;299(6):C1234-49. pmid:20844252
  7. 7. Protze SI, Lee JH, Keller GM. Human pluripotent stem cell-derived cardiovascular cells: from developmental biology to therapeutic applications. Cell Stem Cell. 2019;25(3):311–27.
  8. 8. Zhao M-T, Shao N-Y, Garg V. Subtype-specific cardiomyocytes for precision medicine: where are we now? Stem Cells. 2020;38(7):822–33. pmid:32232889
  9. 9. Pezhouman A, Nguyen NB, Shevtsov A, Qiao R, Ardehali R. In vitro generation of heart field specific cardiomyocytes. In: Kannan N, Beer P, editors. Stem cell assays: methods and protocols. New York (NY): Springer US; 2022. p. 257–67.
  10. 10. Shiba Y, Fernandes S, Zhu W-Z, Filice D, Muskheli V, Kim J, et al. Human ES-cell-derived cardiomyocytes electrically couple and suppress arrhythmias in injured hearts. Nature. 2012;489(7415):322–5. pmid:22864415
  11. 11. St John Sutton M, Lee D, Rouleau JL, Goldman S, Plappert T, Braunwald E, et al. Left ventricular remodeling and ventricular arrhythmias after myocardial infarction. Circulation. 2003;107(20):2577–82. pmid:12732606
  12. 12. Paige SL, Plonowska K, Xu A, Wu SM. Molecular regulation of cardiomyocyte differentiation. Circ Res. 2015;116(2):341–53. pmid:25593278
  13. 13. Wu S, Cheng C-M, Lanz RB, Wang T, Respress JL, Ather S, et al. Atrial identity is determined by a COUP-TFII regulatory network. Dev Cell. 2013;25(4):417–26. pmid:23725765
  14. 14. Machado D, Costa RS, Rocha M, Ferreira EC, Tidor B, Rocha I. Modeling formalisms in systems biology. AMB Express. 2011;1:45. pmid:22141422
  15. 15. Kerkhofs J, Roberts SJ, Luyten FP, Van Oosterwyck H, Geris L. Relating the chondrocyte gene network to growth plate morphology: from genes to phenotype. PLoS One. 2012;7(4):e34729. pmid:22558096
  16. 16. Morris MK, Saez-Rodriguez J, Sorger PK, Lauffenburger DA. Logic-based models for the analysis of cell signaling networks. Biochemistry. 2010;49(15):3216–24. pmid:20225868
  17. 17. Radulescu O, Gorban AN, Zinovyev A, Noel V. Reduction of dynamical biochemical reactions networks in computational biology. Front Genet. 2012;3:131. pmid:22833754
  18. 18. Delgado FM, Gómez-Vela F. Computational methods for Gene Regulatory Networks reconstruction and analysis: a review. Artif Intell Med. 2019;95:133–45. pmid:30420244
  19. 19. Müssel C, Hopfensitz M, Kestler HA. BoolNet--an R package for generation, reconstruction and analysis of Boolean networks. Bioinformatics. 2010;26(10):1378–80. pmid:20378558
  20. 20. Garg A, Xenarios I, Mendoza L, DeMicheli G. An efficient method for dynamic analysis of gene regulatory networks and in silico gene perturbation experiments. Annual International Conference on Research in Computational Molecular Biology; 2007. p. 62–76.
  21. 21. Naldi A, Monteiro PT, Müssel C, Consortium for Logical Models and Tools, Kestler HA, Thieffry D, et al. Cooperative development of logical modelling standards and tools with CoLoMoTo. Bioinformatics. 2015;31(7):1154–9. pmid:25619997
  22. 22. Grieb M, Burkovski A, Sträng JE, Kraus JM, Groß A, Palm G, et al. Predicting variabilities in cardiac gene expression with a Boolean network incorporating uncertainty. PLoS One. 2015;10(7):e0131832. pmid:26207376
  23. 23. Stefanovic S, Zaffran S. Mechanisms of retinoic acid signaling during cardiogenesis. Mech Dev. 2017;143:9–19. pmid:28007475
  24. 24. Ihara D, Watanabe Y, Seya D, Arai Y, Isomoto Y, Nakano A, et al. Expression of Hey2 transcription factor in the early embryonic ventricles is controlled through a distal enhancer by Tbx20 and Gata transcription factors. Dev Biol. 2020;461(2):124–31. pmid:32035085
  25. 25. Miao L, Li J, Li J, Tian X, Lu Y, Hu S, et al. Notch signaling regulates Hey2 expression in a spatiotemporal dependent manner during cardiac morphogenesis and trabecular specification. Sci Rep. 2018;8(1):2678. pmid:29422515
  26. 26. Xin M, Davis CA, Molkentin JD, Lien C-L, Duncan SA, Richardson JA, et al. A threshold of GATA4 and GATA6 expression is required for cardiovascular development. Proc Natl Acad Sci U S A. 2006;103(30):11189–94. pmid:16847256
  27. 27. Schwach V, Verkerk AO, Mol M, Monshouwer-Kloots JJ, Devalla HD, Orlova VV, et al. A COUP-TFII human embryonic stem cell reporter line to identify and select atrial cardiomyocytes. Stem Cell Rep. 2017;9(6):1765–79. pmid:29173897
  28. 28. Prummel KD, Nieuwenhuize S, Mosimann C. The lateral plate mesoderm. Development. 2020;147(12):dev175059. pmid:32561665
  29. 29. Biben C, Harvey RP. Homeodomain factor Nkx2-5 controls left/right asymmetric expression of bHLH gene eHand during murine heart development. Genes Dev. 1997;11(11):1357–69. pmid:9192865
  30. 30. McFadden DG, Barbosa AC, Richardson JA, Schneider MD, Srivastava D, Olson EN. The Hand1 and Hand2 transcription factors regulate expansion of the embryonic cardiac ventricles in a gene dosage-dependent manner. Development. 2005;132(1):189–201. pmid:15576406
  31. 31. Yamagishi H, Yamagishi C, Nakagawa O, Harvey RP, Olson EN, Srivastava D. The combinatorial activities of Nkx2.5 and dHAND are essential for cardiac ventricle formation. Dev Biol. 2001;239(2):190–203.
  32. 32. Srivastava D, Thomas T, Lin Q, Kirby ML, Brown D, Olson EN. Regulation of cardiac mesodermal and neural crest development by the bHLH transcription factor, dHAND. Nat Genet. 1997;16(2):154–60.
  33. 33. Srivastava D, Cserjesi P, Olson EN. A subclass of bHLH proteins required for cardiac morphogenesis. Science. 1995;270(5244):1995–9. pmid:8533092
  34. 34. Pereira FA, Qiu Y, Zhou G, Tsai M-J, Tsai SY. The orphan nuclear receptor COUP-TFII is required for angiogenesis and heart development. Genes Dev. 1999;13(8):1037–49.
  35. 35. Bruneau BG, Nemer G, Schmitt JP, Charron F, Robitaille L, Caron S, et al. A murine model of holt-oram syndrome defines roles of the T-box transcription factor Tbx5 in cardiogenesis and disease. Cell. 2001;106(6):709–21.
  36. 36. Koibuchi N, Chin MT. CHF1/Hey2 plays a pivotal role in left ventricular maturation through suppression of ectopic atrial gene expression. Circ Res. 2007;100(6):850–5. pmid:17332425
  37. 37. Kathiriya IS, King IN, Murakami M, Nakagawa M, Astle JM, Gardner KA, et al. Hairy-related transcription factors inhibit GATA-dependent cardiac gene expression through a signal-responsive mechanism. J Biol Chem. 2004;279(52):54937–43. pmid:15485867
  38. 38. Xin M, Small EM, van Rooij E, Qi X, Richardson JA, Srivastava D, et al. Essential roles of the bHLH transcription factor Hrt2 in repression of atrial gene expression and maintenance of postnatal cardiac function. Proc Natl Acad Sci U S A. 2007;104(19):7975–80. pmid:17468400
  39. 39. Moignard V, Woodhouse S, Haghverdi L, Lilly AJ, Tanaka Y, Wilkinson AC, et al. Decoding the regulatory network of early blood development from single-cell gene expression measurements. Nat Biotechnol. 2015;33(3):269–76. pmid:25664528
  40. 40. Offermann B, Knauer S, Singh A, Fernández-Cachón ML, Klose M, Kowar S, et al. Boolean modeling reveals the necessity of transcriptional regulation for bistability in PC12 cell differentiation. Front Genet. 2016;7:44. pmid:27148350
  41. 41. Hamey FK, Nestorowa S, Kinston SJ, Kent DG, Wilson NK, Göttgens B. Reconstructing blood stem cell regulatory network models from single-cell molecular profiles. Proc Natl Acad Sci U S A. 2017;114(23):5822–9. pmid:28584094
  42. 42. Krumsiek J, Marr C, Schroeder T, Theis FJ. Hierarchical differentiation of myeloid progenitors is encoded in the transcription factor network. PLoS One. 2011;6(8):e22649. pmid:21853041
  43. 43. Collombet S, van Oevelen C, Sardina Ortega JL, Abou-Jaoudé W, Di Stefano B, Thomas-Chollier M, et al. Logical modeling of lymphoid and myeloid cell specification and transdifferentiation. Proc Natl Acad Sci U S A. 2017;114(23):5792–9. pmid:28584084
  44. 44. Schivo S, Khurana S, Govindaraj K, Scholma J, Kerkhofs J, Zhong L, et al. ECHO, the executable CHOndrocyte: a computational model to study articular chondrocytes in health and disease. Cell Signal. 2020;68:109471. pmid:31837466
  45. 45. Schroer AK, Ryzhova LM, Merryman WD. Network modeling approach to predict myofibroblast differentiation. Cell Mol Bioeng. 2014;7(3):446–59. pmid:33072223
  46. 46. Martínez-Sosa P, Mendoza L. The regulatory network that controls the differentiation of T lymphocytes. Biosystems. 2013;113(2):96–103.