Figures
Abstract
Thoracic aortic aneurysms (TAAs) stem from diverse mechanical and mechanobiological disruptions to the aortic wall that can also increase the risk of dissection or rupture. There is increasing evidence that dysfunctions along the aortic mechanotransduction axis, including reduced integrity of elastic fibers and loss of cell-matrix connections, are particularly capable of causing thoracic aortopathy. Because different insults can produce distinct mechanical vulnerabilities, there is a pressing need to identify interacting factors that drive progression. In this work, we employ a finite element framework to generate synthetic TAAs arising from hundreds of heterogeneous insults that span a range of compromised elastic fiber integrity and cellular mechanosensing. From these simulations, we construct localized dilatation and distensibility maps throughout the aortic domain to serve as training data for neural network models to predict the initiating combined insult. Several candidate architectures (Deep Operator Networks, UNets, and Laplace Neural Operators) and input data formats are compared to establish a standard for handling future subject-specific information. We further quantify the predictive capability when networks are trained on geometric (dilatation) information alone, which mimics current clinical guidelines, versus training on both geometric and mechanical (distensibility) information. We show that prediction errors based on dilatation data are significantly higher than those based on dilatation and distensibility across all networks considered, highlighting the benefit of obtaining local distensibility measures in TAA assessment. Additionally, we identify UNet as the best-performing architecture across all training data formats. These findings demonstrate the importance of obtaining full-field measurements of both dilatation and distensibility in the aneurysmal aorta to identify the mechanobiological insults that drive disease progression, which will advance personalized treatment strategies that target the underlying pathologic mechanisms.
Author summary
We investigated causes of thoracic aortic aneurysms (TAAs), which are local enlargements of the aorta that can lead to life-threatening rupture. TAAs result from a combination of structural and cellular disruptions in the aortic wall, but need not pose the same risk even if appearing similar in size. We created computer models of aneurysms caused by different combinations of known risk factors, including tissue damage and cellular dysfunction. Using these simulations, we trained multiple neural network models with information on TAA shape (dilatation) and mechanics (distensibility) to see if they could recover the original causes. We found that predictions based on both dilatation and distensibility were significantly better than using dilatation alone, which represents the current clinical standard. Among the models tested, we also identified a best performing architecture, UNet, for these applications. These results suggest that measuring the full shape and mechanics of the aneurysmal aorta could lead to improved personalized diagnoses and treatment for TAAs.
Citation: Li DS, Goswami S, Cao Q, Oommen V, Assi R, Humphrey JD, et al. (2025) Importance of localized dilatation and distensibility in identifying determinants of thoracic aortic aneurysm with neural operators. PLoS Comput Biol 21(10): e1013550. https://doi.org/10.1371/journal.pcbi.1013550
Editor: Hamidreza Mortazavy Beni, Islamic Azad University Arsanjan Branch, IRAN, ISLAMIC REPUBLIC OF
Received: May 11, 2025; Accepted: September 22, 2025; Published: October 9, 2025
Copyright: © 2025 Li et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: The data and scripts used in this study are available through GitHub (https://github.com/Centrum-IntelliPhysics/Mixed-Insults-Aortic-Aneurysm).
Funding: Funding was provided by the National Institutes of Health (R01 HL168473 to RA and GK and R01 HL163811 to JDH). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Treatment planning for individuals with thoracic aortic aneurysm (TAA), a localized dilatation resulting from underlying microstructural damage, continues to be based primarily on aortic size and growth rate [1], with maximum size being the de facto predictor. Nevertheless, life-threatening aortic events occur below established thresholds [2,3]. Importantly, there is increasing evidence that dysfunction can occur at several points along the mechanotransduction axis [4–8] that informs aortic cells responsible for maintaining healthy mural structure and function. These may include reduced or dysfunctional fibrillin-1 that stabilizes elastic fibers in the aortic wall [9,10], loss of cell-matrix connections (microfibrils and integrin binding sites) required for accurate assessment of wall stress and assembly of extracellular matrix [11,12], disrupted deposition or organization of fibrillar collagens [13], aberrant transforming growth factor-beta signaling [14], and altered cellular contractility [15,16]. Many of these effects have been shown to result in aneurysmal dilatation; yet, because different insults may lead to lesions of similar size but vastly different mechanical strength and vulnerability, there is a critical need to understand the underlying biomechanical and mechanobiological mechanisms that drive disease progression in a patient-specific manner. Moreover, since mechanobiological function (and its compromise) cannot be directly evaluated in vivo, such insight must be gained using often-limited, minimally-invasive clinical information following diagnosis.
Toward this end, we recently assessed the performance of neural network models in identifying factors that could contribute to TAA in a synthetic dataset resulting from finite element (FE) simulations, using metrics that can be derived simply from three-dimensional medical image reconstructions [17]. Briefly, a simulation platform for aortic growth and remodeling (G&R) [18] was used to generate aneurysms from randomly distributed, localized insults to structural constituents and mechanobiological mechanisms in a healthy aortic model. Local dilatation (normalized inner radius) and distensibility (normalized difference in radius between systole and diastole) fields from the resulting TAAs were used as training data for a deep operator network (DeepONet) [19] in the form of two-dimensional maps, which revealed the efficacy of a convolutional neural network (CNN)-based DeepONet architecture in predicting the associated initiating insult profile for a given dilatation-distensibility pair with a high degree of accuracy, thereby establishing the feasibility of evaluating mechanobiological compromise using image-based information.
Yet, experimental observations from disease models usually cannot be captured with a computational framework without consideration of multi-contributor insults [20,23]. For instance, recent work elucidating mechanobiological drivers of TAA in the genetic condition Marfan syndrome [21,22] identified several interconnected effects in its natural history, including both structural (e.g., elastic and collagen fibers) and biological (e.g., cellular mechanosensing and mechanoregulation of matrix) impairments in severe disease [10,23]. It is clear that synthetic data for training surrogate models must similarly consist of combined effects to yield clinically useful predictions. To more closely represent the scope of in vivo pathologies leading to TAA, we now build on this established framework by introducing additional factors to our synthetic data generation pipeline, including parameterization of the model to ascending aortic biomechanical properties of a mouse model of TAA (prone to dissection and rupture), as well as, importantly, focusing on aneurysms resulting from multiple superimposed initiators.
Furthermore, looking beyond the DeepONet framework, we wish to evaluate a range of candidate neural operators varying in type and architecture (including UNets and Laplace Neural Operators) that may be best suited for these applications. We compare the performance of several architectures in identifying these multi-contributor insult profiles from maps of local dilatation and distensibility. Importantly, to bracket the range of predictive capability in the lower limit of available clinical information, each network is also trained with knowledge of dilatation alone. We show that both dilatation and distensibility information are necessary for accurate estimation of combined insult profiles, especially for TAAs that exhibit similar maximum dilatations. When assessing the predictive accuracy of these models when trained on both dilatation and distensibility data versus dilation data only, we find that the prediction accuracy is significantly lower when relying solely on dilatation across all networks. Additionally, models based on convolutional neural networks, particularly UNets, exhibit the best performance in determining mechanobiological insult magnitude and distribution, serving as a promising tool for predicting TAA determinants and gaining insight into patient risk.
Materials and methods
Generation of synthetic data
Computational studies on TAA have investigated the propensity of multiple localized perturbations in aortic mechanics and mechanobiology to initiate and propagate dilatations in the aorta, including loss of elastic fiber integrity [20,24,25], reduced collagen cross-linking [26,27], dysfunctional mechanosensing [4], and impaired mechanoregulation of collagen fibers [28]. While these effects have largely been studied individually, with some recent effort devoted toward combining them to capture experimental observations [23,29], neural network models have yet to be trained on data comprised of these interacting contributors. We leverage our existing pipeline described elsewhere [18,30–33], built upon the notion of mechanobiological homeostasis and its loss in the aorta, with key extensions detailed below. Further information may be found in S1 Supporting information.
Contributing insults.
We focus on two primary classes of contributing factors to TAA [17], namely reduced integrity of elastic fibers in the aortic wall and compromised mechanosensing of intramural cells responsible for modulating the mechano-response of the aorta. Both effects have been shown to play critical roles in aneurysmal progression, leading to localized dilatation, abberant elastic energy storage, and increased circumferential stiffness [18,23,34]. As in previous work, insults to elastic fibers take the form of prescribed reductions in the stiffness of the elastin-dominated extracellular matrix (Table A in S1 Supporting information), particularly impacting the energy storage capability of the aorta required for the windkessel effect in normal function. Mechanosensing insults are characterized by scaling the deviation in the intramural stress that is “sensed” by aortic cells, where σ is a scalar measure and
is the homeostatic “set-point” that the cells work to maintain by modulating cell and matrix turnover in the aortic wall. Specifically, a coefficient
is introduced in the expression to give
(with
being perfect mechanosensing in the normal aorta), capturing impairment of the ability of the aorta to effectively modify the surrounding matrix constituents in response to elevated intramural stress [12,18,23].
FE simulation and dilatation/distensibility maps.
To generate synthetic training data, we employed a well-established computationally efficient FE model for determining the long-term, mechanobiologically equilibrated evolution of TAAs [18], with baseline parameters estimated to reproduce the in vivo behavior of a non-dilated ascending aorta of a mouse model of Marfan syndrome under normotensive conditions [23]. We used our recently developed method for mapping spatially heterogeneous perturbations generated via Gaussian random fields to produce randomly distributed insult profiles (Fig 1a), which indicate the local normalized severity of mechanobiological compromise throughout the initial aortic domain. Each profile defines spatially varying multi-contributor insults by superimposing defects in elastic fiber integrity
and mechanosensing
(Fig 1b). Each contributor pair was provided as input to the FE TAA simulation (Fig 1c) to compute the steady-state evolved geometry, at which point the G&R evolution was halted, and we simulated changes in luminal pressure over a cardiac cycle. Dilatation d was defined as the local inner radius r from the aortic centerline normalized by the average inner radius at the vessel ends, and distensibility
was defined as the normalized change in inner radius between systolic (S) and diastolic (D) loading (Fig 1d, cf. [17]), both of which can be computed from a cardiac gated medical image. The magnitudes of the insult contributors
were constrained to produce maximum dilatations of approximately 1.5 (defined as aneurysmal), with closely matched dilatation values across all insult combinations to focus attention on detecting the underlying pathologic mechanism at the time that a dilatation first reaches aneurysmal status, a critical time in clinical treatment planning (Fig A in S1 Supporting information displays the differential effects of a wider range of combinations of
on dilatation and distensibility). We note that, although TAA growth was simulated under systolic loading, we selected maps of dilatation at diastolic pressure for training data, with the aim to emulate what can easily be computed from a standard clinical image (tending to capture anatomy around end-diastole). We generated 100 unique spatial distributions (Fig B in S1 Supporting information), assigning 5 combinations of compromised elastic fiber integrity and mechanosensing to each profile, yielding 500 final dilatation-distensibility map pairs.
(a) Insult distributions along the circumferential (θ) and axial (z) directions are randomly generated with Gaussian random field (GRF) methods to define normalized insult profiles () that are mapped to the initial loaded aortic geometry to define the insult region (see S1 Supporting information for further details). (b) For each profile,
delineates multiple cases of mechanobiological insults defined by combinations of compromised integrity of elastic fibers (
) and dysfunctional mechanosensing (
) as inputs to (c) the nonlinear FE simulation to compute the long-term evolved state of the TAA. (d) Maps for dilatation (d) and distensibility (
) are obtained from the final geometry under multiple in vivo loading conditions (diastolic and systolic pressures), either processed as heat maps or converted to 8-bit grayscale maps, to serve as training data for the neural networks.
combinations are constrained to produce closely matched maximum dilatations (
) for each case to focus attention on detecting the underlying pathologic mechanism at the time when a dilatation first reaches aneurysmal status.
Numerical experiments
Knowledge of the underlying mechanical and mechanobiological insults for a given TAA could provide valuable insight into aortic vulnerability and optimal interventions. With this goal of predicting the relative contributions of both initiating insults based only on imaging-derived quantities, we compared several candidate neural operator architectures (two forms of a Deep Operator Network, UNet, and Laplace Neural Operator, described below) and input data formats to establish a standard for handling subject-specific dilatation and distensibility information.
Dilatation only vs. dilatation and distensibility.
In order to minimize radiation dose or imaging time, information is typically not acquired at multiple phases over the cardiac cycle; thus, only one image per clinical time point is available. Analogous to the current standard for surgical evaluation, we compared the performance of each network in predicting elastic fiber integrity and mechanosensing insults when trained only on dilatation maps (clinical standard) versus both dilatation and distensibility maps (requiring data over a cardiac cycle).
Heat maps vs. grayscale maps.
We recently developed competing data preprocessing methods for constructing dilatation and distensibility maps. We interpolated d and , evaluated on a nodal basis from the FE simulations in the θ–z domain, to
uniform grids either as the physical quantities (here named “heat maps”) or after normalizing and converting to 8-bit integer ([0 255]) intensity fields (here named “grayscale maps”), with dilatation maps sharing a preset range across all cases and distensibility maps each sharing a separate range across all cases. This approach was taken to allow straightforward integration with diverse neural operator architectures (discussed below), where heat maps are well-suited for feed-forward neural networks and grayscale maps for convolutional networks, as done previously [17]. Importantly, we also sought to evaluate our former best-performing architecture, based on grayscale map inputs, against alternative network designs.
Deep Operator Network (DeepONet).
DeepONet consists of branch networks to encode the input data and a trunk network to define the output domain, allowing a resolution-independent representation of input and output functions [19]. Building on our previous framework, we compared the performance of two DeepONets that differ in branch net architecture, with dilatation and distensibility maps serving as inputs, encoded via either fully connected convolutional neural networks (CNNs) (Fig 2a) or feed-forward neural networks (FNNs) (Fig 2b). In both DeepONets, the trunk net received initial positions of the aortic domain
as input (as cylindrical coordinates), using an FNN. We defined separate solution operators for each insult contributor that served as inputs to the combined loss function with a single set of trainable parameters (mean squared error), minimization of which allowed estimation of the insult profiles for elastic fiber integrity and mechanosensing.
Each branch net is a (a) CNN or (b) FNN that embeds the dilatation (d) and distensibility () maps as either grayscale or heat maps. Each trunk net takes the coordinates
to define the output domains of the corresponding insult contributor. The solution operators for each insult (
(
)) are formed from element-wise dot products of the outputs of the branch and trunk networks, with shared learnable parameters (
). Minimization of the loss function (
), defined as the combination of both operator outputs, determines the optimal parameters that enable estimation of the insult profiles and contributors (
).
UNet.
UNets are U-shaped fully convolutional neural networks, originally developed for biomedical image segmentation applications [35]. UNets are also the building blocks of the diffusion models [36] used in generative AI packages like DALL.E [37]. The extensive applications of UNets are rooted in their ability to learn the mapping from input to output signals through latent representations at varying degrees of coarseness, with coarser representations responsible for learning low-frequency components of the solution and finer representations responsible for high-frequency components. The multigrid representation learning makes the UNet effective in extracting spatiotemporal correlations entangled in the solutions of PDEs [39,40], subsequently leading to widespread applications from materials science [41] to turbulence modeling [42]. We provided the dilatation and distensibility information as inputs to a UNet that learned the mapping to the corresponding insult profiles (Fig 3).
Maps of dilatation (d) and distensibility () are encoded via successive layers of two-dimensional convolution (Conv2D), group normalization, and Gaussian Error Linear Unit (GELU) activation. Down- and up-sampling the input by factors of 2 is achieved through two-dimensional max-pooling operations (Maxpool2D) and two-dimensional transpose convolutional operations (Conv2DTranspose), respectively. Finally, skip connections are implemented to propagate information from earlier layers.
Laplace Neural Operator (LNO).
The Laplace Neural Operator is an architecture that performs operator learning in the Laplace domain for solving ordinary and partial differential equations [43]. This architecture leverages the solution of PDEs represented by the integral of Green’s function and the kernel integral is transformed and calculated in the Laplace domain. A key innovation in LNO is its Laplace layer, which employs an analytical pole-residue operation to establish a physically interpretable and meaningful mapping between the input and output functions in the Laplace domain. By independently learning the steady-state response, transient response with zero initial conditions, and transient response under nonzero initial conditions, LNO effectively captures true system dynamics. This enables it to achieve better approximation accuracy compared to other neural operators for extrapolation circumstances and dynamical systems. In this study, four Laplace layers were chosen, with width and modes for each layer being 32 and 8, respectively (Fig 4).
The dilatation (d) and distensibility () are lifted to a higher dimension via shallow neural network
, to which Laplace layers are applied, each yielding the output
, where
is the Laplace transform of the kernel integral transformation and V(s) is the Laplace transform of the lifted input function. Each layer contains pole-residue methods to obtain the transient and steady-state responses (utr(t) and ust(t), respectively), with residues
, coefficients
, and response poles
and
. Finally, the outputs (insult profiles) are projected back into the target dimension using the shallow network
.
Training
The 500 FE simulations were randomly categorized into 450 training and 50 testing samples. We then evaluated the prediction accuracy of each architecture-input data combination with a relative error computed over all testing samples, with separate errors for compromised elastic fiber integrity and mechanosensing. We also computed point-wise absolute errors in each predicted insult profile (
,
, where
is the predicted insult profile and
is the ground truth) to assess the ability of each network to reproduce the insult spatial distributions.
For the DeepONets, the initial architecture, learning rate, and number of training epochs were selected based on commonly used configurations reported in prior work [17]. Building on this baseline, we conducted a grid search over activation functions, network depth, and width, while monitoring loss curves to ensure convergence and mitigate overfitting. The final architecture was chosen based on the configuration that yielded the lowest validation loss within a predefined number of epochs (200,000), resulting in an Adam optimizer with the learning rate set at 0.001. In the cases of dilatation-only training, the branch net encoding distensibility was omitted from both architectures. For the UNet, the batch size was set to 250 and trained for 100,000 epochs. Following [39], we used GELU activations [38] and a cosine-annealing-based learning rate scheduler with an initial learning rate of 0.0001. For the LNO, the dilatation and distensibility information were provided as the inputs with dimensions , and the two insults were the outputs with dimensions
, where 20 is the batch size, 41 is the 2D resolution of the map, and 2 is the number of channels. Both the inputs and outputs were normalized by min-max normalization during training. The relative
error between the predicted insults and the true insults was used as the loss function. We first adopted commonly used settings from similar works [43] and performed a series of experiments to ensure convergence without overfitting, yielding 10,000 epochs and an exponential decay learning rate schedule with an initial learning rate of 0.001. The best model during these 10,000 epochs was chosen as the final surrogate model. Relevant network parameters are summarized in Table 1.
Note that the number of weight updates does not correspond to the number of epochs.
Results
Effects of insult contributors on dilatation and distensibility
FE simulations of TAAs initiated from combinations of compromised elastic fiber integrity and mechanosensing equilibrated with normalized inner radius ranging from 0.973 (at the vessel ends) to 1.657 (at the apex of the insult region), with a mean maximum dilatation of 1.496 ± 0.0476. Insult profiles having a greater span in the circumferential direction generated greater levels of dilatation, consistent with previous investigations [17]. The initial non-aneurysmal aorta exhibited a distensibility of 0.05442 (i.e., 5.4%), while aneurysmal dilatation decreased distensibility within the insult region, with a mean minimum distensibility of 0.0344 ± 0.0038 (i.e., 3.4%), corresponding to a ∼37% reduction associated with the ∼50% dilatation.
We further observed differential effects on the maximum dilatation and minimum distensibility depending on the dominating contributor to the overall insult. Although all combinations of compromised elastic fiber integrity and mechanosensing resulted in significant dilatation and decreased distensibility, mechanosensing-dominated insults tended to have consistently high dilatation, while elastic fiber integrity-dominated insults corresponded to lower distensibility within the insult region. Regions opposite the location of minimum distensibility also exhibited reduced distensibility despite not directly experiencing a prescribed insult, which occurred for all combined insults considered. Representative dilatation and distensibility profiles corresponding to the full range of combined insult magnitudes can be seen in Fig A in S1 Supporting information.
Overall prediction accuracy
Following training with the same 450 simulations, the four neural operator architectures were used to predict the remaining 50 insult fields of compromised elastic fiber integrity and dysfunctional mechanosensing based on inputs of dilatation only or dilatation and distensibility, each provided as either grayscale or heat maps. Most networks were able to predict the initiating insults within 10% relative error for both elastic fiber integrity (Fig 5a) and mechanosensing (Fig 5b). In particular, CNN-DeepONet and UNet architectures consistently predicted both insult profiles with approximately 5% error, even when trained on dilatation only. However, the relative
error for the LNO in dilatation only cases exceeded 15%, and the FNN-DeepONet in the dilatation only heat map case rose above 10% for both insult contributors. Table 2 shows the corresponding prediction errors.
Relative errors are reported separately for compromised (a) elastic (eln) fiber integrity and (b) mechanosensing over all testing cases considered: dilatation (d) only in grayscale and heat map formats, and dilatation and distensibility (d &
) in grayscale and heat map formats.
The best results (lowest error) are highlighted by boldface in both cases.
Figs 6 and 7 show representative examples of insult profile predictions for each network. Between the DeepONet designs, the FNN-DeepONet exhibited worse overall performance than the CNN-DeepONet, with the greatest difference in the dilatation-only heat map case. The CNN-DeepONet was able to achieve a similar prediction accuracy as the UNet when both were trained with dilatation only, whereas its performance appeared similarly competitive with the LNO when trained with both dilatation and distensibility. LNO, while better than both DeepONets at predicting the insult profiles when trained on both dilatation and distensibility, showed the worst prediction error in dilatation-only cases. Overall, UNet achieved the greatest prediction accuracy, particularly with dilatation and distensibility input data (Figs C and D in S1 Supporting information show additional comparisons).
(a) The ground truth combined insult field consisted of contributions of both compromised elastic fiber integrity () and dysfunctional mechanosensing (
) superimposed in the FE simulation to generate the TAA with dilatation and distensibility profiles shown in (b–c). Predictions (
) and absolute errors (
(
)) are shown for CNN-DeepONet, FNN-DeepONet, UNet, and LNO trained on (b) dilatation grayscale maps only and (c) dilatation and distensibility grayscale maps.
Similar to Fig 6 except for heat map inputs. (a) The ground truth combined insult field consisted of contributions of both compromised elastic fiber integrity () and dysfunctional mechanosensing (
) superimposed in the FE simulation to generate the TAA with dilatation and distensibility profiles shown in (b–c). Predictions (
) and absolute errors (
(
)) are shown for CNN-DeepONet, FNN-DeepONet, UNet, and LNO trained on (b) dilatation heat maps only and (c) dilatation and distensibility heat maps.
The greatest prediction error concentrated in the insult region with dilatation-only training; however, both DeepONet frameworks further yielded noticeable prediction errors away from the insult region, while UNet and LNO frameworks represented the non-insult region with high accuracy. Testing samples in which contributions of elastic fiber integrity and mechanosensing insults were more evenly balanced tended to be predicted with greater accuracy, while combined insults dominated by one factor were predicted less well.
Effects of distensibility maps
The inclusion of distensibility data in training reduced the prediction errors for all four networks by roughly a factor of 2 (Fig 5). As can be seen in Fig 6 and Fig 7, the prediction errors for both insult contributors effectively vanished upon the use of distensibility maps, especially for the high prediction errors of the FNN-DeepONet and LNO, and the network performances became comparable to one another. Nevertheless, there remained persistent small prediction errors with similar patterning in the dilatation-only cases, with DeepONet errors throughout the model domain and UNet and LNO errors clustered around the insult region.
Effects of input data format
Interestingly, use of grayscale versus heat maps for training did not significantly impact the predictions of the networks based on CNNs (Fig 8a–8b) or the LNO; for most networks, we observed only modest gains in prediction accuracy when using heat maps, and UNet performance was virtually the same regardless of data format (Fig 8e–8f). Yet, heat maps slightly improved the performance of the FNN-DeepONet in dilatation-distensibility cases (Fig 5), but their use in dilatation-only cases resulted in worse prediction errors (Fig 8c–8d).
All networks were trained on dilatation (d) maps only. (a) Ground truths for elastic fiber integrity and mechanosensing contributions. Predictions () and computation of absolute errors (
(
)) are shown for (b) CNN-DeepONet, (c) FNN-DeepONet, (d) UNet, and (e) LNO.
Discussion
In previous work, we established the use of DeepONets comprised of CNN-based branch networks to estimate insults to the aortic wall resulting from either compromised elastic fiber integrity or dysfunctional mechanosensing based on dilatation and distensibility information. We found that, for single-contributor insults with arbitrary spatial distributions, full-field measurements of dilatation and distensibility in grayscale map format led to more robust predictions compared to sparse sensor point grids, demonstrating the gain in prediction accuracy with two-dimensional knowledge of the geometry and mechanical properties of the aneurysmal aorta rather than measuring only at sparse locations around the point of maximum dilatation [17]. Herein, we extended this prior approach by generating new synthetic data that superimposes multiple insults to both mechanical properties and mechanobiological processes, while contrasting multiple neural network architectures and formats for data input. Combined with parameterization of the initial model to a diseased murine ascending aorta, this allowed the synthetic data to further close the gap between hypothetical in silico and in vivo cases. Moreover, to bracket the potential loss in predictive accuracy when the gold standard of input data cannot be reached, we investigated an intermediate case in which only dilatation measurements were available to train the model. Finally, with recent advancements in neural network applications to biomedical image analysis, we assessed the performance of UNets and LNOs as alternative approaches in performing the same tasks.
UNet as the preferred architecture for estimating combined insults
Reasonable predictions of combined insults were achieved with all networks considered in this investigation. Our previous DeepONet-based architecture continued to perform well for multi-contributor TAAs, including an additional framework comprised of FNNs, confirming our initial choice of network. Nevertheless, these architectures were still prone to modest prediction errors throughout the model domain, even far from the actual insult region (this can be seen in Fig E in S1 Supporting information, where despite the good predictions achieved by all networks, spatial distributions of error varied widely by network type). Alternatively, both UNet and LNO predictions consistently captured the boundaries of the insult region with much greater accuracy than the DeepONets, owing to their ability to capture low-order and high-order features in the input data (in contrast, fully connected CNNs are considered better with texture mapping than with feature detection in encoding images). Even significant UNet and LNO errors remained concentrated at the insult region. This difference is especially prominent in dilatation-only testing samples in which all networks yielded poor predictions (Fig 6). LNO performed the worst in these scenarios, though this is due to incorrectly estimating the magnitude of insult contributors and not because of failing to correctly identify the spatial distribution of the insult. We emphasize that, for assessing the functional state of the aorta, correct prediction of the insult magnitudes is paramount, especially if the clinical focus is the region of greatest dilatation.
Because predictions away from the aneurysm apex may be unrealistic to validate in clinical cases, we also reevaluated the error estimates on only the regions where the normalized insult was 50% or above (), concentrating on the most affected regions in each testing sample. Even after this filtering, the overall performances of each network exhibited the same trends as when evaluated on the whole domain, with one exception that UNets trained on heat maps rather than grayscale maps became the best performing architecture (Table B in S1 Supporting information). Overall, these results suggest for dilatation-only scenarios, both CNN-DeepONet and UNet could be appropriate choices, with the caveats of small, diffuse prediction errors arising from the DeepONet and the requirement that input data be interpolated onto square domains. In dilatation-distensibility cases, UNet and LNO outperformed the DeepONet-based frameworks in predicting the combined insult profiles, with UNet exhibiting the best prediction accuracy overall regardless of the input data format. Therefore, for predictions based on two-dimensional full-field maps, UNet is the preferred choice due to its accuracy and multi-scale processing capabilities, even if distensibility is not available.
Importance of distensibility in mixed insult data
We observed correlations between maximum dilatation and minimum distensibility depending on the relative contributions of elastic fiber integrity and mechanosensing insults. All combined insults decreased distensibility, though arising from different causes. In the case of compromised elastic fiber integrity, the prescribed reduction in mechanical properties of the elastin-dominated extracellular matrix resulted in both a transferal of load from the more compliant elastin to the stiffer collagen as well as further stimulation of production of collagen fibers; this double-hit contributed to the decreased distensibility. On the other hand, dysfunctional mechanosensing left the properties of the elastic fibers intact while governing the turnover of smooth muscle cells and especially collagen fibers to stiffen the wall. Being that elastic fibers play a dominant role in the ability of the aorta to distend with blood pressure changes, it is not surprising that the most dramatic effects are observed in elastic fiber integrity-dominated insults. This can be seen in Fig F in S1 Supporting information, in which synthetic TAAs that exhibit mostly the same average dilatation distributions fall into distinct distensibility patterns as a function of elastic fiber loss. We hypothesize that this differentiation in distensibility profile is what allows the networks to achieve accurate predictions when provided distensibility maps in either grayscale or heat map formats, as well as why dilatation-only cases proved more difficult to predict well.
Finally, examining the computed stress distributions within the synthetic data reveals dramatically different profiles depending on the dominating contributor (Fig G in S1 Supporting information). In particular, mechanosensing-dominated combined insults associated with significantly higher circumferential () and axial (
) stresses within the aneurysmal region. Estimating the intramural shear stress as
, which could serve as an analog for understanding vulnerability to dissection, is shown to be sensitive to mechanosensing dysfunction. This underscores that the mechanisms driving aortic dilatation may have a profound impact on the mechanical stability of the aneurysm while presenting nearly identically based on geometry alone. We thus reiterate that the ability to distinguish underlying contributors to TAA may provide critical insight into the potential mechanical strength of the aortic wall within the aneurysmal region.
Limitations
This study focused on combined insults of compromised elastic fiber integrity and dysfunctional mechanosensing in a model of a single initial geometry and material parameter set. Other studies suggest that additional effects could be considered, including altered mechanoregulation of matrix and compromised collagen cross-linking, in the synthetic data. We submit nonetheless that this work constitutes a viable potential pipeline demonstrating that dilatation and distensibility maps are essential to accurately estimate superimposed insults. Additionally, future work will need to incorporate increased variability in vessel dimensions, material properties, and loading conditions from diverse experimental investigations.
Next, the FE simulation platform used a computationally efficient implementation of equilibrated aortic growth and remodeling, yielding TAA geometries considered to be mechanobiologically stable, meaning no further growth of the TAA would be expected. Although this may be appropriate for a portion of TAA patients, these synthetic datasets do not address a TAA that would undergo unstable growth or develop a dissection or rupture. Furthermore, uniform cylindrical initial geometries were used to generate dilatation and distensibility maps; while effort was made to simulate TAAs of appropriate size for the initial vessel, the curvature of the ascending aorta was not considered. This simplification in original geometry allowed us to assess effects of combined insults, the utility of possible down-sampling, and so on, while targeting a level of dilatation (1.5-fold increase) that is clinically important independent of effects of curvature of the ascending aorta. Therefore, future work will need to consider both stable and unstable growth of TAAs based on combined insults applied to image-based aortic geometries, in which the UNet and LNO will be expected to have particular relevance. Looking to the future, we note that, if provided synthetic data of a longitudinal, time-evolving nature, different performances for these networks may reveal an alternative preferred choice of network, given that LNO is better suited for temporal applications and DeepONet is not as well suited.
Finally, there remains a compelling need to adapt this synthetic data pipeline to incorporate experimental data from actual human TAAs. With recent advancements in rapid three-dimensional reconstruction of the aorta from medical images [44], obtaining biomechanical properties from human samples [45], and using machine learning surrogate models to augment or complement high-cost biomechanical simulations [46], the potential to generate robust training data from clinically relevant growth and remodeling models may soon be a reality, at which point these pre-trained neural operators can help improve clinical decision-making.
Conclusion
In summary, using machine learning approaches for accurate prediction of mechanobiological insults that give rise to similarly sized TAAs promises to better discriminate mechanical vulnerability, thereby helping to inform subject-specific clinical treatment planning. Toward this end, we performed a systematic comparison of multiple neural networks in predicting the spatial distribution of synthetic multi-contributor TAA initiators when trained on dilatation information alone versus both dilatation and distensibility. Our findings highlight that:
- Full-field measurement of the ascending aortic domain facilitates localized assessment of the underlying aortic insult, advancing beyond the current clinical standard of evaluating a singular maximum aortic size.
- Localized dilatation fields have substantial predictive value in estimating mechanisms of TAA progression; however, corresponding knowledge of local distensibility is essential to accurately determine the relative contributions of multiple disease drivers.
- UNet-based neural networks are an efficient and promising tool for performing these predictions based on image-derived aortic quantities.
Ultimately, as progress continues in machine learning-assisted automatic segmentation of medical images and generation of biomechanical FE models, the prospect of obtaining these scalar-field measurements in a rapid, patient-specific manner could yield a minimally invasive, image-based approach for improved TAA risk assessment and management.
Supporting information
S1. Supporting information
Supplemental Tables A & B and Figs A–G.
https://doi.org/10.1371/journal.pcbi.1013550.s001
(PDF)
Acknowledgments
The authors acknowledge the computing support provided by the Advanced Research Computing at Hopkins (ARCH) core facility at Johns Hopkins University and the Rockfish cluster, as well as the computational resources and services at the Center for Computation and Visualization at Brown University, where all numerical experiments were carried out. The ARCH core facility (rockfish.jhu.edu) is supported by the National Science Foundation (OAC1920103).
References
- 1. Senser EM, Misra S, Henkin S. Thoracic aortic aneurysm: a clinical review. Cardiol Clin. 2021;39(4):505–15. pmid:34686263
- 2. Kim JB, Spotnitz M, Lindsay ME, MacGillivray TE, Isselbacher EM, Sundt TM. Risk of aortic dissection in the moderately dilated ascending aorta. Journal of the American College of Cardiology. 2016;68(11):1209–19.
- 3. Mansour AM, Peterss S, Zafar MA, Rizzo JA, Fang H, Charilaou P, et al. Prevention of aortic dissection suggests a diameter shift to a lower aortic size threshold for intervention. Cardiology. 2018;139(3):139–46. pmid:29346780
- 4. Humphrey JD, Milewicz DM, Tellides G, Schwartz MA. Cell biology. Dysfunctional mechanosensing in aneurysms. Science. 2014;344(6183):477–9. pmid:24786066
- 5. Humphrey JD, Schwartz MA, Tellides G, Milewicz DM. Role of mechanotransduction in vascular biology: focus on thoracic aortic aneurysms and dissections. Circ Res. 2015;116(8):1448–61. pmid:25858068
- 6. Karimi A, Milewicz DM. Structure of the Elastin-contractile units in the thoracic aorta and how genes that cause thoracic aortic aneurysms and dissections disrupt this structure. Can J Cardiol. 2016;32(1):26–34. pmid:26724508
- 7. Yamashiro Y, Yanagisawa H. The molecular mechanism of mechanotransduction in vascular homeostasis and disease. Clin Sci (Lond). 2020;134(17):2399–418. pmid:32936305
- 8. Creamer TJ, Bramel EE, MacFarlane EG. Insights on the pathogenesis of aneurysm through the study of hereditary aortopathies. Genes (Basel). 2021;12(2):183. pmid:33514025
- 9. López-Guimet J, Andilla J, Loza-Alvarez P, Egea G. High-resolution morphological approach to analyse elastic laminae injuries of the ascending aorta in a murine model of Marfan syndrome. Sci Rep. 2017;7(1):1505. pmid:28473723
- 10. Cavinato C, Chen M, Weiss D, Ruiz-Rodríguez MJ, Schwartz MA, Humphrey JD. Progressive microstructural deterioration dictates evolving biomechanical dysfunction in the Marfan aorta. Front Cardiovasc Med. 2021;8:800730. pmid:34977201
- 11. Li S, Van Den Diepstraten C, D’Souza SJ, Chan BMC, Pickering JG. Vascular smooth muscle cells orchestrate the assembly of type I collagen via alpha2beta1 integrin, RhoA, and fibronectin polymerization. Am J Pathol. 2003;163(3):1045–56. pmid:12937145
- 12.
Turlo KA, Noel ODV, Vora R, LaRussa M, Fassler R, Hall-Glenn F, et al. An essential requirement for
1 integrin in the assembly of extracellular matrix proteins within the vascular wall. Dev Biol. 2012;365(1):23–35. pmid:22331032
- 13. Weiss D, Rego BV, Cavinato C, Li DS, Kawamura Y, Emuna N, et al. Effects of age, sex, and extracellular matrix integrity on aortic dilatation and rupture in a mouse model of Marfan syndrome. Arterioscler Thromb Vasc Biol. 2023;43(9):e358–72. pmid:37470181
- 14.
Ramirez F, Caescu C, Wondimu E, Galatioto J. Marfan syndrome; a connective tissue disease at the crossroads of mechanotransduction, TGF
signaling and cell stemness. Matrix Biol. 2018;71–72:82–9. pmid:28782645
- 15. Chung AWY, Au Yeung K, Sandor GGS, Judge DP, Dietz HC, van Breemen C. Loss of elastic fiber integrity and reduction of vascular smooth muscle contraction resulting from the upregulated activities of matrix metalloproteinase-2 and -9 in the thoracic aortic aneurysm in Marfan syndrome. Circ Res. 2007;101(5):512–22. pmid:17641224
- 16. Milewicz DM, Trybus KM, Guo D-C, Sweeney HL, Regalado E, Kamm K, et al. Altered smooth muscle cell force generation as a driver of thoracic aortic aneurysms and dissections. Arterioscler Thromb Vasc Biol. 2017;37(1):26–34. pmid:27879251
- 17. Goswami S, Li DS, Rego BV, Latorre M, Humphrey JD, Karniadakis GE. Neural operator learning of heterogeneous mechanobiological insults contributing to aortic aneurysms. J R Soc Interface. 2022;19(193):20220410. pmid:36043289
- 18. Latorre M, Humphrey JD. Numerical knockouts-In silico assessment of factors predisposing to thoracic aortic aneurysms. PLoS Comput Biol. 2020;16(10):e1008273. pmid:33079926
- 19. Lu L, Jin P, Pang G, Zhang Z, Karniadakis GE. Learning nonlinear operators via DeepONet based on the universal approximation theorem of operators. Nat Mach Intell. 2021;3(3):218–29.
- 20. Weiss D, Latorre M, Rego BV, Cavinato C, Tanski BJ, Berman AG, et al. Biomechanical consequences of compromised elastic fiber integrity and matrix cross-linking on abdominal aortic aneurysmal enlargement. Acta Biomater. 2021;134:422–34. pmid:34332103
- 21. Pereira L, Lee SY, Gayraud B, Andrikopoulos K, Shapiro SD, Bunton T, et al. Pathogenetic sequence for aneurysm revealed in mice underexpressing fibrillin-1. Proc Natl Acad Sci U S A. 1999;96(7):3819–23. pmid:10097121
- 22. Judge DP, Biery NJ, Keene DR, Geubtner J, Myers L, Huso DL, et al. Evidence for a critical contribution of haploinsufficiency in the complex pathogenesis of Marfan syndrome. J Clin Invest. 2004;114(2):172–81. pmid:15254584
- 23. Li DS, Cavinato C, Latorre M, Humphrey JD. Computational modelling distinguishes diverse contributors to aneurysmal progression in the Marfan aorta. Proc R Soc A. 2023;479(2276).
- 24. Wilson JS, Baek S, Humphrey JD. Importance of initial aortic properties on the evolving regional anisotropy, stiffness and wall thickness of human abdominal aortic aneurysms. J R Soc Interface. 2012;9(74):2047–58. pmid:22491975
- 25. Laubrie JD, Mousavi SJ, Avril S. About prestretch in homogenized constrained mixture models simulating growth and remodeling in patient-specific aortic geometries. Biomech Model Mechanobiol. 2022;21(2):455–69. pmid:35067825
- 26. Wilson JS, Baek S, Humphrey JD. Parametric study of effects of collagen turnover on the natural history of abdominal aortic aneurysms. Proc Math Phys Eng Sci. 2013;469(2150):20120556. pmid:23633905
- 27. Watton PN, Hill NA, Heil M. A mathematical model for the growth of the abdominal aortic aneurysm. Biomech Model Mechanobiol. 2004;3(2):98–113. pmid:15452732
- 28. Irons L, Estrada AC, Humphrey JD. Intracellular signaling control of mechanical homeostasis in the aorta. Biomech Model Mechanobiol. 2022;21(5):1339–55. pmid:35867282
- 29. Jamaleddin Mousavi S, Jayendiran R, Farzaneh S, Campisi S, Viallon M, Croisille P, et al. Coupling hemodynamics with mechanobiology in patient-specific computational models of ascending thoracic aortic aneurysms. Comput Methods Programs Biomed. 2021;205:106107. pmid:33933713
- 30. Humphrey JD. Constrained mixture models of soft tissue growth and remodeling - twenty years after. J Elast. 2021;145(1–2):49–75. pmid:34483462
- 31. Latorre M, Humphrey JD. A mechanobiologically equilibrated constrained mixture model for growth and remodeling of soft tissues. Z Angew Math Mech. 2018;98(12):2048–71. pmid:30618468
- 32. Latorre M, Humphrey JD. Modeling mechano-driven and immuno-mediated aortic maladaptation in hypertension. Biomech Model Mechanobiol. 2018;17(5):1497–511. pmid:29881909
- 33. Murtada S-I, Kawamura Y, Li G, Schwartz MA, Tellides G, Humphrey JD. Developmental origins of mechanical homeostasis in the aorta. Dev Dyn. 2021;250(5):629–39. pmid:33341996
- 34. Bellini C, Bersi MR, Caulk AW, Ferruzzi J, Milewicz DM, Ramirez F, et al. Comparison of 10 murine models reveals a distinct biomechanical phenotype in thoracic aortic aneurysms. J R Soc Interface. 2017;14(130):20161036. pmid:28490606
- 35.
Ronneberger O, Fischer P, Brox T. U-Net: Convolutional networks for biomedical image segmentation. Medical image computing and computer-assisted intervention–MICCAI 2015 : 18th international conference, Munich, Germany, October 5-9, 2015, proceedings, part III. Springer; 2015. p. 234–41.
- 36. Ho J, Jain A, Abbeel P. Denoising diffusion probabilistic models. Advances in Neural Information Processing Systems. 2020;33:6840–51.
- 37.
Ramesh A, Pavlov M, Goh G, Gray S, Voss C, Radford A. In: International conference on machine learning. PMLR; 2021. p. 8821–31.
- 38.
Hendrycks D, Gimpel K. Gaussian error linear units (gelus). arXiv preprint 2016. https://arxiv.org/abs/1606.08415
- 39.
Gupta JK, Brandstetter J. Towards multi-spatiotemporal-scale generalized pde modeling. arXiv preprint 2022. https://arxiv.org/abs/2209.15616
- 40.
Ovadia O, Turkel E, Kahana A, Karniadakis GE. DiTTO: Diffusion-inspired Temporal Transformer Operator. arXiv preprint 2023. https://arxiv.org/abs/2307.09072
- 41. Oommen V, Shukla K, Desai S, Dingreville R, Karniadakis GE. Rethinking materials simulations: blending direct numerical simulations with neural operators. NPJ Comput Mater. 2024;10(1):145.
- 42.
Oommen V, Bora A, Zhang Z, Karniadakis GE. Integrating neural operators with diffusion models improves spectral representation in turbulence modeling. arXiv preprint 2024. https://arxiv.org/abs/2409.08477
- 43. Cao Q, Goswami S, Karniadakis GE. Laplace neural operator for solving differential equations. Nat Mach Intell. 2024;6(6):631–40.
- 44. Burris NS, Bian Z, Dominic J, Zhong J, Houben IB, van Bakel TMJ, et al. Vascular deformation mapping for CT surveillance of thoracic aortic aneurysm growth. Radiology. 2022;302(1):218–25. pmid:34665030
- 45. Sommer G, Benedikt C, Niestrawska JA, Hohenberger G, Viertler C, Regitnig P, et al. Mechanical response of human subclavian and iliac arteries to extension, inflation and torsion. Acta Biomater. 2018;75:235–52. pmid:29859367
- 46. Motiwale S, Zhang W, Feldmeier R, Sacks MS. A neural network finite element approach for high speed cardiac mechanics simulations. Computer Methods in Applied Mechanics and Engineering. 2024;427:117060.