Figures
Abstract
The run-and-tumble behavior is a simple yet powerful mechanism that enables microorganisms to efficiently navigate and adapt to their environment. These organisms run and tumble alternately, with transition rates modulated by intracellular chemical concentration. We introduce a neural network-based model capable of identifying the governing equations underlying run-and-tumble dynamics. This model accommodates the nonlinear functions describing movement responses to intracellular biochemical reactions by integrating the general structure of ODEs that represent these reactions, without requiring explicit reconstruction of the reaction mechanisms. It is trained on datasets of measured responses to simple, controllable signals. The resulting model is capable of predicting movement responses in more realistic, complex, temporally varying environments. Moreover, the model can be used to deduce the underlying structure of hidden intracellular biochemical dynamics. We have successfully tested the validity of the identified equations based on various models of Escherichia coli chemotaxis, demonstrating efficacy even in the presence of noisy measurements. Moreover, we have identified the governing equation of the photo-response of Euglena gracilis cells using experimental data, which was previously unknown, and predicted the potential architecture of the intracellular photo-response pathways for these cells.
Author summary
Microscopic organisms like bacteria and algae often adjust their movement in response to changing environments, such as light or chemical signals. They do this using a behavior called run-and-tumble, alternating between straight swimming and reorientation. Understanding how internal cell processes drive this behavior is difficult, especially when we can’t directly observe the biochemical pathways involved. In this study, we present a machine learning approach that discovers the governing rules behind these behaviors using only observable input-output data. Our model, based on neural networks, learns from how cells respond to simple stimuli and can predict how they behave under more complex, realistic conditions, without needing to know the details of internal reactions. We validated the method using simulations of bacterial chemotaxis and real experimental data from Euglena gracilis, a microorganism that responds to light. Our model accurately predicted cell responses and revealed insights into possible internal signaling structures. This approach provides a powerful tool for studying biological behavior and could help uncover how other organisms process environmental information.
Citation: Lei S, Li Y, Ma Z, Zhang H, Tang M (2025) Identification of the governing equation of stimulus-response data for run-and-tumble dynamics. PLoS Comput Biol 21(8): e1013287. https://doi.org/10.1371/journal.pcbi.1013287
Editor: Calina Copos, Northeastern University, UNITED STATES OF AMERICA
Received: February 17, 2025; Accepted: June 30, 2025; Published: August 5, 2025
Copyright: © 2025 Lei et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: The code and data required to replicate findings reported in the article are available at https://github.com/leiemily/NNsummary.
Funding: M.T. is supported by National Natural Sciences Fundation of China (Grants No. NSFC12031013). H. P. Z. is supported by the National Natural Science Foundation of China (Grants No. 12225410 and No. 12074243), the Ministry of Science and Technology Most China (Grant No. 2021YFA0910700). The sponsors or funders play no role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
In the intricate natural environment, organisms have developed a variety of sensory receptors to perceive external stimuli, including chemoattractants [1–13], thermal changes [14], hydrodynamics [15–17], and light conditions [18–34]. These sensory systems are essential for the survival of organisms, enabling them to avoid harmful environments, locate food sources, find mates, and secure suitable habitats. Complicated information acquired from their sensors is processed through many biochemical and neural pathways. Studying these signal-processing pathways, which have evolved over millions of years, is crucial for understanding organisms’ complex searching strategies and adaptive capacities to thrive in diverse ecosystems.
A variety of proteins, ions, complex biological macromolecules, and neurons are involved in processing information and producing appropriate behavioral responses. Information extraction procedures can often be described using the time dynamics of intracellular biochemical reactions that determine how the internal states change under specific inputs or stimuli. One can denote the governing equation of J internal-state variables of interest by and write the evolutionary equation in the general ordinary differential equations (ODEs) form:
where s(t) denotes the external stimuli. denotes the set of functions that characterize the regulatory relations among
. The particular values of internal variables together with the value of external stimuli determine the behavioral response at the cellular level, which can be quantified by f through an output function F:
Here, the particular forms of H and F depend on the biological systems and species.
While the general formula above appears straightforward, the specific forms of H and F may be highly nonlinear. Determining the exact mathematical expressions for H and F requires both comprehensive knowledge of intracellular biochemical reaction networks and precise quantification of internal state variables. To date, quantitative understanding and the development of associated mathematical models for signal processing and directed motility have been primarily established for E. coli chemotaxis. In contrast, acquiring such measurements of internal reactions in more complex unicellular organisms remains particularly challenging, thereby limiting the applicability of traditional modeling frameworks to these systems.
A natural question arises: Is it possible to discern the governing equations from given stimulus-response data without explicit knowledge of the underlying intracellular biochemical reactions H and the behavioral response function F? How do we design the external signal and use its characteristic information (values and gradients) to infer the possible underlying structure of intracellular kinetics?
We introduce a novel framework for identifying the governing equations of stimulus-response data, thereby eliminating the need to know the specific forms of Eqs (1) and (2). By reformulating Eqs (1) and (2) using the chain rule, we obtained new equations that are satisfied by the stimulus-response data and their derivatives. The terms in these new equations can be represented by fully connected neural networks, which can be trained using appropriate loss functions. The trained neural network model can predict the cell’s temporal response to any type of external stimulus, provided that the stimuli do not change too rapidly. Furthermore, one can infer the number of necessary internal variables for different stimuli.
To validate the proposed approach, we first apply it to numerical experiments using established models of Escherichia coli chemotaxis. The results demonstrate that the model accurately captures stimulus-response behavior and effectively recovers key properties of the underlying system. Although the E. coli chemotaxis pathway is more complex than models using only one (J = 1) or two (J = 2) internal variables, when the stimulus does not change too rapidly, most proteins can be considered to be in a quasi-equilibrium state. Given our focus on investigating the quantitative relationship between stimulus inputs and behavioral outputs, previous studies [35–37] have shown that models with one (J = 1) or two (J = 2) internal variables suffice to reproduce the diverse behavioral patterns observed experimentally. Therefore, this study systematically examines these two cases (J = 1 and J = 2) as representative scenarios.
Following validation with E. coli chemotaxis models, we then applied our method to experimental data of Euglena gracilis phototaxis behavior. E. gracilis, a eukaryotic microorganism, plays a significant role in ecological systems. These organisms are known for their phototactic responses, which are crucial to ecological phenomena such as algal blooms and diel vertical migration. A wide range of photo-responsive behaviors have been reported for E. gracilis, including positive and negative phototaxis, polygonal swimming motion, and localized spinning [38]. When confined to a 2D environment, as in [39], E. gracilis exhibits a run-and-tumble pattern and uses phototaxis to redistribute itself based on light intensity. However, the complex photo-receptive systems of E. gracilis are difficult to measure directly, and current knowledge about the functional forms of H and F is limited. It remains uncertain how many internal variables are essential to describe their phototaxis behavior. Our work has achieved two key outcomes for the phototaxis response of E. gracilis: First, we derived the governing equation that can predict the response under any given stimulus (though there is a threshold for how rapidly the stimulus can change, due to limitations in the available experimental data). Second, we inferred the potential structure of the underlying intracellular biochemical dynamics.
The insights derived from this study are not accessible via conventional recurrent neural networks (RNNs) [40], neural controlled differential equations (Neural CDEs) [41], or the convolution response kernel method [42,43]. RNNs lack the interpretability required to elucidate the underlying intracellular biochemical dynamics and require equidistant sampling of data points. Furthermore, Neural CDEs encounter difficulties in managing integrals involving dual internal variables [41]. In contrast, our proposed method circumvents the complexities associated with path integrals and offers a general governing equation. While the convolution response kernel method [42,43] can delineate the relationship between external stimuli and output signals, it is challenging to identify a universal response kernel applicable to diverse external stimuli. More importantly, we are able to achieve good predictive performance with a relatively small training set.
Methods
The modeling framework
The main idea of our approach is to use chain rules to eliminate the internal variables. We establish a comprehensive functional relationship between the time derivatives of the stimulus and the tumbling fraction, which is then represented and trained by the neural network approximations.
Single internal variable model (SIVM). When there is only a single internal-state variable m in Eq (1), the equation becomes
After applying the chain derivative rule to the output function and substituting Eq (3), one has
where Fm and Fs denote the first-order partial derivative of F in (2) with respect to m and s respectively. Typically, the response F depends monotonically on m, as in various E. coli chemotaxis models [35,36]. Assume that F in (2) monotonically depends on m, then for given s and f, one can determine a unique m that satisfies . This indicates that one can identify a function M such that
Substituting Eq (5) into Eq (4) yields
The first term Fm(M, s)H(M, s) and the coefficient of the second term Fs(M, s) on the right side of (6) can be expressed as functions that depend only on f and s, which we denote as G1(f, s) and G2(f, s), respectively. Eq (6) can then be written as:
The above equation has no internal variable m and depends only on stimulus s(t) and response f(t).
Dual internal variable model (DIVM). When two internal states (m,n) are necessary, the dual internal variable reaction can be modeled by the following ODEs:
Similarly, if F(m,n,s) exhibits a monotonic dependence on m and n in the dual internal-state situation, as is valid for the E. coli chemotaxis pathway [4,44], one can assume that there exists a function M such that
can be determined. Applying the chain rule to the temporal derivative of and then using Eqs (8) and (9) yields
The Eq (10) describes a complex relationship among variables n, f, s, and their gradients and
, which indicates that one can find a function N that satisfies
Though it is impossible to determine the function N since the internal-state n is not measurable, (11) and (9) indicate that both m and n can be determined by .
We introduce the information of the second-order derivative of f and s. More precisely, applying the chain rule to the second-order derivative of f, one can obtain
where
Here Fm, Fn and Fs denote respectively the first-order partial derivative of f with respect to m, n and s. Fmm, Fnn, Fss, Fms, Fns and Fmn are the second-order partial derivatives. In Eq (12), the first and second-order time derivatives of m and n can all be considered as functions that depend on m, n, s, and . Thus, the right-hand side of (12) can be expressed as
where and
are two undetermined functions. By substituting Eq (9) and Eq (11) into Eq (13), one has
Eq (14) includes no internal variables m, n, and depends only on stimulus s(t) and response f(t).
The two models (7) and (14) give general forms that directly connect the signal s(t) and response f(t). However, the particular forms of G1(f, s), G2(f, s), and
need to be determined from data and we use deep neural networks (DNN) to represent them in the subsequent part.
Remark. The monotonicity of the response function F with respect to internal variables is essential for inverting the function. While this assumption holds for E. coli chemotaxis, its generalizability to other microorganisms can be reasoned as follows. The central idea of our reformulation is this: the quantities m and n are unknowns that could potentially represent the concentration of a specific protein or a function derived from it. This approach allows m and n to be defined flexibly, such that they do not inherently need to correspond to the concentration of a specific protein. We assume one key protein concentration is c, and define . By making no assumptions about the specific form of
, we can always posit the existence of a function
such that F is monotonic with respect to
, even if F is non-monotonic with respect to c itself.
Chemotaxis models in E. coli
A well-studied example of signal processing and behavioral response is the run-and-tumble E. coli chemotaxis system. E. coli can move toward favorable environments through alternating run-and-tumble patterns by rotating flagella, governed by an internal chemosensory system, which can be described in the form of (1) and (2):
- The intracellular mechanism of the E. coli chemosensory system. The internal biochemical reactions in E. coli is of the form (1) and can be highly complex. The forms and numbers of ODEs differ significantly in different models. One type of E. coli model considers the molecular details including receptor clusters and a coupled biochemical reaction network [45–53]. The other category considers only the system-level properties of the signaling pathway, which employs a receptor clustering-based framework and takes the signaling complex as the basic unit [35–37,54–58]. Some complex models can finely describe the dynamics of internal variables with up to 158 ODEs [58]. The signalling pathways that primarily govern cell motility can often be described by 1 to 2 ODEs [35,37].
- The behavioral response. Flagella of each E. coli rotate either counter-clockwise (CCW) or clockwise (CW). When most flagella rotate CCW, the bacterium runs straight; when most flagella rotate CW, the cell stops and changes its movement direction. The concentration of the response regulator protein affects the rotational direction of the flagella, thus the probability that the bacterium is in the running or tumbling phase. The behavioral response of E. coli cells can be described by a tumbling fraction (the probability that the cells are in the tumbling state) or tumbling frequency, which can be modeled by the equation of the form (2). These two quantities are also modeled by functions of various forms. For example, the tumbling fractions take the following forms: f = F(m1, s) in [35], f = F(m1) in [44], or
in [4]. The specific forms of F vary in the literature, depending on the experimental setup and bacterial strains.
In the past decades, great progress has been made in understanding the mechanism of the chemosensory system and behavioral response, leading to various mathematical models. Here, we present an exposition of three representative models in the literature: one reduced model with J = 1 and two comprehensive dynamical models with J = 2.
The well-studied E. coli chemosensory system mainly adapts to the relatively slow receptor methylation and demethylation processes catalyzed by two enzymes CheR and CheB, which can be described by one ODE. The slow dynamics of receptor methylation level, denoted as m, and tumbling fraction, denoted as f, can be generally modeled by the following single internal-state model I [36,44]:
Model I for m is typically coupled with equations for the downstream phosphorylation and dephosphorylation reactions of CheA, CheY, and CheB. Since these reactions occur much faster than the methylation and demethylation reactions, one can use quasi-steady state (QSS) approximations to get the simplified Model I. The specific forms and parameter values of and
can be found in Appendix A of S1 Text, and the validation of the monotonic dependence of
on m can be found in Fig E(A) in S1 Text.
The second model takes into account the excitation of the signal transduction pathway, in which CheY-P plays an important role. The changes in the methylation level (m) [35] and the CheY-P concentration (Yp) [53] can be modeled by the evolution equation of the following form with f determined only by Yp [44]:
The temporal scales of m and Yp differ significantly. The methylation level m changes on a much slower time scale (ranging from seconds to minutes) compared to the CheY-P concentration, which evolves on a timescale of less than 1 s [53]. The specific forms and parameter values of ,
and
can be found in Appendix A of S1 Text, and the validation of the monotonic dependence of
on Yp can be found in Fig E(B) in S1 Text.
A third model with dual internal states that do not have temporal scale differences can be found in [4]. Based on the findings that changes in CheY-P levels during adaptation lead to changes in FliM proteins, the dynamics of FliM proteins and receptor methylation level m in Model III are:
Here depends monotonically on m and
(see Fig E(C-D) in S1 Text). The temporal scales of m and
in Model III are similar.
These three models have connections, yet are distinct. As shown in Fig F in S1 Text, their different responses to a sudden change in s(t) are displayed. Model I is based on detailed biochemistry reactions, where the CheY-P level Yp is determined by m and s through QSS approximation. When s(t) changes fast, in a short time scale, the QSS approximation no longer holds, necessitating the use of in Model II to describe the dynamics of Yp. Model I and II can explain the “perfect adaptation” [59], which indicates that when cells are exposed to sudden environmental changes, they respond by temporarily changing their tumbling frequency, then gradually returning to their pre-stimulus tumbling frequency over time. On the other hand, Model III emphasizes the “overshoot” feature during the adaptation process, quantifying the degree of excessive response before the return to pre-stimulus behavior. The particular forms of
in [36],
in [35], and
in [4] are different in different works, but the last two are linear approximation of the first one. More details of Models I-III are included in Appendix A in S1 Text. Given the diversity of E. coli models, identifying the appropriate model to describe a new experimental dataset presents a challenge.
Due to the complexity of model selection and the difficulty of determining the number of internal states, it becomes essential to establish governing equations that give the relationship between stimuli and response signals without relying on information about internal reactions. Although the internal chemical reactions within cells may be highly complex, the key reactions affecting their movement behavior can be described using only 1-2 ODEs, as in the E. coli chemotaxis pathway.
Photo-response movement in algae
Photosynthetic microorganisms, such as microalgae, exhibit phototaxis—directional movement in response to light stimuli [22,25,26]. For example, E. gracilis adjusts its flagellar beating patterns in response to perceived light intensity, enabling navigation toward or away from light sources [23–25,29,33,60–63]. In microalgae, photoreceptors convert optical signals into electrical signals via electron/ion interactions, which are then transmitted to locomotor organelles like flagella.
- The possible intracellular mechanism of the E. gracilis photoreception system The photoreceptor has been revealed to be photoactivated adenylyl cyclase, which produces cyclic adenosine monophosphate (cAMP) upon blue light exposure. Intracellular cAMP levels rise sharply within 1 s of illumination before rapidly returning to baseline. This transient cAMP signal may activate a protein kinase, which could phosphorylate one or several flagellar proteins to alter beating patterns, ultimately regulating phototactic behavior of E. gracilis [64]. However, no model is available to quantify the intracellular signaling pathway.
- The behavioral response. When confined to a 2D environment, as reported in [39], E. gracilis exhibits a run-and-tumble movement pattern. Tumbling is defined as a decrease in translational velocity or a concurrent increase in angular velocity [6,39]. The behavioral response can be quantified by the tumbling fraction, which displays complex patterns in response to changes in light intensity.Details of the experimental setup and measurement method are in Appendix B in S1 Text.
Adaptive photo-responses have been observed in other algae species, such as Chlamydomonas reinhardtii [34,65] and Volvox carteri [21]. These species exhibit a similar response pattern: an initial rapid increase in the signal, followed by a slower recovery to the resting state. Models of the form given by Eqs (1) and (2) with J = 2 have been proposed in the literature to describe the photo-responses of Chlamydomonas reinhardtii [34] and Volvox carteri [21]. However, the light-response mechanisms of E. gracilis remain poorly understood. One can hypothesize that its photoresponse model shares a similar underlying structure.
Implementation
Experimental data typically contain noise, which can be modeled by adding a white noise 𝜖 of Gaussian type with zero mean and standard deviation . The measured data points yi are given by
Noisy data cannot be accurately differentiated and requires prior smoothing. One can smooth the data sequence using a spline method to get the reference solutions. We apply the cubic smoothing spline method to noisy data points, and the reference is the minimizer of
Here, the first term measures the error and the second term controls the roughness. denotes the second order derivative of the function
. One can adjust the smoothing parameter p to regulate the smoothness of the reference solution. For the discontinuous responses, such as those under step stimuli, the smoothing method will be used to smooth the noisy data for each segment of constant stimulation, rather than smoothing the entire data sequence. We use the “csaps” function in MATLAB to achieve the process. Following the analysis provided in [66], the discrepancy between the smoothed value
and the theoretical value fi is negligible. In our subsequent part, we adopt the smoothed data as the reference and denote points in this data set by fi.
Based on Eqs (7) and (14), deep neural networks ,
,
and
can be constructed and trained to approximate the corresponding functions G1(f, s), G2(f, s),
and
.
In the SIVM, we train the function based on the function values of the discrete-time points ti (). The stimulus and responses at these discrete time points are si and fi,
is the approximation of
. At all discrete-time ti,
and
should satisfy Eq (7). Therefore, the SIVM loss is defined as
Similarly, given , the DIVM loss based on Eq (14) is
Denote the time step by , one can approximate
,
in (16) and (17) by
and
by
It should be emphasized that, when constructing the losses, the idea is similar to that of physics-informed neural networks (PINN) [67–69], where physical laws expressed by differential equations and the chain rule are employed. The SIVM/DIVM loss resembles the PDE loss in PINN, but the and
are not derived from the automatic differentiation, and the time variable t does not appear explicitly in the neural network. Moreover, the time variable is only used to provide the approximations in (18) and (19). As far as one can get the time derivatives, there is no need to use equally spaced discrete time points, which differs from RNNs [40]. Therefore, during training, one can use data from different time series and cut the series freely.
The data are divided into three parts: 70% for training, 10-15% for validation, and the remainder for testing. Owing to the inherent noise in the experimental data of f, even after smoothing, attempting to approximate can lead to the amplification of error. This may lead to overfitting during the training, thereby affecting the prediction accuracy of f. To avoid overfitting, during training the functions
,
,
and
by the training data, one has to test the results on the validation set. The obtained functions are validated by the function values of f in the validation set. More precisely, given the initial values of f in the validation set, such that
one can iteratively obtain by the following equations:
Then the long-term responses can be iteratively given by
The mean squared errors are employed to evaluate the accuracy of the validation set, whose definition is
We terminate the training process until stops decreasing significantly. An illustrative schematic is shown in Fig 1.
The input dataset for SIVM and DIVM are respectively discrete pairs and quadruplets
(
). The neural networks
and
are trained through SIVM loss
, while
and
are constrained by DIVM loss
. During the training of neural networks, we assess the performance of the validation set through
between
and fi to avoid overfitting.
Upon completion of the training process, we use the same methodology in Eq (21) to predict the function values of on the test data set. The prediction accuracy is quantified using the relative mean squared error:
Results
Parameters of DNN are trained upon the losses defined by Eq (16) or Eq (17). The DNN architecture comprises feed-forward neural networks with 4 hidden layers and 20 neurons in each hidden layer. We use the tanh activation function and choose the Glorot uniform method for the weight initialization. For the training, we use an Adam optimizer with default hyperparameters and a learning rate of 10−3. in (22) is outputted every 50 or 100 training epochs during the training process.
Performance on the E. coli tumbling fraction
We first study the performance of our algorithms for the E. coli models when adding noise to the data in the training set. The training data are generated by using the following two different types of s(t):
- Piecewise constant (PWC) functions. s(t) across the temporal domain
is a piecewise constant function:
- where the time domain is randomly divided into
intervals
- and
- ck is a constant randomly selected between
.
- Linear combination of cosine (LCC) functions. The linear combination of cosine functions is defined as:
- where ap and bp are randomly chosen from a uniform distribution of the intervals
and
, respectively.
An illustration of the two types of stimulus is presented in Fig G(A, C) in S1 Text.
We generate tumbling responses fi using a forward Euler discretization. To represent experimental noise, we introduce Gaussian noise with a mean of zero and a standard deviation
through a given signal-noise ratio (
) which can be calculated by
For the j-th numerical solution with data points, the power
of the original signal can be achieved by calculating the mean square value of the original signal:
. We set
to a range of 20 to 22, which aligns with the measured E. gracilis experimental data, which we will discuss later.
Here, we use stimulus-response data obtained by simulating Model I-III and then adding some noise. The particular choices of ,
,
,
,
, and
relate to the time scale of the behavioral response. Due to the related experimental phenomena, different E. coli models have different response time scales, which play a role in the design of the data sets. That is, the stimuli in the test set can have different forms, but their magnitudes and rates of change have to be in a similar range to those in the training set. Detailed settings for each model’s training, validation, and test sets are in Appendix D in S1 Text.
It should be noted that, during training and testing, we only use data with and
that are finite. More precisely, we exclude those points near the jump discontinuities of s(t) and
. We train the NNs for each model using the SIVM and DIVM losses separately. After training, we determine that the NNs are not overfitted by monitoring the error decrease Eq (22) on the validation set.
We then enrich the testing set with smooth stimuli that do not belong to the two types of basic stimuli. Due to the fold-change detection of E. coli cells [70], in [3,48,53], exponential sine wave signal is used to investigate the cells’ response. We test our scheme’s performance using a generalization of the exponential sine signal and consider the following more complex stimuli:
- Exponential functions with exponents being a linear combination of cosines (ELCC):
- where aq, bq and cq are chosen randomly in a interval.
For testing, we compare the predicted dynamics inferred by both algorithms based on Eq (21), with the reference solution. The results are shown in Fig 2 and very good agreement can be observed. For ELCC signals used in Fig 2(A)–2(C), the range of aq, bq is consistent with that used in the LCC functions. cq is employed to adjust the magnitude of s(t) to satisfy
with
being the maximum (minimum) of the stimuli in the training set.
(A) Model I; (B) Model II; (C) Model III. Top row: the simplified topological relationships among external stimuli, internal variables, and tumbling fractions in the three different E. coli models. denotes that x influences y, which can be both excitation or inhibition. Second row: The outside signal s(t) of the form ELCC. The third row: the predictive performance of SIVM. The fourth row: the predictive performance of DIVM. Here, the response data (gray circles) are obtained by adding Gaussian noise to the numerical solutions of the corresponding models with
following the uniform distribution U(20,22). These noisy data, after smoothing, serve as the reference values (black solid line).
Model I is a model with a single internal variable. As shown in Fig 2(A), the predicted values obtained from both algorithms are very close to the reference values, indicating that both SIVM and DIVM can effectively handle the situation with a single internal variable. Model II has two internal variables, and there is a significant reaction time scale difference between the two variables. The results in Fig 2(B) demonstrate that DIVM performs exceptionally well in predicting the test data. At the same time, SIVM, although slightly less accurate, still provides relatively accurate predicted values. Unlike Model II, Model III considers two internal variables with no significant difference in reaction speeds. Fig 2(C) shows that the performance of SIVM is significantly inferior to that of DIVM. These results suggest that DIVM can accurately give good long-term predictions for signal patterns that do not appear in the training set, for single and dual internal variable models. On the other hand, SIVM works for data generated with a single internal variable model and sometimes dual internal variables as well.
The performance of SIVM and DIVM for different datasets offers a way to infer the required number of internal variables to model dynamics that produce specific response signals. By comparing the performance of SIVM and DIVM on datasets from Model I and Model III, we can deduce that if both algorithms perform well, the dataset can be modeled by the response of a single internal variable. Conversely, if only DIVM performs well, it suggests that the internal biochemical reactions corresponding to the dataset need to be described by two ODEs, as shown in Fig 2(C).
We further investigate our algorithms’ robustness to complex stimuli with stochastic fluctuations. Specifically, the noisy stimuli are given by where
represent noiseless PWL, LCC, and ELCC signals. We generate tumbling responses fi by simulating Model I-III based on these noisy stimuli (as shown in Fig B in S1 Text). The cubic smoothing spline method is applied to smooth the data sequences s(t) and f(t), from which the derivatives
,
,
, and
are approximated. During both training and testing, we utilize noisy stimulus-response data
. The performance of both algorithms in predicting response data using noisy stimuli is similar to that using smooth stimuli, indicating that our algorithms are robust to complex stimuli with stochastic fluctuations. Although noise levels can degrade model performance (see Table E in S1 Text), our algorithms guarantee stable long-term predictions of f(t) without significant noise-induced fluctuations.
Performance on the measured E. gracilis tumbling fraction
Next, we extend the application of our algorithms to experimental data of Euglena gracilis cells. E. gracilis move with similar run-and-tumble patterns with E. coli, as illustrated in Fig 3(A), while the biological pathways for their photo-responsive movement are not yet fully understood. During “runs”, the cell moves in a near-ballistic manner, whereas in “tumbles”, it spins before selecting a new direction for the subsequent “run”. Then, the fraction of tumbling particles during every second can be quantified (shown in Fig 3(B)–3(E)) under varied light-intensity stimuli s(t) (s(t) is the light intensity whose unit is ).
(A) Typical cell trajectory with uniform light intensity, where the red dots highlight tumble events. (B-C) Measured tumbling fraction with PWC and PWCL stimuli. (D-E) Performance of SIVM and DIVM for different LCC stimuli (yellow dotted line). Gray circles are the experimental data. (D) is for
and (E) is for
. Top row: the LCC stimuli; middle row: predictive performance of SIVM; bottom row: predictive performance of DIVM.
In the experiment, we have measured the corresponding tumbling fraction of E. gracilis for different s(t). Three different types of signal s(t) are used, PWC, LCC and
- Piecewise constant and linear (PWCL) functions. s(t) is a combination of piecewise constant and linear functions:
- where the time domain is randomly divided into
intervals
For, the linear functions are defined as
- where
The values ofencompass the gradient information of s(t). Therefore, the ranges of
in the training and test sets affect the precision. An illustration of this stimulus type is presented in Fig G(B) in S1 Text.
Given that the size of E. gracilis is significantly larger than that of E. coli, its movement is considerably slower. Consequently, it is crucial to control the variation rate of s(t), which can be quantified by , in the designed light signals. Two different datasets are used to test the performance of our proposed method, one with slow varying s(t) while the other includes both slow and relatively faster varying s(t). We use the absolute maximum values of the stimuli, denoted as
, to characterize these two different datasets. The stimulus types and
values used in the two training sets are shown in Table 1. The smoothing spline method is applied to the raw data to obtain the smoothed reference values. Subsequently, SIVM and DIVM are employed to identify the governing equation of the run-and-tumble dynamics through the data set composed of external stimuli s and responses f. The details for each experiment’s training and test sets are shown in Appendix D in S1 Text.
The predicted tumbling fractions under LCC signals are shown in Fig 3(D-E). Using slowly varying s(t), the prediction errors of both SIVM and DIVM are below , with DIVM outperforming SIVM. This suggests that when
, the governing equation can be obtained using SIVM, and a single ODE can describe the internal dynamics. However, when s(t) changes faster, the prediction error of SIVM is much higher than the acceptable 10% threshold, and SIVM fails to provide an accurate governing equation. After retraining with DIVM, the accuracy of prediction is significantly improved, as shown in Table 1, indicating that the internal dynamics of fast-changing environments with
require two ODEs for an accurate description.
Comparisons between the two datasets reveal that E. gracilis exhibits distinct photo-response patterns depending on the rate of change of external signals. In slowly changing environments, a single internal variable suffices to model the response, whereas in rapidly changing environments, two internal variables with clearly distinct reaction timescales are required. According to the biological signal transduction mechanisms of E. gracilis explored in [64], one can hypothesize that the fast internal variable may correlate with the rapid cAMP-mediated phosphorylation of flagellar proteins. The underlying mechanism for slow-scale responses remains unclear.
The good performance of SIVM in slowly varying environments is similar to the case of Model II in the test for E. coli models. Therefore, in the next subsection, we will generate a series of data sets with different based on Model II to further investigate the conditions under which SIVM or DIVM can be applied.
Relation between SIVM and DIVM and criteria for their selection
Notably, Model II includes two internal variables, but SIVM can predict the behavioral response. Further tests indicate that when remains below a certain threshold, SIVM can yield good predictions, but when
increases, the predictive efficacy of SIVM degrades. This can be understood by observing that the reaction rates of the two internal variables are significantly different in Model II. When s(t) changes slowly, one of the two internal variables is always in a quasi-equilibrium state. Hence, the model degenerates to a one-internal-variable model, and SIVM can predict the response. This observation raises a natural question: Is it possible to identify the threshold Sg, when
, SIVM can provide a good prediction, while when
, one has to use DIVM. In this subsection, we will explore this issue.
The threshold Sg can be derived using analytical approaches. Without loss of generality, we assume the variable n in Eq (8) is the fast variable and is the quasi-equilibrium state that satisfies
. The applicability of SIVM implies that
and one internal variable m suffices. Conversely, if n(t) deviates from
, we consider a small perturbation
around
such that
When the deviation satisfies
(where
is a specified small threshold), the theoretical expression for the threshold Sg is given by:
Details of the derivation are provided in Appendix C of S1 Text. For Model II, the threshold Sg can be computed by setting, for example, ,
and corresponding
near the reference methylation level, reducing the above expression to:
where s represents the characteristic timescale of Yp and details of all other parameters can be found in Appendix A of S1 Text. Thus, the threshold Sg is closely tied to the biological parameters of the underlying model, provided that the explicit formulations are known.
Alternatively, the threshold can be estimated numerically by testing the algorithm’s performance across varying rates of change in s(t) (denoted as ). We conduct a systematic training and test process with datasets that vary in
. More precisely, for given external signals s(t) belonging to LCC or ELCC, if the total time of a continuous segment is T, s(t) is progressively compressed by
with T1 < T, as shown in Fig 4(A). As T1 decreases,
increases. In Fig 4(B), one can observe that SIVM fails to give good predictions effectively, based on the data produced by Model II when
becomes large.
External stimuli are compressed in time, as shown in (A). We gradually compress the T = 200 s stimulus signal to T1 = {180,160,140,120,100} s. The absolute maximum value of the gradient is correlated with SIVM and DIVM test errors in (B). The error points from left to right correspond to T1 = {200,180,160,140,120,100} s respectively. (C) illustrates the SIVM (blue dashed line) and DIVM (red dashed line) prediction outcomes of three experiments with different T1. Here, ELCC stimuli are used in the test set; the subplots from left to right correspond to the three error points for T1 = 140 s, T1 = 120 s, and T1 = 100 s in (B).
We correlate SIVM and DIVM test errors with the absolute maximum values of of different data sets, as shown in Fig 4(B). SIVM test errors increase as
increases, whereas DIVM exhibits low test errors across all data sets. When
exceeds a certain threshold, the testing error increases significantly. Based on this, we numerically establish a threshold for
; exceeding this threshold suggests that SIVM is no longer suitable. By requiring that the relative errors in (23) of the test set predictions should be below 5%, we find this threshold is approximately
0.5
(For E. coli, the outside signal is the chemical concentration, whose unit is
.) as shown in Fig 4(B), which is aligns with the theoretical gradient threshold
0.59
. Fig 4(C) displays the SIVM predictions of three different T1. The smaller the T1 is, the larger
is.
for all three different T1s in Fig 4(C) are near the threshold. It can be seen that the long-term predictions become worse as T1 decreases.
For the experimental data of E. gracilis, as illustrated in Fig 3(D)–3(E) and Table 1, SIVM can model the photo-responsive movement of E. gracilis when
. On the other hand, DIVM can achieve excellent predictive results when
becomes larger. Based on the discussions of the degeneration condition of DIVM, this implies that the pathways of E. gracilis photo-response can be modeled by one internal variable when
, while when
reaches 10−2
, one has to use two internal variables to get the correct behavior. One may set
to be
, but to get a better estimate of Sg, more carefully designed experimental data are required.
One can further study the extrapolation capability of the algorithms, specifically the prediction accuracy when the value range and gradient range of test-set stimuli exceed those of the training set. We generate response data using Model II under modified stimulus signals and construct noiseless training and test datasets. Two different categories of test sets are constructed, which are associated with a relative maximum deviation of the absolute stimulus value Rs or a relative maximum deviation of the absolute stimulus gradient from the training set. The detailed method for constructing the two different categories of test sets is provided in Appendix F in S1 Text.
After training the neural networks using both SIVM and DIVM, we apply them to the two different categories of test sets described above. SIVM and DIVM test errors are correlated with the values of Rs and of different data sets, as shown in Fig C(B) and Fig D(B) in S1 Text. As the test data’s deviation from the training range increases (both in value and the gradient of stimuli), the test errors of both algorithms grow. When Rs = 0.49 (
), SIVM’s (DIVM’s) error exceeds 10%. Regarding extrapolation capability, our algorithms maintain accurate predictions even when the gradient range expansion extends significantly beyond the training set. DIVM consistently outperforms SIVM in handling value range expansion.
Design principles for stimuli
Most studies of signal processing and behavioral response are based on measuring responses to simple controllable stimuli. Recent biological experiments have measured behavioral response under varied external stimuli, including step functions (E. coli [71], algae [21,34,65]), ramp [65], exponential ramp [53], sine-wave [34,65,72], and exponentiated sine-wave stimuli with low/high frequency [53]. Inspired by these experiments, we design the above three types of stimulus signals s(t) for training dataset generation.
We further investigate the three types of stimulus signals used in the experiments to explore the impact of different signal combinations on NN’s training and predictive accuracy. The first type is the PWC, which is widely used to study the response behavior of unicellular organisms to sudden signal changes [46,47,49,51]. This signal has a zero first derivative almost everywhere except at the jump points. The second type is called the PWCL, where piecewise constant functions are connected by linear segments whose slopes may vary in different transition intervals. This signal has a zero-second derivative almost everywhere except at the turning points. The third type is the LCC, one of the classic smooth signals. The exact mathematical definitions for these three types of signals have been provided in the previous subsections.
For PWC signal, since at almost everywhere ,
,
and
are untrained. While for PWCL signal, at almost everywhere
,
is untrained.
Our algorithms are applied to the E. coli chemotaxis Model I, II, and III. For the three different types of signals PWC, PWCL, and LCC, the training set comprises one, two, or three of them, as shown in Table 2. stimulus-response pairs are used, with 70% allocated to the training set and the rest for testing.
For the SIVM, it is evident that when the training dataset contains only PWC signals, the neural network cannot be effectively trained. After training the neural network with datasets composed of different signal combinations, while keeping the number of data points and iterations the same, we evaluated the neural network’s performance on the same test set. The results show that using a combination of PWC and LCC signals as the training set achieves the best outcome while using only LCC signals as the training set results in the largest errors.
Similarly, within the DIVM framework, we conduct experiments using Models II and III. From Table 2, LCC signals are necessary for the training set; without them, the neural network cannot be trained and its predictive performance on the test set is very poor. The first and second derivatives of LCC signals offer abundant information, significantly enhancing the training efficacy of the networks. Among the three different signal combinations, when the training data size is the same, combining LCC signals with either PWC or PWCL signals significantly improves predictive accuracy, which is much better than using LCC signals alone. Utilizing all three types in the training set can improve predictive accuracy, but the enhancement is not significant, as shown in the last row of Table 2.
Discussion
Microorganisms adjust their motion behavior in response to external signal stimuli, thereby better adapting to environmental conditions. Traditionally, these dynamic changes are described by a system of ODEs for intracellular pathways and a nonlinear response function. However, measuring the temporal dynamics of internal states is challenging, which complicates model construction.
Hence, there is a need to construct models that directly relate external signal stimuli to behavioral responses, without explicit information of the intracellular signaling pathways. This study introduces a novel neural-network framework based on external stimulus-response data. The approach relies on applying the chain rule to eliminate internal variables that simulate intracellular reactions, thereby establishing functional relationships (7) and (14) between the time derivative of stimulus intensity and tumbling behavior. These relationships are then defined as the losses (16) and (17) for training the neural networks.
We have applied this algorithm to various E. coli chemotaxis models and experimental data of E. gracilis, validating its effectiveness in predicting motion responses under complex environmental conditions. Moreover, the algorithm can infer the potential structure of the response pathways, as has been realized using E. gracilis experimental data.
The framework introduced here does not require measurements of intracellular protein levels or knowledge of the specific forms of equations governing intracellular chemical reactions. Instead, it infers the potential structure of response pathways directly from movement data. We employ neural networks to learn representations, enabling the algorithm to generalize to other systems with unknown latent variables. Thus, we claim that this algorithm is a general-purpose, easy-to-implement, and accurate simulator for identifying the governing equations of run-and-tumble dynamics.
The current work is a first step in adapting neural network framework to identify governing equations from stimulus-response data. The two cell types in this study exhibit a similar run-and-tumble movement pattern, thus, the behavioral response data comprise tumbling fraction changes under different stimuli. Given that our model is derived in a general form, it should, in principle, be applicable to any input-output datasets. Therefore, future work could explore extending this approach to other types of response data to assess its broader applicability.
We focus exclusively on the two cases J = 1 and J = 2 in this work. Nevertheless, extending the method to systems with larger J requires information about higher-order derivatives of the stimulus and response data, which can be challenging when dealing with noisy experimental data. Moreover, since the time derivative is used in the model, only continuous-in-time data are considered here (but the time sequence can be short); extending the method to discontinuous stimulus and response data is necessary. Notably, several neural network architectures, such as RNNs and Neural CDEs, remain viable for processing long time series and performing prediction tasks. A promising future direction involves integrating our current algorithm with other neural network approaches, which may yield novel algorithms with biological interpretability, noise resistance, stable long-term prediction capabilities, and enhanced applicability to more complex systems.
Supporting information
S1 Text. The supplementary document provides Appendix A–F and three supplementary figures for the main text.
https://doi.org/10.1371/journal.pcbi.1013287.s001
(PDF)
References
- 1. Berg HC, Brown DA. Chemotaxis in Escherichia coli analysed by three-dimensional tracking. Nature. 1972;239(5374):500–4. pmid:4563019
- 2. Block SM, Segall JE, Berg HC. Impulse responses in bacterial chemotaxis. Cell. 1982;31(1):215–26. pmid:6760985
- 3. Shimizu TS, Tu Y, Berg HC. A modular gradient-sensing network for chemotaxis in Escherichia coli revealed by responses to time-varying stimuli. Mol Syst Biol. 2010;6:382. pmid:20571531
- 4. Min TL, Mears PJ, Golding I, Chemla YR. Chemotactic adaptation kinetics of individual Escherichia coli cells. Proc Natl Acad Sci U S A. 2012;109(25):9869–74. pmid:22679285
- 5. Saragosti J, Calvez V, Bournaveas N, Perthame B, Buguin A, Silberzan P. Directional persistence of chemotactic bacteria in a traveling concentration wave. Proc Natl Acad Sci U S A. 2011;108(39):16235–40. pmid:21918111
- 6. Masson J-B, Voisinne G, Wong-Ng J, Celani A, Vergassola M. Noninvasive inference of the molecular chemotactic response using bacterial trajectories. Proc Natl Acad Sci U S A. 2012;109(5):1802–7. pmid:22307649
- 7. Brumley DR, Carrara F, Hein AM, Yawata Y, Levin SA, Stocker R. Bacteria push the limits of chemotactic precision to navigate dynamic chemical gradients. Proc Natl Acad Sci U S A. 2019;116(22):10792–7. pmid:31097577
- 8. Friedrich BM, Jülicher F. Chemotaxis of sperm cells. Proc Natl Acad Sci U S A. 2007;104(33):13256–61. pmid:17686980
- 9. Swaney KF, Huang C-H, Devreotes PN. Eukaryotic chemotaxis: a network of signaling pathways controls motility, directional sensing, and polarity. Annu Rev Biophys. 2010;39:265–89. pmid:20192768
- 10. Alvarez L, Dai L, Friedrich BM, Kashikar ND, Gregor I, Pascal R, et al. The rate of change in Ca(2+) concentration controls sperm chemotaxis. J Cell Biol. 2012;196(5):653–63. pmid:22371558
- 11. Alvarez L, Friedrich BM, Gompper G, Kaupp UB. The computational sperm cell. Trends Cell Biol. 2014;24(3):198–207. pmid:24342435
- 12. Jikeli JF, Alvarez L, Friedrich BM, Wilson LG, Pascal R, Colin R, et al. Sperm navigation along helical paths in 3D chemoattractant landscapes. Nat Commun. 2015;6:7985. pmid:26278469
- 13. Zaferani M, Abbaspourrad A. Biphasic chemokinesis of mammalian sperm. Phys Rev Lett. 2023;130(24):248401. pmid:37390449
- 14. Paster E, Ryu WS. The thermal impulse response of Escherichia coli. Proc Natl Acad Sci U S A. 2008;105(14):5373–7. pmid:18385380
- 15. Zaferani M, Cheong SH, Abbaspourrad A. Rheotaxis-based separation of sperm with progressive motility using a microfluidic corral system. Proc Natl Acad Sci U S A. 2018;115(33):8272–7. pmid:30061393
- 16. Jing G, Zöttl A, Clément É, Lindner A. Chirality-induced bacterial rheotaxis in bulk shear flows. Sci Adv. 2020;6(28):eabb2012. pmid:32695880
- 17. Ohmura T, Nishigami Y, Taniguchi A, Nonaka S, Ishikawa T, Ichikawa M. Near-wall rheotaxis of the ciliate Tetrahymena induced by the kinesthetic sensing of cilia. Sci Adv. 2021;7(43):eabi5878. pmid:34669467
- 18. Hill NA, Plumpton LA. Control strategies for the polarotactic orientation of the microorganism Euglena gracilis. J Theor Biol. 2000;203(4):357–65. pmid:10736213
- 19. Kuznicki L, Mikołajczyk E, Walne PL, Hildebrand E Dr rern. Photobehavior of euglenoid flagellates: theoretical and evolutionary perspectives. Critic Rev Plant Sci. 1990;9(4):343–69.
- 20. Jékely G, Colombelli J, Hausen H, Guy K, Stelzer E, Nédélec F, et al. Mechanism of phototaxis in marine zooplankton. Nature. 2008;456(7220):395–9. pmid:19020621
- 21. Drescher K, Goldstein RE, Tuval I. Fidelity of adaptive phototaxis. Proc Natl Acad Sci U S A. 2010;107(25):11171–6. pmid:20534560
- 22. Giometto A, Altermatt F, Maritan A, Stocker R, Rinaldo A. Generalized receptor law governs phototaxis in the phytoplankton Euglena gracilis. Proc Natl Acad Sci U S A. 2015;112(22):7045–50. pmid:25964338
- 23. Hader DP. Polarotaxis, gravitaxis and vertical phototaxis in the green flagellate, Euglena gracilis. Arch Microbiol. 1987;147(2):179–83. pmid:11536573
- 24. Hader D-P, Colombetti G, Lenci F, Quaglia M. Phototaxis in the flagellates, Euglena gracilis and Ochromonas danica. Arch Microbiol. 1981;130(1):78–82.
- 25.
Hader DP, Iseki M. Photomovement in Euglena. In: Schwartzbach SD, Shigeoka S, editors. Euglena: biochemistry, cell and molecular biology. Cham: Springer; 2017. p. 207–35.
- 26. Hader D-P. Simulation of phototaxis in the flagellateEuglena gracilis. J Biol Phys. 1993;19(2):95–108.
- 27. Ozasa K, Won J, Song S, Tamaki S, Ishikawa T, Maeda M. Temporal change of photophobic step-up responses of Euglena gracilis investigated through motion analysis. PLoS One. 2017;12(2):e0172813. pmid:28234984
- 28. Arrieta J, Barreira A, Chioccioli M, Polin M, Tuval I. Phototaxis beyond turning: persistent accumulation and response acclimation of the microalga Chlamydomonas reinhardtii. Sci Rep. 2017;7(1):3447. pmid:28615673
- 29. Tsang ACH, Lam AT, Riedel-Kruse IH. Polygonal motion and adaptable phototaxis via flagellar beat switching in the microswimmer Euglena gracilis. Nature Phys. 2018;14(12):1216–22.
- 30. Perlova T, Gruebele M, Chemla YR. Blue light is a universal signal for Escherichia coli chemoreceptors. J Bacteriol. 2019;201(11):e00762-18. pmid:30858302
- 31. de Maleprade H, Moisy F, Ishikawa T, Goldstein RE. Motility and phototaxis of Gonium, the simplest differentiated colonial alga. Phys Rev E. 2020;101(2–1):022416. pmid:32168596
- 32. Yang S, Huang M, Zhao Y, Zhang HP. Controlling cell motion and microscale flow with polarized light fields. Phys Rev Lett. 2021;126(5):058001. pmid:33605769
- 33. Ramamonjy A, Dervaux J, Brunet P. Nonlinear phototaxis and instabilities in suspensions of light-seeking algae. Phys Rev Lett. 2022;128(25):258101. pmid:35802423
- 34. Leptos KC, Chioccioli M, Furlan S, Pesci AI, Goldstein RE. Phototaxis of Chlamydomonas arises from a tuned adaptive photoresponse shared with multicellular Volvocine green algae. Phys Rev E. 2023;107(1–1):014404. pmid:36797913
- 35. Si G, Wu T, Ouyang Q, Tu Y. Pathway-based mean-field model for Escherichia coli chemotaxis. Phys Rev Lett. 2012;109(4):048101. pmid:23006109
- 36. Xue C. Macroscopic equations for bacterial chemotaxis: integration of detailed biochemistry of cell signaling. J Math Biol. 2015;70(1–2):1–44. pmid:24366373
- 37. Jiang L, Ouyang Q, Tu Y. Quantitative modeling of Escherichia coli chemotactic motion in environments varying in space and time. PLoS Comput Biol. 2010;6(4):e1000735. pmid:20386737
- 38. Tsang ACH, Riedel-Kruse IH. Light-dependent switching between two flagellar beating states selects versatile phototaxis strategies in microswimmers. Proc Natl Acad Sci U S A. 2024;121(47):e2408082121. pmid:39536086
- 39. Li Y, Zhao Y, Yang S, Tang M, Zhang HP. Biased Lévy walk enables light gradient sensing in euglena gracilis. Phys Rev Lett. 2025;134(10):108301. pmid:40153620
- 40. Che Z, Purushotham S, Cho K, Sontag D, Liu Y. Recurrent neural networks for multivariate time series with missing values. Sci Rep. 2018;8(1):6085. pmid:29666385
- 41.
Kidger P, Morrill J, Foster J, Lyons T. Neural Controlled Differential Equations for Irregular Time Series. 2020. https://arxiv.org/abs/2005.08926
- 42. Clark DA, Grant LC. The bacterial chemotactic response reflects a compromise between transient and steady-state behavior. Proc Natl Acad Sci U S A. 2005;102(26):9150–5. pmid:15967993
- 43. Kafri Y, da Silveira RA. Steady-state chemotaxis in Escherichia coli. Phys Rev Lett. 2008;100(23):238101. pmid:18643546
- 44. Xue X, Xue C, Tang M. The role of intracellular signaling in the stripe formation in engineered Escherichia coli populations. PLoS Comput Biol. 2018;14(6):e1006178. pmid:29864126
- 45. Goldbeter A, Koshland DE Jr. Simple molecular model for sensing and adaptation based on receptor modification with application to bacterial chemotaxis. J Mol Biol. 1982;161(3):395–416. pmid:7154085
- 46. Segel LA, Goldbeter A, Devreotes PN, Knox BE. A mechanism for exact sensory adaptation based on receptor modification. J Theor Biol. 1986;120(2):151–79. pmid:3784578
- 47. Barkai N, Leibler S. Robustness in simple biochemical networks. Nature. 1997;387(6636):913–7. pmid:9202124
- 48. Block SM, Segall JE, Berg HC. Adaptation kinetics in bacterial chemotaxis. J Bacteriol. 1983;154(1):312–23. pmid:6339475
- 49. Bray D, Bourret RB, Simon MI. Computer simulation of the phosphorylation cascade controlling bacterial chemotaxis. Mol Biol Cell. 1993;4(5):469–82. pmid:8334303
- 50. Bray D, Bourret RB. Computer analysis of the binding reactions leading to a transmembrane receptor-linked multiprotein complex involved in bacterial chemotaxis. Mol Biol Cell. 1995;6(10):1367–80. pmid:8573792
- 51. Hauri DC, Ross J. A model of excitation and adaptation in bacterial chemotaxis. Biophys J. 1995;68(2):708–22. pmid:7696522
- 52. Levin MD, Morton-Firth CJ, Abouhamad WN, Bourret RB, Bray D. Origins of individual swimming behavior in bacteria. Biophys J. 1998;74(1):175–81. pmid:9449320
- 53. Tu Y, Shimizu TS, Berg HC. Modeling the chemotactic response of Escherichia coli to time-varying stimuli. Proc Natl Acad Sci U S A. 2008;105(39):14855–60. pmid:18812513
- 54. Clausznitzer D, Oleksiuk O, Løvdok L, Sourjik V, Endres RG. Chemotactic response and adaptation dynamics in Escherichia coli. PLoS Comput Biol. 2010;6(5):e1000784. pmid:20502674
- 55. Endres RG, Wingreen NS. Precise adaptation in bacterial chemotaxis through “assistance neighborhoods”. Proc Natl Acad Sci U S A. 2006;103(35):13040–4. pmid:16924119
- 56. Meir Y, Jakovljevic V, Oleksiuk O, Sourjik V, Wingreen NS. Precision and kinetics of adaptation in bacterial chemotaxis. Biophys J. 2010;99(9):2766–74. pmid:21044573
- 57. Mello BA, Tu Y. Quantitative modeling of sensitivity in bacterial chemotaxis: the role of coupling among different chemoreceptor species. Proc Natl Acad Sci U S A. 2003;100(14):8223–8. pmid:12826616
- 58. Xin X, Othmer HG. A “trimer of dimers”-based model for the chemotactic signal transduction network in bacterial chemotaxis. Bull Math Biol. 2012;74(10):2339–82. pmid:22864951
- 59. Othmer HG, Xin X, Xue C. Excitation and adaptation in bacteria-a model signal transduction system that controls taxis and spatial pattern formation. Int J Mol Sci. 2013;14(5):9205–48. pmid:23624608
- 60. Diehn B. Phototaxis and sensory transduction in Euglena. Science. 1973;181(4104):1009–15. pmid:4199225
- 61. Ozasa K, Lee J, Song S, Maeda M. Transient freezing behavior in photophobic responses of Euglena gracilis investigated in a microfluidic device. Plant Cell Physiol. 2014;55(10):1704–12. pmid:25074906
- 62. Ogawa T, Shoji E, Suematsu NJ, Nishimori H, Izumi S, Awazu A, et al. The flux of euglena gracilis cells depends on the gradient of light intensity. PLoS One. 2016;11(12):e0168114. pmid:28033336
- 63. Muku K, Yamashita H, Kamikubo T, Suematsu NJ, Iima M. Long-time behavior of swimming Euglena gracilis in a heterogenous light environment. Front Cell Dev Biol. 2023;11:1133028. pmid:36891510
- 64. Iseki M, Matsunaga S, Murakami A, Ohno K, Shiga K, Yoshida K, et al. A blue-light-activated adenylyl cyclase mediates photoavoidance in Euglena gracilis. Nature. 2002;415(6875):1047–51. pmid:11875575
- 65. Yoshimura K, Kamiya R. The sensitivity of chlamydomonas photoreceptor is optimized for the frequency of cell body rotation. Plant Cell Physiol. 2001;42(6):665–72. pmid:11427687
- 66. Ragozin DL. Error bounds for derivative estimates based on spline smoothing of exact or noisy data. J Approx Theory. 1983;37(4):335–55.
- 67. Raissi M, Perdikaris P, Karniadakis GE. Physics-informed neural networks: a deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. J Comput Phys. 2019;378:686–707.
- 68. Yazdani A, Lu L, Raissi M, Karniadakis GE. Systems biology informed deep learning for inferring parameters and hidden dynamics. PLoS Comput Biol. 2020;16(11):e1007575. pmid:33206658
- 69. Lu L, Meng X, Mao Z, Karniadakis GE. DeepXDE: a deep learning library for solving differential equations. SIAM Rev. 2021;63(1):208–28.
- 70. Lazova MD, Ahmed T, Bellomo D, Stocker R, Shimizu TS. Response rescaling in bacterial chemotaxis. Proc Natl Acad Sci U S A. 2011;108(33):13870–5. pmid:21808031
- 71. Colin R, Rosazza C, Vaknin A, Sourjik V. Multiple sources of slow activity fluctuations in a bacterial chemosensory network. Elife. 2017;6:e26796. pmid:29231168
- 72. Josef K, Saranak J, Foster KW. Linear systems analysis of the ciliary steering behavior associated with negative-phototaxis in Chlamydomonas reinhardtii. Cell Motil Cytoskeleton. 2006;63(12):758–77. pmid:16986140