Skip to main content
Browse Subject Areas

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Petri net–based model of the human DNA base excision repair pathway

  • Marcin Radom ,

    Roles Conceptualization, Formal analysis, Investigation, Methodology, Writing – original draft, Writing – review & editing

    ‡ These authors are joint first authors on this work.

    Affiliations Institute of Computing Science, Poznan University of Technology, Poznań, Poland, Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznań, Poland

  • Magdalena A. Machnicka ,

    Roles Conceptualization, Investigation, Methodology, Validation, Writing – original draft, Writing – review & editing

    ‡ These authors are joint first authors on this work.

    Affiliations Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Warsaw, Poland, Institute of Informatics, University of Warsaw, Warsaw, Poland

  • Joanna Krwawicz,

    Roles Conceptualization, Investigation, Methodology, Validation, Writing – original draft, Writing – review & editing

    Affiliations Institute of Biochemistry and Biophysics, Polish Academy of Sciences, Warsaw, Poland, Laboratory of Structural Biology, International Institute of Molecular and Cell Biology, Warsaw, Poland

  • Janusz M. Bujnicki ,

    Roles Conceptualization, Funding acquisition, Investigation, Methodology, Supervision, Validation, Writing – original draft, Writing – review & editing (PF); (JMB)

    Affiliations Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Warsaw, Poland, Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University, Poznań, Poland

  • Piotr Formanowicz

    Roles Conceptualization, Funding acquisition, Methodology, Supervision, Writing – original draft, Writing – review & editing (PF); (JMB)

    Affiliations Institute of Computing Science, Poznan University of Technology, Poznań, Poland, Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznań, Poland


Cellular DNA is daily exposed to several damaging agents causing a plethora of DNA lesions. As a first aid to restore DNA integrity, several enzymes got specialized in damage recognition and lesion removal during the process called base excision repair (BER). A large number of DNA damage types and several different readers of nucleic acids lesions during BER pathway as well as two sub-pathways were considered in the definition of a model using the Petri net framework. The intuitive graphical representation in combination with precise mathematical analysis methods are the strong advantages of the Petri net-based representation of biological processes and make Petri nets a promising approach for modeling and analysis of human BER. The reported results provide new information that will aid efforts to characterize in silico knockouts as well as help to predict the sensitivity of the cell with inactivated repair proteins to different types of DNA damage. The results can also help in identifying the by-passing pathways that may lead to lack of pronounced phenotypes associated with mutations in some of the proteins. This knowledge is very useful when DNA damage-inducing drugs are introduced for cancer therapy, and lack of DNA repair is desirable for tumor cell death.


Living organisms as well as their functional parts such as organs, tissues, cells, etc. are very complex systems. These systems are composed of numerous basic building blocks connected by a dense network of interactions and mutual dependencies. A sum of such networks determines the structure and the functionality of the system. Therefore, in order to fully understand the nature of living organisms, it does not suffice to analyze properties of the building blocks, but it is also necessary to analyze the networks of interactions among them. Thus far, various computational methods for analyzing complex systems have been developed, and some of them can be applied in the analysis of biological systems [1, 2]. However, these systems have their own specificity, from which follows that in some cases it can be difficult to use methods developed earlier for other systems.

The first and necessary step in the analysis of a complex system is to build its formal model. This model can be expressed in a language of some branch of mathematics, and for instance, differential equations are often used for this purpose. Recently, Petri net theory has emerged as a promising approach for modeling and analysis of biological systems [1]. These nets have been described for the first time by C. A. Petri in the early 1960s in the context of theoretical computer science and for many years this was the main area of their applications [3, 4]. Since this time the theory of Petri nets has been intensively studied and many methods of the analysis of their properties have been developed. In the mid 1990s it was realized that Petri nets could also be used for modeling and analysis of biological systems [5, 6] and during the last two decades the possibilities and limitations of Petri net applications in this area were studied [1, 7, 8].

On the one hand, a strong advantage of Petri net-based models is their intuitive graphical representation. On the other hand, the nets can be analyzed using precise mathematical methods. Moreover, contrary to differential equations, they do not require the knowledge about precise values of some parameters of the modeled system, what can be a very important property in many cases since these values are often difficult to determine for biological phenomena.

The integrity of genetic information depends on the interplay between the DNA base modification (caused by enzymatic reactions), DNA damage (caused by cellular and environmental chemical reactions), and all DNA-binding proteins and readers of information that work together to make the whole system functional. The DNA damage repair is an exemplary complex biological system, in which a large number of elements (mostly enzymes that perform chemical reactions) cooperate with each other to protect the cell from consequences of DNA damage events. Base damage can be mutagenic and/or cytotoxic and can lead to mutagenesis or cell death, respectively, when unrepaired DNA lesions undergo replication or transcription. To avoid such consequences, correct repair of different types of DNA damage is assured by the existence of several DNA repair subsystems maintained by the cell in order to survive the DNA damage. In humans these subsystems, called pathways, include DNA damage signaling (DDS), direct reversal repair (DRR), base excision repair (BER), nucleotide excision repair (NER), mismatch repair (MMR), homologous recombination repair (HRR), nonhomologous end-joining (NHEJ) and translesion synthesis (TLS). Details concerning DNA damage and repair in several living organisms were collected and included in the public databases developed by our groups ( [9], [10]).

The DNA repair processes have already been a subject of computational modeling. A broad range of modeling approaches gave opportunities to gain a variety of insights into DNA damage and repair. Some of these models were built based on detailed kinetic data available from experimental studies. Examples include the stochastic model of NHEJ, which uses molecule numbers, reaction rate constants and kinetic rate constants to explain the dynamic response to damage induced by different levels of gamma irradiation in human fibroblasts [11]. Computational modeling was also conducted for some sub-pathways of BER, where e.g., differential equations were used to model the removal of 8-oxoguanine (8-oxoG) via base excision repair initiated by human OGG1 glycosylase [12]. In this case the availability of kinetic data for both wild type enzymes and their mutant variants allowed for the prediction of the kinetics and capacity of the whole sub-pathway in the presence of some single-nucleotide polymorphisms which have been associated with cancer. Other examples of kinetic models for BER include the works of [13] and [14]. Valuable insights into DNA repair processes were also delivered by models which did not require incorporation of very detailed data as in the cases discussed above. For example, Monte Carlo simulations of radiation-induced damage and its repair were used to investigate the repair of clustered damage via BER and NER pathways [15, 16]. These approaches allowed to estimate the probabilities of correct repair, fractions of damages converted into double-strand breaks (DSBs) and fractions of repairs ending with mutations.

In this work, we present an extensive study of a Petri net model of the BER repair pathways, which we have built based on available literature and from biological pathway databases. In contrast to models built by others, our approach involves modeling the repair of many different types of DNA damage through the activity of a collection of DNA glycosylases initiating BER and other enzymes performing further steps of the repair. With such a comprehensive model formal analysis becomes possible using methods provided by the Petri nets theory. Here we have performed a t-invariant based analysis, involving MCT sets and t-clusters, decomposing the model into precisely identified and named functional modules. Understanding their interaction provides valuable insights into the complex DNA repair process. We have also performed an in silico knockout analysis as a prediction of system behavior under a collection of disturbances.

The organization of the paper is as follows. Section 2 provides an overview of Petri nets, including the analytical analysis of t-invariants. In Section 3 the results of the net analysis are presented. The paper ends with discussion and conclusions given in Section 4.


Petri nets

A Petri net is a mathematical object based on a weighted directed bipartite graph. There are two disjoint sets of vertices called places and transitions. Places correspond to passive components of a modeled system (e.g., chemical compounds) while transitions model active components (e.g., chemical reactions or other elementary subprocesses). Places can be connected with transitions and transitions with places by arcs, which describe causal relations between elementary components of the system. The arcs are labeled by weights being positive integer numbers. Places can hold objects called tokens, which represent discrete quantities of the passive components. In a graphical representation of Petri nets places are denoted as circles, transitions as rectangles or bars, arcs as arrows and tokens as dots or positive integer numbers within places. If a weight of an arc is equal to one, usually it is not shown in the graphical representation; otherwise it is represented as a number labeling the arc.

The bipartite graph underlying a Petri net determines its structure (which should correspond to a structure of the modeled system), but most of the fundamental properties of nets of this type follow from their dynamics which is related to tokens. The flow of tokens through the net from one place to another via transitions corresponds to the flow of information, substances etc. through the modeled system. The distribution of tokens over all places, called marking, represents a state of the system.

The flow of tokens is governed by a transition firing rule. According to this rule, a transition is enabled if the number of tokens in each of its pre-places, i.e., the ones which directly precede this transition, is equal to at least the weight of the arc connecting such a place with the transition. An enabled transition may fire, which means that tokens flow from its pre-places to its post-places, i.e., the ones which are its immediate successors. The number of flowing tokens is equal to the weight of the arc. There are two exceptions to this rule, i.e., transitions without pre-places are continuously enabled while transitions without post-places do not produce tokens. These types of transitions can be used to model interactions of the system with its environment. A read arc is a special kind of bidirectional arc, corresponding to a pairs of arcs. Tokens in a place connected by a read arc to a transition are not consumed when firing this transition, but they are required to enable it.

Part of the results in the following paper comes from the analysis of the net simulation. In such a simulation one must set specific rules governing the firing of the transitions. Our model is based on a classical Petri net, where in general only three scenarios for transition firing are available: in a single step of the simulation only one of the enabled transitions can fire, all enabled transitions can fire simultaneously or, in a third case, only some, randomly chosen enabled transitions will fire. The latter scenario has been used in our simulations. Specifically, each enabled transition has 50% chance of firing and the sequence of enabled transitions selected for firing in a single simulation step is (each time) chosen randomly.

More complex firing scenarios could be available if, e.g., stochastic Petri net has been used. It would however require assigning a so called firing rates to each transition, what in general would be a difficult task requiring knowledge about more or less specific probability of each reaction involved in the modeled system. The assumption of 50% firing chance is therefore a trade-off, allowing simple yet still valuable stochastic simulations of the model.

As a summary one can say that in a single step some random enabled transitions (with 50% chance for firing) are randomly assigned to the firing list. One must note that they will not necessarily all fire, because competing transitions on such a list can share the same pre-places and consume “activation” tokens from each other depending on the firing sequence (i.e., when a transition is on that list, but because of firing of the other ones such a transition loses the necessary minimum of tokens in its pre-places, it stops being active and cannot fire). However, such a firing sequence is random in each simulation step, so a sufficient stochastic scenario is provided to study the dynamic behavior of the model.

More advanced analysis methods for simulation of a net would be available if a stochastic Petri net (SPN) or continuous Petri net (CPN) have been used to model the DNA repair process. The latter would require the formulation of ODEs for the reaction flows. For the SPN model, the so-called firing rates need to be assigned for each transition—in the stochastic Petri net they influence the transition chances of firing in the simulation. Both approaches have advantages (e.g., a more complex and accurate simulation) and disadvantages. As for the latter, it is still difficult to find specific and accurate values which could describe the ODE in the CPN model or firing rates for all transitions for the SPN model. Further, in a continuous Petri net an invariant, MCT and cluster-based analysis explained in the next section is not available, while in this work such an analysis allowed us to divide the Petri net model into some functional subnets, characterized by the common functionality.

t-invariants analysis

Besides the graphical representation of Petri nets also a more formal one, called incidence matrix, is used. Entries of such matrix A = [aij]n × m are integer numbers, and entry aij is equal to a difference between numbers of tokens in places pi before and after firing transition tj.

Among many properties of Petri nets, the ones related to t-invariants are especially important in the context of an analysis of biological systems models. A t-invariant is a vector x of integer numbers being a solution to the equation For every t-invariant x there is a set of transitions supp(x) = {tj:xj>0,j = 1,2,…,m} called its support.

t-invariants correspond to some subprocesses of the modeled biological system which do not change its state. Hence, analyzing relations among them may lead to discoveries of properties of the system [17]. The net is covered by t-invariants if every transition belongs to a support of at least one of them. In such a situation every transition contributes to at least one process of the modeled biological system [1820]. Therefore, when an analysis is based on t-invariants the net should be covered by them. In order to find relations between t-invariants they are grouped into sets called t-clusters, using standard clustering algorithms. Also transitions can be grouped into sets called maximal common transition sets (MCT sets) what helps in the analysis of t-invariants [17, 21]. Both t-clusters and MCT sets correspond to some functional modules of the biological system. A more precise description of a Petri net elements, invariants and MCT sets is available as Appendix in S1 Appendix.


The model of the human base excision repair pathway

The Petri net presented in this work is a model of the human base excision repair pathway. The model was built based on data from the publicly available databases: REPAIRtoire—a database of DNA repair pathways [9], DNAtraffic [10] and Reactome [22, 23]. Subsequently, the computational model was supplemented with several new aspects of the apurinic/apyrimidinic (AP) site processing which are not included in the above-mentioned databases but were reported in the literature:

  • displacement of OGG1 glycosylase at the AP site by AP endonuclease [2426] or NEIL1 glycosylase [27, 28],
  • displacement of NTH1 glycosylase by AP endonuclease [29],
  • Polβ-mediated LP-BER [30, 31],
  • mechanism of the repair initiated by NEIL2 and NEIL3 [32, 33].

The model has been built and analyzed using Holmes software [34] and is available in SPPED (Snoopy), XML (SMBL), .project (Holmes) and PDF formats as S1, S2, S3 and S4 Files respectively.

The model can be subdivided into 4 subnets, described below:

Subnet 1: Introduction of DNA damage.

This part of the model represents the creation of different types of DNA damage from undamaged DNA.

Subnet 2: AP site formation.

The biggest part of the model represents the recognition of DNA damage by DNA glycosylases, followed by the removal of damaged bases by cleavage. This part of the model can be further subdivided into 11 modules, each of which represents the activity of one DNA glycosylase (Table 1).

Table 1. Human DNA glycosylases studied in the Petri net model.

DNA glycosylases and their substrates are represented by places. The substrates were divided into groups, based on a subset of DNA glycosylases that recognize them (Table 2).

Table 2. Damages to the DNA taken into account in the model.

Each group of DNA damage is represented by a separate place. For each glycosylase-substrate pair this part of the model contains a transition representing the damage recognition process, which leads to the glycosylase-damaged DNA complex formation and cleavage of the damaged base, which in turn results in the AP site formation. Fig 1 presents the submodule for the OGG1 DNA glycosylase (full names of places and transitions of this submodule are given in Table 3).

Fig 1. The part of the BER Petri net model presenting AP site formation by the OGG1 glycosylase.

Recognition of a specific error group, the cleavage and formation of the DNA repair complex is shown for different error molecules. A general, schematic diagram of the modeled process is shown on the left. For more detailed description of places and transitions see Table 3. Shades of gray were used to mark steps of the process.

Table 3. The biological meaning of places and transitions for Fig 1, presenting AP site formation by the OGG1 glycosylase (subnet 2).

Subnet 3: AP site processing.

AP site (generated by damaged base removal) is incised either by AP-lyase activity of a bifunctional DNA glycosylase or by AP endonuclease (APE). AP-lyase activities of OGG1, NTH1, NEIL1, NEIL2 and NEIL3 are represented in the model by appropriate lyase activity transitions. In the case of OGG1, NTH1 and NEIL3, which perform β-elimination, the lyase activity results in the formation of DNA incision with 3ʹdRP (deoxyribose phosphate) end (Fig 2A). Full names of places and transitions for all three Fig 2 subnets are given in Table 4.

Fig 2.

Three parts (A-C) of the BER Petri net model presenting AP site processing by different glycosylases. From left to right: part A—an example of the lyase activity generating 3ʹdRP end, part B—an example of the β,δ-elimination reaction, part C—an example of the DNA incision after monofunctional glycosylase activity, D—A general, schematic diagram of the modeled processes. For a more detailed description of places and transitions see Table 4. Shades of gray were used to mark different AP site processing paths.

Table 4. The biological meaning of places and transitions for Fig 2, presenting AP site processing by different glycosylaes (subnet 3).

Only NEIL1 and NEIL2 generate DNA incision with 3ʹP end through β,δ-elimination (Fig 2B). In contrast, AP sites generated by monofunctional glycosylases (SMUG, MBD4, MPG, MYH, TDG, UDG) need additional processing and involvement of APE. This reaction results in DNA incision with 5ʹdRP end (Fig 2C).

The AP-lyase activity of OGG1 is weak ‒ after base cleavage, OGG1 can be substituted by APE, which catalyzes strand incision leading to 5ʹdRP end (Fig 2A). Similarly, APE can process AP sites generated by NTH1, and NEIL1 can catalyze β,δ-elimination at the AP sites generated by OGG1 (Fig 2A and 2C).

The 3ʹdRP and 3ʹP DNA termini (dirty ends) are refractory to DNA polymerase repair synthesis and need to be “cleaned” to 3ʹOH for BER to proceed. In human cells, APE and PNKP are responsible for the removal of 3ʹdRP and 3ʹP, respectively. 5ʹdRP end prevents the ligation process and its removal is primarily carried by DNA Polβ or FEN1.

Subnet 4: DNA repair synthesis and ligation.

The last step of BER is the replacement of the excised nucleoside by repair synthesis catalyzed by DNA polymerases (presented in Fig 3, full names of places and transitions are given in Table 5) and nick sealing by a DNA ligase. It can proceed either as SP- or LP-BER. Typically, SP-BER is performed by Polβ and LIG3-XRCC1 complex on the products of bifunctional glycosylases activity. Repair initiated by monofunctional glycosylases proceeds via LP-BER which requires activity and presence of Polβ, Polδ/Polε, PCNA, FEN1 and LIG1. However, the monofunctional glycosylase pathway can also proceed via SP-BER, after removal of 5ʹdRP by Polβ or FEN1. If Polβ removes 5ʹdRP than the final ligation step is performed by LIG3-XRCC1 complex, otherwise it is done by LIG1. DNA synthesis in LP-BER is usually performed either by Polδ or Polε, with Polδ being preferred in high concentrations of PCNA [66]. However, it was also shown that Polβ can conduct strand synthesis in LP-BER.

Fig 3. The part of the BER Petri net model presenting different repair synthesis pathways.

A general, schematic diagram of the modeled process is shown below the Petri net. For more detailed description of places and transitions see Table 5. Shades of gray were used to mark different paths.

Table 5. The biological meaning of places and transitions for Fig 3 presenting different repair synthesis pathways (subnet 4).

Mathematical analysis of the model

The presented Petri net model consists of 259 transitions and 179 places. Full names of places and transitions are given in Tables A and B in S1 Tables, respectively.

In the first step of the analysis of the model, some of its structural properties have been identified. The net is pure and ordinary, i.e., there are no read arcs, and all the arcs have a weight equal to 1. Therefore, the net is homogenous–all outgoing arcs of a given place have the same weight. The network is connected, but not strongly connected, meaning that all its nodes have non-directed connection(s) with the others, but there are pairs of nodes without a directed path between them. The net is not conservative, i.e., many transitions consume a different number of tokens from their pre-places than they produce in the post-places. There are also static conflicts–some transitions share the same pre-places. Finally, there are input and output transitions in the model, which correspond to the connections of the analyzed system with other processes.

An important part of the analysis of the net is based on the t-invariants. There are 245 t-invariants and the net is covered by them. In the presented model of DNA BER-repair process, there are 94 non-trivial MCT sets, i.e., containing more than one transition. Many of the MCT sets are responsible for almost the same functionality, but operating on different substrates. Description of the most important MCT sets is presented in Table 6.

Table 6. Maximum Common Transitions sets and their biological functions.

The most important basic functions of the modeled system correspond to eight MCT sets: m1 (short patch repair with Polβ and PNKP), m14 followed by m15 (short patch repair for DNA-APE complex), m16, m17, m40 (short patch (SP) and long patch (LP) repair in which Polβ and FEN1 are involved), finally m2 and m3 responsible for long patch repair using Polδ and Polε, respectively. Two very important sets are m13 and m14. The latter is crucial for starting one of the short patch repairs with Polβ. Set m13 is responsible for the preparation of the DNA complex for four out of six repair paths, including three long patch repairs (with Polβ, Polδ, and Polε). Overall, the results of analysis of the MCT sets suggest that in most cases the chosen repair sub-pathway is determined by the damage type at the very beginning of the process. However, in some cases alternative pathways are available. For example repair processes in which generation of 5′dRP by APE activity takes place, the repair can either proceed via SP-BER (m15) or via LP-BER, if PCNA level is high (m17, m40).

The next step of the model analysis concerns t-clusters computation. Such successfully obtained clusters can divide the Petri net into different subnets responsible for the similar or common functions in the whole DNA repair process. Such a division is based on the t-invariants set. The description of such subnets/clusters can use the previously computed MCT sets for simplification (by replacing some groups of transitions by such sets), yet one should have in mind that such a t-cluster analysis is based solely on the computed t-invariants sets.

In order to find the optimal number of clusters, many similarity measures and grouping algorithms had to be used and their results compared. The theory is that for each cluster (group) its t-invariants must be very similar to each other according to the chosen similarity measure, and as much as possible dissimilar to t-invariants belonging to different groups [67]. Three important choices needed to be made: choosing the proper similarity measure, the clustering algorithms and the number of clusters. Thorough comparative tests of different choices in these three variables have been performed. In the tests eight similarity measures have been used: Binary, Canberra, Euclidean, Manhattan, Maximum, Minkowski, uncentered and centered Person (the latter often called Correlated Pearson metric). Further, seven clustering algorithms have been tested: Centroid, Complete, McQuitty, Median, Single, UPGMA, and Ward. The evaluation concerned the number of clusters ranging from 2 to 30. In order to evaluate the results two indices have been used, i.e., Mean Split Silhouette (MSS) [68] and Caliński-Harabasz index [69]. For the presented Petri net the best clustering has been obtained using the UPGMA algorithm and correlated Pearson similarity measure. The best number of clusters according to the used evaluation measures is 27. However, 20 of them are trivial single t-invariants responsible for the fluctuations of the system components, like APE, Polβ, Polδ, Polε and the glycosylases responsible for damage detection. The obtained clustering (i.e., set of clusters) is presented in Table 7.

Clusters 21–27 from Table 7 are all responsible for different repair paths within the presented net. Table 8 presents a more detailed description of each path, providing information about e.g., glycosylases involved in damage detection or main polymerases involved in the repair process.

Table 9 contains the names of the most important transitions corresponding to the elementary processes within short and long patch repair pathways.

Analysis of the robustness of the BER pathway via systematic in silico knockout experiment

The impact of each modeled repair synthesis sub-paths: three short-patch (SP) ones (where DNA Polβ is always present) and three long-patch (LP) repairs (with polymerases δ, ε or β) were analyzed via systematic in silico knockout experiment. The six repair synthesis sub-paths defined for this analysis are as follows:

  • Path 1: SP(1), refers to t-invariants cluster 21, DNA synthesis performed by Polβ, includes PNKP,
  • Path 2: SP(2), refers to t-invariants cluster 22, DNA synthesis performed by Polβ,
  • Path 3: LP(1), refers to t-invariants cluster 23, DNA synthesis performed by Polβ,
  • Path 4: SP(4), refers to t-invariants cluster 27, DNA synthesis performed by Polβ, includes FEN1,
  • Path 5: LP(2), refers to t-invariants cluster 25, DNA synthesis performed by Polδ,
  • Path 6: LP(3), refers to t-invariants cluster 26, DNA synthesis performed by Polε.

Net simulations were used to perform a series of knockout experiments. Each simulation contained 20,000 steps and was repeated 20 times. In a single step, any enabled transition had a 50% chance for firing. Transition t169 (DNA_back_to_pool) was treated as a marker of successful DNA repair. Its chance of firing represents the chance of successful completion of the repair process. The results of the simulations for the non-disturbed model are summarized in Table 10. The chance of successful repair, represented by the firing of transition t169, equals on average 23.3%. The biggest proportion of successful repairs is processed via the SP(2) pathway.

Table 10. Involvement of the synthesis sub-paths in the successful DNA repair in the non-disturbed model.

The results of the knockout experiments are presented in Table 11. Transitions critical for each repair synthesis sub-paths have been identified, and their impact on the overall chance of successful repair was checked. The most prominent change in the repair success rate was observed when Path 2 was disabled. In this setup Paths 1 and 3–6 exhibit a slightly higher activity but they cannot fully recapitulate the repair. At the same time the lack of activity in Paths 3–6 did not result in an observable change in DNA repair efficiency. The high importance of Polβ in DNA repair is a well-known phenomenon—the lower repair rate is thus expected (see Discussion for more details). To the best of our knowledge, Polδ and Polε knockouts influence on BER efficiency and higher activity of other BER sub-pathways after Polβ knockout have not been reported yet.

We have also investigated influence of a glycosylase knockout on the modeled behavior of BER, taking the TDG glycosylase as an example. We found that TDG knockout, simulated by inactivation of transition t_206, which introduces TDG into the system, leads to: i) higher activity of glycosylases involved in processing of the same DNA damage substrates as TDG (namely MBD4, OGG1, NEIL1, NEIL2 and SMUG), ii) slight decrease in the overall DNA repair efficiency, iii) accumulation of DNA damage from group 11, which are recognized only by TDG and iv) decreased activity of the SP-BER pathway.


The BER process can be described as a unique DNA repair pathway, where very precise enzymes are used to recognize and remove specific DNA damage. Thus far eleven DNA glycosylases have been found in human cells (Table 1). All DNA glycosylases are divided into four structurally different superfamilies and share a similar ability to recognize the DNA damage. However, there are also several mismatches (e.g., T-C) or DNA damage types (e.g., 3-mA, 7-mG) which are removed only by one human enzyme (Table 2) [9, 10, 36, 65, 70].

Although BER has already been a subject of several computational modeling attempts, we provide the most comprehensive model, which includes the widest range of protein factors involved in the DNA repair process and the biggest collection of DNA damage types (Table 1). The models available to date concentrate mainly on the repair initiated by the OGG1 glycosylase [1214], while our computational model of human BER includes 68 different DNA damage types found in living cells and identified and repaired by 11 human DNA glycosylases. We have also included into our Petri net model several mechanisms that have not been considered in the available models, e.g., displacement of OGG1 at the AP site by AP endonuclease or NEIL1, displacement of NTH1 by AP endonuclease, Polβ-mediated LP-BER and the mechanism of the repair initiated by NEIL2 and NEIL3. However, we have included in our model mainly the major DNA damage types. As a consequence, some of the products of minor G base oxidative damage (e.g. 5-carboxamido-5-formamido-2-iminohydantoin, 2Ih) which can be removed from DNA by hNEIL1-3 glycosylases [71] are not present in the model. We have also omitted several bulky DNA damage products (from the minor fraction) which are used mostly for in vitro studies of DNA glycosylases activities.

BER can proceed via different sub-pathways, depending on the damage and the type of DNA glycosylase that detected the problem (Table 2). In general, the repair DNA synthesis can be performed by DNA Polβ (Polβ, used for both the so-called short-patch (SP) and long-patch (LP) base excision repairs). Two other major DNA polymerases δ and ε (Polδ and Polε) can repair DNA instead of Polβ, and they are always involved in the LP-BER pathway with PCNA involvement. Other possibilities for repairing DNA by the BER mechanism have also been included in our model. The analysis performed on the base of net t-invariants allowed to divide a structure of the net into meaningful biological units (MCT sets) and more importantly to group t-invariants which represent basic subprocesses of the model into clusters (Table 6). The clusters allow for separating different repairing processes, showing the glycosylases, polymerases and other compounds involved in the repair processes, as presented in Tables 7 and 8 and Figs 13.

Using in silico simulations, we showed that most efficient and successful DNA repair of the studied DNA damage was reached via SP-BER (Table 10). This result was consistent with biological data showing that the patch size in the nucleosomal BER was rather shorter (1 nt) than longer (2–12 nt) [72]. Our in silico knockout experiments showed that Polβ, the so-called repair polymerase (Table 11, Path 2) is necessary for BER. Lack of DNA Polβ in our model resulted in stopping the cleaning of 5’dRP moiety and gap filling (DNA synthesis) which was simultaneously manifested as a block of whole SP-BER. In living cells, the perturbation in BER can lead to genomic instability. Our results could be supported by the observation that somatic mutations in Polβ gene have been detected in various types of cancers (NCI Genomic Data Commons (GDC)). However, it is still not clear whether and how Polβ mutations and overexpression can be linked to cancer onset and its progression. Recently, it has been shown that human R152C Polβ mutant was impaired in BER activity and efficiency. Moreover, the mutant cells have displayed a high frequency of chromatid breakages [73]. Mice carrying a targeted disruption of the Polβ gene has shown growth retardation and died of a respiratory failure immediately after the birth [74]. Also, homozygous Polβ R137Q knock-in mice embryos were typically small in size and had a high mortality rate (21%). In this mutant the BER efficiency was impaired, which subsequently ended in double-strand breaks (DSBs) and chromosomal aberrations [75]. Recently it has been shown that a mouse model with decreased expression of Polβ (with Y265C mutation) developed systemic lupus erythematosus (SLE) [76].

It is not clear which of the two activities of Polβ is the most important in SP-BER. Both DNA synthesis and 5′-dRP-lyase activity can be replaced by other DNA polymerases or FEN1, respectively. If the 5′-dRP moiety is reduced or oxidized, the 5′-dRP-lyase of Polβ cannot remove the modified sugar residue, and LP-BER is initiated. Next, a flap (2–10 nucleotide long) is subsequently removed by FEN1. FEN1 mutations are very rare, suggesting that FEN1 is important for normal DNA metabolism [77, 78]. Targeted deletion of the Fen1 gene in mice causes early embryonic lethality [79]. However, several somatic FEN1 mutations have been detected in human cancer (GDC) but the relationship between FEN1 deficiency and cancer susceptibility remains unclear. Recently, it was reported that L209P FEN1 mutation is associated with colorectal cancer. Human L209P FEN1 mutant was lacking the exo- and endonuclease activities but retaining DNA-binding affinity. This was a dominant-negative mutation and mutated protein impaired LP-BER in vitro and in vivo [80]. In contrast, our computational Petri net BER-knockout model with eliminated FEN1 activity did not show any significant change in BER efficiency (Table 11, Path 4). This inconsistency can be explained by the fact that FEN1 functions not only in LP-BER and it is mostly recognized as a central component of cellular DNA metabolism (e.g. processing of Okazaki fragment maturation intermediates, telomere maintenance and rescue of stalled replication fork) [77].

It is estimated that each day as many as 10,000 abasic sites are formed in one human cell [81]. Removing AP-sites from DNA is a daily task for the DNA repair/tolerance system. Apurinic/apyrimidinic endonuclease 1 is responsible for the AP sites processing which is necessary for further steps of DNA repair pathways [82, 83]. In our computational BER model, the elimination of AP-sites endonuclease 1 (APE1) resulted in downstream blocking of DNA repair, both, SP-BER and LP-BER. Lack of AP-sites repair can be an explanation of the embryonic lethality of Ape1 null mice [84, 85] and increasing tumor susceptibility in heterozygous mice [86]. APE1 mutations in humans (APE1 variants: L104R, E126D, and R237A, exhibiting approximately 40–60% reduction in specific incision activity) have been associated with amyotrophic lateral sclerosis (ALS) [87, 88]. Also, somatic APE1 mutations (APEX P112L, W188X, and R237C) were found in endometrial cancers [89].

Preparation of the 3′OH ends is a key step for repair DNA synthesis. In our model, we observed a reduction of a chance for successful repair when polynucleotide kinase/phosphatase (PNKP) was removed from the pathway (Table 11, Path 1). PNKP involvement in BER is mostly due to APE-independent base excision repair pathway in human cells after NEIL1 and NEIL2 action on oxidized base lesions [27, 28]. However, PNKP (two activities: DNA 5'-kinase and DNA 3′-phosphatase) is mostly recognized as an enzyme which generates 5′-phosphate/3′-hydroxyl DNA termini that are critical for ligation by the non-homologous end joining (NHEJ) DNA ligase LigIV during double-strand break repair (DSBR). Microcephaly with early-onset, intractable seizures and developmental delay (MCSZ) is a hereditary disease caused by mutations in PNKP [90, 91].

Eleven human DNA glycosylases were in silico deleted one-by-one in our computational BER model. None of them was critical for BER function because almost each DNA damage can be removed by at least two distinct glycosylases (Table 2). Apart from DNA damage removal, various BER glycosylases (SMUG, MBD4, TDG, UNG2, NEIL1-3) are involved in nucleotide replacement during the active DNA demethylation process [57]. However, unlike other DNA glycosylases, TDG is essential for embryonic development; mice die at day 11.5 [92]. The lethal phenotype is associated with epigenetic aberrations affecting the expression of developmental genes. Mouse embryonic fibroblasts, (MEFs) derived from Tdg null embryos showed impaired gene regulation due to imbalance histone modification and CpG methylation [93]. Other DNA glycosylases are not essential, mouse knockouts are viable, however, showing increase in mutation frequency or some immune dysfunction [36]. Human mutant DNA glycosylases have also been widely described and correlated with a predisposition to various diseases [70]. Also, an interplay between DNA repair of the oxidatively damaged base, 8-oxoG (8-oxo-7,8-dihydroguanine) and transcriptional activation has been documented for mammalian genes: removal of 8-oxoG from the coding strand by OGG1-mediated BER resulted in upregulated transcription [94]. Since oxidation is ongoing and transition of C to U occurs spontaneously or at specific times during differentiation and development, there is a strong suggestion that BER substrates might be epigenetic and modulate transcription factor binding [71, 95].

We believe that our model can be a valuable tool that can help to predict the influence of changes in protein activities or levels on the repair process as well as some regulatory functions. It can be used to predict the sensitivity of the cell with inactivated repair proteins to different types of DNA damage and it can help to identify the by-passing pathways that may lead to lack of pronounced phenotypes associated with mutations in some of the proteins.

Supporting information

S1 Tables.

Two supplementary Tables A–B listing all net places and transitions.


S1 Appendix. Appendix with more detailed description of Petri net elements and theory.


S4 File. The model as a figure in PDF format.



  1. 1. Koch I, Reisig W, Schreiber F. Modeling in systems biology: the Petri Net approach: Springer Science & Business Media; 2010.
  2. 2. Wang J, Pantopoulos K. Regulation of cellular iron metabolism. Biochemical Journal. 2011;434(3):365–81. pmid:21348856
  3. 3. Murata T. Petri nets: Properties, analysis and applications. Proceedings of the IEEE. 1989;77(4):541–80.
  4. 4. Petri CA. Kommunikation mit automaten. Institut für Instrumentelle Mathematik, Bonn. 1962.
  5. 5. Hofestädt R. A Petri net application to model metabolic processes. Systems Analysis Modelling Simulation. 1994;16(2):113–22.
  6. 6. Reddy VN, Mavrovouniotis ML, Liebman MN, editors. Petri net representations in metabolic pathways. ISMB: 0-929280-47-4; 1993:328–336. PubMed PMID: 584354; 1993.
  7. 7. Chen M, Hariharaputran S, Hofestädt R, Kormeier B, Spangardt S. Petri net models for the semi-automatic construction of large scale biological networks. Natural Computing. 2011;10(3):1077–97.
  8. 8. Marwan W, Wagler A, Weismantel R. Petri nets as a framework for the reconstruction and analysis of signal transduction pathways and regulatory networks. Natural Computing. 2011;10(2):639–54.
  9. 9. Milanowska K, Krwawicz J, Papaj G, Kosinski J, Poleszak K, Lesiak J, et al. REPAIRtoire—a database of DNA repair pathways. Nucleic Acids Res. 2011;39(Database issue):D788–92. pmid:21051355; PubMed Central PMCID: PMCPMC3013684.
  10. 10. Kuchta K, Barszcz D, Grzesiuk E, Pomorski P, Krwawicz J. DNAtraffic—a new database for systems biology of DNA dynamics during the cell life. Nucleic Acids Res. 2012;40(Database issue):D1235–40. pmid:22110027; PubMed Central PMCID: PMC3245060.
  11. 11. Dolan D, Nelson G, Zupanic A, Smith G, Shanley D. Systems modelling of NHEJ reveals the importance of redox regulation of Ku70/80 in the dynamics of dna damage foci. PLoS One. 2013;8(2):e55190. pmid:23457464; PubMed Central PMCID: PMC3566652.
  12. 12. Sokhansanj BA, Wilson DM 3rd. Estimating the effect of human base excision repair protein variants on the repair of oxidative DNA base damage. Cancer Epidemiol Biomarkers Prev. 2006;15(5):1000–8. pmid:16702383.
  13. 13. Sokhansanj BA, Rodrigue GR, Fitch JP, Wilson DM 3rd. A quantitative model of human DNA base excision repair. I. Mechanistic insights. Nucleic Acids Res. 2002;30(8):1817–25. pmid:11937636; PubMed Central PMCID: PMC113225.
  14. 14. Rahmanian S, Taleei R, Nikjoo H. Radiation induced base excision repair (BER): a mechanistic mathematical approach. DNA Repair (Amst). 2014;22:89–103. pmid:25117268.
  15. 15. Semenenko VA, Stewart RD. A fast Monte Carlo algorithm to simulate the spectrum of DNA damages formed by ionizing radiation. Radiat Res. 2004;161(4):451–7. pmid:15038766.
  16. 16. Semenenko VA, Stewart RD, Ackerman EJ. Monte Carlo simulation of base and nucleotide excision repair of clustered DNA damage sites. I. Model properties and predicted trends. Radiat Res. 2005;164(2):180–93. pmid:16038589.
  17. 17. Formanowicz D, Kozak A, Głowacki T, Radom M, Formanowicz P. Hemojuvelin–hepcidin axis modeled and analyzed using Petri nets. Journal of Biomedical Informatics. 2013;46(6):1030–43. pmid:23954231
  18. 18. Heiner M. Understanding Network Behavior by Structured Representations of Transition Invariants. In: Condon A, Harel D, Kok JN, Salomaa A, Winfree E, editors. Algorithmic Bioprocesses. Berlin, Heidelberg: Springer Berlin Heidelberg; 2009. p. 367–89.
  19. 19. Koch I, Ackermann J. On functional module detection in metabolic networks. Metabolites. 2013;3(3):673–700. pmid:24958145
  20. 20. Koch I, Heiner M. Petri nets In: Analysis of Biological Networks. In: Pan Y, Zomaya AY, editors. Series in Bioinformatics: Wiley Book; 2007. p. 139–179,, Print ISBN:9780470041444, chapter 7.
  21. 21. Sackmann A, Heiner M, Koch I. Application of Petri net based analysis techniques to signal transduction pathways. BMC bioinformatics. 2006;7(1):482.
  22. 22. Milacic M, Haw R, Rothfels K, Wu G, Croft D, Hermjakob H, et al. Annotating cancer variants and anti-cancer therapeutics in reactome. Cancers (Basel). 2012;4(4):1180–211. pmid:24213504; PubMed Central PMCID: PMC3712731.
  23. 23. Orlic-Milacic M. Base Excision Repair. Reactome—a curated knowledgebase of biological pathways. 2015;52. PMID: 22985186, PMCID: PMC3578965.
  24. 24. Almeida KH, Sobol RW. A unified view of base excision repair: lesion-dependent protein complexes regulated by post-translational modification. DNA Repair (Amst). 2007;6(6):695–711. pmid:17337257; PubMed Central PMCID: PMC1995033.
  25. 25. Hill JW, Hazra TK, Izumi T, Mitra S. Stimulation of human 8-oxoguanine-DNA glycosylase by AP-endonuclease: potential coordination of the initial steps in base excision repair. Nucleic Acids Res. 2001;29(2):430–8. pmid:11139613; PubMed Central PMCID: PMC29662.
  26. 26. Vidal AE, Hickson ID, Boiteux S, Radicella JP. Mechanism of stimulation of the DNA glycosylase activity of hOGG1 by the major human AP endonuclease: bypass of the AP lyase activity step. Nucleic Acids Res. 2001;29(6):1285–92. pmid:11238994; PubMed Central PMCID: PMC29755.
  27. 27. Mokkapati SK, Wiederhold L, Hazra TK, Mitra S. Stimulation of DNA glycosylase activity of OGG1 by NEIL1: functional collaboration between two human DNA glycosylases. Biochemistry. 2004;43(36):11596–604. pmid:15350146.
  28. 28. Wiederhold L, Leppard JB, Kedar P, Karimi-Busheri F, Rasouli-Nia A, Weinfeld M, et al. AP endonuclease-independent DNA base excision repair in human cells. Mol Cell. 2004;15(2):209–20. pmid:15260972.
  29. 29. Marenstein DR, Chan MK, Altamirano A, Basu AK, Boorstein RJ, Cunningham RP, et al. Substrate specificity of human endonuclease III (hNTH1). Effect of human APE1 on hNTH1 activity. J Biol Chem. 2003;278(11):9005–12. pmid:12519758.
  30. 30. Klungland A, Lindahl T. Second pathway for completion of human DNA base excision-repair: reconstitution with purified proteins and requirement for DNase IV (FEN1). EMBO J. 1997;16(11):3341–8. pmid:9214649; PubMed Central PMCID: PMC1169950.
  31. 31. Sukhanova M, Khodyreva S, Lavrik O. Poly(ADP-ribose) polymerase 1 regulates activity of DNA polymerase beta in long patch base excision repair. Mutat Res. 2010;685(1–2):80–9. pmid:19703477.
  32. 32. Das A, Wiederhold L, Leppard JB, Kedar P, Prasad R, Wang H, et al. NEIL2-initiated, APE-independent repair of oxidized bases in DNA: Evidence for a repair complex in human cells. DNA Repair (Amst). 2006;5(12):1439–48. pmid:16982218; PubMed Central PMCID: PMC2805168.
  33. 33. Krokeide SZ, Laerdahl JK, Salah M, Luna L, Cederkvist FH, Fleming AM, et al. Human NEIL3 is mainly a monofunctional DNA glycosylase removing spiroimindiohydantoin and guanidinohydantoin. DNA Repair (Amst). 2013;12(12):1159–64. pmid:23755964; PubMed Central PMCID: PMC3840045.
  34. 34. Radom M, Rybarczyk A, Szawulak B, Andrzejewski H, Chabelski P, Kozak A, et al. Holmes: a graphical tool for development, simulation and analysis of Petri net based models of complex biological systems. Bioinformatics. 2017;33(23):3822–3. pmid:28961696.
  35. 35. Hu J, de Souza-Pinto NC, Haraguchi K, Hogue BA, Jaruga P, Greenberg MM, et al. Repair of formamidopyrimidines in DNA involves different glycosylases: role of the OGG1, NTH1, and NEIL1 enzymes. J Biol Chem. 2005;280(49):40544–51. pmid:16221681.
  36. 36. Jacobs AL, Schar P. DNA glycosylases: in DNA repair and beyond. Chromosoma. 2012;121(1):1–20. pmid:22048164; PubMed Central PMCID: PMC3260424.
  37. 37. Kim YJ, Wilson DM 3rd. Overview of base excision repair biochemistry. Curr Mol Pharmacol. 2012;5(1):3–13. pmid:22122461; PubMed Central PMCID: PMC3459583.
  38. 38. Ocampo-Hafalla MT, Altamirano A, Basu AK, Chan MK, Ocampo JE, Cummings A Jr., et al. Repair of thymine glycol by hNth1 and hNeil1 is modulated by base pairing and cis-trans epimerization. DNA Repair (Amst). 2006;5(4):444–54. pmid:16446124.
  39. 39. Bjelland S, Seeberg E. Mutagenicity, toxicity and repair of DNA base damage induced by oxidation. Mutat Res. 2003;531(1–2):37–80. pmid:14637246.
  40. 40. Masaoka A, Matsubara M, Hasegawa R, Tanaka T, Kurisu S, Terato H, et al. Mammalian 5-formyluracil-DNA glycosylase. 2. Role of SMUG1 uracil-DNA glycosylase in repair of 5-formyluracil and other oxidized and deaminated base lesions. Biochemistry. 2003;42(17):5003–12. pmid:12718543.
  41. 41. Shinmura K, Kato H, Goto M, Tao H, Inoue Y, Nakamura S, et al. Mutation Spectrum Induced by 8-Bromoguanine, a Base Damaged by Reactive Brominating Species, in Human Cells. Oxid Med Cell Longev. 2017;2017:7308501. pmid:29098062; PubMed Central PMCID: PMC5643121.
  42. 42. Liu P, Burdzy A, Sowers LC. Repair of the mutagenic DNA oxidation product, 5-formyluracil. DNA Repair (Amst). 2003;2(2):199–210. pmid:12531390.
  43. 43. Yoon JH, Iwai S, O'Connor TR, Pfeifer GP. Human thymine DNA glycosylase (TDG) and methyl-CpG-binding protein 4 (MBD4) excise thymine glycol (Tg) from a Tg:G mispair. Nucleic Acids Res. 2003;31(18):5399–404. pmid:12954776; PubMed Central PMCID: PMC203315.
  44. 44. Brooks SC, Adhikary S, Rubinson EH, Eichman BF. Recent advances in the structural mechanisms of DNA glycosylases. Biochim Biophys Acta. 2013;1834(1):247–71. pmid:23076011; PubMed Central PMCID: PMC3530658.
  45. 45. Lee CY, Delaney JC, Kartalou M, Lingaraju GM, Maor-Shoshani A, Essigmann JM, et al. Recognition and processing of a new repertoire of DNA substrates by human 3-methyladenine DNA glycosylase (AAG). Biochemistry. 2009;48(9):1850–61. pmid:19219989; PubMed Central PMCID: PMC2883313.
  46. 46. O'Brien PJ, Ellenberger T. Dissecting the broad substrate specificity of human 3-methyladenine-DNA glycosylase. J Biol Chem. 2004;279(11):9750–7. pmid:14688248.
  47. 47. Saparbaev M, Langouet S, Privezentzev CV, Guengerich FP, Cai H, Elder RH, et al. 1,N(2)-ethenoguanine, a mutagenic DNA adduct, is a primary substrate of Escherichia coli mismatch-specific uracil-DNA glycosylase and human alkylpurine-DNA-N-glycosylase. J Biol Chem. 2002;277(30):26987–93. pmid:12016206.
  48. 48. Wyatt MD, Allan JM, Lau AY, Ellenberger TE, Samson LD. 3-methyladenine DNA glycosylases: structure, function, and biological importance. Bioessays. 1999;21(8):668–76. pmid:10440863.
  49. 49. Dou H, Mitra S, Hazra TK. Repair of oxidized bases in DNA bubble structures by human DNA glycosylases NEIL1 and NEIL2. J Biol Chem. 2003;278(50):49679–84. pmid:14522990.
  50. 50. Grin IR, Dianov GL, Zharkov DO. The role of mammalian NEIL1 protein in the repair of 8-oxo-7,8-dihydroadenine in DNA. FEBS Lett. 2010;584(8):1553–7. pmid:20214901; PubMed Central PMCID: PMC3004018.
  51. 51. Aamann MD, Hvitby C, Popuri V, Muftuoglu M, Lemminger L, Skeby CK, et al. Cockayne Syndrome group B protein stimulates NEIL2 DNA glycosylase activity. Mech Ageing Dev. 2014;135:1–14. pmid:24406253; PubMed Central PMCID: PMC3954709.
  52. 52. Liu M, Bandaru V, Holmes A, Averill AM, Cannan W, Wallace SS. Expression and purification of active mouse and human NEIL3 proteins. Protein Expr Purif. 2012;84(1):130–9. pmid:22569481; PubMed Central PMCID: PMC3378769.
  53. 53. Dherin C, Radicella JP, Dizdaroglu M, Boiteux S. Excision of oxidatively damaged DNA bases by the human alpha-hOgg1 protein and the polymorphic alpha-hOgg1(Ser326Cys) protein which is frequently found in human populations. Nucleic Acids Res. 1999;27(20):4001–7. pmid:10497264; PubMed Central PMCID: PMC148667.
  54. 54. Fortini P, Pascucci B, Parlanti E, D'Errico M, Simonelli V, Dogliotti E. 8-Oxoguanine DNA damage: at the crossroad of alternative repair pathways. Mutat Res. 2003;531(1–2):127–39. pmid:14637250.
  55. 55. Roldan-Arjona T, Wei YF, Carter KC, Klungland A, Anselmino C, Wang RP, et al. Molecular cloning and functional expression of a human cDNA encoding the antimutator enzyme 8-hydroxyguanine-DNA glycosylase. Proc Natl Acad Sci U S A. 1997;94(15):8016–20. pmid:9223306; PubMed Central PMCID: PMC21548.
  56. 56. Bennett MT, Rodgers MT, Hebert AS, Ruslander LE, Eisele L, Drohat AC. Specificity of human thymine DNA glycosylase depends on N-glycosidic bond stability. J Am Chem Soc. 2006;128(38):12510–9. pmid:16984202; PubMed Central PMCID: PMC2809119.
  57. 57. Bochtler M, Kolano A, Xu GL. DNA demethylation pathways: Additional players and regulators. Bioessays. 2017;39(1):1–13. pmid:27859411.
  58. 58. Goto M, Shinmura K, Matsushima Y, Ishino K, Yamada H, Totsuka Y, et al. Human DNA glycosylase enzyme TDG repairs thymine mispaired with exocyclic etheno-DNA adducts. Free Radic Biol Med. 2014;76:136–46. pmid:25151120.
  59. 59. Hashimoto H, Hong S, Bhagwat AS, Zhang X, Cheng X. Excision of 5-hydroxymethyluracil and 5-carboxylcytosine by the thymine DNA glycosylase domain: its structural basis and implications for active DNA demethylation. Nucleic Acids Res. 2012;40(20):10203–14. pmid:22962365; PubMed Central PMCID: PMC3488261.
  60. 60. Maiti A, Morgan MT, Drohat AC. Role of two strictly conserved residues in nucleotide flipping and N-glycosylic bond cleavage by human thymine DNA glycosylase. J Biol Chem. 2009;284(52):36680–8. pmid:19880517; PubMed Central PMCID: PMC2794782.
  61. 61. Sibghat U, Gallinari P, Xu YZ, Goodman MF, Bloom LB, Jiricny J, et al. Base analog and neighboring base effects on substrate specificity of recombinant human G:T mismatch-specific thymine DNA-glycosylase. Biochemistry. 1996;35(39):12926–32. pmid:8841138.
  62. 62. Slyvka A, Mierzejewska K, Bochtler M. Nei-like 1 (NEIL1) excises 5-carboxylcytosine directly and stimulates TDG-mediated 5-formyl and 5-carboxylcytosine excision. Sci Rep. 2017;7(1):9001. pmid:28827588; PubMed Central PMCID: PMC5566547.
  63. 63. Talhaoui I, Couve S, Ishchenko AA, Kunz C, Schar P, Saparbaev M. 7,8-Dihydro-8-oxoadenine, a highly mutagenic adduct, is repaired by Escherichia coli and human mismatch-specific uracil/thymine-DNA glycosylases. Nucleic Acids Res. 2013;41(2):912–23. pmid:23209024; PubMed Central PMCID: PMC3553953.
  64. 64. Krokan HE, Nilsen H, Skorpen F, Otterlei M, Slupphaug G. Base excision repair of DNA in mammalian cells. FEBS Lett. 2000;476(1–2):73–7. pmid:10878254.
  65. 65. Krwawicz J, Arczewska KD, Speina E, Maciejewska A, Grzesiuk E. Bacterial DNA repair genes and their eukaryotic homologues: 1. Mutations in genes involved in base excision repair (BER) and DNA-end processors and their implication in mutagenesis and human disease. Acta Biochim Pol. 2007;54(3):413–34. pmid:17893748.
  66. 66. Burgers PM. Saccharomyces cerevisiae replication factor C. II. Formation and activity of complexes with the proliferating cell nuclear antigen and with DNA polymerases delta and epsilon. J Biol Chem. 1991;266(33):22698–706. pmid:1682322.
  67. 67. Jain AK, Dubes RC. Algorithms for clustering data. Prentice-Hall, Inc.; 1988.
  68. 68. Kaufman L, Rousseeuw PJ. Finding groups in data: an introduction to cluster analysis: John Wiley & Sons; 2009.
  69. 69. Caliński T, Harabasz J. A dendrite method for cluster analysis. Communications in Statistics-Simulation and Computation. 1974;3(1):1–27.
  70. 70. D'Errico M, Parlanti E, Pascucci B, Fortini P, Baccarini S, Simonelli V, et al. Single nucleotide polymorphisms in DNA glycosylases: From function to disease. Free Radic Biol Med. 2017;107:278–91. pmid:27932076.
  71. 71. Fleming AM, Burrows CJ. Formation and processing of DNA damage substrates for the hNEIL enzymes. Free Radic Biol Med. 2017;107:35–52. pmid:27880870; PubMed Central PMCID: PMC5438787.
  72. 72. Meas R, Smerdon MJ. Nucleosomes determine their own patch size in base excision repair. Sci Rep. 2016;6:27122. pmid:27265863; PubMed Central PMCID: PMC4893620.
  73. 73. Zhou T, Pan F, Cao Y, Han Y, Zhao J, Sun H, et al. R152C DNA Pol beta mutation impairs base excision repair and induces cellular transformation. Oncotarget. 2016;7(6):6902–15. pmid:26760506; PubMed Central PMCID: PMC4872757.
  74. 74. Sugo N, Aratani Y, Nagashima Y, Kubota Y, Koyama H. Neonatal lethality with abnormal neurogenesis in mice deficient in DNA polymerase beta. EMBO J. 2000;19(6):1397–404. pmid:10716939; PubMed Central PMCID: PMC305680.
  75. 75. Pan F, Zhao J, Zhou T, Kuang Z, Dai H, Wu H, et al. Mutation of DNA Polymerase beta R137Q Results in Retarded Embryo Development Due to Impaired DNA Base Excision Repair in Mice. Sci Rep. 2016;6:28614. pmid:27358192; PubMed Central PMCID: PMC4928080.
  76. 76. Senejani AG, Liu Y, Kidane D, Maher SE, Zeiss CJ, Park HJ, et al. Mutation of POLB causes lupus in mice. Cell Rep. 2014;6(1):1–8. pmid:24388753; PubMed Central PMCID: PMC3916967.
  77. 77. Balakrishnan L, Bambara RA. Flap endonuclease 1. Annu Rev Biochem. 2013;82:119–38. pmid:23451868; PubMed Central PMCID: PMC3679248.
  78. 78. Sato M, Girard L, Sekine I, Sunaga N, Ramirez RD, Kamibayashi C, et al. Increased expression and no mutation of the Flap endonuclease (FEN1) gene in human lung cancer. Oncogene. 2003;22(46):7243–6. pmid:14562054.
  79. 79. Larsen E, Gran C, Saether BE, Seeberg E, Klungland A. Proliferation failure and gamma radiation sensitivity of Fen1 null mutant mice at the blastocyst stage. Mol Cell Biol. 2003;23(15):5346–53. pmid:12861020; PubMed Central PMCID: PMC165721.
  80. 80. Sun H, He L, Wu H, Pan F, Wu X, Zhao J, et al. The FEN1 L209P mutation interferes with long-patch base excision repair and induces cellular transformation. Oncogene. 2017;36(2):194–207. pmid:27270424; PubMed Central PMCID: PMC5140775.
  81. 81. Lindahl T, Nyberg B. Rate of depurination of native deoxyribonucleic acid. Biochemistry. 1972;11(19):3610–8. pmid:4626532.
  82. 82. Hadi MZ, Ginalski K, Nguyen LH, Wilson DM 3rd. Determinants in nuclease specificity of Ape1 and Ape2, human homologues of Escherichia coli exonuclease III. J Mol Biol. 2002;316(3):853–66. pmid:11866537.
  83. 83. Tsutakawa SE, Lafrance-Vanasse J, Tainer JA. The cutting edges in DNA repair, licensing, and fidelity: DNA and RNA repair nucleases sculpt DNA to measure twice, cut once. DNA Repair (Amst). 2014;19:95–107. pmid:24754999; PubMed Central PMCID: PMC4051888.
  84. 84. Izumi T, Brown DB, Naidu CV, Bhakat KK, Macinnes MA, Saito H, et al. Two essential but distinct functions of the mammalian abasic endonuclease. Proc Natl Acad Sci U S A. 2005;102(16):5739–43. pmid:15824325; PubMed Central PMCID: PMC556297.
  85. 85. Xanthoudakis S, Smeyne RJ, Wallace JD, Curran T. The redox/DNA repair protein, Ref-1, is essential for early embryonic development in mice. Proc Natl Acad Sci U S A. 1996;93(17):8919–23. pmid:8799128; PubMed Central PMCID: PMC38569.
  86. 86. Meira LB, Devaraj S, Kisby GE, Burns DK, Daniel RL, Hammer RE, et al. Heterozygosity for the mouse Apex gene results in phenotypes associated with oxidative stress. Cancer Res. 2001;61(14):5552–7. pmid:11454706.
  87. 87. Hadi MZ, Coleman MA, Fidelis K, Mohrenweiser HW, Wilson DM 3rd. Functional characterization of Ape1 variants identified in the human population. Nucleic Acids Res. 2000;28(20):3871–9. pmid:11024165; PubMed Central PMCID: PMC110798.
  88. 88. Kisby GE, Milne J, Sweatt C. Evidence of reduced DNA repair in amyotrophic lateral sclerosis brain tissue. Neuroreport. 1997;8(6):1337–40. pmid:9172131.
  89. 89. Pieretti M, Khattar NH, Smith SA. Common polymorphisms and somatic mutations in human base excision repair genes in ovarian and endometrial cancers. Mutat Res. 2001;432(3–4):53–9. pmid:11465542.
  90. 90. Aceytuno RD, Piett CG, Havali-Shahriari Z, Edwards RA, Rey M, Ye R, et al. Structural and functional characterization of the PNKP-XRCC4-LigIV DNA repair complex. Nucleic Acids Res. 2017;45(10):6238–51. pmid:28453785; PubMed Central PMCID: PMC5449630.
  91. 91. Reynolds JJ, Walker AK, Gilmore EC, Walsh CA, Caldecott KW. Impact of PNKP mutations associated with microcephaly, seizures and developmental delay on enzyme activity and DNA strand break repair. Nucleic Acids Res. 2012;40(14):6608–19. pmid:22508754; PubMed Central PMCID: PMC3413127.
  92. 92. Saito Y, Ono T, Takeda N, Nohmi T, Seki M, Enomoto T, et al. Embryonic lethality in mice lacking mismatch-specific thymine DNA glycosylase is partially prevented by DOPS, a precursor of noradrenaline. Tohoku J Exp Med. 2012;226(1):75–83. pmid:22200605.
  93. 93. Cortazar D, Kunz C, Selfridge J, Lettieri T, Saito Y, MacDougall E, et al. Embryonic lethal phenotype reveals a function of TDG in maintaining epigenetic stability. Nature. 2011;470(7334):419–23. pmid:21278727.
  94. 94. Fleming AM, Zhu J, Ding Y, Burrows CJ. 8-Oxo-7,8-dihydroguanine in the Context of a Gene Promoter G-Quadruplex Is an On-Off Switch for Transcription. ACS Chem Biol. 2017;12(9):2417–26. pmid:28829124; PubMed Central PMCID: PMC5604463.
  95. 95. Moore SP, Toomire KJ, Strauss PR. DNA modifications repaired by base excision repair are epigenetic. DNA Repair (Amst). 2013;12(12):1152–8. pmid:24216087.