A Comprehensive Analysis of the Dynamic Biological Networks in HCV Induced Hepatocarcinogenesis

Hepatocellular carcinoma (HCC) is a primary malignancy of the liver, which is closely related to hepatitis C and cirrhosis. The molecular mechanisms underlying the hepatocarcinogenesis induced by HCV infection remain clarified from a standpoint of systems biology. By integrating data from protein-protein interactions, transcriptional regulation, and disease related microarray analysis, we carried out a dynamic biological network analysis on the progression of HCV induced hepatocarcinogenesis, and systematically explored the potentially disease-related mechanisms through a network view. The dysfunctional interactions among proteins and deregulatory relationships between transcription factors and their target genes could be causes for the occurrence and progression of this disease. The six pathologically defined disease stages in the development and progression of HCC after HCV infection were included in this study. We constructed disease-related biological networks for each disease stage, and identified progression-related sub-networks that potentially play roles in the developmental stage of the corresponding disease and participate in the later stage of cancer progression. In addition, we identified novel risk factors related to HCC based on the analysis of the progression-related sub-networks. The dynamic characteristics of the network reflect important features of the disease development and progression, which provide important information for us to further explore underlying mechanisms of the disease.


Introduction
Hepatocellular carcinoma (HCC) is a major health problem worldwide. It is the sixth most common neoplasm in the world with more than half a million new cases annually [1], and the main cause of death among cirrhotic patients [2]. Hepatocarcinogenesis is a complex and multistep process, which is associated with many risk factors [3]. Hepatitis B virus (HBV) and hepatitis C virus (HCV) infections, excessive alcohol consumption and aflatoxin are widely recognized as the four major etiological factors of HCC [4]. Chronic infection with hepatitis B virus is the predominant risk factor for HCC in Southeast Asia and Africa, while chronic infection with hepatitis C virus is the predominant risk factor for HCC in Western countries and Japan.
The risk of HCC in patients with chronic hepatitis C is the highest and has been well studied in patients who have established cirrhosis [5,6,7,8]. The incidence of HCC in patients with cirrhosis is between 2%-8% per year as reported based on clinical studies. A previous study showed that [9] 12,008 HCV-positive men conferred a 20-fold increased risk of HCC compared to those HCV-negative subjects. However, the presence or absence of cirrhosis was not evaluated. It was reported that HCV-infected individuals without cirrhosis had a much lower risk of developing HCC [10]. Although previous clinically-based studies revealed close relationships among hepatitis C, cirrhosis and HCC, the underlying molecular mechanism of these phenomena remained unclear.
As the center for the diversified metabolisms and detoxification, liver plays various functions through different parenchymal cell types. Each type of cell is composed of thousands of different types of molecules. A discrete biological function can only rarely be attributed to an individual molecule. In contrast, most biological functions arise from interactions among many components. Usually, 'modules' are used to describe this kind of biological organization. Modules are composed of different types of molecules. They have discrete functions that arise from interactions among their components (proteins, DNA, RNA and small molecules), but these functions cannot easily be predicted by studying the properties of the isolated components [11]. Transcriptional regulatory interaction and protein-protein interaction are two of the most important interactions among these components. Transcriptional regulation is the process in which gene-encoded transcription factors regulate the transcription of other genes. The assembly of regulatory interactions linking transcription factors to their target genes in an organism can be viewed as a directed graph, in which the nodes represent the regulators and their targets, and the regulatory interactions are the edges. Protein-protein interactions (PPIs) are crucial for all biological processes [12]. Previous study on protein-protein interaction networks in liver cancer detected many interactions among proliferation, apoptosis-related proteins and differential glycoproteins, suggesting that a ''molecule groups'' concept should be introduced in the diagnosis and metastasis prediction of HCC instead of analyzing a single, or a few proteins [13]. In all, because the majority of gene products function together with other gene products, biological processes should be considered as complex networks of interconnected components. In other words, for any biological process, one might consider a 'modular approach' in which the behavior and function of the corresponding network are studied as a whole, in addition to studying some of its components individually [11].
Recently, the innovation of high throughput experimental strategies makes it possible for biology to become holistic [14]. With the tremendous increase in human protein interaction data, network approaches have already been employed to understand molecular mechanisms of diseases [15], particularly to analyze carcinogenesis and the genotype related to the cancer phenotype. However, most efforts only focused on protein-protein interaction network. Moreover, few of these efforts studied dynamic changes of the network during the development and progression from pre-HCC disease to HCC, which will facilitate us to understand the molecular mechanisms of hepatocarcinogenesis.
Our hypothesis is that the dysfunctional genes/proteins and interactions among them at each progression-related sub-network are strongly related to the special disease pathogenic state, and play a role in the disease progression. Networks related to each stage of the disease can capture the pathogenic characteristics of corresponding disease stages and reflect the role of those genes/ proteins in the development and progression of hepatocarcinogenesis at the disease specific stage. Based on this hypothesis, we have integrated multiple data at different levels, especially with regard to biological pathways and interaction networks, and built a hepatocellular carcinoma biological network database previously [16]. The integrated network lays the foundation for our current work. With the integration of protein-protein interactions data, transcriptional regulatory interactions (TRIs) data and microarray data covering hepatitis C, cirrhosis and Barcelona Clinic Liver Cancer (BCLC) staging HCC, we carried out a dynamic biological network analysis for the progression of HCV induced hepatocarcinogenesis.

Biological network and interaction annotation
In our previous work, we have integrated a set of human protein-protein interaction data and transcriptional regulatory interaction data, which contains 37811 experimentally confirmed protein-protein interactions and 9148 experimentally confirmed transcriptional regulatory interactions [16]. These interactions form a molecular biological network in humans. Gene Ontology is commonly used for annotation of genes and proteins [17]. However, the function of a module in a network is not only determined by a group of individual molecules, but also the functional association between them. Therefore, annotating the function of these interactions in a biological network will facilitate us to understand systemic functions in the biological network. A transcription factor may achieve its function by regulating the expression of target genes that implement the same function. In this way, if a transcription factor and its target gene could be mapped to a same function annotation, the TRI between them may also participate in this function. Based on Gene Ontology annotation data, which is supported by experimental evidence, we annotated interactions of the human biological network. Finally, 21365 PPIs and 3587 TRIs were annotated by this method.

Disease-related biological network
Biological networks provide valuable information for understanding cellular function and biological processes. The dysfunction of some interactions causes many diseases, including cancers. Therefore, studying the deregulated biological networks will help us to understand molecular mechanisms of these diseases. First, we identified DEGs (Differentially Expressed Genes) by bioinformatics analysis of the microarray data that cover Hepatitis C, Cirrhosis, and BCLC staging HCC. Then we mapped those DEGs to protein-protein interactions data and transcriptional regulatory interactions data integrated in HCCNet database [16], which was a HCC network database previously established by our team. In each disease stage, the PPI between DEGs-encoded proteins, and the TRI between the DEG-encoded transcription factor and its target gene in DEGs were regarded as deregulated. By integrating these interactions, we constructed disease-related biological networks that were deregulated in each disease stage (see details in material and method).
Several classification systems are available for HCC. The Barcelona Clinic Liver Cancer (BCLC) classification has emerged during recent years as the standard classification that is used for trial design and clinical management of patients with HCC [18,19]. It divides HCC into five stages. Patients with end-stage disease (stage D) have a reported median survival of only 3 months [18]. Therefore, this stage is excluded in the following study. Six disease stages in the development and progression of HCV induced hepatocarcinogenesis are included in our study including Hepatitis C, Cirrhosis, BCLC stage 0 HCC, BCLC stage A HCC, BCLC stage B HCC, BCLC stage C HCC. We identified diseaserelated biological networks for each disease stage with the process described above (Figure 1). The deregulated biological network of hepatitis C was constructed, bearing 605 nodes and 999 edges, while the deregulated biological network of cirrhosis included 166 nodes and 197 edges. The deregulated biological network of BCLC stage 0 HCC was composed of 883 nodes and 1545 edges. Meanwhile, the deregulated biological network of BCLC stage A HCC contained 734 nodes and 1057 edges. The deregulated biological network of BCLC stage B HCC was generated with 1053 nodes and 1783 edges. The deregulated biological network of BCLC stage C HCC was constructed by 1277 nodes and 2378 edges (Supplementary Data S1). In these networks, edges represent PPIs or TRIs. Line in PPI is undirected when line in TRI is directed. Nodes linked by the edge in PPI are proteins. The edge in TRI goes from transcription factor to its target gene.

Dynamic deregulated biological network
Comparison among deregulated biological networks shows that when the disease progression switches sequentially from hepatitis C to BCLC stage C HCC, the deregulated biological network changes correspondingly. The dynamic network implies the potential molecular mechanism that underlies the development and progression of the carcinogenesis. Some interactions were deregulated consistently in different stages during the disease development and progression. We hypothesize that these shared interactions participate in the development and progression of the carcinogenesis. For example, an interaction that was deregulated in hepatitis C initially and remained dysfunctional in the following cirrhosis and at all the stages of HCC may participate in the hepatitis C infection, and the development and progression from hepatitis C to BCLC stage C HCC.
Based on the above hypothesis, we made a comparison among the six disease-related biological networks in the development and progression. 42 interactions were deregulated initially from hepatitis C to BCLC stage C HCC. 26 interactions were deregulated initially from cirrhosis, which means that they were not deregulated in hepatitis C but in cirrhosis, BCLC stage 0 HCC, BCLC stage A HCC, BCLC stage B HCC and BCLC stage C HCC. Accordingly, 224 interactions showed deregulated only from BCLC stage 0 HCC when 114 interactions were deregulated initially from BCLC A HCC. Furthermore, 156 interactions were indentified deregulated initially from BCLC B HCC and 1186 interactions were recognized as deregulated initially from BCLC C HCC (Supplementary Data S2). By integrating interactions that were deregulated initially from the same stage, progression-related sub-networks, which contains fifty six sub-networks (node. = 3), were constructed ( Figure 2).

Functional annotation of progression-related sub-networks
In our hypothesis, genes and proteins in a progression-related sub-network may function together in the progression of HCV induced hepatocarcinogenesis. To better understand their functions in the progression, especially the role of the sub-network in the progression, we need to find out which function is significant comparing to the whole human biological network. Gene Ontology enrichment analysis is usually used to identify statistically significant functions of a group of genes. However, this method treats those genes as individual components and ignores their association, which is the base of a module. We have annotated integrated human biological networks with Gene Ontology data, which is supported by experimental evidence. Based on hypergeometric distribution, we carried out a function enrichment analysis between the progression-related sub-network and the whole human biological network. Statistically significant functions (p value,0.01) of the progression-related sub-networks, which indicate the functional difference comparing to the whole biological network, were identified through this analysis. In the total number of fifty six sub-networks, twenty seven of them were identified with statistically significant functions (Supplementary Data S3). The genes and proteins in the progression-related subnetwork identified with statistically significant functions may work together in the progression of HCV induced hepatocarcinogenesis. Furthermore, the statistically significant functions of the progression-related sub-network indicate the functions these genes and proteins perform by synergistic action. We offered these progression-related sub-networks and their statistically significant functions in the supplementary materials. In our hypothesis, the sub-network that is deregulated from hepatitis C is considered a cirrhosis-HCC-risk sub-network. The dysfunction of it could increase the risk of cirrhosis and following HCC incidence of hepatitis C patients. Many functions that related to cirrhosis and cancer are significantly enriched in this sub-network, such as cell proliferation, positive regulation of fibroblast proliferation, negative regulation of apoptosis, etc. (Table 1). The sub-networks which are deregulated from cirrhosis can be regarded as HCC-risk subnetworks. The dysfunction of them could increase HCC incidence for hepatitis C patients with established cirrhosis. Some functions related to cancer are overrepresented in these sub-networks, like cell adhesion, positive regulation of DNA replication, positive regulation of cell proliferation, etc (Supplementary Data S3).
To verify our hypothesis and those identified disease relate subnetworks. We randomly selected three sub-networks from the twenty seven progression-related sub-networks annotated as statistically significant functions and carried out the text mining from published literature. One sub-network is deregulated from hepatitis C, another sub-network is generated from cirrhosis data and the last one signifies the deregulation from BCLC stage 0 HCC. In the sub-network that is deregulated from hepatitis C ( Figure 3A), HLA-DQA1 and DQB1 are associated with development of cirrhosis, and DQB1 could be risk factors for the occurrence of HCC [20,21]. COL1A2, a protein in the subnetwork which is deregulated from cirrhosis, is involved in the development or progression of hepatoma [22]. Moreover, ESR1, the core protein, which interacts with most nodes in this subnetwork, of the network that is deregulated from BCLC stage 0 HCC, have been shown to be associated with an increased hepatocellular carcinoma risk [23].

Discussion
The progression-related sub-network deregulated in hepatitis C infection stage was constructed with four nodes (HLA-DQA1, HLA-DQB1, HLA-DPA1 and CD74) and interactions among them ( Figure 3A). We proposed that this sub-network is potentially related to cirrhosis-HCC-risk, considering that those genes are differentially expressed in hepatitis C stage. It could be seen that many important functions, including positive regulation of B cell proliferation, negative regulation of apoptosis, etc., are enriched significantly (P value,0.01) in this sub-network (Table 1). In this sub-network, CD74 is the major histocompatibility complex, class II invariant chain. HLA-DQA1 and HLA-DPA1 belong to the HLA class II alpha chain paralogues while HLA-DQB1 belongs to the HLA class II beta chain paralogues.
The class II molecule is a heterodimer consisting of an alpha (DPA, DQA) and a beta (DPB, DQB) chain, both anchored in the membrane. It plays a central role in the immune system by presenting peptides derived from extracellular proteins. Within the DP and DQ molecules both the alpha chain and the beta chain contain the polymorphisms specifying the peptide binding specificities, resulting in up to 4 different molecules. DQB1*0201 allele has been confirmed to be closely correlated with the progression of liver injury in chronic HCV infection [24]. Meanwhile, DQA1*0103 appears to provide protection against chronic active infection with hepatitis C virus [25]. Moreover, HLA-DQA1 and DQB1 are associated with development of cirrhosis, and DQB1 might be a risk factor for the occurrence of HCC [20,21]. Since full function of major histocompatibility complex needs both the invariant chain and variant chain, the PPIs among DQA1, DQB1 and CD74 may participate in the induction of hepatitis C and the development and progression from hepatitis C to HCC.
Our network-based view shows that three progression-related sub-networks appear deregulated initially in cirrhosis stage. One of the sub-networks is composed of four nodes (VWF, LUM, COL1A1 and COL1A2) with four interactions among them ( Figure 3B), and it can be considered an HCC-risk related subnetwork. Certain functions, like, cell adhesion, are significantly enhanced (P value,0.01) in this sub-network (Table 2). Within the same sub-network, VWF functions as an antihemophilic factor carrier and a platelet-vessel wall mediator in the blood coagulation system. LUM is a member of the small leucine-rich proteoglycan (SLRP) family that includes decorin, biglycan, fibromodulin, keratocan, epiphycan, and osteoglycin. In these bifunctional molecules, the protein moiety binds collagen fibrils and the highly charged hydrophilic glycosaminoglycans regulate interfibrillar spacings. Lumican is the major keratan sulfate proteoglycan of the cornea but is also distributed in interstitial collagenous matrices throughout the body. Lumican may regulate collagen fibril organization and circumferential growth, corneal transparency, and epithelial cell migration and tissue repair. COL1A1 is the pro-alpha1 chains of type I collagen whose triple helix comprises two alpha1 chains and one alpha2 chain. COL1A2 is the pro-alpha2 chain of type I collagen whose triple helix comprises two alpha1 chains and one alpha2 chain. Type I is a fibril-forming collagen found in most connective tissues and is abundant in bone, cornea, dermis and tendon.
COL1A1 and COL1A2 are associated with liver fibrogenesis [26,27]. Moreover, COL1A2 is involved in the development or progression of hepatoma [22]. VWF mRNA has been shown to be significantly upregulated in both fibrosis and HCC [28]. In patients with fulminant hepatic failure and liver cirrhosis, circulating plasma VWF antigen levels are extremely high [29,30,31]. Many fibrin thrombi have been found in the hepatic sinusoids in acute liver failure, suggesting a role for intravascular coagulation in the pathogenesis of hepatic necrosis [32]. In cirrhotic liver tissue [33] and even tissue from patients in early stages of alcoholic liver diseases [34], VWF immunostaining shows positive cells predominantly at the scar-parenchyma interface, within the septum, and in the sinusoidal lining. Portal or hepatic vein thrombosis is often observed in advanced cirrhosis [35,36] and microthrombi formation has been found in one or multiple organs in half of autopsied cirrhotics [37]. This hypercoagulable state in liver diseases may be involved in hepatic parenchymal extinction, the acceleration of liver fibrosis, and disease progression. Lumican has diverse biologic roles but has been thought to be primarily involved in fibrosis of the extracellular matrix through the binding of collagen fibrils and regulation of their lateral growth [38,39]. Lumican expression is also increased with progression of hepatic fibrosis in rats [40]. Decreased sulforylation of lumican side chains stimulates macrophage adhesion and the cellular inflammatory response [41,42], suggesting that changes in the structure of lumican may promote the inflammatory process that precedes and enhances collagen deposition during the process of hepatic fibrosis. Taken together, it implies that PPIs among COL1A1, COL1A2, VWF and LUM may participate in the induction of cirrhosis and play roles in the progression from cirrhosis to HCC.
The progression-related sub-networks deregulated from BCLC stage 0 HCC included 11 sub-networks. The biggest sub-network in this category was comprised of 34 nodes with 42 PPIs and 2 TRIs ( Figure 3C). Many functions are significantly enriched in this sub-network (Table 3), including induction of apoptosis, cell cycle arrest, etc. A core protein of this sub-network is ESR1, which interacts with most nodes in this sub-network. This protein regulates the expression of FOS, and subsequently FOS regulates HBA1. HBA1 participate in a smaller sub-network that constructed by four nodes (HBA1, HBB, HBA2, HBG2) with interactions among them. ESR1 is an estrogen receptor, a ligand-activated transcription factor composed of several domains important for hormone binding, DNA binding, and activation of transcription. FOS can dimerize with proteins of the JUN family, thereby forming the transcription factor complex AP-1. As such, the FOS proteins have been implicated as regulators of cell proliferation, differentiation, and transformation. In some cases, expression of the FOS gene has also been associated with apoptotic cell death. The alpha (HBA) and beta (HBB) loci determine the structure of the 2 types of polypeptide chains in adult hemoglobin, Hb A. The   normal adult hemoglobin tetramer consists of two alpha chains and two beta chains. Mutation in beta globin is the cause of sickle cell anemia. Absence of beta chain causes beta-zero-thalassemia. Reduced amounts of detectable beta globin causes beta-plusthalassemia. Animal models and human epidemiologic studies have suggested that estrogens act as tumor promoters and might induce hepatocarcinogenesis [43,44,45,46]. The estrogens exert the effects by binding to estrogen receptors (ESR). The genetic polymorphisms within ESRs could influence the effects of estrogens, which in turn results in genotype-dependent differences in risk for hepatocellular carcinoma. Indeed, the polymorphisms in the 59 end of the ESR a (ESR1) gene have been shown to be associated with an increased hepatocellular carcinoma risk, supporting the involvement for the estrogen-ESR axis in the estrogen-induced hepatocarcinogenesis [23]. The oncogene c-fos (FOS), which is required for quiescent cells to enter the cell cycle [47], also is up-regulated in HCC [48]. HBV X peptide has been shown to activate the c-fos gene, which is postulated to contribute to hepatocarcinogenesis [49]. Our result implies that the association between ESR1 and FOS may participate in the process of HCC initiation and may play roles in the progression of carcinogenesis from early to advanced HCC. Moreover, the hemoglobin (Hb A) level may be deregulated by this association in the progression of carcinogenesis from early HCC.
Previous studies have indicates that when hepatitis C patients establishes cirrhosis, the HCC incidence will increase largely. Cirrhosis is an important stage in the progression from hepatitis C to HCC. In the cirrhosis-related network, we can recognize some interesting components and relationships ( Figure 4). The nodes that appeared in cirrhosis-HCC-risk sub-network are located close to the nodes belonging to HCC-risk sub-networks. They are connected by CD44. The CD44 is a cell-surface glycoprotein. The CD74, a protein of cirrhosis-HCC-risky related sub-network, is also membrane protein and the rest of the subnetwork proteins are located extracellularly. Similarly, the proteins in HCC-risk related sub-networks are also located in extracellular region. It could be anticipated that the synergistic action between deregulated cirrhosis-HCC-risky related subnetwork and HCC-risky related sub-networks could be one of the reasons for the dramatic increase of HCC incidence when hepatitis C patients develop cirrhosis. As we mentioned before, the dysfunction of cirrhosis-HCC-risk related sub-network emerges from hepatitis C, while the dysfunction of HCC-risk related sub-networks gets started from cirrhosis. Those relationships and results supported by the experimental evidences imply that the dysfunction of HCC-risk related sub-networks is under the influence of the dysfunction of the cirrhosis-HCC-risk related sub-network.
Although we have identified progression-related sub-networks for HCV induced hepatocarcinogenesis and recognized some potential synergistic actions of proteins and genes in the subnetwork, our approach has certain limitations. First, we only use experimentally confirmed interaction data in our analysis. So some potential associations among DEGs may be lost. Second, our analysis depends not only on biological reality but also on sampling, experimental conditions, and reported results; there could be other genes and proteins that significantly contribute to network functions not yet addressed.
In conclusion, we constructed the disease-related biological networks through the integration of those DEGs, PPIs and TRIs data for different stages of hepatitis C infected disease, including liver cancer. By comparison of the disease-related biological networks between each stage, the dynamic characteristics of the networks show that they mostly reflect the important features of the disease development and progression, which provides important information for us to explore the underlying mechanisms of the diseases. We identified progression-related sub-networks in the development and progression through dynamic biological network analysis and annotated significant functions of these sub-networks. Text mining results from published literature confirmed our hypothesis largely in the examples randomly chosen. It implies that these progressionrelated sub-networks, especially the ones which are annotated with significant functions, can be helpful in the understanding of molecular mechanism that underlies the progression of HCV induced hepatocarcinogenesis.

Microarray data and DEG
Microarray data was collected from Gene Expression Omnibus (GEO) database. Two datasets (GSE6764, GSE9843) were used in this analysis. They contain 10 normal samples, 21

Detection of deregulated biological network
Protein-protein interactions data and transcriptional regulatory interactions data came from HCCNet database [16], which was a HCC network database previously established by our team, The database contains 37811 experimentally confirmed proteinprotein interactions and 9148 experimentally confirmed transcriptional regulatory interactions. Those data were used to construct deregulated biological networks of each disease stage. Firstly, DEGs indentified in each disease sample were mapped to PPI and TRI data. Then deregulated interactions between DEGs were identified and used to construct the individual deregulated biological network. An individual deregulated biological network is a network constructed by deregulated PPIs and TRIs identified in one disease sample. When all individual deregulated networks of a disease were constructed, occurrence frequency of each deregulated interaction in the disease could be calculated. R d represents the occurrence frequency of a deregulated interaction in a disease. N d signifies the number of disease samples in which the interaction is deregulated. N a represents the number of all the samples of the disease.
If R d of an interaction is more than 0.5, it means that this interaction is deregulated in more than half of the cases of a disease, the interaction can be regarded as high confidently deregulated one and the dysfunction value X of the interaction is defined as 1, otherwise it is defined as 0. In other words, we believe those interactions whose X value equals to 1 are deregulated in the disease.

X~1
, R d §0:5 0, R d v0:5 With those indentified deregulated interactions, deregulated biological network of each disease stag was constructed for further analysis. Finally, deregulated biological networks in six stages were established accordingly.

Detection of progression-related biological sub-network
When the progression of the disease advances from hepatitis C to BCLC stage C HCC, the deregulated biological network changes correspondingly. Six disease stages were considered in this progression: Hepatitis C, Cirrhosis, BCLC stage 0 HCC, BCLC stage A HCC, BCLC stage B HCC, BCLC stage C HCC. Some interactions are deregulated initially in one stage and keep dysfunction in the following stages, they may participate in the development stage of the disease in which they are deregulated initially and may also play a role in the following progression . Cirrhosis-related network mentioned in discussion. Node color: green, nodes used to be in cirrhosis-HCC-risky sub-network; yellow, nodes in HCC-risky sub-network; purple, other nodes in cirrhosis-related network. Edge color: red, PPI; green, TRI. Edge direction: undirected, PPI; directed, TRI, from transcriptional factor to target gene. doi:10.1371/journal.pone.0018516.g004 process. As defined above, if an interaction is deregulated in a disease stage, the X value of the interaction is 1, otherwise the X value is 0. So when an interaction is deregulated initially in hepatitis C and keeps dysfunction in the following disease stage, the F He (X ) will be 1. In other words, when the value of F He (X ) is 1, the interaction is initially in hepatitis C and keeps dysfunction in the following disease stage.
Generally, we defined an indicator function F S (X) as:

Detection of significant functions of sub-networks
In our previous work, we annotated each interaction of integrated biological network with a new Gene Ontology based annotation method that developed by ourself. Gene Ontology annotation data of Homo sapiens was downloaded on April 20, 2010. We filtered annotations which are supported by experiment evidence and used them in the annotation of interactions. With the annotated interaction data and hypergeometric distribution method, we compare each sub-network to the whole biological network and identified statistical significant functions (p value,0.01) of each subnetwork. Statistical analysis is performed with R.

Supporting Information
Data S1 Disease-related biological networks in each stage of HCV induced hepatocarcinogenesis.