To date, there are no highly sensitive and specific minimally invasive biomarkers for detection of breast cancer at an early stage. The occurrence of circulating microRNAs (miRNAs) in blood components (including serum and plasma) has been repeatedly observed in cancer patients as well as healthy controls. Because of the significance of miRNA in carcinogenesis, circulating miRNAs in blood may be unique biomarkers for early and minimally invasive diagnosis of human cancers. The objective of this pilot study was to discover a panel of circulating miRNAs as potential novel breast cancer biomarkers.
Using microarray-based expression profiling followed by Real-Time quantitative Polymerase Cycle Reaction (RT-qPCR) validation, we compared the levels of circulating miRNAs in plasma samples from 20 women with early stage breast cancer (10 Caucasian American (CA) and 10 African American (AA)) and 20 matched healthy controls (10 CAs and 10 AAs). Using the significance level of p<0.05 constrained by at least two-fold expression change as selection criteria, we found that 31 miRNAs were differentially expressed in CA study subjects (17 up and 14 down) and 18 miRNAs were differentially expressed in AA study subjects (9 up and 9 down). Interestingly, only 2 differentially expressed miRNAs overlapped between CA and AA study subjects. Using receiver operational curve (ROC) analysis, we show that not only up-regulated but also down-regulated miRNAs can discriminate patients with breast cancer from healthy controls with reasonable sensitivity and specificity. To further explore the potential roles of these circulating miRNAs in breast carcinogenesis, we applied pathway-based bioinformatics exploratory analysis and predicted a number of significantly enriched pathways which are predicted to be regulated by these circulating miRNAs, most of which are involved in critical cell functions, cancer development and progression.
Citation: Zhao H, Shen J, Medico L, Wang D, Ambrosone CB, Liu S (2010) A Pilot Study of Circulating miRNAs as Potential Biomarkers of Early Stage Breast Cancer. PLoS ONE 5(10): e13735. https://doi.org/10.1371/journal.pone.0013735
Editor: Chad Creighton, Baylor College of Medicine, United States of America
Received: July 14, 2010; Accepted: October 8, 2010; Published: October 29, 2010
Copyright: © 2010 Zhao et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: The study is supported by grants from NIH/NCI (5R01CA136483) www.nih.gov and Congressionally directed Medical Research Program (BC074340) http://cdmrp.army.mil/. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Discovery of sensitive and specific minimally invasive biomarkers that can be exploited to detect early neoplastic changes, thus facilitating the detection of breast cancer at an early stage, is one of the most important challenges in the management of breast cancer. The ideal biomarker should be easily accessible such that it can be sampled relatively noninvasively, sensitive enough to detect early presence of tumors in almost all patients, and absent or minimally present in healthy, tumor free individuals. Unfortunately, none of the existing diagnostic tools and biomarkers for breast cancer meets the above criteria –. For example, mammography, currently the gold standard diagnostic tool, needs to use ionizing radiation and has a false positive rate of 8% to 10% . A number of circulating tumor markers, such as carcinoembryonic antigen and carbohydrate antigen 15–3, have shown promise in the management of breast cancer, but the sensitivity of these markers is low, and so they are not useful for the early detection of breast cancer –. Clearly, the development of minimally or noninvasive, highly sensitive and specific breast cancer biomarkers which can complement and improve on current strategies for breast cancer detection is urgently needed.
In the last decades, the relationship between microRNA (miRNA) and human cancer has been extensively investigated. Research has shown that miRNAs are deregulated in a widespread manner in almost every type of human cancer, and the signature of miRNA expression in tumors can be potentially used as biomarkers for tumor characterization and cancer prognosis –. Circulating RNAs have been identified in the serum/plasma of cancer patients for more than a decade. Recently, several studies have reported the occurrence of circulating miRNAs in serum and plasma samples from both cancer patients and healthy controls –. For example, in colorectal cancer, Huang et al  found plasma miRNAs were highly sensitive for detecting colorectal cancer and advanced adenomas, and that miR-29a and miR-92a were associated with advanced neoplasia. To further explore the origins of these circulating miRNAs, they compared the levels of miR-29a and miR-92a expression in plasma samples harvested from pre-operative and post-operative bloods and found a significant reduction in both miRNAs compared to the pre-operative samples of the same patients. In breast cancer, Heneghan et al  surveyed a panel of 7 candidate miRNAs in whole blood RNAs from 148 breast cancer patients and 44 age-matched and disease free controls. They found the expression of miR-195 was significantly elevated in breast cancer patients. Additionally, they observed a significant reduction in miR-195 in post-operative whole blood compared to the pre-operative samples of the same patients.
Using microarray-based expression profiling, the goal of this pilot study was to identify a panel of circulating miRNAs which are differentially expressed in plasma samples from breast cancer patients and matched healthy controls, and to examine if there were differences in circulating miRNA expression between Caucasians Americans (CAs) and African Americans (AAs). We also aimed to apply bioinformatics tools to explore the potential biological function of identified candidate miRNAs.
Materials and Methods
The study has been approved by Institutional Research Board (IRB) of Roswell Park Cancer Institute. Anonymized biospecimens and questionnaire data used in this study were made available through the Roswell Park Cancer Institute's (RPCI) Data Bank and BioRepository (DBBR) . Patients are enrolled through site-specific clinics prior to surgery and/or chemotherapy, and controls are individuals who are free from cancer and are visitors or family members of patients. Relationships between patients and controls are carefully annotated, so that we avoid overmatching patients to their own family or friends. Written consent is obtained from every individual before he/she enrolls in the DBBR. The consent will allow DBBR to provide anonymized biospecimens and questionnaire data for research (such as this study) without further consent. Patients and controls are consented to provide a non-fasting blood sample and to complete a questionnaire. Blood samples are drawn in phlebotomy and transferred to the DBBR laboratory. Following DBBR standard operating procedure (SOP), samples are processed and blood components stored within one hour of collection to minimize degradation. Ten milliliters of whole blood was obtained from each study subject. Plasma was extracted by centrifuging whole blood at 3,000 rpm for 10 minutes at room temperature. All extracted plasma samples are stored in phased liquid nitrogen. To minimize the effect of freeze-thaw on circulating miRNAs, we only used plasma samples which had not been previously thawed. In this study, a total of 20 women with breast cancer and 20 cancer-free women were included in the microarray profiling analysis. Same AA study participants (10 AA cases and 10 AA controls) were included in the RT-qPCR validation analysis. The CA study participants (15 CA cases and 15 CA controls) included in RT-qPCR validation analysis were also obtained from DBBR, but they were different from the ones used in microRNA profiling.
Total RNA, including miRNA from plasma, was isolated using the miRNeasy kit (Qiagen) with minor modifications. In brief, 700 µl of QIAzol reagent was added to 200 µl of plasma sample. The sample was mixed in a tube, followed by the addition of 3 µl of miSPIKE, spiked-in miRNA, at a concentration of 0.1 µM (IDT) and 140 µl of chloroform. After mixing vigorously for 15 s, the sample was then centrifuged at 12,000 g for 15 minutes. The upper aqueous phase was carefully transferred to a new collection tube, and 1.5 volume of ethanol was added. The sample was then applied directly to a silica membrane-containing column and the RNA was bound and cleaned using buffers provided by the manufacturer to remove impurities. The immobilized RNA was then collected from the membrane with a low salt elution buffer. The quality and quantity of the RNA was evaluated by 260/280 ratio using NanoDrop spectrophotometry (NanoDrop ND-1000 Technologies Inc.) and Agilent 2100 Bioanalyzer (Agilent Technologies). The efficiency of small RNA isolation was monitored by the amount of spiked-in miRNA recovered by using PCR with sequence specific primers (IDT).
MicroRNA Quantification of miRNA Expression
Two hundred ng of total RNA from each sample were labeled and hybridized on Human v2 MicroRNA Expression BeadChips (Cat. no. MI-102–1024; Illumina), according to the manufacturers recommendations (Illumina MicroRNA Expression Profiling Assay Guide). The raw intensity of Illumina Plasma V2 MicroRNA expression array was scanned and extracted using BeadScan, with the data corrected by background subtraction in GenomeStudio module. The lumi package in the R-based Bioconductor Package was used to normalize the log2 transformed intensity data with the Quantile normalization algorithm. The expression profiles have been deposited in NCBI's Gene Expression Omnibus (GEO) with accession number GSE22981.
microRNA MicroArray Data Analysis
All data analysis was performed under R programming environment (www.r-project.org). Differential Expression Testing For the 40 subjects, we compared cases and controls stratified by race/ethnicity. We used the Limma program in the R-based Bioconductor package to calculate the level of differential expression for each comparison. Briefly, for each comparison, a linear model was fit to the data (with cell means corresponding to the different conditions and a random effect for array) . For each comparison, we obtained the list of differentially expressed microRNA constrained by P-value less than 0.05 and then checked for candidates with at least two-fold expression change. Clustering Following single miRNA-based significance testing, we used the expression value of miRNAs (P value <0.05 and at least two-fold expression) to cluster the patients for each comparison. Our purpose was to check whether the identified miRNAs for each comparison, as a whole, were able to serve as potential miRNA signature to classify patients into their corresponding case/control status. Hierarchical clustering based on the average linkage of Pearson Correlation was employed . The expression value of the same miRNA lists was also used as the input for Principal Component Analysis (PCA). Briefly, the 1st principal component (i.e., the direction along which the miRNAs show largest variation) and the 2nd principle component (i.e., the direction uncorrelated to the 1st component along which the miRNAs show the largest deviation) were shown to capture the clustering structures. The PCA implementation here is based on the single value decomposition (SVD) package. ROC curve analysis The expression profile of each identified miRNA was used as the input for Receive Characteristics Curve (ROC) analysis with the ROCR package . ROC curve is displayed as the True Positive Rate (TPR) versus the False Positive Rate (FPR). The area under the ROC curve (AUC), a measure of discrimination accuracy, is reported. Pathway Enrichment Analysis Using TargetScan (http://www.targetscan.org), we obtained the list of genes predicted to be targeted by the miRNAs identified from each comparison. The predicted miRNA target genes were analyzed for enriched KEGG pathways by using the NCBI DAVID server (http://david.abcc.ncifcrf.gov) with default setting, in which the null hypothesis is that no difference exists between the number of genes falling into a given pathway in the target gene list and the genome as a whole .
Real-time quantitative PCR analysis
The expression levels of miRNA were confirmed with a Taqman-based real-time quantitative PCR (RT-qPCR) using individual miRNA-specific primers and probes as described by the manufacturer (Applied Biosystems). The first-strand miRNA-cDNA PCR template was generated from 50 ng of total RNA according to the manufacturer's instructions. Approximately 2.5 ng of cDNA was then used in the PCR on a StepOnePlus Real-Time PCR System from Applied Biosystems. Triplicate samples, validated endogenous controls, and inter-assay controls were used throughout. The RT-qPCR results were analyzed by SDS 2.2.2. For the AA group, let-7d* and miR-425* were chosen for RT-qPCR validation (Table S2). For the CA group, let-7c and miR-589 were chosen for RT-qPCR validation (Table S3, S4). So far, there is no reliable endogenous control microRNA in studying circulating microRNAs. We have surveyed the literature and found that expression of miR-16 is relatively stable. Therefore, in this study, we chose miR-16 as the endogenous control. RT-qPCR data were the normalized expression values in which the endogenous control miR-16 was used as the reference gene. For each assay, the Ct (Cycle threshold) of miRNA of interest in the TaqMan qPCR assay was subtracted from the average miR-16 Ct value to obtain a ΔCt value (miR-16 - miRNA of interest). A higher delta Ct value indicates a higher expression level of the miRNA of interest.
Demographic and clinical characterization of study population
Twenty breast cancer cases (10 CAs and 10 AAs) and 20 healthy controls (10 CAs and 10 AAs) were included in the analysis. The cases and controls were well-matched on age (mean age: 56 vs 57, P = 0.562). All cases had histologically confirmed early stage (I and II) invasive ductal carcinoma. Blood samples were drawn prior to surgery. The tumor size ranged between 0.2 to 2.5 cm. ER, PR and HER2/neu status data were available for 18 patients: 11 ER+, 7 ER−; 10 PR+, 8 PR−; and 6 HER2/neu+, 12 HER2/neu−. Four patients had triple negative breast cancer.
A large number of miRNAs are detected in plasma
The expression-detection P-value of Illumina Expression BeadChips was used to quantify the number of miRNA detected in the plasma of participants, designed to test the null hypothesis that the expression intensity of a given miRNA is indistinguishable from the background intensity. As shown in Table S1, among a total of 1,145 miRNAs profiled by the chips, among CAs,there were 886, 518 and 266 miRNAs detected (with detection Pvalue<0.05) in the plasma of at least 1, 5, and 10 controls, respectively. The corresponding number for case subjects in CA group was 873, 503 and 278, respectively. Similar number of miRNAs was detected in the plasma of AA participants (Table S1). Overall, this demonstrates that plasma contains a large amount of detectable miRNAs which provide a valuable repertoire that can be used to discover circulating miRNA-based biomarkers for breast cancer early detection.
Identification of differentially expressed miRNAs
At first, we made case-versus-control comparisons using all 40 participants (i.e., 20 vs. 20). We identified 26 miRNAs with at least two-fold differential expression at the significance level of p<0.05 (Table 1), which reasonably separate the 20 cases from the 20 controls (Figure 1A–B). Compared with the differentially expressed miRNAs derived from case-versus-control comparisons stratified by race, 10 of these 26 miRNAs can be derived using CA group only, while 5 miRNAs can be derived using AA group only (Figure 1C). There are 11, 19 and 9 miRNAs unique to the comparison using AA group only, CA group only, and all 40 participants, respectively (Figure 1C). A total of 39 miRNAs are specific to one of the three groups, compared with the 15 miRNAs shared by two groups and 2 miRNAs shared by three groups, suggesting a potential racial difference in circulating miRNAs. Clearly, separate case-versus-control comparisons stratified by race are necessary.
A–B) Hierarchical clustering and principal component clustering of differentially expressed microRNAs C) The overlap of differentially expressed microRNAs obtained from all 40 participants with those from AA group only (20 participants) and CA group only (20 participants), respectively.
In the case-versus-control comparison in CA group, we identified 31 miRNAs with at least two-fold differential expression at the significance level of p<0.05, with 17 miRNAs up-regulated in cases and 14 miRNAs down-regulated. Applying the same criteria to the AA group, we identified 18 differentially expressed miRNAs, with 9 miRNAs up-regulated in cases and 9 miRNAs down-regulated. The number of identified differentially expressed miRNAs (p<0.05) and the subgroup restricted by desired fold change for each comparison are summarized in Table 2. A detailed list of differentially expressed miRNAs with at least two-fold expression change is shown in Table 3.
Characteristics of differentially expressed miRNAs
Remarkably, we found that there is little overlap between the differentially expressed miRNAs identified from the CA groups and those from the AA groups (Table 2–3). Only two of the 31 CA-derived miRNAs were also found in the 18 AA-derived miRNAs, namely miR-181a and miR-1304. These results, while preliminary, suggest that it might be necessary to develop race-specific circulating miRNA based biomarkers in breast cancer early detection.
The expression values of differentially expressed miRNAs identified from each comparison were used to cluster the participants into their corresponding case/control status. As shown in Figure 2A–B, the 18 differentially expressed miRNAs derived from the AA group reasonably separate the 10 cases from the 10 controls. Similarly, the 31 differentially expressed miRNAs derived from the CA group can reasonably separate the 10 cases from the 10 controls (Figure 2D–E). The clustering results demonstrate that the lists of differentially expressed miRNAs derived from case-versus-control study might collectively serve as potential circulating miRNA-based biomarkers in breast cancer early detection.
A–B) Hierarchical clustering and principal component clustering of miRNAs in samples from AA participants. D–E) Hierarchical clustering and principal component clustering of miRNAs in samples from CA participants. C, F) The distribution of novel vs. known differentially expressed miRNAs obtained from AA group and CA group, separately.
The Illumina Human v2 MicroRNA Expression BeadChips contains 1,145 miRNAs including 858 known miRNAs annotated by miRBase (http://www.mirbase.org/) and 287 novel miRNAs not found by miRBase (i.e., obtained from deep sequencing in human tissues http://www.illumina.com/). In both AA and CA groups, we found that novel miRNA constitute ∼15% of identified differentially expressed miRNAs (Figure 2C, 2F). For example, the serum level of HS_242 and HS_217 are the first and second most elevated in case vs. control in AA group and CA group, respectively. As the rapidly developingd next-generation sequencing technique provides unprecedented power to discover and characterize new miRNAs –, it is expected that there will be more currently-unfound circulating miRNA whose biomarker potential remains to be explored.
Exploratory in silico pathway analysis
As the number of experimentally validated miRNA targets is limited, we used the widely used TargetScan algorithm to obtain the list of Entrez genes predicted to be targeted by the miRNAs obtained from the AA study and CA study, respectively. We then use NCBI DAVID server to identify the significantly enriched canonical pathways (P<0.01) in these conserved targets. As shown in Figure 3, although there are few overlaps at the individual miRNA level, there is a much higher degree of convergence at the pathway level regulated by identified miRNAs. Specifically, 27 of 30 enriched pathways regulated by AA-derived miRNAs are also enriched in the predicted targets of CA-derived miRNAs.
A) The differentially expressed microRNAs (P<0.05 & FC >2). B) The target genes predicted to be regulated by differentially expressed microRNAs. C) The enriched pathways (P<0.01) in target genes predicted to be regulated by differentially expressed microRNAs.
As shown in Table 4, pathways in cancer (KEGG: hsa05200) is one of the most enriched pathways among the genes predicted to regulated by AA-derived (P = 3.78e-09) and CA-derived (P = 1.20e-13) miRNAs, respectively. This reassures the correctness of our approach and indicates the potentially important functional role of circulating miRNAs in tumor involvement. Pathways involved in various signal transduction and cell-cell interactions such as ErbB signaling pathways, Focal adhesion, and Adherens junction are also significantly enriched in both AA and CA case-control comparisons. Taken together, these exploratory analyses suggest that variation in the plasma level of key circulating miRNAs might affect critical pathways involved in breast cancer formation and progression, an important mechanism warranting follow-up research. As miRNA target prediction algorithm is known to contain both false positives and false negatives, and our in silico pathway enrichment analysis is based on mRNA genes predicted to be targeted by circulating miRNAs, a full understanding of the potential functional role of circulating miRNAs can only be established using functional experiments.
Specific differentially expressed miRNAs
The expression levels of miRNA were confirmed with a Taqman-based RT-qPCR. Let-7c and miR-589 were selected to be validated in CA group and miR-425* and let-7d* were selected to be validated in AA group. For the CA group, plasma levels of let-7c were, in general, lower in cases than in controls (Fig 4A) based on microarray profiling (FC = −3.0, P = 0.015). The expression data of let-7c in case and control groups were then used to build a ROC plot (Fig 4B), which reflects reasonable separation between the two groups (AUC = 0.84). To confirm the pattern of let-7c is reproducible in independent participant cohorts, we performed a validation study using 30 independent samples (15 cases vs. 15 controls). Taqman based miRNA RT-qPCR assays were used to quantify the levels of plasma miRNAs in the validation study. As shown in Figure 4C–D, a similar separation pattern was observed in this independent cohort (FC = −1.9, P = 0.01, AUC = 0.78). MiR-589 was identified from the microarray study with up-regulation pattern for cases over controls in CA group (Fold-Change = 6.9, p = 0.013). However, the separation is less obvious with an AUC value of 0.62 (Figure 5A–B), which ranked it as a relatively weak candidate in our microarray-based discovery phase. To further explore its biomarker potential, Taqman based miRNA RT-qPCR assays were used to quantify its expression level in the 30 independent samples (15 cases vs. 15 controls) described above. The RT-qPCR results show that this microRNA is indeed characterized by an up-regulation (FC = 3.3, P = 0.0009), and stronger separation is shown with an AUC value of 0.85 (Figure 5C–D).
A–B) Data from microarray profiling of 20 participants: The fold change of let-7c in case relative to control is −3.0 (P = 0.015, AUC = 0.84). C–D) Data from RT-qPCR validation in an independent set of 30 participants: The fold change of let-7c in case relative to control is −1.9 (P = 0.01, AUC = 0.78).
A–B) Data from microarray profiling of 20 participants: The fold change of miR-589 in case relative to control is 6.9 (P = 0.0131, AUC = 0.62). C–D) Data from RT-qPCR validation in an independent set of 30 participants: The fold change of miR-589 in case relative to control is 3.3 (P = 0.0009, AUC = 0.85).
For the AA group, plasma levels of miR-425* (the minor form of hsa-miR-425) were, in general, higher in cases compared with controls (Fig 6A) based on microarray profiling (FC = 5.0, P = 0.00328). The expression data of miR-425* in case and control groups were used to build a ROC plot (Fig 6B), which reflects reasonable separation between the two groups (AUC = 0.79). To check the accuracy of miR-425* expression pattern obtained from microarray study, we used Taqman based miRNA RT-qPCR assays as a golden-standard platform to quantify the levels of plasma miRNAs in the same 20 samples. As shown in Figure 6C–D, a similar separation pattern was observed (FC = 3.3, P = 0.01226, AUC = 0.83). Let-7d* was identified from the microarray study with down-regulation pattern in cases over controls of AA group (FC = -6.6, p = 0.03063, AUC = 0.73). Our RT-qPCR measurements in the same 20 samples confirmed the microarray results, showing that this microRNA is indeed down-regulated (FC = −9.4, P = 1.6–7), and a close-to-perfect separation is shown with an AUC of 0.99 (Figure 7). While we found that microarray and RT-qPCR are consistent in capturing the overall expression pattern of circulating miRNA (i.e., up or down), the choice of different technology might result in different sensitivity and specificity (e.g., AUC of 0.73 from microarray vs. AUC of 0.99 from RT-qPCR for let-7d*).
A–B) Data from microarray profiling of 20 participants: The fold change of miR-425* in case relative to control is 5.0 (P = 0.00328, AUC = 0.79). C–D) Data from RT-qPCR validation in the same 20 participants: The fold change of miR-425* in case relative to control is 3.3 (P = 0.01226, AUC = 0.83).
A–B) Data from microarray profiling of 20 participants: The fold change of let-7d* in case relative to control is −6.6 (p = 0.03063, AUC = 0.73). C-D) Data from RT-qPCR validation in the same 20 participants: The fold change of let-7d* in case relative to control is −9.4 (P = 1.6e-7, AUC = 0.99).
This is the first report of a comprehensive interrogation of circulating miRNAs in breast cancer patients and healthy controls. Our results demonstrate that circulating miRNAs in plasma can potentially serve as novel minimally invasive biomarkers for early detection of breast cancer. We found that 31 miRNAs were significantly differentially expressed between CA cases and CA controls and 18 miRNAs were significantly differentially expressed between AA cases and AA controls. Using selected miRNAs from those differentially expressed miRNAs, breast cancer cases and healthy controls can be discriminated with reasonable specificity and sensitivity. Intriguingly, there are only two differentially expressed miRNAs overlapping between CA and AA study subjects, suggesting potential racial differences in circulating miRNA expression. In addition, in silico pathway-based exploratory analysis predicted that these differentially expressed circulating miRNAs might affect critical pathways conducive to breast cancer formation and progression, a potentially important mechanism warranting further investigations.
In our study, 17 out of 31 differentially expressed miRNAs were up-regulated in CA study subjects and 9 out of 18 differentially expressed miRNAs were up-regulated in AA study subjects. Several of these up-regulated miRNAs have been reported to play an important role in carcinogenesis. For example, miR-425*, a minor form of miR-425, was significantly up-regulated in AA breast cancer patients compared to AA healthy controls. Although the role of miR-425* in human cancer is not clear, miR-425 was reported to be altered in cancer cell lines and tumor tissues –. miR-425 resides at 3p21, next to miR-191 precursors embedded in the 1st intron of DALRD3 gene. Significant abundance and co-expression of miR-191 and miR-425 were demonstrated in various cancer cell lines . miR-425 is reported to provide downregulation of ncRNA pathway via direct targeting of DICER1 expression. miR-302b, which was significantly up-regulated in CA breast cancer patients compared to CA healthy controls, belongs to a panel of 12 miRNAs which are associated with inflammatory breast cancer . miR-302 family has been demonstrated to directly regulate p21. In human mammary epithelial cells, overexpression of miR-302b could modulate the activity of p21 and consequently alter the oncogenic phenotypes. In addition to up-regulated circulating miRNAs, we also observed a similar number of down-regulated circulating miRNAs. For example, we found the levels of circulating let-7c were significantly higher in healthy controls than the cases. This is consistent with the notion that let-7 family acts as tumor suppressor genes in breast tumors. Recent studies have found that let family, especially let-7a, 7b and 7c, could inhibit cell proliferation and subsequently induce apoptosis in MCF-7 breast cancer cell lines by directly regulating ER-α .
The observation of decreased levels of circulating miRNAs in breast cancer patients raises an interesting question concerning the origin of circulating miRNAs and their potential functions in breast tumorigenesis. Obviously, our observation cannot be explained by the popular hypothesis that the origin of circulating miRNAs is from tumor as a result of tumor cell death and lyses. Alternatively, it is possible that both normal and tumor cells can secrete certain miRNAs. Tumor cells may secrete miRNAs that are transferable and functional in the recipient cells. However, the biological significance of such actions is still unclear. Our observations fit generally with this alternative hypothesis. However, it is clear that further functional experiment studies are needed to solve the question of the origin of circulating miRNAs.
In a previous breast cancer study, Heneghan et al  surveyed a small panel of 7 candidate miRNAs in whole blood RNAs from 148 breast cancer patients and 44 age-matched and disease free controls. They found the expression of miR-195 was significantly higher in breast cancer patients than healthy controls (P<0.001). In addition, they found the expression of let-7a, which is well-regarded as a reliable endogenous control for analysis of miRNA in breast cancer, was increased over 5-fold in breast cancer patients compared to healthy controls (P<0.001). In our study, we did not observe differential expression for miR-195 (P = 0.169) or let-7a (P = 0.106) between cases and controls. The discrepancy between two studies might be due to different study materials. The Heneghan study used whole blood for detection of miRNAs, while we used plasma for our analyses. Whole blood contains different types of cells, so miRNAs detected might be circulating miRNAs as well as cellular miRNAs from additional cells types. The discrepancy might also reflect the heterogeneity of breast cancer. Different molecular pathways are involved in different subtypes of breast cancer, with different molecular characterizations between luminal A, luminal B, and basal like subtypes. In the Heneghan study, 59% of breast cancers were stage I and II, 71% were invasive ductal cancer, and 82% were ER positive. Meanwhile, in our study, all patients had stage I and II invasive ductal cancer, and only 55% of patients were ER positive. These differences in clinical characteristics, together with the relatively small sample size used in this study, might contribute to the difference between their study and our study. In our ongoing validation study, we have significantly increased sample size and included different subtypes of breast cancer. Hopefully, this will give us an opportunity to validate our findings in this study as well as test the predictive value of miR-195 and let-7a.
Another interesting observation from our study is the potential racial difference in terms of differentially expressed miRNAs between CA and AA study specimens. In the United States, breast cancer mortality is higher among AA women compared to women of European ancestry (EA). Breast cancer in AA women is characterized by earlier age at onset, later stage at diagnosis, higher nuclear grade, higher mitotic index and lower prevalence of estrogen receptor (ER) and/or progesterone receptor (PR) expression compared with EA women . However, whether the genetic alterations leading to breast carcinogenesis are different between AA and CA breast cancer patients is not clear. More intriguingly, we found that the in silico predicted biological pathways were similar between CA and AAs although their miRNA expression profiles were different. This is consistent with the clinical observations that the course of breast cancer may be characterized by certain common pathways and the balance between tumor and host traits influences the pace of the common pathways. However, it should be emphasized that our observations are exploratory in nature and need to be assessed in further studies with larger sample size. More importantly, functional experiments are required to verify and establish the causal association between differentially expressed miRNAs and the predicted pathways.
One limitation of this study is the relatively small sample size which does not provide us enough power to assess relationships between levels of circulating miRNAs and clinical characteristics. To achieve an 80% power to detect a gene expression difference of 2-fold and above, we will need 18 pairs of samples, assuming a set of 1,145 human miRNA genes on the microarray, a standard deviation of 0.7, 1 expected false positive, and an adjusted P-value of 0.001 (http://bioinformatics.mdanderson.org/MicroarraySampleSize).
Also, as we observed in the study, there might be unknown nc-RNAs existing in the circulation, which cannot be studied using microarray analysis relying on pre-designed probes. Some of these nc-RNAs might have important predictive values and significant functional roles in breast carcinogenesis. The rapidly developing massive parallel sequencing technology is not dependent on any prior probe information, instead providing information about all known microRNAs in the sample and allowing for discovery of novel microRNAs . In our data, we found that novel miRNAs constitute ∼15% of identified differentially expressed miRNAs. Nevertheless, this is the first genome-wide study to comprehensively survey the circulating miRNAs in breast cancer patients and healthy controls. The results presented here show significantly altered circulating levels of certain miRNAs in breast cancer patients compared with healthy controls. Due to the genetic and clinical heterogeneity of breast cancer, some of the separation is not perfect. Therefore, the data should be interpreted with caution. Future large studies and advanced technologies are warranted to confirm our findings and further explore the existing potential of circulating miRNAs to be utilized clinically as novel biomarkers for breast cancer.
We thank Dr. Warren Davis and Ms. Mary Nesline, the Co-Directors of DataBank and Biorepository (DBBR) in Roswell Park Cancer Institute, for proving high-quality biospecimens and related data for this study. The Roswell Park Cancer Institute DBBR is a CCSG Shared Resource (NIH P30 CA016056-27).
Conceived and designed the experiments: HZ CBA SL. Performed the experiments: JS LM. Analyzed the data: DW SL. Contributed reagents/materials/analysis tools: SL. Wrote the paper: HZ JS SL.
- 1. Taplin S, Abraham L, Barlow WE, Fenton JJ, Berns EA, et al. (2008) Mammography facility characteristics associated with interpretive accuracy of screening mammography. J Natl Cancer Inst 100: 876–887.
- 2. Thompson A, Brennan K, Cox A, Gee J, Harcourt D, et al. (2008) Evaluation of the current knowledge limitations in breast cancer research: a gap analysis. Breast Cancer Res 10: R26.
- 3. Piccart-Gebhart MJ, Procter M, Leyland-Jones B, Goldhirsch A, Untch M, et al. (2005) Herceptin Adjuvant (HERA) Trial Study Team. Trastuzumab after adjuvant chemotherapy in HER2-positive breast cancer. N Engl J Med 353: 1659–1672.
- 4. Harris L, Fritsche H, Mennel R, Norton L, Ravdin P, et al. (2007) American Society of Clinical Oncology. American Society of Clinical Oncology 2007 update of recommendations for the use of tumor markers in breast cancer. J Clin Oncol 25: 5287–5312.
- 5. O'Hanlon DM, Kerin MJ, Kent P, Maher D, Grimes H, et al. (1995) An evaluation of preoperative CA 15–3 measurement in primary breast carcinoma. Br J Cancer 71: 1288–1291.
- 6. Uehara M, Kinoshita T, Hojo T, Akashi-Tanaka S, Iwamoto E, et al. (2007) Long-term prognostic study of carcinoembryonic antigen (CEA) and carbohydrate antigen 15–3 (CA 15–3) in breast cancer. Int J Clin Oncol 13: 447–451.
- 7. Lu J, Getz G, Miska EA, Alvarez-Saavedra E, Lamb J, et al. (2005) MicroRNA expression profiles classify human cancers. Nature 435: 834–838.
- 8. Esquela-Kerscher A, Slack FJ (2006) Oncomirs - microRNAs with a role in cancer. Nat rev Cancer 6: 259–269.
- 9. Johnson R, Zuccato C, Belyaev ND, Guest DJ, Cattaneo E, et al. (2008) A microRNA-based gene dysregulation pathway in Huntington's disease. Neurobiol dis 29: 438–445.
- 10. Asangani IA, Rasheed SA, Nikolova DA, Leupold JH, Colburn NH, et al. (2008) MicroRNA-21 (miR-21) post-transcriptionally downregulates tumor suppressor Pdcd4 and stimulates invasion, intravasation and metastasis in colorectal cancer. Oncogene 27: 2128–2136.
- 11. Huang Q, Gumireddy K, Schrier M, le Sage C, Nagel R, et al. (2008) The microRNAs miR-373 and miR-520c promote tumor invasion and metastasis. Nature cell biol 10: 202–210.
- 12. Ma L, Teruya-Feldstein J, Weinberg RA (2007) Tumor invasion and metastasis initiated by microRNA-10b in breast cancer. Nature 449: 682–688.
- 13. Zhu S, Si ML, Wu H, Mo YY (2007) MicroRNA-21 targets the tumor suppressor gene tropomyosin 1 (TPM1). J Biol Chem 282: 14328–14336.
- 14. Zhu S, Wu H, Wu F, Nie D, Sheng S, et al. (2008) MicroRNA-21 targets tumor suppressor genes in invasion and metastasis. Cell Res 18: 350–359.
- 15. Tavazoie SF, Alarcon C, Oskarsson T, Padua D, Wang Q, et al. (2008) Endogenous human microRNAs that suppress breast cancer metastasis. Nature 451: 147–152.
- 16. Valastyan S, Reinhardt F, Benaich N, Calogrias D, Szász AM, et al. (2009) A pleiotropically acting microRNA, miR-31, inhibits breast cancer metastasis. Cell 137: 1032–1046.
- 17. Hurst DR, Edmonds MD, Welch DR (2009) Metastamir: the field of metastasis-regulatory microRNA is spreading. Cancer Res 69: 7495–7498.
- 18. Iorio MV, Ferracin M, Liu CG, Veronese A, Spizzo R, et al. (2005) MicroRNA gene expression deregulation in human breast cancer. Cancer Res 65: 7065–7070.
- 19. Mattie MD, Benz CC, Bowers J, Sensinger K, Wong L, et al. (2006) Optimized high-throughput microRNA expression profiling provides novel biomarker assessment of clinical prostate and breast cancer biopsies. Mol Cancer 5: 24.
- 20. Lawrie CH, Gal S, Dunlop HM, Pushkaran B, Liggins AP, et al. (2008) Detection of elevated levels of tumor-associated microRNAs in serum of patients with diffuse large B-cell lymphoma. Br J Haematol 141: 672–675.
- 21. Mitchell PS, Parkin RK, Kroh EM, Fritz BR, Wyman SK, et al. (2008) Circulating microRNAs as stable blood-based markers for cancer detection. Proc Natl Acad Sci USA 105: 10513–10518.
- 22. Gilad S, Meiri E, Yogev Y, Benjamin S, Lebanony D, et al. (2008) Serum MicroRNAs are promising novel biomarkers. PLoS One 3: e3148.
- 23. Chen X, Ba Y, Ma L, Cai X, Yin Y, et al. (2008) Characterization of microRNAs in serum: a novel class of biomarkers for diagnosis of cancer and other diseases. Cell Res 18: 997–1006.
- 24. Resnick KE, Alder H, Hagan JP, Richardson DL, Croce CM, et al. (2009) The detection of differentially expressed microRNAs from the serum of ovarian cancer patients using a novel real-time PCR platform. Gynecol Oncol 112: 55–59.
- 25. Wang K, Zhang S, Marzolf B, Troisch P, Brightman A, et al. (2009) Circulating microRNAs, potential biomarkers for drug-induced liver injury. Proc Natl Acad Sci USA 106: 4402–4407.
- 26. Heneghan HM, Miller N, Lowery AJ, Sweeney KJ, Newell J, et al. (2010) Circulating microRNAs as novel minimally invasive biomarkers for breast cancer. Ann Surg 251: 499–505.
- 27. Huang Z, Huang D, Ni S, Peng Z, Sheng W, et al. (2010) Plasma microRNAs are promising novel biomarkers for early detection of colorectal cancer. Int J of Cancer 127: 118–126.
- 28. Lodes MJ, Caraballo M, Suciu D, Munro S, Kumar A, et al. (2009) Detection of Cancer with Serum miRNAs on an Oligonucleotide Microarray. PLoS ONE 4: e6229.
- 29. Ambrosone CB, Nesline MK, Davis W (2006) Establishing a cancer center data bank and biorepository for multidisciplinary research. Cancer Epidemiol Biomarkers Prev 15: 1575–1577.
- 30. Smyth GK (2004) Linear models and empirical Bayes methods for assessing differential expression in microarray experiments. Stat Appl Genet Mol Biol 3: R3.
- 31. Eisen MB, Spellman PT, Brown PO, Botstein D (1998) Cluster analysis and display of genome-wide expression patterns. Proc Natl Acad Sci USA 95: 14863–14868.
- 32. Sing T, Sander O, Beerenwinkel N, Lengauer T (2005) ROCR: visualizing classifier performance in R. Bioinformatics 21: 3940–3941.
- 33. Huang W, Sherman BT, Tan Q, Kir J, Liu D, et al. (2007) DAVID Bioinformatics Resources: expanded annotation database and novel algorithms to better extract biology from large gene lists. Nucleic Acids Res 35: W169–75.
- 34. Morozova O, Hirst M, Marra MA (2009) Applications of new sequencing technologies for transcriptome analysis. Annu Rev Genomics Hum Genet 10: 135–151.
- 35. Friedlaender MR, Chen W, Adamidi C, Maaskola J, Einspanier R, et al. (2008) Discovering microRNAs from deep sequencing data using miRDeep. Nature Biotechnol 26: 407–415.
- 36. Grinchuk OV, Jenjaroenpun P, Orlov YL, Zhou J, Kuznetsov VA (2010) Integrative analysis of the human cis-antisense gene pairs, miRNAs and their transcription regulation patterns. Nucleic Acids Res 38: 534–547.
- 37. Selcuklu SD, Yakicier MC, Erson AE (2009) An investigation of microRNAs mapping to breast cancer related genomic gain and loss regions. Cancer Genet Cytogenet 189: 15–23.
- 38. Van der Auwera I, Van Laere S, Limame R, Trinh X, Van Marck E, et al. (2009) A microRNA Expression Profile Consisting of 12 microRNAs Is Associated with Inflammatory Breast Cancer. Cancer Res 69: 6119.
- 39. Zhao Y, Deng C, Wang J, Xiao J, Gatalica Z, Recker RR, Xiao GGLet-7 family miRNAs regulate estrogen receptor alpha signaling in estrogen receptor positive breast cancer. Breast Cancer Res Treat. 2010 Jun 10 [Epub ahead of print].
- 40. Millikan RC, Newman B, Tse CK, Moorman PG, Conway K, et al. Epidemiology of basal-like breast cancer. Breast Cancer Res Treat 2008 May;109(1): 123–39.