Risk Prediction for Breast, Endometrial, and Ovarian Cancer in White Women Aged 50 y or Older: Derivation and Validation from Population-Based Cohort Studies

Ruth Pfeiffer and colleagues describe models to calculate absolute risks for breast, endometrial, and ovarian cancers for white, non-Hispanic women over 50 years old using easily obtainable risk factors. Please see later in the article for the Editors' Summary


Introduction
Several statistical models predict a woman's probability or absolute risk of developing invasive breast cancer based on her age and reproductive, medical, and lifestyle factors [1][2][3][4][5]. Many risk factors associated with breast cancer are also associated with the risk of other gynecologic cancers. For example, four of the seven risk factors used in the publicly available Breast Cancer Risk Assessment Tool (BCRAT; http://www.cancer.gov/bcrisktool) [6] are also strongly associated with ovarian and endometrial cancer risks, including current age, age at menarche, parity, and first-degree family history of breast cancer. Therefore, a woman with a high breast cancer risk due to combinations of these risk factors likely also has above-average endometrial or ovarian cancer risk. However, while some models have been proposed for risk prediction for ovarian cancer [7], no model that predicts the absolute risk of endometrial cancer is available, even though endometrial cancer is the fourth most common cancer in women, and 1 in 38 women will be diagnosed with endometrial cancer during her lifetime [8]. With rates of obesity increasing and rates of hysterectomy declining in many regions of the US, endometrial cancer incidence may rise further, and thus it is important to identify women at highest risk for this disease.
Absolute risk prediction models provide useful information for health care providers and patients and aid in the design and recruitment phase of studies of preventive interventions. Several large chemoprevention studies of tamoxifen (Nolvadex; AstraZeneca) and raloxifene (Evista; Lilly) used BCRAT projected breast cancer risk to determine eligibility and to estimate needed sample sizes [9,10]. These models can also aid clinical management of women at elevated risk of one or more of these outcomes. For example, if a woman presents with endometrial bleeding, but is found not to have endometrial cancer in a subsequent workup, an estimate of her risk of developing endometrial cancer over the next 5 or 10 y may aid her and her physician in deciding further steps, including having a hysterectomy or taking progestin. Knowing her risk of endometrial cancer in addition to her risk of breast cancer could also inform a woman's decision about whether or not to consider use of tamoxifen for breast cancer prevention [11], as tamoxifen reduces breast cancer risk, but also increases the risk of endometrial cancer [10]. Additionally, knowing her ovarian cancer risk might influence her decision regarding prophylactic bilateral oophorectomy to reduce her risk of breast cancer.
We developed absolute risk prediction models for breast, endometrial, and ovarian cancer by combining data from two large prospective cohorts at the National Cancer Institute (NCI)the Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial (PLCO) and the National Institutes of Health-AARP Diet and Health Study (NIH-AARP)-and from incidence and competing mortality rates in the NCI's Surveillance, Epidemiology, and End Results Program (SEER). The SEER cancer registries cover approximately 28% of the US population (http:// seer.cancer.gov/about/overview.html).
All models were validated using independent data from the Nurses' Health Study (NHS).

Data for Relative Risk Models
We used data on white, non-Hispanic women from two NCI cohorts: the PLCO cohort and the NIH-AARP cohort.
PLCO. PLCO has been described in detail previously [12]. In brief, PLCO enrolled 78,232 women, aged 55 to 74 y at baseline, between November 1993 and June 2001 at ten screening centers. Women were eligible if they had no history of lung, colorectal, or ovarian cancer and were neither undergoing cancer treatment nor participating in other screening or prevention trials. Women who had undergone bilateral oophorectomy or were taking tamoxifen were initially ineligible but were later included. Women randomized to the screening arm of the trial received a single-view chest X-ray annually for 4 y to screen for lung cancer; a CA (cancer antigen) 125 blood test annually for 6 y and a transvaginal ultrasound yearly for 4 y, both to screen for ovarian cancer; and flexible sigmoidoscopy at the beginning of the trial and either 3 y or 5 y later to screen for colorectal cancer. Institutional review boards at the NCI and screening centers approved the study.
At entry, participants completed a self-administered lifestyle questionnaire. Cancers were ascertained via annual study updates and death certificates, and verified via review of medical records.
NIH-AARP. NIH-AARP has been described previously [13]. It included 567,169 men and women who, in 1995-1996, were 50-71 y old and resided in one of eight states. Participants returned a self-administered baseline questionnaire and a second more detailed questionnaire, sent 6 mo after the baseline questionnaire. The NCI Special Studies Institutional Review Board approved the study.
Cancer cases were identified through linkage with state cancer registries with a 90% completeness rate [14]. All cases had a histologic diagnosis. Vital status was ascertained through annual linkage to the Social Security Administration Death Master File and the National Death Index Plus.
Analytic populations. In each study, we restricted the analysis to non-Hispanic, white women who completed the baseline questionnaire, had follow-up information, and had no personal history of the cancer of interest at baseline. For the ovarian and endometrial cancer models, we further excluded women with bilateral oophorectomy or hysterectomy, respectively (see Table 1 for further details on exclusions). After these exclusions, the NIH-AARP and PLCO study populations included 191,604 and 64,440 women, respectively, for the breast cancer analysis, 114,931 and 42,821 women, respectively, for the endometrial cancer analysis, and 151,165 and 58,282 women, respectively, for the ovarian cancer analysis.

Data for Model Validation
Independent data from the NHS cohort from July 1990 to June 2004 were used to validate our models. The NHS cohort included 121,701 women aged 30-55 y in 1976 [15]. All participants gave informed consent at enrollment. With the same exclusions as for the relative risk (RR) models, the validations were based on 57,906 women for the breast cancer model and 37,241 for the endometrial cancer model. For the ovarian cancer absolute risk model we additionally excluded women who reported removal of a single ovary or ovarian surgery with unknown status of the ovaries during follow-up, resulting in 56,638 women for validation. See Table 2 for details on exclusions. Breast and ovarian cancer incidence rates were similar to those in SEER, but endometrial cancer incidence was substantially lower, with an overall incidence at age 50-74 y of 42.16 per 100,000 person-years, compared to 78.1 per 100,000 person-years in SEER for the same age range (see Tables 3 and 4 for further details).

Statistical Methods
Relative risk models. We estimated separate RR models for breast, endometrial, and ovarian cancer using Cox proportional hazards regression (SAS, version 9.1; SAS Institute), with age as the timescale. For each of the cancers, the primary outcome was incident, invasive epithelial cancer. For each outcome we first estimated RRs and 95% confidence intervals (CIs) separately for each cohort. However, we used RRs from a Cox model for the combined cohorts that included a study indicator as covariate (NIH-AARP versus PLCO). Analyses of the combined data that stratified the baseline hazard function in the Cox model on study yielded results in agreement to the third decimal point with this regression method. Proportionality of the hazard functions was assessed by visual inspection of hazard plots and Schoenfeld residuals.
For each RR model, women were considered at risk from the age at study entry (randomization for PLCO and completion of baseline questionnaire for NIH-AARP) until the age at the earliest of the following: (1) diagnosis of cancer of interest, (2) death, or (3) administrative censoring (for PLCO, most recent annual study update through December 31, 2005; for NIH-AARP, December 31, 2003). In PLCO, women were also censored at date of unconfirmed self-reported cancers and non-epithelial cancers (the International Classification of Diseases for Oncology, Third Edition [16] morphology codes 8000 to 8573). For the breast cancer RR analysis, women were additionally censored at diagnosis of breast carcinoma in situ, and for the ovarian cancer RR analysis, at diagnosis of an ovarian tumor of low malignant potential in PLCO.
To synchronize age at menarche (PLCO categories ,10, 10-11, 12-13, 14-15, 16+ y; NIH-AARP categories #12, 13-14, 15+ y), we randomly allocated women in the overlapping PLCO categories (e.g., 12-13 y) to the NIH-AARP categories (#12 y or 13-14 y) using the age at menarche distribution from the nationally representative NHANES study [17]. For OC use we randomly assigned women in the #1 year category in PLCO to the ,1 and 1+ year categories. Alcohol consumption was not ascertained in the PLCO control arm, and thus RRs for association with alcohol consumption were estimated from the PLCO intervention arm and NIH-AARP. RRs for duration of estrogen and progestin MHT use and duration of unopposed estrogen MHT use were estimated from women in the NIH-AARP cohort who had responded to the second questionnaire. Information for benign breast disease was missing on 20% of the women in the dataset, and we thus created an indicator for missing values. For all other variables, women with missing values were excluded as the number missing was very small (,5% for all variables; Table 5). We first assessed all risk factors listed above as possible predictors for each cancer as main effects. Final models included only variables that were significant in multivariable models with p,0.01. We chose a stringent p-value as we did not want to include variables with modest RRs that would not improve prediction. Model building was repeated using stepwise variable selection in Cox proportional hazards models and led to the same variables being selected for each cancer. We also assessed the significance at p,0.01 of all first-order interaction terms of variables included in the final models. We fitted variables with trends whenever appropriate. For all risk factors, the reference category was the lowest risk category, to facilitate attributable risk (AR) computations. To accommodate missing data, we calculated AR for the breast cancer model in women with complete data. Using time on study as the timescale, adjustment for calendar period or additional censoring at age of diagnosis of any other cancer (e.g., censoring the breast cancer models at diagnosis of endometrial cancer or ovarian cancer) or adjustment for gynecologic surgery did not change the results appreciably.
Absolute risk models. The absolute risk p(a,b) of cancer c (c = breast, endometrial, or ovarian cancer) in the age interval (a,b] is the probability of developing a specific cancer c during that interval, given that one is alive and free of previous cancer c at the beginning of the interval. The absolute risk is reduced by death from causes other than cancer c and is defined by In Equation 1, l 1 and l 2 are the cause-specific hazard rates for cancer c and for competing causes, respectively. We modeled l 1 (a,x) = l 10 (a) rr 1 (a,x) as the product of the agespecific baseline hazard rate l 1 (a) and the RR model, rr 1 (a,x) = exp(b'x), where x denotes covariates. We did not include covariates in the hazard l 2 for competing causes of death. The age-specific baseline hazard rates l 10 (a) are computed by multiplying the agespecific SEER incidence rates, l 1 *(a), by one minus the estimate of Table 3. Age-specific incidence per 100,000 person-years in non-Hispanic white women from the NHS cohort, ex luding only women with prevalent cancer of interest and no positive follow-up time.  the AR, i.e., l 10 (a) = l 1 *(a)(12AR 1 (a)), as outlined in [2]. The AR for cancer type c was obtained as one minus the number of women in the dataset divided by the sum of the multivariate RR estimates for cancer c over all women. We used a model with piecewise constant hazards to approximate Equation 1, with 5-y age intervals.
In SEER, endometrial and ovarian cancer incidence rates are calculated based on all women in the population in a given age group. These rates thus are lower than rates that are based on women with an intact uterus or ovaries, respectively. We therefore adjusted the age-specific SEER rates by dividing them by the percentage of women who had not had a hysterectomy (estimated from the Behavioral Risk Factor Surveillance System (BRFSS) survey [18] for the same areas included in SEER) or oophorectomy (estimated from NHANES [17]) (see Tables 4 and S1, S2, S3). In Table S1 we also present rates for models that allow for the possibility that a woman might have a hysterectomy or an oophorectomy during the projection interval, by adding hysterectomy and oophorectomy rates to the competing risks for the two models, respectively.
Statistical analysis used for model validation. We compared the expected and the observed numbers of cases overall and in subgroups defined by risk factor combinations. The expected number of cases was calculated by summing the individual projected probabilities, given the baseline covariate values for each woman from entry (July 1990) into the NHS cohort to June 2004. The 95% CIs for the expected/observed (E/O) ratios were calculated using the normal approximation to the cates that the model overestimates cancer risk, and an E/O less than one indicates that the model underestimates cancer risk. We evaluated the discriminatory accuracy of the prediction models using the area under the receiver operating characteristic curve (AUC), with 95% bootstrap CIs.

Results
The characteristics of the women in the study and cancer incidence rates are shown in Table 1.

Absolute Risks
The AR estimates used to compute baseline hazard rates for the models were 0.52 for breast cancer, 0.81 for endometrial cancer, and 0.43 for ovarian cancer. Table 7 shows examples of 10-and 20-y absolute risks for several risk profiles and for initial ages 50 and 65 y. Breast cancer absolute risk estimates ranged from 1.57% to 21.78% for 10-y projections and from 3.64% to 35.11% for 20-y projections. Risk for endometrial cancer ranged from 0.35% to 10.50% for 10-y and from 1.22% to 17.08% for 20 y. The highest 20-y risk, 17.08%, was seen for a 65-y-old woman in the 40+ kg/m 2 BMI category who had never smoked. For specific risk factor combinations, endometrial absolute risk can be higher than breast cancer absolute risk. For example, profile 8 (Table 7) corresponds to a premenopausal nulliparious 50-y-old woman with a BMI of 35 kg/ m 2 , who never smoked, has no family history of breast or ovarian cancer, and never had a breast biopsy. Her 10-y endometrial cancer absolute risk is 4.9%, while her 10-y absolute risk of breast cancer is only 1.62%. 10-y absolute risk of ovarian cancer ranged from 0.28% to 0.96%, and 20-y risk, from 0.74% to 1.77%.
Comprehensive tabulations of 5-, 10-, and 20-y risks for 50-and 60-y-old women for all possible combinations of risk factors for the three outcomes, and software written in SAS to compute absolute risk estimates for any age, projection length, and combination of risk factors, are freely available for download under Breast/ Endometrial/Ovarian Risk Assessment at http://dceg.cancer. gov/tools/risk-assessment.

Validation of the Models in the Nurses' Health Study
RR estimates in the NHS cohort for all three cancers (Table 8) were very similar to those used in the model (Table 6).
Breast cancer. A total of 2,934 incident breast cancers were diagnosed among women included in the analyses, and the model predicted 2,930, resulting in an E/O ratio of 1.00 (95% CI: 0.96-1.04) ( Table 9). The model significantly underestimated the number of breast cancers in premenopausal women. For all other variables, the E/O ratios were not statistically significantly different from unity. The overall AUC (discriminatory power) in the NHS cohort was 0.58 (95% CI: 0.57-0.59). We also compared the performance of the new breast cancer model to that of NCI's BCRAT [6] on the same validation data. BCRAT predicted 2,947 cases, resulting in E/ O = 1.00 (95% CI: 0.97-1.04). The number of cases was overpredicted significantly in cells defined by family history using either the coding for the new model or BCRAT (  (Table 10). The number of endometrial cancers was underestimated for women in the highest BMI category and for premenopausal women, and significantly underestimated for women who reported taking MHT for 10 or more years. The AUC was 0.68 (95% CI: 0.66-0.70).
Ovarian cancer. A total of 377 incident ovarian cancers were diagnosed among women included in the analyses, and the model predicted 406 (Table 10), resulting in E/O = 1.08 (0.97-1. 19). The number of ovarian cancers was overestimated for all covariate categories, albeit not statistically significantly, with the exception of the nulliparous group and women taking MHT for 10 or more years. The AUC was 0.59 (95% CI: 0.56-0.63).

Cross-Classification of Breast and Endometrial Cancer Risk in the Nurses' Health Study
Current American Society of Clinical Oncology guidelines indicate that premenopausal women and postmenopausal women with low risk of side effects and a 5-y projected BCRAT risk $1.66% may benefit from tamoxifen and/or raloxifene for breast cancer prevention [11]. Among the 3,837 premenopausal women   aged 50-55 y at baseline in the NHS cohort who reported having a uterus during follow-up, 784 had a 5-y absolute breast cancer risk $1.66%. Of those, three women had a $2% 5-y absolute endometrial cancer risk. Tamoxifen reduces breast cancer risk by approximately 50% and increases endometrial cancer risk approximately 4-fold in older women [9]. Gail et al. [19] and Freedman et al. [20] weighed the risks and benefits of tamoxifen assuming average age-specific risks of health outcomes, apart from breast cancer. Using models both for breast cancer and for endometrial cancer yields a more accurate weighing of risks and benefits. For example, based on an average 5-y risk of endometrial cancer of 0.41% [19], tamoxifen would increase absolute endometrial cancer risk by 1.23% while reducing a 2.5% breast cancer risk by 1.25%. These risks and benefits are nearly equal. If, instead, the woman had an endometrial cancer risk of 2%, tamoxifen would increase endometrial cancer risk by 6%, which greatly exceeds the reduction in breast cancer risk. In carefully balancing risk and benefits, however, the comparative lethality of the two outcomes might also be an additional important factor in a decision concerning whether to take tamoxifen.

Discussion
We developed models that predict individualized probabilities of developing breast, endometrial, and ovarian cancers among US white women aged 50+ y. We chose these three cancers because they share several risk factors, presumably reflecting a common hormonal etiology, and because management decisions may depend jointly on these risks.
To our knowledge, there is no other absolute risk model for endometrial cancer, despite the fact that it is the fourth most common cancer in women [8] and its absolute risks are quite high,  particularly in obese women ( Table 7). Knowledge of endometrial cancer risk might inform decision-making about diagnostic workup, clinical management, surgical interventions, and the use of agents, such as tamoxifen or unopposed estrogen that increase endometrial cancer risk. Such a model might also aid in designing intervention trials to prevent endometrial cancer and in identifying women with elevated risk who might benefit from such interventions. The endometrial cancer model may also be useful in assessing the burden of that cancer in the general population, which may increase, as more than a third of all US women now have a BMI of 30 kg/m 2 or higher [21]. In combination with data on trends in the prevalence of obesity, one could use the model to investigate the extent to which the increasing prevalence of obesity accounts for the significant 2% per year increase in endometrial cancer incidence seen among white women 2006-2010 [22]. Unlike most other models for breast cancer, our model includes factors that are potentially modifiable for the individual or in populations over time, such as alcohol consumption, BMI, and use of MHT. This model had slightly better discriminatory accuracy (AUC = 0.58) in the validation data than the widely used BCRAT (''Gail model'') (AUC = 0.56), which predicts breast cancer risk based on reproductive factors, number of breast biopsies, and atypical hyperplasia. However, despite this increase in AUC, the discriminatory accuracy of the breast cancer absolute risk model is still modest, and limits its clinical applicability, particularly for screening. Several breast cancer risk prediction models include non-modifiable risk factors such as family history (e.g., [1]), mammographic density [23][24][25], and reproductive and medical factors such as age at menarche and number of breast biopsies [6]. A few models for US women include potentially modifiable risk factors, such as BMI and alcohol use [26]. Our model also incorporates use, and duration of use, of combined estrogen and progestin MHT and other types of MHT. Use of estrogen and progestin MHT had the second largest RR in our model, and had the same RR (RR = 1.40) for a one category increase in duration as benign breast disease. Petracci et al. [27] illustrate public health and counseling applications of a breast cancer model with modifiable risk factors in Italian women, and similar calculations could be performed for the US with the model we developed. These calculations could also aid in understanding the impact of increases in obesity on US breast cancer incidence. For ovarian cancer, a model based on the NHS includes age at menopause, age at menarche, OC use, and tubal ligation [7].
To build the models, we combined data from two large prospective studies from well-characterized populations. We used another large cohort, the NHS cohort, to independently validate our models. The RR estimates agreed well with those in the NHS cohort for all three models. The discriminatory power, as assessed by the AUC, was 0.58 and 0.59 for the breast and ovarian cancer models, respectively. While these values indicate modest discriminatory ability, they are similar to those reported for other cancer risk models for these cancers [2,7]. The AUC value for endometrial cancer was 0.67, which is larger than that seen for most models of cancer incidence.
The breast and ovarian model were well calibrated in the NHS cohort; however, because women in the NHS cohort were censored at the age of diagnosis of an in situ breast cancer, the breast model may have underestimated slightly. The endometrial model significantly over-predicted the number of endometrial cancers. This reflects the fact that the NHS cohort has considerably lower endometrial cancer rates (Table 3) than those seen in SEER (Table 4). Further studies of the calibration of this model in additional cohorts would be desirable to assess its applicability to the general US population.
Well-calibrated risk models, even those with modest discriminatory accuracy, have several public health applications. These include designing cancer prevention trials, assessing the absolute burden of disease in the population and in subgroups, and gauging the potential absolute reductions in risk from preventive strategies. Using risk models to select individuals for screening or other interventions usually requires high discriminatory accuracy [28]. Well-calibrated risk models with modest discriminatory accuracy can also aid in individual decision-making. Such models can provide realistic information on level of risk that is useful in making decisions, such as whether or not to have a mammogram [29]. Such models are also useful in decisions on whether or not to take an intervention that has both beneficial and harmful health effects [19,28].
There are several potential limitations to our models. The exact number of breast biopsies, an important predictor in BCRAT, was not available in our cohorts, and the models were restricted to women aged 50+ y. Our RR models were built using white, non-Hispanic women and may not generalize to other races or ethnicities. We adjusted the age-specific endometrial and ovarian cancer incidence rates for the prevalence of hysterectomy and oophorectomy among US women estimated from populationbased surveys, but some residual error may exist. Another limitation is that MHT use was probably more prevalent during the period of study of PLCO and NIH-AARP, from 1993 to 2005, than it is currently. While such changes in the prevalence of risk factors would not affect model calibration, they could reduce the variation of risk in the population and hence reduce the discriminatory accuracy of the models. Among the strengths of our models are large study sample size, nearly complete end point ascertainment, and information on most of the important risk factors.
Our models are not intended to predict the probability of the three cancers among women known to be at much higher than average risk, e.g., women with a mutation in BRCA1 or BRCA2 or with hereditary non-polyposis colorectal cancer (HNPCC). Each model is applicable to women without a prior diagnosis of that particular cancer, and thus in principle the breast cancer model can be applied to predict breast cancer risk for women with a prior diagnosis of any other cancer, including endometrial cancer. However, in applying the risk estimates, one needs to consider that a woman's risk may be altered by treatment for a previous cancer.
In conclusion, we developed and assessed models that project the probabilities of developing breast, endometrial, or ovarian cancer among white, non-Hispanic women aged 50+ y. These models might improve the ability to identify potential participants for research studies and assist in clinical decision-making related to the risks of these cancers. Why Was This Study Done? Several statistical models (for example, the Breast Cancer Risk Assessment Tool) have been developed that estimate a woman's absolute risk (probability) of developing breast cancer over the next few years or over her lifetime. Absolute risk prediction models are useful in the design of cancer prevention trials and can also help women make informed decisions about cancer prevention and treatment options. For example, a woman at high risk of breast cancer might decide to take tamoxifen for breast cancer prevention, but ideally she needs to know her absolute endometrial cancer risk before doing so because tamoxifen increases the risk of this cancer. Similarly, knowledge of her ovarian cancer risk might influence a woman's decision regarding prophylactic removal of her ovaries to reduce her breast cancer risk. There are few absolute risk prediction models for ovarian cancer, and none for endometrial cancer, so here the researchers develop models to predict the risk of these cancers and of breast cancer.

Supporting Information
What Did the Researchers Do and Find? Absolute risk prediction models are constructed by combining estimates for risk factors from cohorts with population-based incidence rates from cancer registries. Models are validated in an independent cohort by testing their ability to identify people with the disease in an independent cohort and their ability to predict the observed numbers of incident cases. The researchers used data on white, non-Hispanic women aged 50 years or older that were collected during two large prospective US cohort studies of cancer screening and of diet and health, and US cancer incidence and mortality rates provided by the Surveillance, Epidemiology, and End Results Program to build their models. The models all included parity as a risk factor, as well as other factors. The model for endometrial cancer, for example, also included menopausal status, age at menopause, body mass index (an indicator of the amount of body fat), oral contraceptive use, menopausal hormone therapy use, and an interaction term between menopausal hormone therapy use and body mass index.
Individual women's risk for endometrial cancer calculated using this model ranged from 1.22% to 17.8% over the next 20 years depending on their exposure to various risk factors. Validation of the models using data from the US Nurses' Health Study indicated that the endometrial cancer model overestimated the risk of endometrial cancer but that the breast and ovarian cancer models were well calibrated-the predicted and observed risks for these cancers in the validation cohort agreed closely. Finally, the discriminatory power of the models (a measure of how well a model separates people who have a disease from people who do not have the disease) was modest for the breast and ovarian cancer models but somewhat better for the endometrial cancer model.
What Do These Findings Mean? These findings show that breast, ovarian, and endometrial cancer can all be predicted using information on known risk factors for these cancers that is easily obtainable. Because these models were constructed and validated using data from white, non-Hispanic women aged 50 years or older, they may not accurately predict absolute risk for these cancers for women of other races or ethnicities. Moreover, the modest discriminatory power of the breast and ovarian cancer models means they cannot be used to decide which women should be routinely screened for these cancers. Importantly, however, these well-calibrated models should provide realistic information about an individual's risk of developing breast, ovarian, or endometrial cancer that can be used in clinical decision-making and that may assist in the identification of potential participants for research studies. N Information on the Breast Cancer Risk Assessment Tool, the Surveillance, Epidemiology, and End Results Program, and on the prospective cohort study of screening and the diet and health study that provided the data used to build the models is also available on the NCI site N Cancer Research UK, a not-for-profit organization, provides information about cancer, including detailed information on breast cancer, ovarian cancer, and endometrial cancer N The UK National Health Service Choices website has information and personal stories about breast cancer, ovarian cancer, and endometrial cancer; the not-for-profit organization Healthtalkonline also provides personal stories about dealing with breast cancer and ovarian cancer