Figures
Abstract
Background
Natural menopause is defined as the permanent cessation of menstruation that occurs after 12 consecutive months of amenorrhea without any obvious pathological or physiological cause. The age of this phenomenon has been reported to be associated with several health outcomes.
Objectives
This study aimed to estimate the Age at Natural Menopause (ANM) and to identify reproductive and demographic factors affecting ANM.
Methods
This cross-sectional, population-based study was conducted on 2517 post-menopausal women aged 40–70 years participating in the first phase of the PERSIAN cohort study of Kharameh, Iran, during 2014–2017. To more accurately detect the determinants of ANM, we applied multiple linear regression beside some machine learning algorithms including conditional tree, conditional forest, and random forest. Then, the fitness of these methods was compared using Mean Squared Error (MSE) and Pearson correlation coefficient.
Results
The mean±SD of ANM was 48.95±6.13. Both applied forests provided more accurate results and identified more predictors. However, according to the final comparison, the conditional forest was the most accurate method which recognized that more pregnancies, longer breastfeeding, Fars ethnicity, and urbanization have the greatest impact on later ANM.
Citation: Pasokh Z, Seif M, Ghaem H, Rezaianzadeh A, Ghoddusi Johari M (2024) Age at natural menopause and its determinants in female population of Kharameh cohort study: Comparison of regression, conditional tree and forests. PLoS ONE 19(4): e0300448. https://doi.org/10.1371/journal.pone.0300448
Editor: Zahra Cheraghi, Hamadan University of Medical Sciences, School of Public Health, ISLAMIC REPUBLIC OF IRAN
Received: November 2, 2023; Accepted: February 28, 2024; Published: April 16, 2024
Copyright: © 2024 Pasokh et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: Data cannot be shared publicly because data contain potentially identifying and sensitive patient information. Data are available from the kharameh cohort center (contact via: kheramehcohort@gmail.com) for researchers who meet the criteria for access to confidential data.
Funding: This study was supported by the Research Vice-chancellor of Shiraz University of Medical Sciences (grant No. 25007). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Age at the onset of menopause is the most important determinant of women’s future health outcomes [1]. During the aging process, the number and quality of the oocytes in the ovaries decrease, the associated cyclic endocrinological activities stop, and consequently, women enter menopause [2]. Natural menopause corresponds to the last menstrual period (LMP) and is recognized to have occurred after 12 consecutive months of amenorrhea for which there is no other obvious pathological or physiological cause [3].
The median age of the world’s population is increasing due to the decline in fertility and the increase in life expectancy [4]. Accordingly, most women spend more than a third of their lives in the postmenopausal period. The timing of menopause is claimed to be a substantial risk factor for some chronic diseases. Previous studies have demonstrated that lower ANM is associated with cardiovascular disease [5], all-cause mortality [6], worse cognitive function [7], type 2 diabetes [8], osteoporosis, and fracture [9]. In contrast, higher ANM is associated with breast cancer [10], and uterine/ovarian cancer [11]. These complications highlight the importance of identifying factors that can affect ANM.
Globally, the average age of menopause is estimated at 50 years [12]. Several studies have indicated that women living in developing countries, experience natural menopause several years earlier than those in developed countries [13]. According to a systematic review and meta-analysis performed in 2017, the average age of menopause in Iran was estimated as 48.57 years [14] which is lower than Europe (50.54), Australia (51.25), and the USA (49.11) but higher than the Middle East (47.37), Latin America (47.24) and almost equal to Asia (48.75) and Africa (48.38) [15].
ANM substantially varies across different ethnicities, regions, and countries due to genetic background, socioeconomic position, environment, lifestyle, and reproductive or early childhood factors [15]. Many studies have been conducted around the world to investigate the determinants of menopause onset, and Iran is no exception. They have indicated that demographic factors such as socioeconomic status, occupation, education, residence, smoking, and some reproductive factors like breastfeeding, menstruation start age, marital status, pregnancy, and other factors are related to ANM [16–24] but whether these factors are definitive in all women or even increase or decrease the age of menopause remain controversial. For instance, in Farjam et al.’s study, menopausal age was lower in not-married, employed and illiterate women [19] while in another study conducted by Vatankhah et al., Education, employment, and marital status were not associated with early menopause [25]. Although in several studies, women with lower socioeconomic status had a younger menopausal age [20, 23], in some other studies, it wasn’t associated with menopausal age [18]. Therefore, it is difficult to predict the ANM due to these conflicting results. In other words, the impact of many of these predictors on ANM may be influenced by racial, ethnic, and environmental characteristics and specific to a particular geographical and racial area. Furthermore, to date, there are a few studies that have used machine learning algorithms to determine the factors influencing the age of menopause.
This study aimed to estimate the ANM and to identify reproductive and demographic factors affecting ANM. Therefore, we used some machine learning algorithms including conditional tree, random and conditional forest beside multiple linear regression as a basic method hoping that by more precisely identifying the determinants of the ANM, a step will be taken to prevent its complication and thereby promote the health and the quality of life of menopausal and postmenopausal women.
Materials and methods
This cross-sectional, population-based study was conducted on 2517 menopausal women aged 40–70 years who were enrolled from 2014 to 2017 in the first phase of the PERSIAN (Prospective Epidemiological Research Studies in Iran) cohort study of Kharameh County (located in Fars Province in the southern part of Iran).
The PERSIAN cohort study was launched in 2014 by the Ministry of Health and Medical Education to identify the most prevalent Noncommunicable Diseases (NCDs) among Iran’s ethnic groups and to investigate effective methods of prevention [26]. The PERSIAN Cohort Study started in Kharameh in October 2014, and enrollment ended in February 2017.
It should be declared that in the Kharameh cohort study, eligible participants were entered through the census. After obtaining the consent of the participants, standardized questionnaires containing reproductive and demographic information were completed by trained staff through face-to-face interviews. The validity and reliability of the questionnaires applied in this study had been already checked [26].
In the Kharameh cohort study, the whole residents of Kharameh, aged 40–70 years with Iranian nationality were invited to participate in this study. Some people were excluded from the study due to not attending the clinics for physical examination or mental retardation or unwillingness to participate in the study. Eventually, out of 10,836 people, 10,663 people (5,944 women and 4,719 men), which is equivalent to 98.4% of the population of Kharamah, participated in this census. Data collected in the Kharameh cohort study were accessed for our study in February 2022 after approval by the research committee. Since this study aims to determine associated factors with the ANM, we used the information of women who reported having natural menopause. In other words, we refused to use the information of women who had induced menopause due to hysterectomy, bilateral oophorectomy, illness, or other causes. Out of 5944 women, 2517 people were reported to be naturally menopause.
This study received approval from the Ethics Committee of Shiraz University of Medical Sciences (IR.SUMS.SCHEANUT.REC.1400.104). Informed consent was obtained from all participants prior to their involvement. The participants were provided with assurance regarding confidentiality and anonymity.
In the current survey, we utilized several main reproductive and demographic information collected from the Kharameh cohort study to predict the ANM. Reproductive factors included marital status (single, married, divorced, widowed, others), menstruation start age, age at first marriage, number of pregnancies, history of abortion and stillbirth, breastfeeding duration, tubectomy, use of contraceptive drugs, and hormonal replacement drugs. Demographic factors included socioeconomic status (SES) which is calculated using the principle component analysis method by considering the following variables: homeownership, home size, number of bathrooms in the house, owing cars and their prices, domestic and international travels, reading books, access to the internet, owning mobile phones, computers, televisions, washing machines, dishwashers, refrigerators, vacuum cleaners, and microwaves. The resulting index is categorized into four levels (low, moderate, high, and very high). Other demographic variables are as follows: education years, residence (city or village), job (employed or housewife), and mother and father’s ethnicity (Fars or other ethnicities).
In this study, the data were analyzed using the R statistical software, version 4.2.0. To recognize the probable factors associated with ANM and also to determine the importance of these factors, we employed multiple linear regression (using the “stepwise” method) and machine learning methods through a recursive partitioning algorithm consisting of the conditional tree (using the “party” package), conditional forest (using the “party” package) and random forest (using the “randomForestSRC” package). In this study, the level of significance was 0.05.
Random and conditional forests are ensemble methods that combine a large collection of trees. The main difference between random forests (Rforest) and conditional forests (Cforest) is the different base learners exerted by the two methods. Rforests are built from classification and regression trees (CART trees), while Cforests are made from conditional inference trees (Ctree) [27]. It can be stated that the greater number of trees in the forest leads to lower variance and higher accuracy.
When fitting a Ctree, the conditional distribution of the statistics that measure the associations between the response variable and the predictor variables is driven. Multiple testing procedures (adjusted P-values) can be used to define whether there are statistically significant associations between any of the predictors and the response. If present, the predictor variable that has the strongest association with the response variable is selected for splitting; otherwise, the recursive procedure will stop. However, when fitting the CART trees, at each node, a small random subset of predictor variables could be chosen and the best split over these variables which causes the purest node with the lowest MSE will be found. Indeed, the splitting procedure is repeated until a certain stopping criterion (the node is pure or its size is smaller than a pre-specified value) is met. Generally, the tree is grown until it is adequately large, without pruning [27]. However, pruning is less important for trees that form a forest; because a forest is an ensemble of trees and a limited number of over-fitted trees do not change the main result of the forest which is an aggregation over all trees. Therefore, the trees of a forest are usually unpruned.
Another property of the forest is the number of candidate variables for splitting in each node. As mentioned, in the growth of a tree, it is possible to first select a random subset of variables and then after testing the selected variables, the splitting is done on the most suitable variable. However, this property, like pruning, is less important in trees of a forest than the fitness of one tree and remains controversial. It is shown that the performance of forests is insensitive to the number of predictors considered at each split [28], and its results are typically near optimal over a wide range of the number of candidate variables for splitting [29]. Breiman suggested that a third of the total number of predictors should be randomly selected for regression trees of a forest, however, the square root of the number of predictors should be used if a forest is consisted of classification trees [30].
The result of a forest is usually reported as the importance of variables. In this study, Breiman-Cutler permutation VIMP is used, which is the decrease of prediction accuracy in conditions where the observations are permuted over the interested variable. Simply, the importance of a variable is the difference in the accuracy of the prediction obtained from the following two forests:
- A forest that grows normally with the real data set.
- A forest that grows with real data but is permuted on the interested variable.
Although other methods have been proposed to measure the importance of a variable in the forests; For example, decrease in the prediction accuracy due to the random allocation to the daughter node in a situation where splitting has occurred on the interested variable.
In this study, the forests consisted of 1,000 trees, and the number of candidate variables for splitting was considered one-third of the total number of predictors. For tree growing, due to the small number of predictors, especially in comparison to the number of observations, all variables were candidates as split variable at each node split.
To evaluate the performance of the models, we split the dataset into a training set (containing 70% of the data) and a test set (containing 30% of the data). The training set was used to train the models. The models fit and learn from this data. The performance of these models was measured using the test set. Finally, the goodness of fit of the models was assessed by calculating and then comparing the MSE and Pearson correlation coefficient between predicted and observed values.
Results
This study was conducted on 2517 menopausal women aged 40 to 70 years. The mean±SD (95%CI) and median (1st Qu, 3rd Qu) of ANM were 48.95±6.13 (48.71, 49.19) and 50.00 (45.00,53.00) years, respectively. Table 1 shows the reproductive and demographic factors associated with ANM via regression model. As can be seen, increasing the socioeconomic level would increase the ANM by 0.45 year (p-value <0.001). Urban women who contain 31% of the population, had an average 0.67 year higher ANM than rural ones (p-value = 0.027). In Kharameh, 78% of women were housewives who had 0.70 years higher ANM than employed ones (p-value = 0.019). The people of Kharameh are mostly Fars. So, we divided the ethnicity of women’s mother and father into 2 groups: Fars and other ethnicities. In this study, other ethnicities are Arab, Turk, Lor, Azeri, and Baluch. But each group consists of a small number of people. Therefore, we considered all of them as other ethnicities. Regression analysis revealed that only the mother’s ethnicity is related to ANM. According to the mother’s ethnicity, Fars women, with 68% of the population, had an average 0.98 year higher ANM than women with other ethnicities (p-value<0.001). In this population, most of the women were illiterate or poorly educated. Regression also confirmed that education years is adversely related to ANM (p-value = 0.001). We found the mean±SD of the first menstruation 13.57±2.02 years. With each year increase in the age of first menstruation, the ANM increases by 0.21 year (p-value<0.001). The mean±SD duration of breastfeeding was 10.11±4.56 years, which had a positive association with ANM (p-value = 0.009). Although breastfeeding has a significant relationship with ANM, every year increase in breastfeeding, ANM would increase only by 0.082 years or almost one month. We split the number of pregnancies based on the median (7 pregnancies), which is one of the most common methods for splitting [31]. Women who had more than 7 pregnancies had 1.065 years later menopause (p-value<0.001). Further analyses confirm the results of multiple regression; we found a correlation between the ANM and the number of pregnancies (r = 0.132, p<0.001), breastfeeding duration (r = 0.137, p<0.001), and menstruation start age (r = 0.093, p<0.001). Given the results of multiple linear regression, women with the experience of stillbirth had earlier menopause by 0.88 year (p-value = 0.005). Only 1.87% of women had used hormonal replacement drugs; they had 1.94 years earlier menopause. Actually, women who experienced earlier menopause were treated with hormone replacement drugs (p-value = 0.03).
Conditional tree
In our effort to predict the determinants of menopause age according to fertility factors, a conditional inference tree was used (Fig 1). The tree found the number of pregnancies as the most important factor related to ANM. Other associated factors in order of their importance are as follows: breastfeeding duration, job, residence, use of hormone replacement drugs, and mother’s ethnicity; so that Fars mother’s ethnicity increases the ANM by almost two years. Among women with a maximum of one pregnancy, the duration of breastfeeding is the second most important factor. As can be seen in Fig 1, women with the highest (50.70) and the lowest ANM (41.22) are placed in nodes 7 and 10, respectively.
At each terminal node, the box plot shows the distribution of menopause age.
Conditional forest
For a better understanding of predictors of menopause timing, we used Cforest to rank the variables according to their importance. As shown in Fig 2, the number of pregnancies, duration of breastfeeding, mother’s ethnicity, and residence are the most important variables. In fact, 64% of correct predictions were due to these variables. Other predictors in order of their importance are as follows: menstruation starts age, education years, SES, father’s ethnicity, marital status, use of contraceptive drugs, job, stillbirth, and use of hormonal replacement drugs.
Random forest
Similar to Cforest, Rforest recognized the mother’s ethnicity, number of pregnancies, breastfeeding duration, and residence as the most important predictors. However, in Rforest, 58% of correct predictions were due to these variables. According to Fig 3, other predictors in order of their importance are as follows: socioeconomic status, job, menstruation start age, marital status, stillbirth, education years, use of contraceptive drugs, use of hormonal replacement drugs, and father’s ethnicity. It is worth mentioning that the father’s ethnicity, marital status, and use of contraceptive drugs were simultaneously identified by Cforest and Rforest. Actually, Fars women, married ones, and contraceptive users had a later ANM.
Comparison of methods
As we mentioned, we used two indices for comparing the goodness of fit of the applied methods regarding the ANM prediction and its related factors.
As shown in Table 2, MSE in the train set of Rforest is the lowest while in the test set, Cforest has the lowest amount of MSE. For comparing the methods according to the Pearson correlation coefficient, it was indicated that all the correlations between observed and predicted values were significant, by all methods and within both train and test sets. Based on this criterion, Rforest in the train set has the best performance while the performance of this method in the test set is similar to the Cforest. Therefore, in conclusion, according to the test set and by considering both MSE and correlation coefficient, it can be claimed that Cforest’s performance is the most accurate one. It is also valuable to note that there are many differences in Rforest’s performance in two the train and test sets; In the way that mentioned indices indicated a much better fit in the train set in comparison to the test set. Therefore, it seems that the Rforest algorithm is overfitted in the train set.
Discussion
In this population-based study on the evaluation of age at natural menopause and its determinants among post-menopausal women who participated in phase I Persian cohort Kharameh, we found the following: firstly, the mean±SD (95%CI) and median (1st Qu, 3rd Qu) of ANM were 48.95±6.13 (48.71, 49.19) and 50.00 (45.00, 53.00) years, respectively. Secondly, associated factors with ANM were: the number of pregnancies, duration of breastfeeding, mother’s ethnicity, residence, menstruation start age, education years, SES, father’s ethnicity, marital status, use of contraceptive drugs, occupation, stillbirth, and use of hormonal replacement drugs. Lastly, Cforest showed the best performance among other applied methods.
Compared to the other cities of Iran, the menopause age in this study is higher than Isfahan 48.66 [20], Shiraz 48.3 [32], Kerman 48.56 [33], and Gorgan 47.6 [34] and lower than Sari and Mazandaran 49.2 [18] and Hamadan 49.6 [35]. With a Global vision, it’s higher than Turkey 47 [36], India 46.6 [37], Brazil 47.5 [38], China 48.8 [39] but still lower than developed countries like Taiwan 50.2 [40], Japan 50.4 [41], USA 52.6 [42], Spain 51.7 [42], France 52 [43] and Italy 51.2 [44].
The differences between our findings and previous results may be explained by sample size variations and differences in genetic background, lifestyle, and socioeconomic status in studied women.
As mentioned, we used four different methods for a better understanding of the determinants of menopause age. Associated Factors recognized by Ctree, were identified by other methods too; including number of pregnancies, breastfeeding duration, job, residence, mother’s ethnicity, and use of hormone replacement drugs. Menstruation start age, socioeconomic status, education years, and stillbirth were detected in regression and both forests. Marital status, use of contraceptive drugs, and father’s ethnicity were simultaneously identified by both forests.
Several studies have found a significant association between lower SES and lower education with earlier menopause [13, 20, 23, 45, 46]. In the current study, women with a better SES had later menopause, since it ranges from 48.36 years in a low socioeconomic background to 49.52 years in a very high socioeconomic background (Table 1). However, we found an inverse relationship between education years and ANM. Actually, there is an inverse correlation between the number of pregnancies and education years; in the way that people with more pregnancies are less educated. Due to the existence of education years and the number of pregnancies in a single regression model, it seems that the relationship between education years and ANM is confounded. Anyway, our results are in line with the findings of Sinha et al. [47] and Ismaeeli et al. [48]; that women with higher education and employment have an earlier menopause [47]. Actually, in our study, earlier menopause among employees may be because most of them (87.61%) are rural dwellers. As mentioned, rural women have an earlier menopause. However, these variables are not associated with ANM in some investigations [18, 21]. Another reason for the lower ANM in employed women may be due to their exposure to workplace stressors. Cassou et al. revealed that some job stressors might be associated with an increased risk of earlier age at menopause [43].
Previous investigations including a study conducted in Iran in 2013 found that menopause occurs earlier in rural areas than in urban areas [49–52]. Similarly, in this study, ANM in urban and rural areas was 49.39 and 48.01 years, respectively. A reason for the higher ANM in urban women can be their better socioeconomic status. So, socioeconomic status seems to be a confounder.
The effect of ethnicity/race has been investigated in several studies [53, 54]. For instance; Gold. E.B. et al. demonstrated that Japanese women have a later natural menopause than Caucasian, African-American, Hispanic, or Chinese women do [45]. According to another study, Japanese-American race/ethnicity was associated with late natural menopause and Latinas experienced early natural menopause, with those born in the United States experiencing menopause later than those born outside the United States [55]. The majority of the population of Kharamah are Fars and we observed that Fars ethnicity is a notable factor for later menopause. To the best of our knowledge, there is still no study that compares Fars ethnicity with other ethnicities. In addition, we considered both the father’s and mother’s ethnicity in our analysis. But against regression and Ctree which only identified the mother’s ethnicity as a significant factor, both Cforest and Rforest, introduced the father’s ethnicity as a less important factor next to the mother’s ethnicity. According to the mother’s ethnicity, the Fars group had almost 1 year later menopause. The mean of ANM was 49.39 and 48.01 for the Fars group and other ethnic group, respectively. This ethnic difference supports the hypothesis that ANM is a heritable trait. Torgerson et al. showed that there is a relationship between ANM and maternal ANM [56]. According to twin and family studies, the heritability score of ANM is 44% to 65% [57, 58].
Moreover, our further analysis showed that there is a significant correlation between Residency and the mother and father’s ethnicity (p-value<0.001). So that Fars ethnic groups are more urban dwellers than rural dwellers and as mentioned, urban women have an older age at menopause. So, this may be another explanation for later menopause among Fars women. In summary, we found the ethnicity of the mother as a great predictor for menopause age which has not been considered in previous studies; perhaps girls mostly inherit menopause traits from their mothers. Therefore, more investigations are needed for a better understanding of this question.
Another reason for earlier ANM among rural women, women of other ethnicities, and women with a lower SES could be that these people have lower health literacy and less access to health providers [59, 60].
According to the regression model, breastfeeding has little impact on ANM and this statistical significance may be due to the large sample size but both forests identified it as one of the most important variables. However, in agreement with other studies [18, 22, 61], we found that longer breastfeeding and more pregnancies cause later menopause. Also, we observed that women with higher age at first menstruation have higher ANM which is consistent with some other studies [61–64]. Nonetheless, some studies have not shown a relationship [18]. Our results may be explained by the fixed follicle pool and therefore possibly a fixed number of ovulatory cycles. This mechanism may also explain the relationship between later menopause and more pregnancies, longer breastfeeding, and the use of hormonal contraceptives which all disturb the ovulation cycle [61]. Indeed, the onset of menopause is theorized to be related to the rate of loss of oocytes and thus to the occurrence of ovulatory cycles [47]. Past studies suggest that the use of hormonal contraceptives can delay menopause [61, 65–69], which has been proved by both Cforest and Rforest; although some studies have found no association [18].
Previous studies have mostly concentrated on abortion than stillbirth but we considered both of them. Although we didn’t find any association between having abortions and ANM, we observed that women with the experience of stillbirth had an earlier menopause by 0.88 year. However, Dorjgochoo et al. found no relationship [68].
Hormone Replacement drugs contain the hormones that a woman’s body stops producing after menopause. Actually, it is used to treat menopausal symptoms and it’s recommended for women who experience early menopause until the average age of menopause [70]. We also found that women with younger menopause age had a history of HRT use.
According to Kamyabi et al., single women reach menopause earlier due to the regular activity of ovaries and successive stimulation of follicles, under the influence of pituitary hormones [71]. Both of our applied forests revealed that married women have a higher ANM, which is also consistent with some studies [19, 72]. However, it’s in contrast with some other investigations [20, 25].
In summary, although the age of menopause is affected by non-modifiable factors such as genetics and ethnicity, some factors related to people’s lifestyles are modifiable and should be considered in women’s health programs.
Based on our research, this is the first study in which machine learning algorithms including Ctree and forests are used in addition to linear regression to analyze factors related to ANM. It should be declared that all applied methods confirm each other due to the overlap in identifying the factors related to ANM. Past studies have indicated that predictions provided by Rforest are more accurate than Cforest [27, 73], although in another study conditional forest outperformed the regularized random forest [74]. However, given the comparison of these 4 methods using MSE and correlation coefficient, unlike Rforest which was overfitted in the train set, the performance of Cforest was much better than other methods in the test set. Furthermore, we found the Ctree as the least accurate method. In line with our results, the inconsistency and higher variance of tree estimates compared to forests [75–77] as well as logistic regression have been noted in previous studies [73]. Although in linear regression MSE was almost lower than Rforest, the correlation coefficient denotes the better performance of Rforest. So, our findings confirm the study of Ryu et al. [78], in which Rforest was more accurate than linear regression. however, in some other studies linear regression outperformed Rforest [79].
The strength of this population-based study is the large population of participants who entered the study through the census. Furthermore, we used four different methods for better prediction of factors affecting menopause age while three of these models have not been applied in Iranian society so far. This study has some limitations; it is a cross-sectional design, and the obtained relationships are not necessarily causal. In Addition, the ANM and other factors were self-reported by women; therefore, it depends on the participants to recall those events. Since the study group is women 40–70 years old, we were not able to consider postmenopausal women under 40 and over 70 years. Our study aimed to identify reproductive factors affecting menopause age by controlling some demographic variables. So, factors like genetics, physical activity, and illnesses have not been considered. More extensive and multicenter studies that examine all aforementioned variables simultaneously are needed.
Supporting information
S1 Checklist. Human participants research checklist.
https://doi.org/10.1371/journal.pone.0300448.s001
(DOCX)
References
- 1. Yazdkhasti M, Tourzani ZM, Roozbeh N, Hasanpour V, Saeieh SE, Abdi F. The association between diabetes and age at the onset of menopause: A systematic review protocol. Systematic Reviews. 2019;8(1). pmid:30940191
- 2. Amanvermez R, Tosun M. An Update on Ovarian Aging and Ovarian Reserve Tests. International journal of fertility & sterility. 2016;9(4):411–5. pmid:26985328
- 3.
Research on the menopause in the 1990s. Report of a WHO Scientific Group. World Health Organ Tech Rep Ser. 1996;866:1–107.
- 4.
Kinsella KG, Velkoff VA. An aging world: 2001: Bureau of Census; 2001.
- 5. Zhu D, Chung HF, Dobson AJ, Pandeya N, Giles GG, Bruinsma F, et al. Age at natural menopause and risk of incident cardiovascular disease: a pooled analysis of individual patient data. The Lancet Public health. 2019;4(11):e553–e64. pmid:31588031
- 6. Muka T, Oliver-Williams C, Kunutsor S, Laven JS, Fauser BC, Chowdhury R, et al. Association of Age at Onset of Menopause and Time Since Onset of Menopause With Cardiovascular Outcomes, Intermediate Vascular Traits, and All-Cause Mortality: A Systematic Review and Meta-analysis. JAMA Cardiol. 2016;1(7):767–76. pmid:27627190
- 7. Ryan J, Scali J, Carrière I, Amieva H, Rouaud O, Berr C, et al. Impact of a premature menopause on cognitive function in later life. BJOG: an international journal of obstetrics and gynaecology. 2014;121(13):1729–39. pmid:24802975
- 8. Anagnostis P, Christou K, Artzouchaltzi AM, Gkekas NK, Kosmidou N, Siolos P, et al. Early menopause and premature ovarian insufficiency are associated with increased risk of type 2 diabetes: a systematic review and meta-analysis. Eur J Endocrinol. 2019;180(1):41–50. pmid:30400047
- 9. Svejme O, Ahlborg HG, Nilsson J, Karlsson MK. Early menopause and risk of osteoporosis, fracture and mortality: a 34-year prospective observational study in 390 women. BJOG: an international journal of obstetrics and gynaecology. 2012;119(7):810–6. pmid:22531019
- 10. Menarche, menopause, and breast cancer risk: individual participant meta-analysis, including 118 964 women with breast cancer from 117 epidemiological studies. Lancet Oncol. 2012;13(11):1141–51. pmid:23084519
- 11. Ossewaarde ME, Bots ML, Verbeek AL, Peeters PH, van der Graaf Y, Grobbee DE, et al. Age at menopause, cause-specific mortality and total life expectancy. Epidemiology. 2005;16(4):556–62. pmid:15951675
- 12. Cramer DW, Xu H, Harlow BL. Family history as a predictor of early menopause*. Fertility and Sterility. 1995;64(4):740–5. pmid:7672145
- 13. Gold EB. The timing of the age at which natural menopause occurs. Obstetrics and gynecology clinics of North America. 2011;38(3):425–40. pmid:21961711
- 14. Saei Ghare Naz M, Sayehmiri F, Kiani F, Ozgoli G. A Systematic Review and Meta-analysis on the Average Age of Menopause among Iranian Women. Evidence Based Care. 2019;8(4):26–34.
- 15. Schoenaker DA, Jackson CA, Rowlands JV, Mishra GD. Socioeconomic position, lifestyle factors and age at natural menopause: a systematic review and meta-analyses of studies across six continents. International journal of epidemiology. 2014;43(5):1542–62. pmid:24771324
- 16. Canavez F, Werneck G, Parente R, Keller Celeste R, Faerstein E. The association between educational level and age at the menopause: A systematic review. Archives of gynecology and obstetrics. 2009;283:83–90. pmid:20013287
- 17. Özdemir O, Çöl M. The age at menopause and associated factors at the health center area in Ankara, Turkey. Maturitas. 2004;49(3):211–9. pmid:15488349
- 18. Zamaniyan M, Moosazadeh M, Peyvandi S, Jaefari K, Goudarzi R, Moradinazar M, Kheradmand M. Age of Natural Menopause and Related Factors among the Tabari Cohort. Journal of menopausal medicine. 2020;26(1):18–23. pmid:32307946
- 19. Farjam M, Amiri Z, Sharafi M, Bahramali E. Factors Associated with Delayed Menopause in Iran: Findings from Fasa Cohort Study, a Branch of Persian Cohort Study. Galen medical journal. 2018;7:e922. pmid:34466418
- 20. Golshiri P, Akbari M, Abdollahzadeh MR. Age at Natural Menopause and Related Factors in Isfahan, Iran. Journal of menopausal medicine. 2016;22(2):87–93.
- 21. Bustami M, Matalka KZ, Elyyan Y, Hussein N, Hussein N, Abu Safieh N, et al. Age of Natural Menopause Among Jordanian Women and Factors Related to Premature and Early Menopause. Risk management and healthcare policy. 2021;14:199–207. pmid:33500673
- 22. Wang M, Gong W-W, Hu R-Y, Wang H, Guo Y, Bian Z, et al. Age at natural menopause and associated factors in adult women: findings from the China Kadoorie Biobank study in Zhejiang rural area. PloS one. 2018;13(4):e0195658. pmid:29668705
- 23. Ahuja M. Age of menopause and determinants of menopause age: A PAN India survey by IMS. Journal of mid-life health. 2016;7(3):126–31. pmid:27721640
- 24. Al Shahrani AS, Faraj S, Alhargan A, Aljumaid M, Gosty A, Al-Sugair M, et al. Factors affecting age at menopause among saudi women in riyadh, sa: A cross-sectional study. Family Medicine and Primary Care Review. 2018;20(1):55–60.
- 25. Vatankhah H, Khalili P, Vatanparast M, Ayoobi F, Esmaeili-Nadimi A, Jamali Z. Prevalence of early and late menopause and its determinants in Rafsanjan cohort study. Scientific Reports. 2023;13(1):1847. pmid:36725860
- 26. Poustchi H, Eghtesad S, Kamangar F, Etemadi A, Keshtkar A-A, Hekmatdoost A, et al. Prospective Epidemiological Research Studies in Iran (the PERSIAN Cohort Study): Rationale, Objectives, and Design. American journal of epidemiology. 2017;187(4):647–55.
- 27.
Xia R. Comparison of random forests and Cforest: variable importance measures and prediction accuracies. 2009.
- 28.
Bayley S, Falessi D. Optimizing prediction intervals by tuning random forest via meta-validation. arXiv preprint arXiv:180107194. 2018.
- 29. Meinshausen N, Ridgeway G. Quantile regression forests. Journal of machine learning research. 2006;7(6).
- 30. Breiman L. Random forests. Machine learning. 2001;45:5–32.
- 31.
Heller GZ. Survival Analysis: Techniques For Truncated And Censored Data, 2nd Edition. John P. Klein and Melvin L. Moeschberger, Springer-Verlag, New York, 2003. Hardcover. No. of pages: xv +536. ISBN 0-387-95399-X. 2004.
- 32. Ayatollahi S, Ghaem H, Ayatollahi S. Determinants of Age at Natural Menopause in Shiraz. Iranian Journal of Medical Sciences. 2015;27(3):131–3.
- 33. Foroud A, Mehdipour S, Nikvarz F, Shafieian T. The natural menopause age and associated factors at women menopause in Kerman, Iran. Caspian Sea J. 2016;10(1):298–301.
- 34. Abdollahi AA, Qorbani M, Asayesh H, Rezapour A, Noroozi M, Mansourian M, et al. The menopausal age and associated factors in Gorgan, Iran. Medical journal of the Islamic Republic of Iran. 2013;27(2):50–6. pmid:23741165
- 35. Shobeiri F, Nazari M. Age at Menopause and Its Main Predictors among Iranian Women. International journal of fertility & sterility. 2014;8(3):267–72. pmid:25379155
- 36. Vehid S, Aran SN, Köksal S, Işiloglu H, Senocak M. The prevalence and the age at the onset of menopause in Turkish women in rural area. Saudi Med J. 2006;27(9):1381–6. pmid:16951777
- 37. Prasad JB, Tyagi NK, Verma P. Age at menopause in India: A systematic review. Diabetes & Metabolic Syndrome: Clinical Research & Reviews. 2021;15(1):373–7. pmid:33524647
- 38. Hachul H, Polesel D, Nozoe K, Sanchez Z, Prado M, Andersen M, et al. The age of menopause and their associated factors: a cross-sectional population-based study. J Women’s Heal Care. 2016;5:5.
- 39. Zhang L, Bao L, Li Y, Wang C, Dong X, Abdulai T, et al. Age at menopause, body mass index, and risk of type 2 diabetes mellitus in postmenopausal Chinese women: The Henan Rural Cohort study. Nutrition, metabolism and cardiovascular diseases. 2020;30(8):1347–54. pmid:32600954
- 40. Shen T-Y, Strong C, Yu T. Age at menopause and mortality in Taiwan: A cohort analysis. Maturitas. 2020;136:42–8. pmid:32386665
- 41. Akahoshi M, Soda M, Nakashima E, Tominaga T, Ichimaru S, Seto S, Yano K. The effects of body mass index on age at menopause. International journal of obesity. 2002;26(7):961–8. pmid:12080450
- 42. Reynolds RF, Obermeyer CM. Age at natural menopause in Spain and the United States: results from the DAMES project. Am J Hum Biol. 2005;17(3):331–40. pmid:15849704
- 43. Cassou B, Mandereau L, Aegerter P, Touranchet A, Derriennic F. Work-related factors associated with age at natural menopause in a generation of French gainfully employed women. American journal of epidemiology. 2007;166(4):429–38. pmid:17557899
- 44. Parazzini F. Determinants of age at menopause in women attending menopause clinics in Italy. Maturitas. 2007;56(3):280–7. pmid:17069999
- 45. Gold EB, Bromberger J, Crawford S, Samuels S, Greendale GA, Harlow SD, Skurnick J. Factors associated with age at natural menopause in a multiethnic sample of midlife women. American journal of epidemiology. 2001;153(9):865–74. pmid:11323317
- 46. Lawlor DA, Ebrahim S, Smith GD. The association of socio-economic position across the life course and age at menopause: the British Women’s Heart and Health Study. BJOG: an international journal of obstetrics and gynaecology. 2003;110(12):1078–87. pmid:14664879
- 47. Sinha I, Tigga P, Mondal N, Sen J. Association between age at menarche and age at menopause among women of an indigenous population of North Bengal, India. Journal of biosocial science. 2021;53(3):319–35. pmid:32375912
- 48. Esmaili H, Alimi R, Javanrouh N, Azizi H, Vahid Roudsari F. Age at Menopause and Associated Factors in Middle-Aged Women in Mashhad City, 2010–2011. The Iranian Journal of Obstetrics, Gynecology and Infertility. 2013;15(36):1–7.
- 49. MacMahon B, Worcester J. Age at menopause. United States—1960–1962. Vital and health statistics Series 11, Data from the National Health Survey. 1966(19):1–20. pmid:5298639
- 50. Lim YM, Jeong K, Lee SR, Chung HW, Lee W. Association between premature ovarian insufficiency, early menopause, socioeconomic status in a nationally representative sample from Korea. Maturitas. 2019;121:22–7. pmid:30704561
- 51. Sievert LL, Huicochea-Gómez L, Cahuich-Campos D, Whitcomb BW, Brown DE. Age at menopause among rural and urban women in the state of Campeche, Mexico. Menopause (New York, NY). 2021;28(12):1358–68. pmid:34854837
- 52. Mohammad K, Sadat Hashemi SM, Farahani FK. Age at natural menopause in Iran. Maturitas. 2004;49(4):321–6. pmid:15531128
- 53. Bromberger JT, Matthews KA, Kuller LH, Wing RR, Meilahn EN, Plantinga P. Prospective study of the determinants of age at menopause. American journal of epidemiology. 1997;145(2):124–33. pmid:9006309
- 54. Sievert LL, Hautaniemi SI. Age at menopause in Puebla, Mexico. Hum Biol. 2003;75(2):205–26. pmid:12943159
- 55. Henderson KD, Bernstein L, Henderson B, Kolonel L, Pike MC. Predictors of the Timing of Natural Menopause in the Multiethnic Cohort Study. American journal of epidemiology. 2008;167(11):1287–94. pmid:18359953
- 56. Torgerson DJ, Thomas RE, Campbell MK, Reid DM. Alcohol consumption and age of maternal menopause are associated with menopause onset. Maturitas. 1997;26(1):21–5. pmid:9032743
- 57. Murabito JM, Yang Q, Fox C, Wilson PW, Cupples LA. Heritability of age at natural menopause in the Framingham Heart Study. The Journal of clinical endocrinology and metabolism. 2005;90(6):3427–30. pmid:15769979
- 58. van Asselt KM, Kok HS, Pearson PL, Dubas JS, Peeters PH, Te Velde ER, van Noord PA. Heritability of menopausal age in mothers and daughters. Fertil Steril. 2004;82(5):1348–51. pmid:15533358
- 59. Chen X, Orom H, Hay JL, Waters EA, Schofield E, Li Y, Kiviniemi MT. Differences in Rural and Urban Health Information Access and Use. J Rural Health. 2019;35(3):405–17. pmid:30444935
- 60. Svendsen MT, Bak CK, Sørensen K, Pelikan J, Riddersholm SJ, Skals RK, et al. Associations of health literacy with socioeconomic position, health risk behavior, and health status: a large national population-based survey among Danish adults. BMC Public Health. 2020;20(1):565. pmid:32345275
- 61. Wang M, Kartsonaki C, Guo Y, Lv J, Gan W, Chen ZM, et al. Factors related to age at natural menopause in China: results from the China Kadoorie Biobank. Menopause (New York, NY). 2021;28(10):1130–42.
- 62. Ruth KS, Perry JR, Henley WE, Melzer D, Weedon MN, Murray A. Events in Early Life are Associated with Female Reproductive Ageing: A UK Biobank Study. Sci Rep. 2016;6:24710. pmid:27094806
- 63. Bjelland EK, Hofvind S, Byberg L, Eskild A. The relation of age at menarche with age at natural menopause: a population study of 336 788 women in Norway. Human reproduction (Oxford, England). 2018;33(6):1149–57. pmid:29635353
- 64. Chang SH, Kim CS, Lee KS, Kim H, Yim SV, Lim YJ, et al. Premenopausal factors influencing premature ovarian failure and early menopause. Maturitas. 2007;58(1):19–30. pmid:17531410
- 65. Gottschalk MS, Eskild A, Hofvind S, Gran JM, Bjelland EK. Temporal trends in age at menarche and age at menopause: a population study of 312 656 women in Norway. Human reproduction (Oxford, England). 2020;35(2):464–71.
- 66. Gold EB, Crawford SL, Avis NE, Crandall CJ, Matthews KA, Waetjen LE, et al. Factors related to age at natural menopause: longitudinal analyses from SWAN. American journal of epidemiology. 2013;178(1):70–83. pmid:23788671
- 67. Pokoradi AJ, Iversen L, Hannaford PC. Factors associated with age of onset and type of menopause in a cohort of UK women. Am J Obstet Gynecol. 2011;205(1):34.e1–13. pmid:21514918
- 68. Dorjgochoo T, Kallianpur A, Gao YT, Cai H, Yang G, Li H, et al. Dietary and lifestyle predictors of age at natural menopause and reproductive span in the Shanghai Women’s Health Study. Menopause (New York, NY). 2008;15(5):924–33. pmid:18600186
- 69. Stepaniak U, Szafraniec K, Kubinova R, Malyutina S, Peasey A, Pikhart H, et al. Age at natural menopause in three central and eastern European urban populations: the HAPIEE study. Maturitas. 2013;75(1):87–93. pmid:23489553
- 70.
Department of Health SGoV, Australia. Hormone replacement therapy (HRT) and menopause Australia2018 [https://www.betterhealth.vic.gov.au/health/conditionsandtreatments/hormone-replacement-therapy-hrt-and-menopause.
- 71. Kamyabi Z, Aflatonian MR, Bahrampour A. Determination of the mean age of menopause and it’s distribution terms of known variables in Kerman. Journal of Kerman University of Medical Sciences. 1997;5(4):178–85.
- 72. Lay AAR, Duarte YAO, Filho A. Factors associated with age at natural menopause among elderly women in São Paulo, Brazil. Menopause (New York, NY). 2019;26(2):211–6.
- 73. Mirahmadizadeh A, Sahraian S, Delam H, Seif M. Adherence to insulin treatment in participants with type 2 diabetes: Comparison of logistic regression and conditional tree and forests to determine the effective factors. Journal of Health Sciences & Surveillance System. 2019;7(4):171–7.
- 74. Chung T, Yan R, Weller DL, Kovac J. Conditional forest models built using metagenomic data could accurately predict Salmonella contamination in Northeastern streams. bioRxiv. 2022:2022.07.11.499664.
- 75. Franklin J. The elements of statistical learning: data mining, inference and prediction. The Mathematical Intelligencer. 2005;27(2):83–5.
- 76. Leung RKK, Wang Y, Ma RCW, Luk AOY, Lam V, Ng M, et al. Using a multi-staged strategy based on machine learning and mathematical modeling to predict genotype-phenotype risk patterns in diabetic kidney disease: a prospective case—control cohort analysis. BMC Nephrology. 2013;14(1):162. pmid:23879411
- 77.
Xu Y, Greene TH, Bress AP, Bellows BK, Zhang Y, Zhang Z, et al. An Efficient Approach for Optimizing the Cost-effective Individualized Treatment Rule Using Conditional Random Forest. arXiv preprint arXiv:220410971. 2022.
- 78. Ryu K-J, Yi KW, Kim YJ, Shin JH, Hur JY, Kim T, et al. Machine learning approaches to the determinants of women’s vasomotor symptoms using general hospital data. 2020.
- 79. Smith PF, Ganesh S, Liu P. A comparison of random forest regression and multiple linear regression for prediction in neuroscience. Journal of Neuroscience Methods. 2013;220(1):85–91. pmid:24012917