Figures
Abstract
Background
Infectious diseases are a major threat to public health, causing serious medical consumption and casualties. Accurate prediction of infectious diseases incidence is of great significance for public health organizations to prevent the spread of diseases. However, only using historical incidence data for prediction can not get good results. This study analyzes the influence of meteorological factors on the incidence of hepatitis E, which are used to improve the accuracy of incidence prediction.
Methods
We extracted the monthly meteorological data, incidence and cases number of hepatitis E from January 2005 to December 2017 in Shandong province, China. We employ GRA method to analyze the correlation between the incidence and meteorological factors. With these meteorological factors, we achieve a variety of methods for incidence of hepatitis E by LSTM and attention-based LSTM. We selected data from July 2015 to December 2017 to validate the models, and the rest was taken as training set. Three metrics were applied to compare the performance of models, including root mean square error(RMSE), mean absolute percentage error(MAPE) and mean absolute error(MAE).
Results
Duration of sunshine and rainfall-related factors(total rainfall, maximum daily rainfall) are more relevant to the incidence of hepatitis E than other factors. Without meteorological factors, we obtained 20.74%, 19.50% for incidence in term of MAPE, by LSTM and A-LSTM, respectively. With meteorological factors, we obtained 14.74%, 12.91%, 13.21%, 16.83% for incidence, in term of MAPE, by LSTM-All, MA-LSTM-All, TA-LSTM-All, BiA-LSTM-All, respectively. The prediction accuracy increased by 7.83%. Without meteorological factors, we achieved 20.41%, 19.39% for cases in term of MAPE, by LSTM and A-LSTM, respectively. With meteorological factors, we achieved 14.20%, 12.49%, 12.72%, 15.73% for cases, in term of MAPE, by LSTM-All, MA-LSTM-All, TA-LSTM-All, BiA-LSTM-All, respectively. The prediction accuracy increased by 7.92%. More detailed results are shown in results section of this paper.
Conclusions
The experiments show that attention-based LSTM is superior to other comparative models. Multivariate attention and temporal attention can greatly improve the prediction performance of the models. Among them, when all meteorological factors are used, multivariate attention performance is better. This study can provide reference for the prediction of other infectious diseases.
Citation: Feng Y, Cui X, Lv J, Yan B, Meng X, Zhang L, et al. (2023) Deep learning models for hepatitis E incidence prediction leveraging meteorological factors. PLoS ONE 18(3): e0282928. https://doi.org/10.1371/journal.pone.0282928
Editor: Lu Peng, Wuhan University of Technology, CHINA
Received: February 2, 2023; Accepted: February 27, 2023; Published: March 13, 2023
Copyright: © 2023 Feng et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are available at: https://github.com/guoyanhui03/dataset.git.
Funding: This work was supported by Shandong Medical Health Science and Technology Development Programs (No. 2018WS309) to YF, Taishan Scholar Program of Shandong Province(No. tstp20221164) to LZ, ZhiFei Disease Prevention and Control Technology Research Fund Project (No. LYH2017-08) to YF, Science and Technology Project for the Universities of Shandong Province (No. J18KB171) to YG, and Shandong Women’s University High level scientific research project Cultivation Fund (No. 2020GSPGJ08) to YG. YF: Conceptualization; Data curation; Methodology; Writing original draft. LZ: Data curation; Writing review and editing. YG: Conceptualization; Methodology; Writing review and editin.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Viral hepatitis is recognized as a major public health problem and is now considered to be comparable with the three major infectious diseases(AIDS, malaria, and tuberculosis) [1]. Hepatitis E virus (HEV) is the most common cause of acute viral hepatitis worldwide, leading to a major public health problem [2]. HEV infection mainly breaks out in developing and industrialized countries, especially in Asia, Africa and Central America [3]. There are around 20 million cases worldwide, with approximately 3.3 million symptomatic cases, leading to 55,000 HEV-related deaths annually. The fatality rate in young adults reached 0.5–3%.HEV infection has a poor prognosis among pregnant woman, especially in the third trimester, where the fatality rate can reach up to 30% [4–6]. Sporadic hepatitis E has caused over 50% of acute viral hepatitis cases in recent years [7], which caused the huge social, economic, and health burden. To better mitigate future outbreaks, a method is needed to accurately predict the incidence of hepatitis E. US Centers for Disease Control and Prevention have openly endorsed adopting models to inform decision making [8].
It is relatively well accepted that climate change can affect human pathogenic diseases. Camilo Mora indicated that over half of known human pathogenic diseases can be aggravated by climate change [9]. Li et al. [10] studied the relationship between the epidemic of SARS, dengue, influenza, respiratory syncytial virus and meteorological factors, and believed that specific meteorological factors were the driving force of the virus epidemic. A large number of literature show that meteorological factors are related to hand foot mouth disease [11, 12], COVID-19 [13, 14] and other diseases. Some researchers suggested that rainfall has a certain impact on the spread of hepatitis A virus [15]. Kiook Baek [16] analyzed the association between temperature and precipitation and the incidence of hepatitis A in Seoul, which proved that meteorological factors have an impact on hepatitis A and are helpful to predict the incidence of hepatitis A. For hepatitis E, Anna investigated that population density and water balance influence the global occurrence of hepatitis E epidemics. Vianney [17] showed that hepatitis E outbreak associated with rainfall in the Central African Republic. Understanding which meteorological factors have a greater impact on the incidence of hepatitis E is conducive to effective prevention and treatment of hepatitis E. In this paper, we analyzed the incidence data and meteorological data of hepatitis E in the past 13 years, and reached an effective conclusion.
Besides, exploring the rules of historical data and predicting the future incidence of hepatitis E can achieve the purpose of accurate prevention and control. Time series methods are commonly used to solve the above problems. The traditional prediction models are the Autoregressive Integrated Moving Average (ARIMA) and its variants, including Seasonal ARIMA (SARIMA), ARIMAX. The ARIMA model, as a statistical regression model, is widely used to predict the incidence of various diseases, such as influenza [18], AIDS [19], COVID-19 [20]. However, the result might be unsatisfactory due to linear assumption requirements. Another mainstream to analyze time series is utilized by artificial intelligence methods, such as Markov model [21], artificial neural network [22], support vector machine(SVM) [23], etc. Among them, SVM model has been successfully used in many fields of time series prediction, including financial prediction [24] and disease prediction [25, 26], due to the generalization ability and nonlinear regression estimation. At present, benefiting from the powerful feature representation capabilities of deep learning, Recurrent Neural Network(RNN) [27] is an effective approach to analyze temporal data. Subsequently, LSTM, overcoming the lack of vanishing gradients in the RNN, was widely used in various fields, including disease prediction [28], energy consumption forecasting [29], solar power forecasting [30], oil markets prediction [31] and so on. Recently, some researchers have applied transformer to predict wind speed [32]. For hepatitis E incidence prediction, our previous work [25] adopted ARIMA, SVM, LSTM methods to predict hepatitis E incidence, obtaining the state of the art result by LSTM. Later, Xiaoqing Cheng et al. [33] utilized a variant of LSTM(Bi-LSTM) to predict hepatitis E incidence in Jiangsu, China.
In order to further improve the prediction accuracy, researchers used multiple factors to predict disease incidence. Some researchers leverage Google search index to improve the accuracy of disease prediction, such as influenza [34, 35], COVID-19 [36, 37] etc. Meteorological factors, as another important aspect to improve the performance of disease prediction, have been widely studied. A large number of literature have proved that the combination of meteorological factors and machine learning algorithms is conducive to improving the accuracy of disease incidence prediction, including dengue fever [38, 39], hand-foot-mouth disease [40], mumps [41] etc. For hepatitis E incidence prediction, Tu Peng et al. [42] demonstrated that meteorological factors (radiation, air pressure, precipitation) can contribute to the prediction effectiveness. Xiaoqing Cheng et al. [33] adopted Bi-LSTM model with the meteorological factors of temperature, rainfall to improve the prediction accuracy of hepatitis E incidence. However, Bi-LSTM model treats all factors equally, which will cause data with weak correlation to interfere with the model. In order to make full use of meteorological factors, we explore attention-based LSTM models with multi-factors, which can make different factors have different contributions to the models.
In this study, we adopt grey correlation model to analyze the importance of multiple meteorological factors with hepatitis E incidence. Then, LSTM and attention-based LSTMs are used to predict hepatitis E monthly incidence with and without meteorological factors, respectively. Experiments show that our proposed method obtains state-of-the-art performance. The construction and conclusions of models provide some references for the prevention and control of hepatitis E. Meanwhile, these methods are general and could also be suitable for predicting other diseases. The main contributions are listed as follows.
- Grey correlation model is employed to analyze the correlation between hepatitis E incidence and meteorological factors. The conclusion is beneficial to the prevention and control of hepatitis E.
- We propose several methods for predicting the incidence of hepatitis E, including MA-LSTM, TA-LSTM, BiA-LSTM. The performance of these methods are compared with other methods, significantly improving the prediction accuracy of hepatitis E.
- The conclusion that attention mechanism and multiple factors can improve prediction accuracy of hepatitis E, provides a reference for other prediction problems.
Materials and methods
Materials source
This study collected information regarding hepatitis E from January 2005 to December 2017 in Shandong Province, China. Data were provided by the Shandong Center for Disease Control and Prevention, mainly including monthly incidence and monthly cases number of hepatitis E in Shandong. The meteorological data is abstracted from the China meteorological data sharing service system, which contains the statistical data of many meteorological stations. We take the average value of the meteorological data observed by each station as the provincial value of monthly meteorological data.
Grey relational analysis
Grey relational analysis(GRA) is a multi-factor statistical analysis method to explore the similarity and dissimilarity among factors. GRA judges the relationship between different sequences by computing the similarity of the geometric shape of the sequence curve. It uses the grey relational grade to measure the relational degree of factors. First, we select the reference series and alternative series in all series. The reference series can express the behavior characteristics of the system, similar to the dependent variable. Alternative series is the factor that affects the reference series, similar to independent variable. In this study, monthly incidence of hepatitis E is regarded as reference series, and meteorological factors are taken as alternative series. Then, the series needs to be normalized to make the values free of unit. This process is called grey relational generating. We normalize all the series to [0, 1] by min-max normalization.
The next step is to calculate the grey relational coefficient(GRC), which is an indicator of the similarity between the reference series and alternatives series. The principle is shown in the formula (1)–(3). Where, y denotes the grey relational coefficient. a,b denote the extremum of the matrix. x0 denotes the reference series. ρ denotes the discrimination coefficient, set to 0.5.
(1)
(2)
(3)
Fially, the grey relational grade(GRG) is calculated by GRC for evaluating all the alternatives, as shown in Eq (4). A large GRG indicates a higher correlation with the reference series.
(4)
LSTM model
LSTM is the most popular variant of RNN, which can solve the problem of long-term dependency and is suitable for processing and predicting time series. Different from RNN, LSTM consists of the memory cells to replace the hidden layer neurons. The state of memory cells are controlled by gate structure, including input gate, forget gate and output gate. These gates, as information filters, determine which information needs to be retained or ignored. Fig 1 shows the structure of LSTM memory cells. Assuming that the cell is at a state of t, the calculation is as follows.
(5)
(6)
(7)
(8)
(9)
The forget gate f(t) controls how much information of the last cells state c(t−1) is ignored. The input gate i(t) indicates how much the current input information is retained to the state of c(t). The output gate o(t) controls to generates output result h(t) according to the current state, which should be transferred to the next memory cell. Among the formula (5)–(9), W, U denote the weight matrix to be optimized, b represents the bias term. The symbol σ denotes the sigmoid activation function, ⊗ is the dot product operation.
Model for hepatitis E prediction.
In this study, we adopt LSTM model to implement incidence prediction of hepatitis E, by univariate and multivariate, respectively. Univariate method only employs the previous four monthly data(monthly incidence, monthly cases) of hepatitis E to predict the next monthly data. Multivariate method employs the previous four monthly data and multiple meteorological factors(temperature data, rainfall data, photoperiod data, etc). The form of input and output is shown as formula (10), according time step. , is the result that the model want to obtain. x(i) is a number of monthly data or a vector of multi-factor data, for two methods, respectively. In order to design the network structure of LSTM, we employ the grid search method to carry out our experiments. The time step ranges from 1 to 5, and the number of nodes changes from 25 to 32. The experimental results show that the performance is the best when time step is set to 4 and the number of nodes is set to 30, as shown in Fig 2. The number of nodes in hidden layer of LSTM is 30. And the number of input layer nodes is 1 or 11 for univariate and multivariate methods. The output layer has 1 nodes for two methods.
(10)
Parameters setting of LSTM.
To overcome the over-fitting problem of small samples, we employ dropout and regularization strategies. Dropout rate is set to 0.15, and regularization parameter is set to 0.001 in our models. The epochs for training is set to 220. Besides, we adopt Adam as the optimizer which is faster and better than stochastic gradient descent (SGD) method. Learn rate is set to 0.001.
Attention-based LSTM
Attention mechanism, derived from the research of human vision, is to improve the attention to the key local features and ignore the useless information. By introducing the attention mechanism, deep learning models can solve the problem of information overload and improve the efficiency and performance. Firstly, the model needs to calculate the attention distribution by softmax function. Then, the input information is weighted according to the attention. For LSTM model, attention mechanism can be added to the time step to make the model focus on the time context. Attention mechanism can also be added to multivariables, so that the model can distinguish the importance of input information.
Temporal attention.
In this paper, we propose temporal attention-based LSTM(called TA-LSTM) for multivariate prediction of hepatitis E, based on a hypothesis, the closer the data is to the predicted data, the greater its impact on the results. We employ a linear layer to implement attention computation. The principle is as the following formula (11)–(13). Among them, etj means the output of the attention linear layer a, and j denotes a time step of T which is set to 4 in this paper. xj and h(t−1) are the input of LSTM at time step j. atj represents the weight of each time step leveraging softmax function. denotes the weighted input of LSTM. Fig 3 shows the principle of TA-LSTM working. TA-LSTM has the same network structure and parameter settings with the above mentioned LSTM model.
(11)
(12)
(13)
Multivariate attention.
We also propose multivariate attention-based LSTM(called MA-LSTM) for multivariate prediction of hepatitis E, to increase the expression ability of factors with strong correlation and reduce the interference of factors with weak correlation. A linear layer is used to implement the attention distribution. Firstly, we obtain attention score by a learnable linear layer. Then, computing the attention weight by softmax function is needed, as shown in formula (14). N denotes the dimension of features. Finally, the new input is obtained by dot product. MA-LSTM also has the same network structure and parameter settings with the above mentioned LSTM model.
(14)
Temporal and multivariate attention.
Can we adopt the two attentions (temporal, multivariate attention) to improve the performance of the LSTM? We propose bi-attention-based LSTM (called BiA-LSTM), to verify our ideas. In order to achieve the goal, we first fix the time step attention to train multivariable attention. Then, we alternate the two attention to train again.
Model evaluation
To verify the effectiveness of the methods, we apply three widely used quality indexes, including Root Mean Square Error (RMSE), Mean Absolute Percent Error (MAPE), Mean Absolute Error (MAE). RMSE is sensitive to the maximum or minimum errors in a group of data, and can well express the precision of measurement, as shown in the formula (15). MAPE is a relative error representation method and is also the most popular indicator for evaluating prediction performance, as shown in the formula (16). MAE shows the actual prediction error, as shown in the formula (17). In the above formulas, yi and denote true value and predictive value, respectively. N denotes the number of samples.
(15)
(16)
(17)
Results
Results of GRA
In this study, we analyzed the correlation between the monthly incidence, cases of hepatitis E and meteorological factors by GRA, respectively. There are 10 kinds of meteorological data to be analyzed, including maximum temperature(°C), minimum temperature(°C), total rainfall (mm), average temperature(°C), average water vapor pressure(hPa), average minimum temperature(°C), average maximum temperature(°C), days with daily rainfall greater than 0.1mm(days), duration of sunshine(hours), maximum daily rainfall(mm), which are arranged in the order.
The analysis results are displayed in the form of heat map matrix, as shown in Fig 4. Among them, the Fig 4(a) shows the GRA results between the incidence of hepatitis E and meteorological factors, and the Fig 4(b) illustrates the GRA results between the cases of hepatitis E and meteorological factors. The conclusions of the above two analyses are consistent, so we will only discuss the results of Fig 4(b). The meteorological factors relevant to the cases of hepatitis E are duration of sunshine, total rainfall, maximum daily rainfall, days with daily rainfall greater than 0.1mm, respectively. Besides, we can also find that meteorological factors of the same category have strong correlation, such as temperature factors, rainfall factors, etc.
(a) GRA heat map on the incidence of hepatitis E; (b) GRA heat map on the cases of hepatitis E.
Results of incidence prediction of hepatitis E
To verify the effectiveness of the methods, we also introduced the experimental results of ARIMA and SVM models, which are derived from our previous work [25]. We divide into two groups of our experiments, namely, univariate prediction experiments(ARIMA, SVM, LSTM and A-LSTM) and multivariate prediction experiments(LSTM-MF, MA-LSTM-MF, TA-LSTM-MF, BiA-LSTM-MF, LSTM-All, MA-LSTM-All, TA-LSTM-All and BiA-LSTM-All), to compare the influence of meteorological factors on the incidence prediction of hepatitis E. The performances of all employed models, in terms of RMSE, MAE, and MAPE, are presented in Table 1. The specific prediction values of hepatitis E incidence in all employed models are illustrated in Fig 5. In these methods, the prefix MA(multivariate attention) represents the multi-variable attention mechanism, and the prefix TA(temporal attention) represents the multi-time attention mechanism. The prefix BiA indicates two attention mechanisms (TA and MA). The suffix MF(main factors) indicates that the method uses several meteorological factors with stronger correlation(total rainfall, days with daily rainfall greater than 0.1mm, duration of sunshine, maximum daily rainfall). The suffix All indicates that all meteorological factors are used in the method.
For univariate prediction experiments, we can see from Table 1 that A-LSTM achieves the best performance in term of MAPE(19.50%), comparing with ARIMA, SVM, LSTM(23.50%, 21.70%, 20.74%, respectively). Among the methods with main factors, TA-LSTM-MF obtains the best results in term of MAPE by 13.74%, superior to LSTM-MF, MA-LSTM-MF, BiA-LSTM-MF(14.87%, 13.84%, 14.65%). However, among the methods with all meteorological factors, the best model is TA-LSTM-All, which achieves 12.91% in term of MAPE. BiA-LSTM-All method performs the worst(16.83% in term of MAPE) among all the multivariate prediction methods. Comparing LSTM, LSTM-MF, and LSTM-All(20.74%, 14.87%, 14.74% in term of MAPE, respectively), we can find that the more meteorological factors used, the better the prediction effect. When comparing A-LSTM(19.50% in term of MAPE) with MA-LSTM(13.84%, 12.91%, for MF and All, respectively) and TA-LSTM(13.74%, 13.21%, for MF and All, respectively), we can get the same conclusion. However, the conclusion of method BiA-LSTM-MF and method BiA-LSTM-All is an exception, which the all foactor method is inferior to the method with main factors. We will explain the reasons in the discussion section.
Fig 5 presents the predictive monthly incidence of hepatitis E from July 2015 to December 2017 by all the employed methods. It can be seen from Fig 5 that univariate prediction methods(ARIMA, SVM, LSTM and A-LSTM) perform poorly in predicting inflection point, as shown in the middle of Fig 5. The remaining methods are consistent with the trend of grandtruth, especially MA-LSTM-All and TA-LSTM-All methods.
Results of cases prediction of hepatitis E
Table 2 depicts the evaluation of prediction performance by all the employed methods, in term of RMSE, MAE, and MAPE. The conclusion of this experiment is consistent with that of the previous experiment. A-LSTM is the best model in univariate prediction of hepatitis E cases. MA-LSTM-All is the best in multivariate prediction and all experiments. BiA-LSTM-All also performs poorly in all the multivariate prediction methods. The only difference from the incidence prediction of hepatitis E is that MA-LSTM-MF is better than TA-LSTM-MF. Predictive monthly cases of hepatitis E is shown in Fig 6. It can be seen that the trend of the predictive line generated by MA-LSTM-All is closest to that of grandtruth. Also, the predictive line of MA-LSTM-All is smoother than lines generated by other methods.
Discussion
Analyzing the related factors of infectious disease is crucial to prevent the spread of infectious diseases. Weather condition is one of the main factors that influence the incidence of diseases. In this paper, we analyzed the correlation between incidence of hepatitis E and 10 meteorological factors, as shown in Fig 4. Duration of sunshine is the most important factor influencing the incidence of hepatitis E, followed by rainfall-related factors, and finally temperature-related factors.
Another idea is whether we can leverage machine learning models to predict the incidence of hepatitis E, to prevent the spread of hepatitis E early. To achieve the above goal, we conducted univariate prediction experiments on hepatitis E incidence data and cases data. The experimental results show that deep learning methods(LSTM, A-LSTM) are superior to the statistical method(ARIMA) and the traditional machine learning method(SVM), due to its powerful feature learning capability. Attention mechanism can improve the performance of LSTM.
We want to implore whether meteorological factors can benefit the prediction performance of hepatitis E. So, we did two groups experiments, one group used the meteorological factors with strong correlation, and the other group used all meteorological factors. Compared with univariate methods, the methods with meteorological factors can greatly improve the prediction performance of hepatitis E incidence(from 19.39% to 12.91%, in term of MAPE). Two sets of multivariate prediction experiments show that meteorological factors with weak correlation are also helpful to improve the prediction capability of the model(from 13.84% to 12.91%, in term of MAPE).
Besides, we propose two types of attention mechanisms, including multivariate attention, temporal attention. The multivariate attention mechanism can distinguish the contribution of each factor to the prediction of the model and carry out weighted processing. As for the temporal attention mechanism, we initially assumed that the closer the location to the predicted value, the greater its impact on the predicted value. Compared with LSTM(14.74%), the two attention-based methods improves to 12.91%, 13.21%, in term of MAPE, respectively. When all meteorological factors are used, MA-LSTM-All(12.91%) is better than TA-LSTM-All(13.21%). This means that the multivariate attention mechanism is more suitable for the prediction of hepatitis E than temporal attention mechanism.
Finally, we try to adopt the two attention mechanisms(BiA-LSTM-All) to improve performance of hepatitis E incidence. Unfortunately, the BiA-LSTM methods perform poorly(14.65%, 16.83%). Meanwhile, BiA-LSTM-All is worse than BiA-LSTM-MF, which is different from other methods. We speculate that this is because our sample is too small, resulting in the inability of model to learn two kinds of attention. In addition, it should be noted that the overall performance of hepatitis E incidence prediction is relatively poor, compared with the prediction in other fields. The reason is that the incidence of hepatitis E is affected by many factors, and the historical data is less. In the future, we will further explore methods to improve the prediction performance of hepatitis E incidence.
Conclusion
In this study, we analyzed the correlation between the incidence of hepatitis E and meteorological factors. We implemented more than 10 prediction methods using LSTM and attention-based LSTM. The study presents several important conclusions. 1) By the GRA correlation analysis, we found that duration of sunshine and total rainfall are the important meteorological factors affecting the incidence of hepatitis E. 2) Comparing all the employed methods, it can be found that the meteorological data played a positive role in the incidence prediction of hepatitis E and could greatly improve the prediction performance. 3) We can also draw the conclusion that attention mechanism can improve the prediction performance of hepatitis E incidence. 4) Multivariate attention mechanism(MA-LSTM-All) is superior to temporal attention mechanism(TA-LSTM-All) when all meteorological factors are used.
However, we only explored the correlation between the incidence of hepatitis E and meteorological factors, and more detailed research needs to be carried out. In the future work, we will further study the reasons why the dual attention mechanism method(BiA-LSTM) is not effective. In the future work, we will also study how to play the positive role of exogenous data in order to improve the prediction performance of hepatitis E.
References
- 1. Locarnini S, Chen DS, Shibuya K. No more excuses: viral hepatitis can be eliminated. The Lancet. 2016;387(10029):1703–1704. pmid:27116263
- 2. Lee L, Lin V, Wang R, Zhao H. Public health in China: history and contemporary challenges. Global public health: a new era. 2009;185(10.1093).
- 3. Pérez-Gracia MT, García M, Suay B, Mateos-Lindemann ML. Current knowledge on hepatitis E. Journal of clinical and translational hepatology. 2015;3(2):117. pmid:26355220
- 4. Boccia D, Guthmann JP, Klovstad H, Hamid N, Tatay M, Ciglenecki I, et al. High mortality associated with an outbreak of hepatitis E among displaced persons in Darfur, Sudan. Clinical infectious diseases. 2006;42(12):1679–1684. pmid:16705571
- 5. Bhatnagar G, Sharma S, Kumar A, Prasad S, Agarwal S, Kar P. Reduced glutathione in hepatitis E infection and pregnancy outcome. Journal of Obstetrics and Gynaecology Research. 2016;42(7):789–795. pmid:27075039
- 6. Pérez-Gracia MT, Suay-García B, Mateos-Lindemann ML. Hepatitis E and pregnancy: current state. Reviews in medical virology. 2017;27(3):e1929. pmid:28318080
- 7. Ren H, Li J, Yuan ZA, Hu JY, Yu Y, Lu YH. The development of a combined mathematical model to forecast the incidence of hepatitis E in Shanghai, China. BMC infectious diseases. 2013;13(1):1–6. pmid:24010871
- 8. Meyers LA, Meltzer MI, Jernigan DB, Hatchett R, Iskander JK, Thorpe P, et al. Staying ahead of the curve: modeling and public health decision-making. 2016;.
- 9. Mora C, McKenzie T, Gaw IM, Dean JM, von Hammerstein H, Knudson TA, et al. Over half of known human pathogenic diseases can be aggravated by climate change. Nature climate change. 2022;12(9):869–875. pmid:35968032
- 10. Xiaocen Li CY Xiangying Zhu. Influence of climatic factors on virus epidemic. Science and Technology Review. 2020;38(23):138.
- 11. Yu G, Li Y, Cai J, Yu D, Tang J, Zhai W, et al. Short-term effects of meteorological factors and air pollution on childhood hand-foot-mouth disease in Guilin, China. Science of the Total Environment. 2019;646:460–470. pmid:30056233
- 12. Duan C, Zhang X, Jin H, Cheng X, Wang D, Bao C, et al. Meteorological factors and its association with hand, foot and mouth disease in Southeast and East Asia areas: a meta-analysis. Epidemiology & Infection. 2019;147.
- 13. Chien LC, Chen LW. Meteorological impacts on the incidence of COVID-19 in the US. Stochastic Environmental Research and Risk Assessment. 2020;34(10):1675–1680. pmid:32837311
- 14. Yuan J, Wu Y, Jing W, Liu J, Du M, Wang Y, et al. Association between meteorological factors and daily new cases of COVID-19 in 188 countries: A time series analysis. Science of The Total Environment. 2021;780:146538. pmid:34030332
- 15. Leal PR, Guimarães RJdPSe, Kampel M. Associations Between Environmental and Sociodemographic Data and Hepatitis-A Transmission in Pará State (Brazil). GeoHealth. 2021;5(5):e2020GH000327. pmid:34027261
- 16. Baek K, Choi J, Park JT, Kwak K. Influence of temperature and precipitation on the incidence of hepatitis A in Seoul, Republic of Korea: a time series analysis using distributed lag linear and non-linear model. International Journal of Biometeorology. 2022;66(9):1725–1736. pmid:35829753
- 17. Tricou V, Bouscaillou J, Laghoe-Nguembe GL, Béré A, Konamna X, Sélékon B, et al. Hepatitis E virus outbreak associated with rainfall in the Central African Republic in 2008–2009. BMC Infectious Diseases. 2020;20(1):1–7. pmid:32245368
- 18. Wang C, Li Y, Feng W, Liu K, Zhang S, Hu F, et al. Epidemiological features and forecast model analysis for the morbidity of influenza in Ningbo, China, 2006–2014. International journal of environmental research and public health. 2017;14(6):559. pmid:28587073
- 19. Li Z, Li Y. A comparative study on the prediction of the BP artificial neural network model and the ARIMA model in the incidence of AIDS. BMC medical informatics and decision making. 2020;20(1):1–13.
- 20. Roy S, Bhunia GS, Shit PK. Spatial prediction of COVID-19 epidemic using ARIMA techniques in India. Modeling earth systems and environment. 2021;7:1385–1391. pmid:32838022
- 21. Li J, Pedrycz W, Jamal I. Multivariate time series anomaly detection: A framework of Hidden Markov Models. Applied Soft Computing. 2017;60:229–240.
- 22. Faruk DÖ. A hybrid neural network and ARIMA model for water quality time series prediction. Engineering applications of artificial intelligence. 2010;23(4):586–594.
- 23. Cortes C, Vapnik V. Support-vector networks. Machine learning. 1995;20:273–297.
- 24. Guo Y, Han S, Shen C, Li Y, Yin X, Bai Y. An adaptive SVR for high-frequency stock price forecasting. IEEE Access. 2018;6:11397–11404.
- 25. Guo Y, Feng Y, Qu F, Zhang L, Yan B, Lv J. Prediction of hepatitis E using machine learning models. Plos one. 2020;15(9):e0237750. pmid:32941452
- 26. Khan S, Ullah R, Khan A, Wahab N, Bilal M, Ahmed M. Analysis of dengue infection based on Raman spectroscopy and support vector machine (SVM). Biomedical optics express. 2016;7(6):2249–2256. pmid:27375941
- 27.
Mikolov T, Karafiát M, Burget L, Cernockỳ J, Khudanpur S. Recurrent neural network based language model. In: Interspeech. vol. 2. Makuhari; 2010. p. 1045–1048.
- 28. Wang Y, Xu C, Zhang S, Yang L, Wang Z, Zhu Y, et al. Development and evaluation of a deep learning approach for modeling seasonality and trends in hand-foot-mouth disease incidence in mainland China. Scientific reports. 2019;9(1):8046. pmid:31142826
- 29. Peng L, Wang L, Xia D, Gao Q. Effective energy consumption forecasting using empirical wavelet transform and long short-term memory. Energy. 2022;238:121756.
- 30. Lim SC, Huh JH, Hong SH, Park CY, Kim JC. Solar Power Forecasting Using CNN-LSTM Hybrid Model. Energies. 2022;15(21):8233.
- 31. Wu B, Wang L, Wang S, Zeng YR. Forecasting the US oil markets based on social media information during the COVID-19 pandemic. Energy. 2021;226:120403. pmid:34629690
- 32. Wu B, Wang L, Zeng YR. Interpretable wind speed prediction with multivariate time series and temporal fusion transformers. Energy. 2022;252:123990.
- 33. Cheng X, Liu W, Zhang X, Wang M, Bao C, Wu T. Predicting incidence of hepatitis E using machine learning in Jiangsu Province, China. Epidemiology & Infection. 2022;150:e149.
- 34. Samaras L, García-Barriocanal E, Sicilia MA, et al. Syndromic surveillance models using web data: the case of influenza in Greece and Italy using Google Trends. JMIR public health and surveillance. 2017;3(4):e8015. pmid:29158208
- 35. He Y, Zhao Y, Chen Y, Yuan HY, Tsui KL. Nowcasting influenza-like illness (ILI) via a deep learning approach using google search data: An empirical study on Taiwan ILI. International Journal of Intelligent Systems. 2022;37(3):2648–2674.
- 36. Prasanth S, Singh U, Kumar A, Tikkiwal VA, Chong PH. Forecasting spread of COVID-19 using google trends: A hybrid GWO-deep learning approach. Chaos, Solitons & Fractals. 2021;142:110336. pmid:33110297
- 37. Ayyoubzadeh SM, Ayyoubzadeh SM, Zahedi H, Ahmadi M, Kalhori SRN. Predicting COVID-19 incidence through analysis of google trends data in Iran: data mining and deep learning pilot study. JMIR public health and surveillance. 2020;6(2):e18828. pmid:32234709
- 38. Nguyen VH, Tuyet-Hanh TT, Mulhall J, Minh HV, Duong TQ, Chien NV, et al. Deep learning models for forecasting dengue fever based on climate data in Vietnam. PLoS Neglected Tropical Diseases. 2022;16(6):e0010509. pmid:35696432
- 39.
Saturi S, Sravani M, Hruthika SC, Sambaraju M, Prudvendra R, Kiran S. Development of Prediction and Forecasting Model for Dengue Disease Based on the Environmental Conditions Using LSTM. In: Data Engineering and Intelligent Computing: Proceedings of 5th ICICC 2021, Volume 1. Springer; 2022. p. 425–435.
- 40. Zhang R, Guo Z, Meng Y, Wang S, Li S, Niu R, et al. Comparison of ARIMA and LSTM in forecasting the incidence of HFMD combined and uncombined with exogenous meteorological variables in Ningbo, China. International journal of environmental research and public health. 2021;18(11):6174. pmid:34200378
- 41. Zha Wt, Li Wt, Zhou N, Zhu Jj, Feng R, Li T, et al. Effects of meteorological factors on the incidence of mumps and models for prediction, China. BMC Infectious Diseases. 2020;20(1):1–11.
- 42. Peng T, Chen X, Wan M, Jin L, Wang X, Du X, et al. The prediction of hepatitis E through ensemble learning. International journal of environmental research and public health. 2021;18(1):159.