Figures
Abstract
Background: Head and neck cancer (HNC) becomes a vital global health burden. Accurate assessment of the disease burden plays an essential role in setting health priorities and guiding decision-making.
Methods: This study explores data from the Global Burden of Disease (GBD) 2021 study, involving totally 204 countries during the period from 1980 to 2021. The analysis focuses on age-standardized incidence, mortality, and disability-adjusted life years (DALYs) for HNC. A Transformer-based model, HNCP-T, is used for the prediction of future trends from 2022 to 2030, quantified based on the estimated annual percentage change (EAPC).
Results: The global age-standardized incidence rate (ASIR) for HNC has escalated between 1980 and 2021, with men bearing a higher burden than women. In addition, the burden rises with age and exhibits regional disparities, with the greatest impact on low-to-middle sociodemographic index (SDI) regions. Additionally, the model predicts a continued rise in ASIR (EAPC = 0.22), while the age-standardized death rate (ASDR) is shown to decrease more sharply for women (EAPC = -0.92) than men (EAPC = –0.54). The most rapid increase in ASIR is projected for low-to-middle SDI countries, while ASDR and DALY rates are found to decrease in different degrees across regions.
Conclusions: The current work offers a detailed analysis of the global burden of HNC based on the GBD 2021 dataset and demonstrates the accuracy of the HNCP-T model in predicting future trends. Significant regional and gender-based differences are found, with incidence rates rising, especially among women and in low-middle SDI regions. Furthermore, the results underscore the value of deep learning models in disease burden prediction, which can outperform traditional methods.
Citation: Hu Q, Lv S, Wang X, Pan P, Gong W, Mei J (2025) Global burden and future trends of head and neck cancer: a deep learning-based analysis (1980–2030). PLoS ONE 20(4): e0320184. https://doi.org/10.1371/journal.pone.0320184
Editor: Enes Erul, Ankara University, TÜRKIYE
Received: October 31, 2024; Accepted: February 14, 2025; Published: April 9, 2025
Copyright: © 2025 Hu et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All data source files are available from the GBD database (URLs: https://www.healthdata.org/research-analysis/gbd).
Funding: This work was supported in study design by the Anhui Provincial Science and Technology Department under Grant 2022AH050662, and in data collection and analysis by the Anhui Provincial Postgraduate Education Quality Engineering Project under Grant 2022zyxwjxalk060, and in decision to publish by the Research Fund of Anhui Institute of translational medicine under Grant 2022zhyx-C42, and in preparation of the manuscript by the National Natural Science Foundation Incubation Program of The Second Affiliated Hospital of Anhui Medical University under Grant 2021GMFY04.
Competing interests: The authors have declared that no competing interests exist.
Introduction
In both developed and developing countries, cancer ranks as a primary reason for death, also bringing about substantial emotional, social, and economic effects [1,2]. Among them, head and neck cancer (HNC) is regarded as the seventh most common cancer worldwide, responsible for over 660,000 new cases and 325,000 deaths annually [3,4]. Anatomically, HNC is a group of malignant tumors occurring above the clavicle, below the skull base, and in the anterior region of the cervical vertebral column, such as thyroid cancer (TC), lip and oral cavity cancer (LOC), laryngeal cancer (LC), nasopharyngeal cancer (NPC), and other pharyngeal cancers (OPC). These cancers pose a great global health burden, with patients frequently confronting severe challenges both from the tumor itself and from treatment [5]. Head and neck tumors can invade or compress vital structures, leading to difficulty swallowing (dysphagia) when the esophagus or pharynx is affected and difficulty breathing (dyspnea) when the airway is blocked. Involvement of the larynx or vocal cords can result in speech disorders, making communication difficult. Tumor invasion of nerves may cause pain and sensory impairment, further reducing quality of life. In addition, treatments such as surgery and radiation therapy may result in functional impairment and disfigurement. These physical burdens are often accompanied by psychosocial complications such as depression and social isolation, emphasizing the severe impact of the disease on patients.
Despite the obtained advances in surgical techniques, radiotherapy [6], and molecular targeted therapies [6] that have enhanced early diagnosis and treatment, HNC is a vital public health issue, especially in regions with limited resources. Efforts made by the World Health Organization (WHO) to lower non-communicable diseases, including HNC [7], through targeted interventions addressing behavioral and dietary risk factors such as smoking, hypertension, and poor nutrition, are in consistence with the United Nations’ Sustainable Development Goals to decrease premature mortality by 2030. Nevertheless, predicting future trends in HNC plays a vital role in guiding these global health initiatives [8], as current efforts to lower incidence, mortality, and disability-adjusted life years (DALYs) still remain unclear.
Reliable global predictions of HNC trends are crucial for shaping effective prevention strategies and optimizing healthcare services. Previous studies on HNC burden have been limited by small sample sizes, sociological factors [9], and regional economic disparities [10,11], hindering a comprehensive understanding of the global impact. The GBD database provides a valuable resource for overcoming these limitations by offering standardized methodologies to assess disease burden across populations and time periods. However, to better verify and quantify the HNC burden, more comprehensive global data is required [12].
Therefore, we apply the latest GBD data from 1980 to 2021 to assess global trends in HNC. Traditional statistical methods, such as linear regression, are commonly applied in epidemiological studies but often fall short in their ability to capture the complex interactions and patterns existing in disease progression. In recent years, deep learning methods have emerged as a powerful tool in medical predictions [13]. Among these, transformer-based models have gained prominence due to their ability to capture long-range dependencies and handle complex nonlinear relationships more effectively than traditional Recurrent Neural Networks (RNNs) and Long Short-Term Memory (LSTM) networks [14,15]. This makes them particularly suitable for predicting trends in diseases such as head and neck cancers, where intricate temporal dynamics and multifaceted interactions between various risk factors are prevalent. By utilizing the advanced models, our goal is to improve the accuracy and reliability of predictions [16], ultimately contributing to the development of more feasible public health strategies.
In this study, we conduct a comprehensive analysis of global trends in HNC incidence, mortality, and DALYs from 1980 to 2021, and employ a transformer-based HNC prediction method (HNCP-T) to forecast age-standardized rates of HNC incidence, mortality, and DALYs from 2022 to 2030. By utilizing data from 1980 to 2021, we are aimed at providing actionable insights for future decision-making, emphasizing the potential of deep learning to improve disease burden prediction and support public health policy.
Materials and methods
As shown in Fig 1, we propose a novel strategy for predicting the global burden of HNC, referred to as HNCP-T. This section details the design and methods behind the HNCP-T strategy. In the section Data source, we introduce the used dataset. Meanwhile, the employed data preprocessing techniques are outlined in the section Data preprocessing. Further introduction of the model architecture is provided in the section Design of the predictive model, while the section Model training and validation presents the model’s training process.
Data processing flow for the Global Burden of Disease (GBD) dataset. At first, the K-Nearest Neighbors (KNN) imputation technique is employed to address missing data in the GBD dataset. Then, calendar years from 1980 to 2021 are transformed into time steps for time series analysis. Subsequently, Z-score normalization is applied to standardize the dataset after deriving new variables. The dataset is split into two parts: data from 1980 to 2016 is used for training, while that from 2017 to 2021 is reserved for testing; Training, testing, and prediction strategy of the proposed model. The model is consisted of an embedding layer (yellow), transformer layers (orange, with structural details shown in the figure, and a fully connected (FC) layer (green). First, the model is trained using the training data, after which its performance is tested on the test dataset. Finally, the trained model is used for predicting the global burden of HNC from 2022 to 2030; Transformer layer architecture. Each transformer layer includes a block of layer normalization followed by multi-headed self-attention, and another block of layer normalization followed by a multi-layer perceptron (MLP). Skip connections are applied across each block to enhance model performance.
Data source
In this study, we utilize the GBD 2021 dataset, providing comprehensive data on HNC incidence, mortality, DALYs, as well as age-standardized HNC incidence, mortality, and DALY rates worldwide between 1980 and 2021. In addition, the dataset also includes estimates of HNC risk factors between 2022 and 2030, stratified by age, gender, and sociodemographic index (SDI) levels.
Following a comprehensive literature review and analysis of available data, the global burden of HNC data is chosen from the Global Health Data Exchange (GHDx), provided by the Institute for Health Metrics and Evaluation (IHME) [17]. Sponsored by IHME at the University of Washington, GBD is a highly exhaustive global epidemiological study on a global scale [18–20].
The specifics of the data sources are available in other references [17]. In the most recent GBD study, covering data through 2021, age standardization is conducted using the direct method, based on the global age structure of 2021. Based on on global average estimates from 1980 to 2021 obtained from the GHDx, our study further predicts age-standardized HNC incidence, death, and DALY rates for the period 2022 to 2030.
Data preprocessing
This study performs an extensive analysis and prediction using the GBD dataset, spanning the years 1980 to 2021, with forecasts extended through 2030. As displayed in Fig 1, the data preprocessing phase involved several critical steps to sustain the integrity and quality of the data applied in the present study.
Handling missing data with K-Nearest neighbors imputation.
To address the missing data, the K-Nearest Neighbors (KNN) imputation technique was applied. KNN imputation is a reliable method estimating missing values based on the nearest neighboring data points within the dataset, effectively preserving local structures and patterns. By maintaining these local relationships, this method enhances the predictive power of our analysis, ensuring that the underlying associations within the data are retained for more accurate predictions.
Derivation of new variables: EAPC.
One of the variables derived in our analysis is the Estimated Annual Percentage Change (EAPC), a key metric which can be used for measuring trends in age-standardized rates (ASR) over a specified time period [21]. EAPC provides novel insight into the annual rate of change for metrics like incidence or death rates. Then, it is determined by:
where p ( y ∣ m ) indicates the slope of the natural logarithm of the ASR, obtained from a linear regression model in which the calendar year refers to the independent variable.
Then, along with its 95% confidence interval (CI), the EAPC is applied for evaluating whether the rate is increasing or lowering over time.
Variable selection and transformation.
Standardized indicators more effectively account for changes in population structure, offering more accurate predictions and supporting the development of improved public health strategies and policies [19,22]. By providing consistent and comparable metrics across different regions and time periods, these indicators help public health officials identify trends, allocate resources efficiently, and implement targeted interventions. Since predicting these indicators yields valuable data at both global and regional levels [18], age-standardized HNC incidence, death, and DALY rates have been selected as the primary outcome variables for this study. The predictor variables for age-standardized HNC can be detected through an extensive literature review on Google Scholar, using search terms including “risk factors", or “predictors" of HNC. Following this review, three key factors j = 1 , … , 8 elevated plasma glucose levels, high body mass index (BMI), and the SDI are selected for the prediction model, on the basis of data from the GHDx and the IHME. Additionally, the calendar years from 1980 to 2021 are transformed into time steps for time series analysis. Furthermore, this transformation plays an essential role in converting data into a format suitable for time series modeling, allowing this model to effectively obtain temporal dependencies and trends.
Standardization using Z-score.
In order to standardize the dataset, this study employs Z-score normalization, a widely applied method in data preprocessing, ensuring that all features are centered at a mean of zero with a standard deviation of one. This step is of particular importance when handling variables with varying scales, as it can prevent any single variable from disproportionately affecting the model.
The Z-score for a given data point is expressed by:
where suggests an individual data point, j = 1 , … , 8 represents the mean of the dataset, and j = 1 , … , 6 denotes the standard deviation of the data .
Z-score normalization is vital in machine learning models, especially when features differ in units or scales. Through converting the data into a standardized form, Z-score normalization helps the model converge more efficiently and prevents larger-scale features from dominating the model’s behavior. This standardization ensures that each feature makes equal contributions to the analysis, enhancing the overall accuracy and performance of the model.
Data splitting.
The dataset is categorized into two parts: data from 1980 to 2015 is applied as the training set, while data from 2016 to 2021 is reserved as the test set. When compared with shuffling and randomly splitting the dataset, this approach ensures that the model is trained on historical data and evaluated on more recent data, which allows for a more accurate assessment of its predictive performance on unseen data, referring to the test dataset that the model has not encountered during training.
Through carefully following these preprocessing steps, it can be ensured that the dataset is optimized for analysis and predictive modeling, offering a solid basis for reliable and valid results.
Design of the predictive model
The architecture of our predictive model, as displayed in Fig 1, consists of several key components specifically designed to enhance the handling of temporal and contextual dependencies within the dataset. Unlike traditional RNNs and LSTMs, which process data sequentially and may struggle with long-term dependencies, the transformer-based architecture employs self-attention mechanisms that allow for parallel processing and more efficient capturing of long-range interactions [15]. This architectural choice not only improves computational efficiency but also enhances the model’s ability to discern complex patterns within the data.
- Embedding layer: This initial layer converts categorical input data into dense vector representations. Reducing dimensionality and noise is crucial in this predictive context because high-dimensional data, especially with categorical variables, can lead to overfitting and obscure important patterns. By embedding categorical inputs, the model simplifies the data representation, removing irrelevant variations and highlighting essential relationships. This reduction in complexity not only improves the model’s predictive accuracy but also enhances its ability to generalize to new data, offering a clearer, more reliable understanding of the underlying patterns in HNC trends. This dense representation offers a richer set of features for the subsequent layers, therefore enhancing the model’s capability of effectively processing and learning from the input data.
- Transformer layers: Following the Embedding layer, the model employs three Transformer layers, as displayed in Fig 1. These layers are vital for obtaining complex dependencies and interactions across different time steps in the input data. The multi-head self-attention mechanism within the Transformer layers allows the model to concentrate on multiple segments of the input sequence simultaneously, enabling it to recognize vital features regardless of their positional context. Moreover, this capability is vital for modeling the intricate dynamics present in time series data. This ensures that the model can accurately obtain long-range dependencies.
- Fully connected layer: Concluding the architecture is a Fully Connected (FC) layer that can aggregate the high-dimensional features extracted by the Transformer layers. This layer combines information by taking the rich, multi-dimensional representations learned through the Transformer’s self-attention mechanism and integrating them into a more compact form. It does so by weighting and summing these features, effectively capturing the complex feature interactions across the entire sequence. Then, the FC layer maps these aggregated features to the desired output dimensions, transforming the complex relationships into predictions tailored to the specific forecasting task. This process ensures that the final predictions leverage all the relevant information captured by the preceding Transformer layers.
The sequential configuration of these components—from the initial data embedding to processing through Transformer layers, and finally, aggregation in the FC layer—is designed to maximize the extraction and interpretation of useful patterns for time series forecasting. This architecture can ensure that each stage of the model contributes to a comprehensive understanding and processing of the input data, culminating in accurate and reliable predictions. As presented in Fig 1, the prediction model consists of an Embedding layer, followed by three Transformer layers, and a final FC layer. Furthermore, this architecture uses the power of Transformer layers to effectively handle sequential data, ensuring that the model captures both temporal dynamics and contextual relationships, which are vital for obtaining high prediction accuracy in time series forecasting tasks.
In summary, the methodological approach employed in this study, which incorporates deep learning methods, plays an important role in capturing the complex temporal patterns and relationships within the HNC data. By utilizing age-standardized incidence, death, and DALY rates as key outcome variables, this approach allows for a more accurate and comprehensive analysis of the global burden of HNC. These methods not only improve the reliability of predictions but also provide valuable insights that can inform public health strategies and policy development aimed at mitigating the impact of HNC globally.
Model training and validation
To explore and predict the global burden of HNC from 1980 to 2030 using the GBD dataset, this study implements a structured training strategy consisting of multiple stages including data preparation, model training, and testing.
Our model is trained on data from 1980 to 2015, including comprehensive information on global HNC incidence, mortality, DALYs, ASIR, ASDR, and age-standardized DALYs. This time range is chosen to ensure that the model could capture long-term trends and patterns in the burden of HNC across regions and demographics.
The model is trained with the Mean Absolute Error (MAE) as the loss function, delimited by the following formula:
where represents the actual observed value,
refers to the predicted value, and
suggests the number of observations. MAE is chosen due to its robustness in the presence of outliers and its straightforward interpretation as the average magnitude of errors in predictions. This loss function is particularly suitable for our task, as it can effectively handle outliers and provide a robust measure of prediction accuracy.
After training the model, we tested its performance using the obtained data from 2016 to 2021. During this period, this test dataset is fed into the trained model to generate predictions for global HNC burden metrics. Specifically, the model predicted global incidence, mortality, DALYs, ASIR, ASDR, as well as age-standardized DALYs. This testing phase is vital for demonstrating the model’s accuracy and generalization, ensuring its ability to reliably predict the global burden of HNC on unseen data.
The model’s performance on the test set provided essential insights into its generalization ability and allowed us to identify areas where the model could be improved. The evaluation on the test set guided the training process in several key ways: Firstly, the results from the test set reveal specific areas where the model’s predictions are less accurate. Also, by analyzing the test set predictions, we identify which predictor variables (such as plasma glucose levels, BMI, and SDI) have the most significant impact on the model’s accuracy. Furthermore, based on the performance discrepancies observed on the test set, we employ regularization techniques, such as dropout and L2 regularization, to prevent overfitting. This ensured that the model maintained its predictive power even on unseen data, improving generalization. Last but not least, the test set results help us identify specific cases of large prediction errors. By focusing on these outliers, we could modify the training process to give more attention to the problematic time frames or regions, thereby refining the model’s accuracy.
Based on a validated and trained model, we proceeded to predict the global burden of HNC from 2022 to 2030. The predictions generated by the model for this period provide novel insights into potential future trends in HNC incidence, mortality, and DALYs, as well as their age-standardized rates. These predictions are crucial for global health planning and policy-making, offering a data-driven foundation for anticipating and addressing the future burden of HNC.
Moreover, the structured approach to model training and testing ensures that the model not only obtains historical trends accurately but also offers reliable forecasts for future global health outcomes which are associated with HNC.
Results
This section analyzes the global burden of HNC and predicts its future trends. We examine global incidence, mortality, DALYs, ASIR, ASDR, and age-standardized DALYs of HNC on the basis of the GBD dataset, and the analysis results are shown in Section. In order to offer a clearer understanding of future global HNC burden, this study forecasts ASIR, ASDR, and age-standardized DALYs from 2022 to 2030 presented in Section.
Analyze the global trend of burden of HNC
Tables 1 and 2 and Figs 2–5 present the incidence, mortality, DALYs, ASIR, ASDR, age-standardized DALYs, and EAPC of HNC worldwide from 1980 to 2021 categorized by gender, SDI and age.
(a) The trends in disability-adjusted life years (DALYs) for different subtypes of HNC, and the overall trend in DALYs for HNC. (b) The trends in Deaths for different subtypes of HNC, and the overall trend in Deaths for HNC. (c) The trends in incidence for different subtypes of HNC, and the overall trend in incidence for HNC. (d) The trends in age-standardized DALYs for different subtypes of HNC, and the overall trend in age-standardized DALYs for HNC. (e) The trend in age-standardized death rate (ASDR) for different subtypes of HNC, and the overall trend in ASDR for HNC. (f) The trend in age-standardized incidence rate (ASIR) for different subtypes of HNC, and the overall trend in ASIR for HNC.
(a) The trend in disability-adjusted life years (DALYs) of HNC across different age groups (5-year intervals). (b) The trend in Deaths of HNC across different age groups (5-year intervals). (c) The trend in incidence of HNC across different age groups (5-year intervals).
(a) The trend in disability-adjusted life years (DALYs) of HNC across different gender groups (male and female). (b) The trend in Deaths of HNC across different gender groups (male and female). (c) The trend in incidence of HNC across different gender groups (male and female). (d) The trend in age-standardized DALYs of HNC across different gender groups (male and female). (e) The trend in age-standardized death rate (ASDR) of HNC across different gender groups (male and female). (f) The trend in age-standardized incidence rate (ASIR) of HNC across different gender groups (male and female).
(a) The trend in disability-adjusted life years (DALYs) of HNC across different SDI groups (high SDI, high-middle SDI, middle SDI, low-middle SDI, and low SDI). (b) The trend in deaths of HNC across different SDI groups. (c) The trend in incidence of HNC across different SDI groups. (d) The trend in age-standardized DALYs of HNC across different SDI groups. (e) The trend in age-standardized death rate (ASDR) of HNC across different SDI groups. (f) The trend in age-standardized incidence rate (ASIR) of HNC across different SDI groups.
The global trend of burden of HNC.
In 2021, the global number of DALYs of HNC is 15,597,835 (j = 7 , 8 CI: 14,180,450–16,948,531 ), and age-standardized DALYs is LRj (π ∼ Uniform ( 0 . 05 , 0 . 5 ) CI: 162.94–194.93) (Table 1, Fig 2a). The number of deaths is σ = 1 (95% CI: 499,649–587,847), causing an ASDR of 6.30 (95% CI: 5.78–6.80) (Table 1, Fig 2b). The number of incidence is 1 , 160 , 696 (95% CI: 1,064,793–1,256,106), with an ASIR 13.40 (95% CI: 12.30–14.51) (Table 1, Fig 2c). By analyzing the age-standardized trends, it is evident that the ASIR has exhibited a progressive rise, with EAPC of 0.12 (95% CI: 0.06–0.19) (Table 1, Fig 2f). By contrast, the ASDR and age-standardized DALYs have revealed a declining trend, with EAPC of –0.65 (95% CI: [–0.72]–[–0.57]) and –0.94 (95% CI: [–1.04]–[–0.84]), respectively (Table 1, Fig 2d and 2e).
The global trend of burden of HNC by SDI.
AS is shown in Table 1. In 2021, compared with 1980, there was a general decrease in ASDR and age-standardized DALYs across all SDI regions, while incidence, death, DALYs, and ASIR exhibited an increasing trend. Notably, in 2021, the high SDI level region exhibited the highest ASIR for HNC, with a value of 15.96 (95% CI: 15 . 10 − 16 . 73) (Table 1 and Fig 5f). However, it also displayed the lowest ASDR 4.00 (95% CI: 3 . 71 − 4 . 21) and the lowest age-standardized DALYs 107.97 (95% CI: 101 . 78–113 . 89) (Table 1 and Fig 5). According to the data from 1980 to 2021, a comprehensive analysis of different SDI regional data reveals a slowly increasing trend in the incidence, mortality, and DALYs (Fig 5a–5c). Conversely, there is a decreasing trend in ASDR and age-standardized DALYs, except in the low-medium or low SDI regions with stable trend (Fig 5d and 5e).
The global trend of burden of HNC by gender.
Table 1 displays that the EAPC of ASIR in females is 0.50 (95% CI: 0 . 45 − 0 . 55) more than the content in males of –0.09 ((95% CI: [–0.17]–[–0.01]) (Fig 4f). Simultaneously, the EAPC of ASDR is − 0 . 56 ((95% CI: [–0.63]–[–0.50]) in females and − 0 . 73 ((95% CI: [–0.81]–[–0.66]) in males (Fig 4e). However, male patients show higher values in incidence, mortality, DALYs, ASIR, ASDR, and age-standardized DALYs relative to female patients (Fig 4a–4f)).
The global trend of burden of HNC by age.
The trends of incidence, deaths, DALYs for HNC across different age groups (5-year intervals) from 1980 to 2021 are presented in Table 2. In 2021, compared with 1990, there is an overall increasing trend in the incidence, deaths, and DALYs for HNC with increasing age, except for a decrease in deaths and DALYs among individuals aged under 20 (Fig 3a–3c). Since 2005, the DALYs in the 55-59 age group have surpassed those of the 50-54 age group, which can maintain the highest values in the subsequent years (Fig 3a). Meanwhile, its incidence has consistently ranked among the top two, along with the 60-64 age group (Fig 3c). In terms of mortality, the 65-69 age group has recently overtaken the 60-64 age group, becoming the group with the highest number of deaths (Fig 3b). When analyzing incidence and mortality trends, we find that the 70-74 age group has exhibited the fastest growth in both categories in recent years, requiring further attention (Fig 3b–3c).
Prediction of the global trend of burden of HNC
After analyzing the global burden of HNC, research was performed to predict the future burden of HNC using our designed HNCP-T framework. In combination with the previous analysis, these predictions offer novel insights for the prevention and management of HNC.
Prediction of the global trend of burden of HNC, from 2022 to 2030.
The ASIR is projected to increase annually from 2022 to 2030 (EAPC = 0.22, 95% CI: 0.09–0.35). By contrast, the ASDR and age-standardized DALYs are expected to decline during the same period, with EAPCs of –0.40 (95% CI: [–0.63]–[–0.16]) and –0.18 (95% CI: –1.03]–[–0.59]), respectively (Table 3 and Fig 6a–6c).
Prediction of the global trend of burden of HNC by Gender, from 2022 to 2030.
From 1990 to 2021, the ASIR of HNC shows an upward trend for both men and women, which is a trend that is projected to continue from 2022 to 2030 (Table 3 and Fig 7a and 7d). The EAPC for ASIR in women is 0.48 (95% CI: 0.26–0.71), which is more than double the growth rate observed in men, whose EAPC for ASIR is 0.20 (95% CI: 0.14–0.26 )(Table 3). However, the ASDR and age-standardized DALYs are expected to decline for both men and women from 2022 to 2030, with EAPCs of –0.54 (95% CI: [–0.61]–[–0.46]) and –0.92 (95% CI: [–1.14] –[–0.69]) , respectively (Table 3 and Fig 7b and 7e). In addition, the age-standardized DALYs rate is predicted to decrease slightly, with an EAPC of –0.14 (95% CI: [–0.83]–[–0.49]) for women and –0.66 (95%: [–0.83]–[–0.49]) for men (Table 3 and Fig 7c and 7f).
(a) to (c) show the comparison between the predicted and actual GDB data for females, covering ASIR, ASDR, and age-standardized disability-adjusted life years (DALYs), respectively, using the HNCP-T strategy. (d) to (f) present the same comparisons for males. As shown in these figures, the blue area represents the comparison between the training results of HNCP-T and actual GDB data from 1980 to 2015, the gray area covers the comparison between test results and actual data from 2016 to 2021, and the yellow area represents predictions for 2022 to 2030.
Prediction of the global trend of burden of HNC by SDI, from 2022 to2030.
From 2022 to 2030, global HNC burden trends show a strong correlation with socio-economic development. The disease indicators in high SDI areas are trending down (Fig 8a–8c), while disease indicators in low SDI areas are trending up (Table 3 and Fig 9e–9g). As illustrated in Figs 8 and 9, the forecasted trends in disease burden from 2022 to 2030 reveal significant discrepancy across regions with varying SDI levels. In high SDI regions, all three indicators are expected to decline, with estimated annual percentage changes (EAPCs) of − 0 . 15, (95% CI: [ − 0 . 38 ] - 0.09), − 1 . 08, (95% CI: [ − 1 . 16 ] − [ − 0 . 99 ] ), and − 1 . 08, (95% CI: [ − 1 . 16 ] − [ − 0 . 99 ] ), respectively (Table 3 and Fig 8). However, in other regions, specifically high-middle SDI, middle SDI, low-middle SDI, and low SDI, an increasing trend in ASIR is expected from 2022 to 2030 (Table 3 and Figs 8d–8f). Notably, the EAPC for ASIR is the highest in low-middle SDI regions (1.39, 95% CI: 1 . 15 − 1 . 63), followed by middle SDI regions (0.45, 95% CI: 0.14, 0.76) (Table 3 and Fig 8a, 8d, 8g and 9b, 9e)). Meanwhile, the ASDR and age-standardized DALY rate are expected to decline in these regions (Fig 9b, 9c, 9e, 9f. High-middle SDI regions notably exhibit the lowest EAPC for ASDR at − 1 . 82 (95% CI: [ − 1 . 97 ] , [ − 1 . 69 ] ) and for age-standardized DALYs at − 0 . 86 (95% CI: [ − 0 . 98 ] , [ − 0 . 74 ] ), followed by high SDI regions with EAPCs of − 1 . 08 (95% CI: [ − 1 . 16 ] , [ − 0 . 99 ] ) for ASDR and − 0 . 86 (95% CI : [ − 0 . 98 ] , [ − 0 . 74 ] ) for age-standardized DALYs, respectively (Fig 9a, 9d, 9e).
(a) to (c) illustrate the comparison between the predicted and actual GBD data for high SDI, covering age-standardized incidence rate (ASIR), age-standardized death rate (ASDR), and age-standardized disability-adjusted life years (DALYs), respectively, using the HNCP-T strategy. (d) to (f) present the same comparisons for high-middle SDI. (g) to (h) show the comparison between the predicted and actual GBD data for middle SDI, covering ASIR and ASDR, using the HNCP-T strategy. As presented in these figures, the blue area indicates the comparison between the HNCP-T training results and actual GBD data from 1980 to 2015, the gray area covers the comparison between test results and actual data from 2016 to 2021, and the yellow area represents predictions for 2022 to 2030.
(a) illustrates the comparison results in age-standardized DALYs between the predicted and actual GDB data for middle SDI, using the HNCP-T strategy. (b) to (d) present the comparison between the predicted and actual GDB data for low middle SDI, covering age-standardized incidence rate (ASIR), age-standardized death rate (ASDR) and age-standardized disability-adjusted life years (DALYs), respectively, using the HNCP-T strategy. (d) to (f) present the same com-parisons for high middle SDI. (e) to (g) indicate the comparison between the predicted and actual GDB data for low SDI, covering ASIR, ASDR and age-standardized DALYs respectively, using the HNCP-T strategy. In these figures, the blue area denotes the comparison between the training results of HNCP-T and actual GDB data from 1980 to 2015, the gray area covers the comparison between test results and actual data from 2016 to 2021, and the yellow area denotes predictions from 2022 to 2030.
Comparative experiment with other models.
Given that LSTM [23] and linear regression [24] models are also often adopted to predict the global disease burden of cancer, to further validate the superiority of our proposed HNCP-T, we conduct a comparative experiment with Linear Regression and LSTM. The experimental setup is as follows:
- Model Selection: Linear Regression is chosen as a representative classical statistical model, and LSTM is selected as an alternative deep learning-based sequence model.
- Training Process: Each model is trained using the training dataset for 100 epochs.
- Evaluation Metrics: MAE is applied as the primary evaluation metric to assess the performance of each model based on the test dataset across three dimensions: ASIR, ASDR, and Age-Standardized DALY rate.
- Comparison Analysis: The MAE values for each model are compared to evaluate their predictive performance.
As shown in Table 4, the proposed HNCP-T model consistently achieves the lowest MAE values across all three metrics compared to Linear Regression and LSTM models. Specifically, the HNCP-T model records an MAE of 0.028 for ASIR, 0.018 for ASDR, and 0.015 for Age-Standardized DALY rate, outperforming the other models by a significant margin. These results demonstrate the HNCP-T model’s superior ability to capture complex nonlinear relationships and temporal dependencies inherent in the HNC burden data, particularly in handling time-series signals and categorical data effectively.
The enhanced performance of the HNCP-T model can be attributed to its self-attention mechanisms, which allow for better modeling of long-range dependencies and interactions within the data. Additionally, the robustness of the HNCP-T model in managing missing and noisy data further solidifies its advantage over traditional and other deep learning-based approaches in this context.
Discussion
In this study, we provide novel insights into the global burden of HNC based on deep learning, specifically a transformer-based model. The transformer-based model offers significant advantages over traditional sequence models such as LSTM and RNNs, including the ability to capture long-range dependencies more effectively through self-attention mechanisms and enable parallel processing, which lead to improved performance in forecasting tasks. Our comparative experiments further validate that the transformer model achieves lower MAE values across key metrics, underscoring its superior capability in handling complex nonlinear relationships and temporal dependencies inherent in the global burden of head and neck cancers. Additionally, the transformer model demonstrates greater robustness in managing missing and noisy data, which is crucial for accurate disease burden predictions in diverse and heterogeneous datasets. We analyze historical trends in HNC incidence, mortality, and DALYs, along with future projections for the age-standardized rates of these metrics. Our results underscore key patterns in disease burden across gender, socioeconomic regions, and age groups, as well as the effects of modern screening and treatment approaches.
Increasing incidence driven by environmental and behavioral factors
Our results emphasize a consistent increase in the age-standardized incidence rate (ASIR) of HNC from 1980 to 2021, with predictions indicating a further 1.3-fold rise by 2030. This increase in incidence is consistent with growing exposure to risk factors such as smoking, alcohol consumption, and human papillomavirus (HPV) [10,11]. The increase in risky behaviors, such as smoking and alcohol consumption, are potential factors in the rising incidence of HNC [25]. Studies have shown that from 1990 to 2017, global adult per capita alcohol consumption increased from 5.9 L (95% CI: 5.8–6.1) to 6.5 L (95%CI: 6.6–6.9), and it is projected to reach 7.6 L (95% CI: 6.5–10.5) by 2030 [26]. The number of smokers have increased significantly from 1980 to 2021 [27,28]. As smoking and alcohol consumption are major risk factors for HNC, public health policies should focus on implementing targeted interventions to reduce these behaviors. This includes increasing awareness, providing support for cessation programs, and enforcing regulations to limit the availability of tobacco and alcohol. Furthermore, the rising global prevalence of HPV, which is also associated with cervical and anal cancers [29,30], is particularly alarming. The growing prevalence of HPV-re lated subsites in HNC is a crucial factor driving the significant changes observed in global HNC trends [31]. Expanding vaccination programs globally, particularly in regions with low coverage, could significantly lower this burden [32].
The role of advancements in diagnostic technologies is considered another significant factor. Narrow-band imaging (NBI) and other early detection techniques have significantly enhanced HNC screening outcomes [33]. Nevertheless, despite these technological improvements, regions with limited access to healthcare continue to exhibit rising incidence rates. Moreover, future studies should concentrate on exploring how further innovation in screening, combined with wider global access, can mitigate the growing incidence.
Declining mortality and DALYs: the role of treatment advancements
While HNC incidence continues to increase, it is suggested in our projections that the age-standardized death rate (ASDR) and DALYs will stabilize or decrease by 2030. This divergence between rising incidence and declining mortality is primarily caused by advances in treatment strategies, including the combined use of surgery, radiotherapy, chemotherapy, and targeted therapies such as Epidermal Growth Factor Receptor (EGFR) inhibitors [34]. Patient outcomes have been further enhanced by improved genetic diagnostics and individualized treatment plans [35].
The predicted decrease in ASDR and DALYs shows the positive impact of modern treatments, while continued monitoring is vital to ensure that these trends persist across different regions and population groups. It is essential for future studies to concentrate on expanding access to these advanced treatment modalities in regions with limited healthcare infrastructure.
Gender-based trends in HNC burden
A key finding is the distinct gender-based trends in HNC burden. Even though men continue to exhibit higher overall ASIR, ASDR, and DALYs compared with women, the incidence rate among women is increasing more rapidly [36,37]. The annual incidence rate in females is expected to increase at a rate of 0.50 (95% CI: 0.45-–0.55), whereas the rate is reducing slightly at –0.09 in males (95% CI: [–0.17]–[–0.01]). Despite the rise in incidence, females show a smaller decrease in mortality rates compared with males [38]. These gender disparities are probably caused by several factors, such as behavioral differences, hormonal influences, and lifestyle choices.
While hormonal factors, such as estrogen’s potential protective role against HNC, have been extensively studied, they do not fully explain the faster rise in incidence among women [36]. Further research is needed to investigate other hormonal mechanisms, such as the role of menopause and hormone replacement therapy in modifying HNC risk. Lifestyle factors, including alcohol consumption and tobacco use, are also evolving differently by gender, with females exhibiting a slower decline in alcohol use and lower rates of tobacco use globally [39]. Notably, alcohol consumption patterns among women, particularly in higher-income regions, are rising, which may contribute to the increasing HNC incidence. Moreover, evolving trends in smoking cessation between men and women might explain some of the disparities in disease progression.
Recurrence and second primary cancers (SPCs) are significant challenges to long-term survival, especially in women. The increase in late-stage HNC among females indicates that gender-based interventions may be necessary to deal with the above-mentioned risks. For example, educational campaigns targeting alcohol consumption among women, or screenings tailored to address late-stage diagnoses could be pivotal in mitigating the rising burden among women. Moving forward, to effectively mitigate the growing HNC burden among women, public health strategies must incorporate these gender differences in risk factors and disease progression.
Socioeconomic disparities in HNC burden
Our findings show substantial disparities in HNC burden across regions with different SDI. High SDI regions reveal a reducing trend in ASIR, ASDR, and DALYs, reflecting advancements in healthcare access and preventive measures. In contrast, low- and middle-SDI regions are expected to bear a greater disease burden by 2030, with middle-SDI regions being projected to confront the highest overall HNC impact [40,41].
Socioeconomic factors, such as income level, education, and healthcare infrastructure, play a significant role in shaping these disparities. For instance, income inequality is associated with limited access to early screening and treatment, leading to delayed diagnoses and poorer outcomes. In regions with lower education levels, public awareness about HNC risk factors—such as tobacco use, alcohol consumption, and HPV vaccination—tends to be lower, further contributing to rising incidence and mortality rates. These regional disparities emphasize the need for targeted healthcare interventions. Improved access to screening and treatment in low- and middle-income regions exerts a critical role in reversing these trends. For example, socioeconomic indicators such as poverty rates, literacy levels, and healthcare expenditure per capita are closely correlated with HNC incidence and mortality rates. Regions with higher poverty and lower healthcare spending usually face more advanced disease stages at diagnosis, resulting in worse prognosis and survival rates. Developing cost-effective, scalable diagnostic tools and expanding vaccination programs could notably alleviate the burden in these areas. In addition, global efforts need to be focused on reducing the delay in diagnosis, remaining a major issue in low-SDI regions [41]. Addressing these socioeconomic barriers through public health policies aimed at improving health literacy, increasing vaccination coverage, and providing affordable access to early screening and care will be essential in reducing HNC disparities. Investing in healthcare infrastructure, including training healthcare professionals and improving diagnostic facilities, is essential to address the disparities in HNC burden across different socioeconomic regions.
Age-Related trends and the complex disease burden
This study also emphasizes age-related trends in HNC, with the incidence rising across most age groups. Nevertheless, a decrease in incidence among older populations indicates underdiagnosis or competing health issues that mask HNC symptoms [42,43]. The decreasing mortality and DALYs among younger populations suggest that advancements in early detection and treatment are especially beneficial for these groups. However, the risks associated with aging, including multiple comorbidities and heightened surgical risks, are significant factors which can influence HNC outcomes [34].
Further research is vital for better comprehending the biological and environmental factors contributing to the age-specific trends found in our analysis. The relationship between accelerated epigenetic aging and adverse outcomes in HNC highlights the need for personalized treatment approaches that address the unique challenges faced by elderly patients [44]. Raising awareness about the risk factors, symptoms, and prevention strategies for head and neck cancers can empower individuals to take proactive measures in reducing their risk and seeking timely medical attention.
Study limitations and future directions
Although the obtained findings provide valuable insights into the global HNC burden, this study still had the following limitations. The GBD dataset, while comprehensive, is subject to variability in reporting accuracy and completeness across different regions [45]. Discrepancies in diagnostic and reporting practices may influence data comparability and introduce bias in this analysis. For instance, underreporting in low- and middle-SDI regions, where healthcare infrastructure may be less developed, could lead to underestimating the true disease burden in these area. This could skew global estimates and potentially mask critical trends, particularly in regions where HNC rates are expected to rise. The summary-level data applied in this study restricts the capability of conducting detailed mechanistic analyses, including those involving genetic or environmental factors. To mitigate these issues in future research, increasing the granularity of the data by incorporating patient-level information would allow for more refined analyses. Enhancing global health surveillance systems, particularly in under-resourced regions, could improve the accuracy and completeness of the data, thereby reducing bias. Collaborations with local health authorities to standardize diagnostic criteria and reporting practices could also help address regional disparities in data quality and comparability.
Future research should concentrate on incorporating individual-level data and exploring the complex interplay of genetic predispositions, environmental pollution, and lifestyle changes in shaping the global HNC burden. Key research questions could include: What specific genetic mutations or variants contribute to the higher incidence of HNC in certain populations? How do environmental pollutants, such as air and water contamination, interact with genetic predispositions to influence HNC outcomes? How do lifestyle factors, including smoking and alcohol consumption, modify these genetic and environmental risks? Expanding research into underrepresented populations and regions will be vital for refining our understanding of HNC dynamics and enhancing the precision of disease burden predictions. Moreover, fostering collaborations with epidemiologists, geneticists, and environmental scientists would provide an integrative approach, allowing for the development of more holistic models considering both biological and external factors. Additionally, these interdisciplinary efforts could also lead to the identification of at-risk populations and the formulation of targeted public health interventions.
Conclusions
To conclude, this study offers a detailed analysis of the global burden of head and neck cancer (HNC) using the GBD 2021 dataset, covering the period from 1980 to 2021, and introduces the HNCP-T model, a novel deep learning-based approach for the prediction of future trends from 2022 to 2030. Our model offers novel insights into the evolving patterns of HNC, especially highlighting significant regional and gender-based disparities. The projected rise in incidence is most pronounced in low- and middle-SDI regions and among women, driven by a combination of enhanced diagnostic capabilities and socio-economic factors. The superior performance of the transformer-based model in our comparative experiments highlights its effectiveness in capturing intricate patterns and dependencies, making it a valuable tool for future disease burden forecasting and public health planning. Through confirming our approach with real-world data, we exhibit the robustness and reliability of using deep learning models to forecast global health trends. This approach not only improves upon traditional statistical methods but also provides a powerful tool for more precise, data-driven public health strategies. The insights from the present study have the potential to inform more effective interventions, guide policy development, as well as better equip healthcare systems to address the growing burden of HNC globally. Our findings underscore the importance of continuing to refine predictive models to deal with global health challenges, particularly in regions and populations with increasing disease burdens. Moreover, the application of such models could cause more targeted and proactive public health responses in the future. However, to enhance the practical application of these findings, specific policy recommendations at local, national, and global levels should be explored. For instance, healthcare systems could adopt adaptive frameworks, integrating predictive models like HNCP-T, to forecast trends and allocate resources more effectively, focusing on high-risk regions and populations.
Acknowledgments
We would like to express our sincere gratitude to the Global Burden of Disease (GBD) collaborators for providing the comprehensive dataset used in this study. We are particularly thankful to our colleagues at the Department of Otolaryngology, Second Affiliated Hospital, Anhui Medical University, and the School of Artificial Intelligence and Data Science, University of Science and Technology of China, for their valuable input and discussions that significantly contributed to this research.
References
- 1. Ferlay J, Soerjomataram I, Dikshit R, Eser S, Mathers C, Rebelo M. Cancer incidence and mortality worldwide: sources, methods and major patterns in GLOBOCAN 2012. Int J Cancer 2015;136(5):E359–86. pmid:25220842
- 2. de Melo NB, de Macedo Bernardino Í, de Melo DP, Gomes DQC, Bento PM. Head and neck cancer, quality of life, and determinant factors: a novel approach using decision tree analysis. Oral Surg Oral Med Oral Pathol Oral Radiol 2018;126(6):486–493. pmid:30243916
- 3. Johnson DE, Burtness B, Leemans CR, Lui VWY, Bauman JE, Grandis JR. Head and neck squamous cell carcinoma. Nature reviews Disease primers 2020;6(1):92. pmid:33243986
- 4. Sung H, Ferlay J, Siegel RL, Laversanne M, Soerjomataram I, Jemal A. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin 2021;71(3):209–49.
- 5. Anderson G, Ebadi M, Vo K, Novak J, Govindarajan A, Amini A. An updated review on head and neck cancer treatment with radiation therapy. Cancers 2021;13(19):4912. pmid:34638398
- 6. Bhatia A, Burtness B. Treating head and neck cancer in the age of immunotherapy: a 2023 update. Drugs 2023;83(3):217–248. pmid:36645621
- 7. World Health Organization. WHO package of essential noncommunicable (PEN) disease interventions for primary health care. WHO. 2020;.
- 8. Bennett JE, Kontis V, Mathers CD, Guillot M, Rehm J, Chalkidou K et al. NCD Countdown 2030: pathways to achieving Sustainable Development Goal target 3.4. Lancet. 2020;396(10255):918–34. pmid:32891217
- 9. Murray CJ. Findings from the Global Burden of Disease Study 2021. Lancet 2024;403(10440):2259–62.
- 10. Tran KB, Lang JJ, Compton K, Xu R, Acheson AR, Henrikson HJ. The global burden of cancer attributable to risk factors, 2010–19: a systematic analysis for the Global Burden of Disease Study 2019. Lancet 2022;400(10352):563–91. pmid:35988567
- 11. Chow LQ. Head and neck cancer. N Engl J Med 2020;382(1):60–72. pmid:31893516
- 12. Raudenská M, Balvan J, Masařík M. Cell death in head and neck cancer pathogenesis and treatment. Cell Death Dis 2021;12(2):192. pmid:33602906
- 13. Wang F, Casalino LP, Khullar D. Deep learning in medicine—promise, progress, and challenges. JAMA Intern Med 2019;179(3):293–4. pmid:30556825
- 14. Hochreiter S, Schmidhuber J. Long short-term memory. Neural Comput 1997;9(8):1735–80. http://dx.doi.org/10.1162/neco.1997.9.8.1735
- 15. Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN et al. Attention is all you need. In: Proceedings of the 31st International Conference on Neural Information Processing Systems. NIPS’17. Red Hook, NY, USA: Curran Associates Inc.; 2017. pp. 6000–-10.
- 16. Han K, Wang Y, Chen H, Chen X, Guo J, Liu Z. A survey on vision transformer. IEEE transactions on pattern analysis and machine intelligence 2022;45(1):87–110.
- 17. Mattiuzzi C, Lippi G. Worldwide asthma epidemiology: insights from the Global Health Data Exchange database. In: International forum of allergy & rhinology, vol. 10. Wiley Online Library; 2020. pp. 75–80.
- 18. Vos T, Lim SS, Abbafati C, Abbas KM, Abbasi M, Abbasifard M. Global burden of 369 diseases and injuries in 204 countries and territories, 1990–2019: a systematic analysis for the Global Burden of Disease Study 2019. Lancet 2020;396(10258):1204–22. pmid:33069326
- 19. Murray CJ, Aravkin AY, Zheng P, Abbafati C, Abbas KM, Abbasi-Kangevari M. Global burden of 87 risk factors in 204 countries and territories, 1990–2019: a systematic analysis for the Global Burden of Disease Study 2019. Lancet 2020;396(10258):1223–49. pmid:33069327
- 20. Wang H, Abbas KM, Abbasifard M, Abbasi-Kangevari M, Abbastabar H, Abd-Allah F. Global age-sex-specific fertility, mortality, healthy life expectancy (HALE), and population estimates in 204 countries and territories, 1950–2019: a comprehensive demographic analysis for the Global Burden of Disease Study 2019. The Lancet 2020;396(10258):1160–1203. pmid:33069325
- 21. Dobson AJ, Barnett A. Exponential family and generalized linear models. An Introduction to Generalized Linear Models. Boca Raton, FL: Chapman and Hall/CRC. 2002.
- 22. Murray CJ, Lopez AD. Global mortality, disability, and the contribution of risk factors: Global Burden of Disease Study. Lancet 1997;349(9063):1436–42. pmid:9164317
- 23. Rauniyar SK, Hashizume M, Yoneoka D, Nomura S. Projection of morbidity and mortality due to breast cancer between 2020 and 2050 across 42 low-and middle-income countries. Heliyon 2023;9(6):e16427. pmid:37274661
- 24. Zhao Q, Chen M, Fu L, Yang Y, Zhan Y. Assessing and projecting the global burden of thyroid cancer, 1990–2030: analysis of the Global Burden of Disease Study. J Glob Health. 2024;14:04090. pmid:38577809
- 25. Manthey J, Shield KD, Rylett M, Hasan OSM, Probst C, Rehm J. Global alcohol exposure between 1990 and 2017 and forecasts until 2030: a modelling study. Lancet 2019;393(10190):2493–502. pmid:31076174
- 26. Jinyi W, Zhang Y, Wang K, Peng P. Global, regional, and national mortality of tuberculosis attributable to alcohol and tobacco from 1990 to 2019: a modelling study based on the Global Burden of Disease study 2019. J Glob Health. 14:04023. pmid:38175959
- 27. Moscowchi A, Moradian-Lotfi S, Koohi H, Sarrafan Sadeghi T. Levels of smoking and outcome measures of root coverage procedures: a systematic review and meta-analysis. Oral Maxillofac Surg 2024;28(2):485–97. pmid:37528316
- 28. Khan Minhas AM, Sedhom R, Jean ED, Shapiro MD, Panza JA, Alam M. Global burden of cardiovascular disease attributable to smoking, 1990–2019: an analysis of the 2019 Global Burden of Disease Study. Eur J Prev Cardiol 2024;31(9):1123–31. pmid:38589018
- 29. Giannone G, Giuliano AR, Bandini M, Marandino L, Raggi D, Earle W. HPV vaccination and HPV-related malignancies: impact, strategies and optimizations toward global immunization coverage. Cancer Treat Rev. 2022;111:102467. pmid:36265336
- 30. Tota JE, Giuliano AR, Goldstone SE, Dubin B, Saah A, Luxembourg A, et al. Anogenital human papillomavirus (HPV) infection, seroprevalence, and risk factors for HPV seropositivity among sexually active men enrolled in a global HPV vaccine trial. Clin Infect Dis 2022;74(7):1247–56. pmid:34265048
- 31. dos Santos Menezes F, Fernandes GA, Antunes JLF, Villa LL, Toporcov TN. Global incidence trends in head and neck cancer for HPV-related and-unrelated subsites: a systematic review of population-based studies. Oral Oncol. 2021;115:105177. pmid:33561611
- 32. Mühr LSA, Eklund C, Lagheden C, Forslund O, Robertsson KD, Dillner J. Improving human papillomavirus (HPV) testing in the cervical cancer elimination era: the 2021 HPV LabNet international proficiency study. J Clin Virol. 2022;154:105237. pmid:35820294
- 33. Muto M, Minashi K, Yano T, Saito Y, Oda I, Nonaka S. Early detection of superficial squamous cell carcinoma in the head and neck region and esophagus by narrow band imaging: a multicenter randomized controlled trial. J Clin Oncol 2010;28(9):1566–72. pmid:20177025
- 34. Mody MD, Rocco JW, Yom SS, Haddad RI, Saba NF. Head and neck cancer. Lancet 2021;398(10318):2289–99. pmid:34562395
- 35. Rodon J, Soria JC, Berger R, Miller WH, Rubin E, Kugel A. Genomic and transcriptomic profiling expands precision cancer medicine: the WINTHER trial. Nat Med 2019;25(5):751–8. pmid:31011205
- 36. Panth N, Simpson MC, Sethi RK, Varvares MA, Osazuwa-Peters N. Insurance status, stage of presentation, and survival among female patients with head and neck cancer. Laryngoscope 2020;130(2):385–91. pmid:30900256
- 37. Hashim D, Sartori S, Vecchia CL, Serraino D, Maso LD, Negri E. Hormone factors play a favorable role in female head and neck cancer risk. Cancer Med 2017;6(8):1998–2007. pmid:28707400
- 38. Bryazka D, Reitsma MB, Griswold MG, Abate KH, Abbafati C, Abbasi-Kangevari M. Population-level risks of alcohol consumption by amount, geography, age, sex, and year: a systematic analysis for the Global Burden of Disease Study 2020. Lancet 2022;400(10347):185–235. pmid:35843246
- 39. Reitsma MB, Kendrick PJ, Ababneh E, Abbafati C, Abbasi-Kangevari M, Abdoli A. Spatial, temporal, and demographic patterns in prevalence of smoking tobacco use and attributable disease burden in 204 countries and territories, 1990–2019: a systematic analysis from the Global Burden of Disease Study 2019. Lancet 2021;397(10292):2337–60. pmid:34051883
- 40. Fitzmaurice C, Allen C, Barber RM, Barregard L, Bhutta ZA, Brenner H. Global, regional, and national cancer incidence, mortality, years of life lost, years lived with disability, and disability-adjusted life-years for 32 cancer groups, 1990 to 2015: a systematic analysis for the global burden of disease study. JAMA Oncol 2017;3(4):524–548. pmid:27918777
- 41. Ravikumar S, Casellas NJ, Shah S, Rieth K. Geographic disparities in head and neck cancer survival in Upstate New York 2011–2019. Head Neck 2022;44(2):472–82. pmid:34845771
- 42. Bray F, Ferlay J, Soerjomataram I, Siegel RL, Torre LA, Jemal A. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin 2018;68(6):394–424. pmid:30207593
- 43. Hashibe M, Brennan P, Chuang Sc, Boccia S, Castellsague X, Chen C. Interaction between tobacco and alcohol use and the risk of head and neck cancer: pooled analysis in the International Head and Neck Cancer Epidemiology Consortium. Cancer Epidemiol Biomarkers Prev 2009;18(2):541–50. pmid:19190158
- 44. Xiao C, Miller AH, Peng G, Levine ME, Conneely KN, Zhao H. Association of epigenetic age acceleration with risk factors, survival, and quality of life in patients with head and neck cancer. Int J Radiat Oncol Biol Phys 2021;111(1):157–67. pmid:33882281
- 45. Murray CJ. The global burden of disease study at 30 years. Nat Med 2022;28(10):2019–26.