Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Analysis of heavy metal contamination in topsoils across land use types within the Manghe River watershed in South Taihang and its source attribution

  • Xiaoqiang Wan ,

    Contributed equally to this work with: Xiaoqiang Wan, Chengyu Wang

    Roles Conceptualization, Data curation, Writing – original draft

    Affiliations The First Institute of Henan Provincial Resource Environment Survey, Zhengzhou, Henan Province, China, School of Public Administration and Law, Northeast Agricultural University, Harbin, Heilongjiang Province, China

  • Chengyu Wang ,

    Contributed equally to this work with: Xiaoqiang Wan, Chengyu Wang

    Roles Methodology, Supervision, Writing – original draft

    Affiliation School of Public Administration and Law, Northeast Agricultural University, Harbin, Heilongjiang Province, China

  • Quanlai Ma,

    Roles Resources, Software

    Affiliation The First Institute of Henan Provincial Resource Environment Survey, Zhengzhou, Henan Province, China

  • Chongke Yang,

    Roles Data curation, Software, Visualization

    Affiliation The First Institute of Henan Provincial Resource Environment Survey, Zhengzhou, Henan Province, China

  • Jizhou Zhang,

    Roles Data curation, Visualization

    Affiliation School of Public Administration and Law, Northeast Agricultural University, Harbin, Heilongjiang Province, China

  • Yingtao Shang

    Roles Conceptualization, Writing – review & editing

    s231202040@neau.edu.cn

    Affiliation School of Public Administration and Law, Northeast Agricultural University, Harbin, Heilongjiang Province, China

Abstract

To investigate the characteristics of soil heavy metal pollution in the Manghe River watershed, a typical industrial and mining complex area in the Yellow River Basin, concentrations of Hg, Cr, Cu, Ni, Pb, Zn, Cd, and pH were measured in 121 topsoil samples (0–20 cm) collected from the study area. Geostatistical methods were employed to analyze the spatial distribution patterns of heavy metals. The pollution status was assessed using the pollution load index (PLI), while correlation analysis, principal component analysis (PCA), and a positive matrix factorization (PMF) model were applied to identify the sources of heavy metals. The results indicated that: (1) The concentrations of Hg, As, Ni, Cu, Pb, Zn, and Cd exceeded their respective background values, with Hg, Pb and Cd reaching 3.52, 4.85, and 46.4 times of the background levels, respectively.(2) Different elements exhibited distinct spatial distribution and diffusion patterns, revealing their respective sources and influencing factors. (3) The overall PLI was 0.785, reflecting a mild pollution level across the region, while industrial and mining lands exhibited severe pollution (PLI = 4.3). The relative contribution of each heavy metal to the pollution load was ranked as follows: Cd (30.35)> Pb (4.76)> Hg (3.62)> Zn (2.18)> As (1.77)> Cu (1.53). (4) Principal component analysis categorized the sources of heavy metals into anthropogenic activities and natural origins. Further analysis using the PMF model delineated four specific sources: coal combustion (10.87%), natural and agricultural contributions (27.37%), transportation and agricultural actives (26.81%), and industrial emissions (34.95%). Finally, the study identified the following feasible strategies for controlling heavy metal pollution: blocking and remediating industrial pollution sources; treating agricultural non-point source pollution through biological methods; and substituting traditional transportation sources with new energy alternatives. This research could support decision-making processes related to the prevention and control of heavy metal pollution in the study area, as well as regional sustainable development.

Introduction

Soil heavy metal pollution has emerged as one of the prominent global environmental pollution issues, with developing countries experiencing more severe impacts during rapid industrialization and urbanization [1,2]. As the largest developing country in the world, China also faces the pressing issue of soil heavy metal pollution. According to the National Soil Pollution Survey Bulletin jointly released by the Ministry of Ecology and Environment and the Ministry of Natural Resources in 2014, the exceedance rate of heavy metals in soil has reached 16.1%, and the situation is continuously worsening [3]. The 2022 China Ecological Environment Bulletin indicates that preventing and controlling soil heavy metal pollution remains a top priority. Heavy metal pollution is characterized by bioaccumulation, persistence, and concealment, making its remediation particularly challenging. Moreover, heavy metals can accumulate in soil, leading to ecological damage and entering the human body through food chain absorption or inhalation of dust particles, thereby posing a significant threat to human health [4,5]. Therefore, clarifying the distribution patterns and sources of heavy metal contamination in soil is of crucial importance for the prevention, control, and remediation of heavy metal pollution.

Regarding the spatial heterogeneity of soil heavy metal content across different land use types, the academic community has developed a multi-dimensional methodological framework. Traditional geostatistical approaches, such as Kriging interpolation and spatial autocorrelation analysis, are effective in revealing macro-level distribution patterns of heavy metals [6]; however, they fall short in explaining the underlying mechanisms of pollution formation, which are influenced by both anthropogenic activities and natural factors. Recently, GIS-based multi-model integration techniques have emerged to overcome these limitations. By combining spatial interpolation with pollution assessment, these techniques support an integrated analytical process-from spatial differentiation of pollution to land use response and finally to ecological risk classification [7,8]. In terms of pollution assessment, existing researches primarily employ five categories of indicator models: (1)Single-factor evaluation models, including the enrichment factor (EF) and the geo-accumulation index (Igeo), focus on threshold characteristics of individual elements [9,10]; (2) Composite index models, such as the Pollution Load Index (PLI) and Nemerow Index (NIPI), are suitable for assessing synergistic pollution involving multiple elements [11,12]; (3) Ecological risk indices, such as the Hakanson index, are used to quantify the biotoxic effects of heavy metals [13]; (4) Source-Sink relationship models, employing models like PMF and APCS-MLR, analyze pollutant migration pathways [14,15]; (5) Health risk assessment models establish Exposure Dose-Health Effect relationship.

For the scientific issue of pollution source analysis, research paradigms have evolved from the early qualitative identification of natural parent materials and human activities to more advanced quantitative attribution approaches [16]. The PMF model is particularly valuable for analyzing pollution sources in complex contaminated environments, such as industrial and agricultural areas, due to its its capabilities in quantifying uncertainty, enforcing non-negative constraints, and decomposing contributions from multiple sources [17]. Extensive researches show that the PMF model has exceptional capabilities when it comes to analyzing collinear pollution sources. These include industrial emissions (e.g., smelting dust and waste leaching [18]), agricultural non-point sources (e.g., heavy metal impurities in fertilizers and heavy metal accumulation from sewage irrigation [19]), and traffic-related sources (e.g., Zn from tyre wear and Cu from brake pads [20]). The deviation between the model’s calculated pollution source contribution rates and isotope tracing verification results can be controlled to within 15% [21].

This study examines the Manghe River Basin in Jiyuan City, a representative industrial and mining area within the Yellow River Basin. As a key industrial base under China’s “Central Region Rise” strategy, the region has established a comprehensive industrial system that includes steel smelting (accounting for 12% of the nation’s special steel production capacity), lead-zinc smelting (hosting Asia’s largest vertical-furnace zinc smelting base), and precious metal refining (producing 8% of the country’s silver output). Severe heavy metal pollution has resulted from the intertwined and overlapping processes of mining, mineral processing, smelting and manufacturing. The region urgently requires remediation of soil heavy metal pollution. In this study, we (1)combined statistics and geo-statistics methods to reveal the spatial heterogeneity of heavy metals, including Pb, Cr, Cd, Hg, As, Cu, Zn, and Ni; (2) used the Pollution Load Index (PLI) to quantify the contribution of pollution loads from different land use types; (3) applied a coupled PMF-PCA model to characterize source profiles of industrial, agricultural, and traffic emissions. Based on the results, a tiered control strategy is proposed, consisting of “interception and remediation of industrial sources, bio-interception of agricultural non-point sources, and new energy substitution for transportation sources.” This integrated framework of “spatial distribution analysis, pollution source identification, and remediation optimization” provides a systematic and practical approach for controlling soil heavy metal pollution in industrial and mining cities within the Yellow River Basin.

Materials and methods

Study area

The Manghe River watershed is situated in a critical node of the ecological barrier of the Yellow River Basin. It spans from 112°23′37″ to 112°33′2″E and 35°3′4″ to 35°9′40″N (Fig 1). Administratively, the watershed includes 73 villages across three towns-Chengliu, Sili, and Kejing-in Jiyuan City, covering a total area of 112.82 km2. The topography exhibits a distinct stepped pattern: the western part consists of medium-height mountains formed by erosion at the foothills of the Taihang Mountains, with elevations between 650 and 850 meters and a forest coverage rate of 78.2%; the central transitional zone is characterized by alluvial fans at elevations of 300–650 meters, where orchards and sloping farmland predominate; the eastern alluvial plain, at 150–300 meters above sea level, contains 82% of the region’s population and 91% of the industrial and mining enterprises. The climate is warm temperate continental monsoon, with an average annual precipitation of 567.9 mm, 72% of which which occurs during the rainy season from June to September.

Soil sampling and laboratory analysis

To ensure regional soil environmental quality and agricultural product safety, the Henan Provincial Department of Natural Resources conducted a comprehensive soil sampling and analysis program throughout Jiyuan City. This initiative aims to provide a scientific basis for soil pollution remediation and ecological restoration. In accordance with the overall project design, this study focuses on a scientific analysis of sampling data from the Manghe River watershed. All soil samples were collected from publicly accessible areas, and no protected species were disturbed nor was any environmental damage caused during sampling activities. The data sources are legitimate and their use involves no ownership conflicts.

A 1 km × 1 km sampling grid was designed based on remote sensing and land use change survey data. Predefined sampling points were then optimized by taking into account factors such as traffic accessibility, water bodies, residential zones, industrial sites, topographic features, and field reconnaissance. A total of 121 soil sampling points were ultimately established (Fig 1). Sampling was conducted in March 2023, using a handheld GPS for precise positioning. At each point, five topsoil samples (0–20 cm) were collected according the plum-blossom sampling method. Approximately 1 kg of soil was then obtained from each composite sample by quartering and stored for laboratory analysis. Environmental conditions at each sampling site were documented in detail. In the laboratory, all samples were cleaned, air-dried, and finely ground to pass through a 100-mesh nylon sieve. They were then digested using a microwave-assisted four-acid procedure. Concentrations of Cr, Cu, Zn, and Ni were quantified by flame atomic absorption spectrometry(FAAS); Pb and Cd were analyzed using graphite furnace atomic absorption spectrometry(GFAAS); Hg was measured by cold vapor atomic absorption spectrometry (CVAAS), and As was qualified via atomic fluorescence spectrometry (AFS). All measurement were performed in triplicate, with standard deviation maintained within ±5% of the mean value. Quality assurance and control (QA/QC) was conducted using the certified soil reference material GBW07403 (GSS-3) obtained from the National Standardization Reference Materials Center of China. The relative standard deviation (RSD) was below 5%, and recoveries fell within ±10%, confirming the precision and reliability of the analytical results. Method detection limits were as follows: Cr 2 mg/kg, Hg 0.0003 mg/kg, As 0.05 mg/kg, Pb 1 mg/kg, Ni 1 mg/kg, Cd 0.02 mg/kg, Cu 1 mg/kg, and Zn 2 mg/kg. Outliers were identified and excluded using Grubbs’ test.

Methodology

Semi-variance function model.

The semi-variance function model is a fundamental tool in geostatistics for characterizing spatial variability and has been widely utilized to investigate the spatial differentiation patterns of trace elements, soil physichemical properties, and heavy metal contamination [22].

whereas: γ(h) is the semi-variance function model; N(h) represents the total number of sample points when the segmentation distance is h; z(xi) denotes the measured value of the sample point at the spatial position xi; serves as the measured value of the sample point far away from h at xi.

Pollution Load Index.

The Contamination Factor (CF) method is a commonly used method for assessing the pollution level of individual heavy metal elements. To evaluate the overall extent of pollutant accumulation in environmental media such as soil, water, or atmosphere, the Pollution Load Index (PLI) was introduced by Roger Tomlinson et al. [23]. This index quantitatively assesses pollution severity by comparing the concentrations of multiple pollutants to their respective background values. Its calculation formula is as follow:

whereas: CFi is the pollution index of heavy metal i, Ci represents the test concentration of heavy metal i, Cn represents the evaluation criteria of heavy metals (here referring to the soil background value in Henan Province). PLI is the pollution load index of heavy metals, and n is the number of heavy metal elements. The classification of the pollution index is presented in Table 1.

PMF-PCA coupling model.

Principal Component Analysis (PCA) is a widely employed statistical technique for identifying potential sources of heavy metals in soil. Through dimensionality reduction, PCA elucidates the distribution patterns of heavy metals and provides the loadings (weights) of each metal on the principal component [24]. As a powerful receptor model, Positive Matrix Factorization (PMF) model analyzes the sample data matrix by decomposing it into two matrices: factor contributions and factor profiles. This method allows the incorporation of sample-specific uncertainties and enables weighted least-squares optimization for improved resolution of source apportionment [25]. The fundamental formula is expressed as:

whereas: Cij denotes the content of the jth element in sample i, and Cik indicates the contribution of the kth pollutant source in sample i; Fkj signifies the eigenvalue of pollant source k to the jth heavy metal concentration; ij represents the residual, and p donates the number of factors. Based on the pollutant content and uncertainty data of the samples, a weighting coefficient is applied to derive the minimum objective function. The iterative minimization algorithm is then utilized to solve for Q, ensuring that under the condition of minimizing Q, the contribution rate of pollution sources and the pollution source component spectra are accurately calculated.

whereas: n represents the number of samples, m denotes the number of pollutants, and uij is the uncertainty of heavy metals, which is determined as follows:

whereas: c represents the heavy metal concentration, MDL denotes the elemental detection limit. The error fraction (EF) is typically assigned a value of 0.05 ~ 0.2; in this instance, it has been set at 0.1 [26].

Results

Descriptive statistics of soil heavy metals

The characteristics of heavy metal content in the topsoil of Manghe River watershed were summarized in Table 2. The findings indicated that the average pH was 7.73, indicating a neutral soil condition. The mean contents of Hg, As, Cr, Cu, Ni, Pb, Zn and Cd were 0.12 mg/kg, 19.2 mg/kg, 55.4 mg/kg, 31.9 mg/kg, 27.7 mg/kg, 95.2 mg/kg, 128 mg/kg, and 3.43 mg/kg, respectively. Among these, the concentration of Cr remained below the background level for soil in Henan Province. In contrast, the concentrations of Hg, As, Ni, Cu, Pb, Zn and Cd were 3.52, 1.69, 1.62, 1.04, 4.85, 2.14, and 46.4 times their respective background values, indicating significant enrichment-particularly of Hg, Pb, and Cd. The high kurtosis and skewness values for Cd and Hg further suggested pronounced accumulation of these two elements. The coefficient of variation (CV), a dimensionless measure of data dispersion relative to the mean, was used to assess variability in metal concentrations. Generally, CV ≤ 0.15 indicates weak variability, 0.15 < CV < 0.36 denotes medium variability, and CV ≥ 0.36 reflects strong variability [27,28]. The CV values for Cr and Ni were 0.17 and 0.19, respectively, indicating moderate variation. The remaining metals exhibited strong variability, with CV values descending as follows: Cd (3.05)> Hg (2.35)> Pb (1.49)> As (0.67)> Cu (0.64)> Zn (0.57). Notably, the exceptionally high CV values of Cd, Hg and Pb suggest significant anthropogenic influence and pronounced spatial heterogeneity.

thumbnail
Table 2. Descriptive statistics of soil heavy metal pollution.

https://doi.org/10.1371/journal.pone.0335016.t002

Characteristics of heavy metal content in soils under different land use types

As illustrated in Fig 2, the pollution characteristics of heavy metals varied markedly across different land use types. The data, which exhibited small minimal outliers, were highly reliable. While Cr and Ni exhibited a uniform distribution across various land use types, other heavy metals, particularly in industrial and mining areas, exhibited exceptionally high concentrations. Besides Cd, elements like Hg, As, Cu, Pb, and Zn also demonstrated notable distributions in cultivated land, forests, residential zones, and transportation corridors, underscoring the influence of human activities.

thumbnail
Fig 2. Box plots of heavy metal content on different land types.

https://doi.org/10.1371/journal.pone.0335016.g002

Spatial distribution of soil heavy metals

Semi-variance model.

Table 3 presented the semi-variance model fitting results for eight heavy metals utilizing GS + 9.0. The coefficients of determination (R2) ranged from 0.672 and 0.802, all exceeding the threshold of 0.6, indicating a robust model fit. Specifically, Hg and Cu were best fitted by spherical models, As, Cr and Ni by exponential models, and Pb, Zn and Cd by Gaussian model. The nugget-to-sill ratio [C0/(C0 + C)] was used to evaluate the spatial dependence of each element. Ratios below 0.25 for Cr and Ni indicated strong spatial auto-correlation, implying that their distribution was primarily influenced by intrinsic factors such as soil structure and topography. For Hg, Pb, and Cd, the ratios fell between 0.25 and 0.75, suggesting moderate spatial auto-correlation influenced by both natural conditions and random factors. In contrast, As, Cu, and Zn exhibited ratios greater than 0.75, indicating weak spatial auto-correlation and a distribution dominated by stochastic processes and significant spatial variability.

thumbnail
Table 3. Parameters of the fitted semi-variational function model for soil heavy metals.

https://doi.org/10.1371/journal.pone.0335016.t003

Spatial distribution of heavy metals.

Fig 3 illustrated the spatial distribution of the eight heavy metals, as interpolated using indicator kriging. The spatial pattern of Cr and Ni were similar, both exhibiting a concentric structure with concentrations highest at the center and gradually decreasing toward the periphery in a regular ring-like pattern. The high-concentration zone extended from the northwest to the due south. Cu and As also exhibited similarity, with high values displaying a consistent circular gradient from the center to the periphery. Additionally, elevated values formed a distinctive “Y” shape in the northwest to southeast direction. The spatial distribution of Pb, Cd, and Zn shared general similarities, with high-value areas predominantly aligned along a northeast-to -southwest axis. Specific variations were observed: Pb reached its peak in the northeastern corner, with concentrations declining progressively toward the southwest; Cd showed high-value zones in the central and southern regions, whereas Zn showed a patchwork mosaic of high and low values concentrated in the mid-southern area. In contrast, Hg showed no discernible decreasing trend. Its high value region occurred as discrete, isolated patches within lower-concentration surroundings. Specifically, high-value zones in the southwest were contiguous, while those in the north were more fragmented and scattered.

Evaluation of soil heavy metal pollution

Pollution load index method.

The pollution load indices of heavy metals was presented in Table 4. The CFi decreased in the following order: Cd (30.35)> Pb (4.76)> Hg (3.62)> Zn (2.18)> As (1.76)> Cu (1.52)> Ni (0.99)> Cr (0.85). Based on these values, Cr and Ni exhibited minor pollution, As and Cu showed slight pollution, Zn demonstrated moderate pollution, while Hg, Pb, and Cd showed severe pollution. The overall mean Pollution Load Index (PLI) in the study area was 0.79, indicating a slight pollution level. The highest pollution indices for Hg, Cu, Pb, Zn and Cd were found in industrial and mining storage lands. Meanwhile, the pollution index of As peaked in unused land, followed by industrial and mining storage land. The pollution indices of Cr and Ni demonstrated consistent patterns across all land use types. The PLI values across different land use types descended in this order: industrial and mining storage land (6.60)> unused land (4.10)> forests (1.22)> other construction land (0.78)> residential land (0.70)> water bodies and water conservancy infrastructure (0.60) = grassland (0.60) = cultivated land (0.60)> transportation land (0.58). Notably, industrial and mining storage land and unused land were severely polluted, forests were slightly contaminated, and all other land use types remained within unpolluted levels.

thumbnail
Table 4. Heavy metal pollution load index under different site types.

https://doi.org/10.1371/journal.pone.0335016.t004

Sources apportionment of heavy metals

Correlation analysis.

Heavy metals derived from the same or similar sources typically exhibit strong correlations [29], as shown in Table 5. A highly significant positive correlation (p < 0.01) was observed between Cr and Ni, with a correlation coefficient of 0.887. Similarly, Pb, As, Cu, Zn, and Cd showed strong intercorrelations, with coefficient ranging from 0.517 to 0.827, suggesting that these elements likely originated from common or closely related sources. However, Hg showed consistently low correlation coefficient (below 0.354) with the other seven heavy metals, indicating a weak relationship and implying a distinct origin. Further investigation was required to elucidate the specific circumstances.

Principal component analysis.

The findings of the principal component analysis were presented in Table 6. The first principal component (PC1), explaining 44.01% of the total variance, exhibited high loadings for Pb, As, Zn, Cu, Cd, and Hg. Elevated concentrations of As, Pb, Cd, and Zn were observed in industrial and mining storage area, indicating a close link between these four heavy metals and industrial activities. Increased levels of Cd, Cu, and Zn were also detected in forests, grasslands, and cultivated lands, suggesting an association with agricultural practices. Hg was most abundant in industrial and mining storage land, with moderately high levels observed in forest and grassland soils. Its weak correlation with the other five heavy metals implied a distinct origin. PC1 was therefore primarily attributed to anthropogenic activities. The the second principal component (PC2) accounted for 25.38% of the variance, with strong loadings from Cr (0.954) and Ni (0.955). Given that previous analysis indicated Cr and Ni were mainly derived from parent material, PC2 was considered to represent natural sources.

Source apportionment by PMF.

Using EPA PMF 5.0 software, we analyzed the sources of eight heavy metals in the study area. A total of 3–6 factors were set, with 20 iterations performed to minimize analytical bias. Analysis results showed that when comparing different numbers of factors, setting 4 factors resulted in residual values predominantly distributed stably within the range of −3 to 3. Furthermore, the error between Qrobust and Qture did not exceed 25%. When running the PMF model with 4 factors, the R2 values for most elements exceeded 0.6, and the signal-to-noise ratios (S/N) for all elements were greater than 2.0. The category was set to “strong,” indicating that the PMF source apportionment model can meet the requirements for analyzing heavy metal sources in soil. The results of the source allocation are shown in Fig 4.

thumbnail
Fig 4. Source contributions of each factor generated by the PMF model.

https://doi.org/10.1371/journal.pone.0335016.g004

Factor 1 accounted for 34.95% of the total contribution. It was identified as the dominant source of Zn (62.1%) and Cd (60.3%). Given that Jiyuan City hosted Asia’s largest lead-zinc smelting base and possessed substantial zinc reserves, and considering that Cd was intrinsically linked to various industrial processes such as electroplating, metal smelting, and chemical production [30], Factor 1 was attributed primarily to industrial activities.

Factor 2 contributed 10.87%, with Hg accounting for 85% of its profile, while contributions to other metals each remained below 5%. Hg was commonly found in most minerals, and its concentration was amplified through mining and smelting operations. Additionally, coal combustion during smelting facilitated atmospheric deposition of Hg [31]. Thus, Factor 2 was interpreted as originating mainly from coal combustion.

Factor 3 explained 26.81% of the total variance and was a major contributor to Pb (80.95%) and As (35.76%). Traffic emissions, particularly from fuel combustion, engine wear, and catalyst utilization, constituted a primary source of Pb [32]. The exploitation of mineral resources and the improper disposal of industrial waste—such as smelting by-products and fossil fuel combustion residues—constituted major anthropogenic sources of As in soils. For instance, sulfide minerals like pyrite and oxide minerals such as hematite may contain As levels exceeding 100 g/kg [33]. Further more, As was commonly introduced via agricultural practices, such as the application of fertilizers (e.g., ammonium nitrate, ammonium phosphate, and compound fertilizers) and pesticides or herbicides containing inorganic arsenic [33]. Therefore, Factor 3 was considered a mixed source deriving from both transportation, industrial and agriculture activities.

Factor 4 contributed 27.37%, with high loadings of Cr (74.12%), Ni (74.61%), and Cu (50.84%). The contents of Cr and Ni were close to regional soil background values and exhibited low variability, suggesting a origin related to soil parent material and geological background [3436]. In contrast, Cu was widely used in agricultural amendments such as pesticides, fertilizers and insecticides, like copper sulfate, copper oxide or copper carbonate, and livestock feed additives, leading to its accumulation in soils through manure application [37,38]. The elevated Cu content observed in cultivated land supported this inference. Hence, Factor 4 represented a mixed source influenced by both natural processes and agricultural practices.

Discussion

Spatial distribution heterogeneity of heavy metals

This study focuses on mixed soil samples collected from the top 0–20 cm layer, which serves as a critical interface for processes such as atmospheric dry and wet deposition and surface runoff erosion. This layer exhibits significant responses to environmental disturbances, including temperature, precipitation, and industrial thermal emissions [39], resulting in pronounced spatial heterogeneity in heavy metal concentrations. As shown in Figs 2 and 3, industrial and mining areas form distinct composite pollution hotspots of As, Pb, Zn, and Cd, with concentration levels significantly higher than those in other land use types. In contrast, cultivated land and forest-grassland show element-specific differentiation, supporting the value of topsoil as a “pollution process recorder” [40]. The concentration of Cr and Ni show a close relationship with land use types and topographic conditions in the study area. The terrain generally slopes from west to east, with forest areas dominating the west, cultivated land prevailing in the east, and various types of construction land concentrated in the central part. Although Cr and Ni pollution levels in Chinese soils are generally low [41], their distinct ring-like gradient pattern indicates significant anthropogenic interference. Specifically, open-air storage of chromite slag from Zn smelting operation, along with dispersion of Ni-based catalysts, has resulted in a diffusion pattern centered around the smelting plants via both dry and wet deposition, with concentrations decreasing radially outward. While Cu, As, Pb, and Cd also exhibit declining trends from high to low concentrations, their gradient directions and rates vary considerably, indicating different migration mechanisms. As displays a complex decreasing pattern, likely due to combined industrial and agricultural pollution. Cu decreases more uniformly, primarily associated with agricultural activities and population density. Pb, however, is strongly influenced by road networks, forming concentration peaks at major intersections. Hg exhibits a “nested island-type” distribution, influenced by multiple factors such as coal combustion emissions, land use and surface characteristics, monsoon climate effects, and historical pesticide residues [42].

Source apportionment of heavy metals

This study employed correlation analysis, principal component analysis (PCA), and positive matrix factorization (PMF) to systematically identify the sources of heavy metals. Correlation analysis preliminarily revealed symbiotic relationships among elements, suggesting that highly correlated elements may share the same source or similar enrichment pathways [43,44]. PCA further categorized the pollution sources broadly into natural and anthropogenic origins. The PMF model then precisely identified and quantified the specific contributions of each pollution source. Capable of effectively handling missing and uncertain data, the PMF model yields source apportionment results that better reflect actual pollution characteristics [45]. Compared to traditional single-model approaches, the three-stage progressive analysis employed in this study demonstrates remarkable performance. It effectively addresses the resolution limitations of PCA for collinear pollution sources and accurately differentiates between mixed signals originating from coal combustion (Hg) and smelting processes (Cd-Zn). Furthermore, numerous studies have validated that integrating the PMF model with geostatistical methodologies provides a robust and practical strategy for pollution identification and supports the management of regional soil heavy metal contamination [46,47].

Strategies for heavy metal pollution control

The development of chemical and smelting industries in Jiyuan City, coupled with intensive agricultural activities, constitutes the primary cause of soil heavy metal contamination in the region. In addition to Cr and Ni, all six other heavy metals exhibit varying degrees of accumulation, with Cd, Pb, and Hg showing notably elevated levels that warrant urgent attention. Especially, due to its high toxicity and mobility, mercury poses a potential threat to local residents through pathways such as inhaling dust and ingesting food from the food chain. Therefore, health screening must be intensified in pollution hotspots and strict source control measures must be implemented. The first step in controlling heavy metal pollution lies in enhancing monitoring efforts. Based on pollution load levels, a scientific zoning approach should be adopted to facilitate tailored soil utilization and remediation strategies according to contamination severity. Simultaneously, source analysis results should be integrated to enable targeted source control and end-of-pipe treatment. Jiyuan City has a high concentration of smelting and chemical industries, and coal combustion is a significant source of mercury pollution there. Therefore, comprehensive Hg removal throughout the coal combustion process is essential to reduce emissions. To mitigate As and Cu pollution from agricultural sources, modern fertilization techniques such as water-fertilizer irrigation, deep placement and band application should be intensified to enhance utilisation rates and reduce application quantities. For transportation-related Pb pollution, transitioning to clean energy alternatives instead of fuel oil is recommended. For industrial sources such as Zn and Cd, it is particularly crucial to strengthen production process controls to prevent element leakage and enhance waste management.

Limitations and prospects

This study established a comprehensive technical framework based on ‘spatial differentiation, source analysis, and treatment response’. This offers a replicable and scalable “Jiyuan Model” for the prevention and control of heavy metal pollution in industrial and mining cities within the Yellow River Basin. However, given the multiple constraints such as human resources, material resources and funds, this study concentrated on the pollution characteristics and source analysis of heavy metals in the 0–20 cm soil surface over only a one-year period. The absence of data on atmospheric dry and wet deposition fluxes, water quality data (e.g., in rivers and groundwater) and long-term monitoring records resulted in insufficient information regarding the accumulation and migration of heavy metals under different land use types. Furthermore, the generalizability of our findings to other industrial and mining regions, particularly those with differing climatic regimes, industrial profiles, or hydrological settings, may be limited by these constraints. The lack of atmospheric dry and wet deposition flux data prevents a complete understanding of aerial input pathways, especially for highly volatile elements like Hg. The absence of complementary data on heavy metals in water bodies (e.g., rivers, groundwater) and sediments hinders the construction of a comprehensive cross-media migration model. Consequently, while the integrated methodological framework is proposed as a replicable ‘Jiyuan Model,’ its direct application to other areas requires caution and should be supported by region-specific data on atmospheric deposition, hydrology, and long-term environmental monitoring. Future research should focus on enhancing relevant studies in these areas. For instance, integrating a hydrological model like SWAT to simulate water and sediment runoff with a chemical transport model would be the most effective tool to quantitatively simulate the Hg migration pathways under monsoon conditions. Moreover, although the PMF model has been thoroughly validated as a robust and widely accepted tool for allocating heavy metal sources in soil, particularly when the number and distribution of potential sources is unknown, a single model cannot ultimately overcome its inherent limitations. For example, source attribution still relies on subjective judgement. Integrating PMF results with isotope fingerprinting technology can address this shortcoming and validate source identification.

Conclusions

  1. (1) The average concentrations of Cr and Ni in the study area were close to the background values of soils in Henan Province, while Hg, As, Cu, Pb, Zn, and Cd all exhibited varying degrees of enrichment, particularly pronounced in industrial and mining storage land. Hg, Pb and Cd contamination was especially severe, exceeding their background values by factors of 3.52, 4.85 and 46.4, respectively.
  2. (2) The spatial distribution patterns of the heavy metals were distinct: Cr and Ni decreased concentrically from central high-value zones toward the periphery in a regular annular gradient. Although Cu, As, Pb and Cd also demonstrated decreasing trends from high to low values, the gradient directions and rates differed. Hg exhibited a discrete ‘high-low nested’ distribution pattern. Overall, the spatial differentiation of heavy metals correlated strongly with land use types and clearly reflects anthropogenic influences.
  3. (3) Evaluation using the Pollution Load Index (PLI) classified Cr and Ni as slightly polluted, As and Cu as mildly polluted, Zn as moderately polluted, and Hg, Pb, and Cd as heavily polluted. The regional average PLI indicated mild pollution overall.
  4. (4) Source apportionment analysis revealed significant correlations between Cr and Ni, as well as among As, Cu, Pb, Zn and Cd. In contrast, Hg exhibited weak correlations with other elements. PCA showed that Hg, As, Cu, Pb, Zn and Cd originated primarily from human activities, while Cr and Ni were mainly derived from natural sources. The PMF model further identified specific sources: Cr and Ni were influenced by soil parent material; Hg was attributed to coal combustion; Zn and Cd predominantly came from industrial emissions; Cu and As arose from mixed industrial-agricultural sources; and Pb was mainly associated with transportation emissions.

Supporting information

Acknowledgments

We thank our colleagues for their insightful comments on an earlier version of this manuscript.

References

  1. 1. Fei X, Lou Z, Xiao R, Ren Z, Lv X. Source analysis and source-oriented risk assessment of heavy metal pollution in agricultural soils of different cultivated land qualities. J Clean Prod. 2022;341:130942.
  2. 2. Zhao K, Zhang L, Dong J, Wu J, Ye Z, Zhao W, et al. Risk assessment, spatial patterns and source apportionment of soil heavy metals in a typical Chinese hickory plantation region of southeastern China. Geoderma. 2020;360:114011.
  3. 3. PRC M. Technical guidelines for risk assessment of soil contamination of land for construction. China Environment Publishing Group; 2019.
  4. 4. Shi J, Zhao D, Ren F, Huang L. Spatiotemporal variation of soil heavy metals in China: The pollution status and risk assessment. Sci Total Environ. 2023;871:161768. pmid:36740051
  5. 5. Alharbi T, Nour HE, Al-Kahtany K, Zumlot T, El-Sorogy AS. Health risk assessment and contamination of lead and cadmium levels in sediments of the northwestern Arabian Gulf coast. Heliyon. 2024;10(16):e36447. pmid:39247265
  6. 6. Zhang W, Long J, Zhang X, Shen W, Wei Z. Pollution and Ecological Risk Evaluation of Heavy Metals in the Soil and Sediment around the HTM Tailings Pond, Northeastern China. Int J Environ Res Public Health. 2020;17(19):7072. pmid:32992608
  7. 7. Anaman R, Peng C, Jiang Z, Liu X, Zhou Z, Guo Z, et al. Identifying sources and transport routes of heavy metals in soil with different land uses around a smelting site by GIS based PCA and PMF. Sci Total Environ. 2022;823:153759. pmid:35151753
  8. 8. Hossain Bhuiyan MA, Chandra Karmaker S, Bodrud-Doza M, Rakib MA, Saha BB. Enrichment, sources and ecological risk mapping of heavy metals in agricultural soils of dhaka district employing SOM, PMF and GIS methods. Chemosphere. 2021;263:128339. pmid:33297265
  9. 9. Rezapour S, Siavash Moghaddam S, Nouri A, Khosravi Aqdam K. Urbanization influences the distribution, enrichment, and ecological health risk of heavy metals in croplands. Sci Rep. 2022;12(1):3868. pmid:35264644
  10. 10. Cheng B, Wang Z, Yan X, Yu Y, Liu L, Gao Y, et al. Characteristics and pollution risks of Cu, Ni, Cd, Pb, Hg and As in farmland soil near coal mines. Soil Environ Health. 2023;1(3):100035.
  11. 11. Fu K, An M, Song Y, Fu G, Ruan W, Wu D, et al. Soil heavy metals in tropical coastal interface of eastern Hainan Island in China: Distribution, sources and ecological risks. Ecol Indicat. 2023;154:110659.
  12. 12. Nour HE, Aljahdali MH. Ecological and health risk assessment of Sharm El-sheikh beach sediments, Red Sea coast. Mar Pollut Bull. 2025;212:117577. pmid:39832426
  13. 13. Zhang X, Liu H, Li X, Zhang Z, Chen Z, Ren D, et al. Ecological and health risk assessments of heavy metals and their accumulation in a peanut-soil system. Environ Res. 2024;252(Pt 2):118946. pmid:38631470
  14. 14. Dai X, Liang J, Shi H, Yan T, He Z, Li L, et al. Health risk assessment of heavy metals based on source analysis and Monte Carlo in the downstream basin of the Zishui. Environ Res. 2024;245:117975. pmid:38145736
  15. 15. Lv H, Lu Z, Fu G, Lv S, Jiang J, Xie Y, et al. Pollution characteristics and quantitative source apportionment of heavy metals within a zinc smelting site by GIS-based PMF and APCS-MLR models. J Environ Sci (China). 2024;144:100–12. pmid:38802223
  16. 16. Yang S, Zhou Q, Sun L, Qin Q, Sun Y, Wang J, et al. Source to risk receptor transport and spatial hotspots of heavy metals pollution in peri-urban agricultural soils of the largest megacity in China. J Hazard Mater. 2024;480:135877. pmid:39353271
  17. 17. Xu J, Peng X, Guo C-S, Xu J, Lin H-X, Shi G-L, et al. Sediment PAH source apportionment in the Liaohe River using the ME2 approach: A comparison to the PMF model. Sci Total Environ. 2016;553:164–71. pmid:26925728
  18. 18. Zhang K, Chai F, Zheng Z, Yang Q, Zhong X, Fomba KW, et al. Size distribution and source of heavy metals in particulate matter on the lead and zinc smelting affected area. J Environ Sci (China). 2018;71:188–96. pmid:30195677
  19. 19. Niu L, Yang F, Xu C, Yang H, Liu W. Status of metal accumulation in farmland soils across China: from distribution to risk assessment. Environ Pollut. 2013;176:55–62. pmid:23416269
  20. 20. Harrison RM, Jones AM, Gietl J, Yin J, Green DC. Estimation of the contributions of brake dust, tire wear, and resuspension to nonexhaust traffic particles derived from atmospheric measurements. Environ Sci Technol. 2012;46(12):6523–9. pmid:22642836
  21. 21. Liang X, Wang C, Song Z, Yang S, Bi X, Li Z, et al. Soil metal(loid)s pollution around a lead/zinc smelter and source apportionment using isotope fingerprints and receptor models. Appl Geochem. 2021;135:105118.
  22. 22. Qiao P, Wang S, Lei M, Guo G, Yang J, Wei Y, et al. Influencing factors identification and the nested structure analysis of heavy metals in soils in entire city and surrounding the multiple pollution sources. J Hazard Mater. 2023;449:130961. pmid:36801713
  23. 23. Tomlinson DL, Wilson JG, Harris CR, Jeffrey DW. Problems in the assessment of heavy-metal levels in estuaries and the formation of a pollution index. Helgolander Meeresunters. 1980;33(1–4):566–75.
  24. 24. Zhang Y, Li S, Chen Z, Wang F, Chen J, Wang L. A systemic ecological risk assessment based on spatial distribution and source apportionment in the abandoned lead acid battery plant zone, China. J Hazard Mater. 2018;354:170–9. pmid:29751173
  25. 25. Guan Q, Wang F, Xu C, Pan N, Lin J, Zhao R, et al. Source apportionment of heavy metals in agricultural soil based on PMF: A case study in Hexi Corridor, northwest China. Chemosphere. 2018;193:189–97. pmid:29131977
  26. 26. Liu J, Chen Y, Chao S, Cao H, Zhang A, Yang Y. Emission control priority of PM2.5-bound heavy metals in different seasons: A comprehensive analysis from health risk perspective. Sci Total Environ. 2018;644:20–30. pmid:29980081
  27. 27. Zhang S, Yang D, Li F, Chen H, Bao Z, Huang B, et al. Determination of regional soil geochemical baselines for trace metals with principal component regression: A case study in the Jianghan plain, China. Appl Geochem. 2014;48:193–206.
  28. 28. Sawut R, Kasim N, Maihemuti B, Hu L, Abliz A, Abdujappar A, et al. Pollution characteristics and health risk assessment of heavy metals in the vegetable bases of northwest China. Sci Total Environ. 2018;642:864–78. pmid:29925057
  29. 29. Zhou H, Chen Y, Yue X, Ren D, Liu Y, Yang K. Identification and hazard analysis of heavy metal sources in agricultural soils in ancient mining areas: A quantitative method based on the receptor model and risk assessment. J Hazard Mater. 2023;445:130528. pmid:37055956
  30. 30. Zhou X-Y, Wang X-R. Impact of industrial activities on heavy metal contamination in soils in three major urban agglomerations of China. J Clean Prod. 2019;230:1–10.
  31. 31. Jiang H-H, Cai L-M, Wen H-H, Hu G-C, Chen L-G, Luo J. An integrated approach to quantifying ecological and human health risks from different sources of soil heavy metals. Sci Total Environ. 2020;701:134466. pmid:31704412
  32. 32. Men C, Liu R, Xu F, Wang Q, Guo L, Shen Z. Pollution characteristics, risk assessment, and source apportionment of heavy metals in road dust in Beijing, China. Sci Total Environ. 2018;612:138–47. pmid:28850834
  33. 33. Smedley PL, Kinniburgh DG. Arsenic in groundwater and the environment. In: Essentials of Medical Geology: Revised Edition. Dordrecht: Springer Netherlands; 2012. p. 279–310.
  34. 34. Chai Y, Guo J, Chai S, Cai J, Xue L, Zhang Q. Source identification of eight heavy metals in grassland soils by multivariate analysis from the Baicheng-Songyuan area, Jilin Province, Northeast China. Chemosphere. 2015;134:67–75. pmid:25911049
  35. 35. Ćwieląg-Drabek M, Piekut A, Gut K, Grabowski M. Risk of cadmium, lead and zinc exposure from consumption of vegetables produced in areas with mining and smelting past. Sci Rep. 2020;10(1):3363. pmid:32099081
  36. 36. Liu H, Zhang Y, Yang J, Wang H, Li Y, Shi Y, et al. Quantitative source apportionment, risk assessment and distribution of heavy metals in agricultural soils from southern Shandong Peninsula of China. Sci Total Environ. 2021;767:144879. pmid:33550057
  37. 37. Liu J, Liu YJ, Liu Y, Liu Z, Zhang AN. Quantitative contributions of the major sources of heavy metals in soils to ecosystem and human health risks: A case study of Yulin, China. Ecotoxicol Environ Saf. 2018;164:261–9. pmid:30121501
  38. 38. Reboredo F, Simões M, Jorge C, Mancuso M, Martinez J, Guerra M, et al. Metal content in edible crops and agricultural soils due to intensive use of fertilizers and pesticides in Terras da Costa de Caparica (Portugal). Environ Sci Pollut Res Int. 2019;26(3):2512–22. pmid:30471064
  39. 39. Liu J, Chen Y, Shang Y, Li H, Ma Q, Gao F. Contamination Characteristics and Source Apportionment of Heavy Metal in the Topsoil of a Small Watershed in South Taihang. Land. 2024;13(7):1068.
  40. 40. Rahmati M, Or D, Amelung W, Bauke SL, Bol R, Hendricks Franssen H-J, et al. Soil is a living archive of the Earth system. Nat Rev Earth Environ. 2023;4(7):421–3.
  41. 41. Wang Y, Duan X, Wang L. Spatial distribution and source analysis of heavy metals in soils influenced by industrial enterprise distribution: Case study in Jiangsu Province. Sci Total Environ. 2020;710:134953. pmid:31923652
  42. 42. Rodríguez Martín JA, Arias ML, Grau Corbí JM. Heavy metals contents in agricultural topsoils in the Ebro basin (Spain). Application of the multivariate geoestatistical methods to study spatial variations. Environ Pollut. 2006;144(3):1001–12. pmid:16580763
  43. 43. Chai L, Wang Y, Wang X, Ma L, Cheng Z, Su L. Pollution characteristics, spatial distributions, and source apportionment of heavy metals in cultivated soil in Lanzhou, China. Ecol Indicat. 2021;125:107507.
  44. 44. Zhang Q, Xu P, Qian H, Yang F. Hydrogeochemistry and fluoride contamination in Jiaokou Irrigation District, Central China: Assessment based on multivariate statistical approach and human health risk. Sci Total Environ. 2020;741:140460. pmid:32886997
  45. 45. Chen Z, Ding Y, Jiang X, Duan H, Ruan X, Li Z, et al. Combination of UNMIX, PMF model and Pb-Zn-Cu isotopic compositions for quantitative source apportionment of heavy metals in suburban agricultural soils. Ecotoxicol Environ Saf. 2022;234:113369. pmid:35278993
  46. 46. Wu H, Cheng N, Chen P, Zhou F, Fan Y, Qi M, et al. Integrative risk assessment method via combining geostatistical analysis, random forest, and receptor models for potentially toxic elements in selenium-rich soil. Environ Pollut. 2023;337:122555. pmid:37714402
  47. 47. Cui S, Yu W, Han X, Hu T, Yu M, Liang Y, et al. Factors influencing the distribution, risk, and transport of microplastics and heavy metals for wildlife and habitats in “island” landscapes: From source to sink. J Hazard Mater. 2024;476:134938. pmid:38901262