Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Bridging the biomass data gap: A literature-based Length-Weight Relationship framework for estimating representative dry weights of freshwater invertebrates in Korean rivers

Abstract

Representative species weight is a critical ecological index for modeling, vulnerability assessment, and toxicity prediction, yet scientifically validated data for aquatic invertebrates remain limited. To address this gap, we present the first literature-based strategy to estimate and validate representative dry weights of freshwater invertebrate species in Korean rivers using Length-Weight Relationships (LWRs). Species length and dry weight records were compiled from domestic field guides, while dry weight records were compiled from both domestic and global literature. LWR coefficients (a and b) were then calculated at genus, family, and order levels and preprocessed under control conditions. Among the taxonomic levels tested, averaging genus-level coefficients yielded the highest concordance with field-measured dry weights (R2 = 0.6633, n = 240), outperforming broader taxonomic levels. Furthermore, a logarithmic correlation analysis confirmed that greater numbers of LWR sources improve predictive accuracy, particularly at the genus level. Based on this optimal strategy, representative dry weights were estimated for 563 taxa. This methodology fills a critical data gap by leveraging existing literature to generate reliable species-specific weight indices without additional field measurements. Our approach provides a quantitative foundation for biomass estimation in data-limited freshwater ecosystems and supports improved ecological modeling, conservation planning, and machine learning-based impact prediction.

Introduction

Although systematic scientific concern regarding environmental integrity is frequently associated with the late 20th century, its modern foundations were solidified during the ‘Environmental Revolution’ of the 1960s, notably through works such as Rachel Carson’s Silent Spring (1962) [1]. Contemporary freshwater ecosystems face a multifaceted array of global change drivers, including hydrological alterations, invasive species proliferation, and intensive land-use practices. Within the specific context of South Korea, the historical legacy of rapid industrialization coupled with intensifying climatic fluctuations has established climate-driven habitat shifts and chemical perturbations as particularly critical stressors for aquatic biodiversity [213]. These persistent and cumulative perturbations underscore the urgent need for comprehensive research into ecosystem conservation and the refinement of environmental impact assessment (EIA) frameworks [1416]. A fundamental challenge in advancing these frameworks lies in the scarcity of quantitative biomass data for aquatic invertebrates, which serve as essential baseline inputs for ecological models and vulnerability assessments.

Vulnerability assessments and ecological modeling serve as important tools for environmental impact assessments targeting these ecosystems [14,15,1720]. These methods have been developed and utilized to consider complex ecological interactions, surpassing standard population-level toxicity benchmarks such as the median lethal concentration (LC50). By definition, LC50 quantifies acute mortality within a specific test population under controlled conditions, yet it lacks the capacity to inherently reflect system-scale risks or community-level responses [18,20,21]. These methods are employed when conducting impact assessments at or above the community level, serving as tools for ecological research that reflect various environmental perturbations [14,1719]. However, such modeling requires additional ecological information including biomass of species within an ecosystem [14,22]. Therefore, the appropriate calculation of representative weight for species is one of the important tasks in ecosystem conservation research.

This study primarily focuses on generating representative weight of aquatic invertebrates in freshwater ecosystems since aquatic invertebrates are crucial biological group that plays an intermediary role in nutrient cycling and organic matter processing [23]. Among the ecosystems studied, freshwater ecosystems are crucial subjects in ecosystem conservation research because they host more than 10% of all species and one-third of vertebrate species [24]. Furthermore, freshwater ecosystems encompass a vast range of environments with unique ecological characteristics [16,24]. Consequently, constructing biomass information on freshwater invertebrates challenging due to their diversity, but it remains an essential component of ecosystem conservation research.

Studies on aquatic invertebrate biomass are generally conducted through field surveys [25,26]. Investigations of biomass in streams are carried out using DNA analysis, kick net or dip net sampling, and subsequent analysis [26,27]. However, research primarily based on field surveys often quantify each species in terms of individual counts rather than biomass [25,28]. This approach may present challenges when utilizing existing data based on individual counts. Therefore, representative species weight can help to convert individual count data to biomass data.

In the case of Korean aquatic ecosystems, quantitative research on invertebrate biomass began in the 1960s, primarily focusing on a few lakes and the four main rivers [29]. Although the Korean Ministry of Environment conducts biannual aquatic ecosystem health assessments at approximately 3,100 monitoring sites, these surveys currently prioritize individual density and species composition [25,30]. Converting these extensive datasets to biomass-based data can significantly contribute to ecological modeling [28,31,32]. However, there is a scarcity of data for calculating representative weight for many indigenous species, except for a few well-studied taxa.

Estimating species’ representative weight typically involves compiling all available individual weight data and calculating median or mean values. However, when such data are scarce, LWRs provide a practical alternative. LWRs are widely used regression models that relate an organism’s body length to its weight using species-specific coefficients [33,34]. The relationship is generally expressed as W = aLb or in logarithmic form as ln(W) = ln(a)+bln(L), where W is the weight (either wet or dry), L is the body length, and a and b are species-specific constants. This method allows conversion from easily measured length data to weight estimates. However, LWR coefficients can vary due to environmental factors and body shape differences, causing potential discrepancies between estimated and actual weights [34]. To address this, researchers recommend using the geometric or arithmetic means of LWR coefficients calculated from multiple regional studies to obtain representative values [34]. Since region-specific LWR data for various freshwater invertebrate species in Korean rivers are lacking, this study compiled LWR constants from international literature to estimate the body weight of species found in Korean freshwater ecosystems. We hypothesize that taxonomic specificity at the genus level provides significantly higher predictive accuracy for invertebrate biomass than broader family- or order-level aggregations, as it better captures morphological and ecological similarity.

This study estimated the representative dry weight of freshwater invertebrates using Length-Weight Relationships (LWRs) and length data collected from field guide literature. In this study, there are several goals. 1) The preprocessing methodology for LWR coefficients is verified to extrapolate lacking LWR database. 2) The representative species weights are estimated from representative LWR coefficients and representative species length data from field guide literature based on Korean species. 3) The representative species weights are compared to the measured weight of invertebrate species recorded in previous field surveys of Korean freshwater ecosystems to check for significant differences. Despite the ecological importance of freshwater invertebrate biomass, no systematic methodology currently exists for estimating representative species weights for Korean aquatic invertebrates from existing published data. This represents a critical knowledge gap, as decades of monitoring data recorded as individual counts remain unconverted to biomass, limiting their utility for ecological modeling and conservation assessments. To address this gap, we examined whether representative dry weights can be reliably estimated using preprocessed LWR coefficients derived from global literature, without the need for additional field measurements. This entirely literature-based approach is novel in its systematic compilation, preprocessing, and validation of LWR coefficients across multiple taxonomic levels for Korean freshwater invertebrates, and is expected to make a significant contribution to future ecological data generation and biomass-based assessments.

Methods and materials

Study site

This study was conducted in Korean rivers, which exhibit significant ecological diversity in their freshwater ecosystems. Korea is characterized by distinct four seasons, which contribute to a high biodiversity within a relatively small geographical area. This unique feature allows for the coexistence of various species in close proximity [35]. Korea maintains approximately 3,100 ecological monitoring stations where Korean ecologists conduct biannual assessments of aquatic ecosystem health (Fig 1). Several monitoring stations for branch streams are monitored every three years, whereas the main streams are monitored annually [25]. Assessments of aquatic ecosystem health were evaluated for these monitoring stations during 2009 to present biannually. This study examined the invertebrate species monitored in Korean rivers from 2010 to 2020. The research utilized the web database of riverside monitoring points available through the Water Environment Information System (https://water.nier.go.kr/). The dataset for invertebrate species in Korean rivers was organized for all monitoring stations except for mollusk species.

thumbnail
Fig 1. Study area and monitoring stations for ecological assessment in Korean rivers.

https://doi.org/10.1371/journal.pone.0352157.g001

Ethical statement

No specific permits were required for this study as it was based entirely on the synthesis of data from existing literature and published field guides. Since no primary field sampling or animal collection was conducted by the authors, field site access approval was not applicable.

Collection and conversion of individual biomass and length data of invertebrate species in Korean rivers

This study aimed to calculate the representative species weight of aquatic invertebrates inhabiting Korean rivers based on literature to contribute ecological conservation and management. To achieve this goal, the study determined the general lengths of individual species using data from various sources, including the field guide for Korean aquatic invertebrates, online databases such as National Institute of Biological Resources, and taxonomic research papers for the species included in the established database (Fig 2). For several taxonomic groups at the order level, such as Odonata and Plecoptera, where adult forms are not typically aquatic, the study recorded the average lengths based on larval stages. This approach was adopted because larvae have a more direct impact on aquatic ecosystems.

thumbnail
Fig 2. Methodology for calculating representative Length-Weight Relationships (LWRs) coefficient a and b of aquatic invertebrate species.

https://doi.org/10.1371/journal.pone.0352157.g002

Also, the mean dry weight of each invertebrate species living in Korean rivers was evaluated by investigating the freshwater ecology in Korea. Studies from theses, domestic journals, and conferences were used to construct a database of the mean dry weights of the invertebrate species [3641]. The individual dry weight recorded from the literature was determined by dividing the weight density by the population density because the weight data presented in the literature typically shows weight density and population density simultaneously rather than individual weight directly. Specifically, the mean wet weight of Hydropsyche sp. was calculated using the mean wet weight data of five size classes of larvae investigated in a previous study [36]. Because the individual weight recorded in the literature was based on wet weight, we converted it to dry weight using known wet-dry conversion ratios from the literature [42]. The conversion ratios were averaged at the order level for each taxonomic group (S1 Table). Consequently, we documented 341 dry weight data points for species across 49 genera. The abundance of data for each taxonomic group exhibits considerable variation, ranging from 1 to 55 (S1 Fig). The database was used to analyze the concordance between the measured field dry weight and the representative dry weight estimated using LWRs.

Collecting and preprocessing of LWR coefficients for aquatic invertebrates living in Korea

The database of a and b coefficients for LWRs of invertebrate species groups living in Korea was collected from the global literature, including Korean domestic journals. To compile the global database of Length-Weight Relationship (LWR) coefficients, we aimed to cite as many academic literature sources as possible that address invertebrate length-weight relationships, regardless of their geographical origin. However, for efficiency in data collection, we prioritized scholarly papers providing datasets for multiple species (multi-species studies) rather than those focused on single individual species.

Domestic literature was searched using the Research Information Sharing Service (RISS), a comprehensive national research information sharing platform operated by the Korea Education and Research Information Service (KERIS). RISS integrates academic resources from all Korean universities and research institutions, including doctoral and master’s dissertations and journal articles published by Korean academic societies. This was supplemented by global searches in Scopus and Google Scholar using terms such as “freshwater invertebrate” and “length-weight relationship”.

While we acknowledge that intraspecific and population-level variations in body size and weight may exist across different regions and environments, no specific exclusion criteria were applied to the gathered data beyond the following predefined constant preprocessing rules. This is because the application of the geometric mean for the scaling intercept effectively generalizes these parameters and mitigates the influence of environmental or body-shape variance among different populations.

This study recorded the a and b coefficients based on dry weight-length relationships, excluding wet weight-length relationships. The coefficients a and b were incorporated into a formula that utilizes lengths in millimeters (mm) and weights in milligrams (mg). Additionally, the log-based units were converted into decimal-based units. Larval data of several species groups were selected for representative weight because adults of groups such as Odonata, Plecoptera, Diptera, and Trichoptera, which are flying insects and not aquatic organisms.

Coefficients were treated with compiling and categorizing into several groups based on coefficient conditions (R2 and the number of samples used for deriving coefficients (n)). The control group filtered no coefficients. The 4 experimental groups filtered coefficients by using specific criteria with R2 and the number of samples used for deriving coefficients (n). Group 1 included coefficients with R2 > 0.8 and n > 10; Group 2 included those with R2 > 0.9 and n > 10; Group 3 included coefficients with R2 > 0.8 and n > 20; and Group 4 included those with R2 > 0.9 and n > 20 (Fig 2).

Calculating representative LWR coefficients for genus, family, order levels and estimating representative weight of species

The representative a and b coefficients for the LWRs were calculated by averaging coefficients belonging to the same taxonomic group at genus, family, order levels. LWR coefficients are influenced by the living habitat, morphology, and conditions around the target organisms [34]. Therefore, even for the same species, coefficients can vary depending on the study area and environmental conditions. This study generated average coefficients to generalize LWR coefficients at each taxonomic level. The representative a and b coefficients were calculated for each species using a database of coefficients collected from various regions. Coefficient a was averaged using the geometric mean as equation (1), and coefficient b was averaged using the arithmetic mean as equation (2) for the grouped coefficients [34]. The rationale for using these specific formulas lies in the statistical distribution of the LWR parameters. As established in the meta-analysis by Froese (2006), parameter a is typically log-normally distributed, necessitating the use of the geometric mean to account for this skewness and provide a central estimate. Conversely, the growth exponent b generally follows a normal distribution, making the arithmetic mean the most appropriate measure for deriving its representative value.

(1)(2)

The representative dry weight of each species was calculated using the representative LWR coefficients determined for each taxonomic level and group. In the previous step, coefficients were calculated as representative values for the genus, family, and order levels with condition of the control, group 1, group 2, group 3, and group 4. Thus, each species has 15 representative LWR coefficients. Here, the variables aref and bref are defined as the representative scaling intercept and the representative growth exponent, respectively, which were derived for each specific taxonomic level (genus, family, or order) through the aforementioned averaging process. These coefficients were applied to the general total length (GTL) of the invertebrates living in Korean river systems, as recorded in field guides, to calculate the representative dry weight for each species using the following equation (3).

(3)

The calculated representative dry weights for each species were then compared with the actual recorded weight from the field data to determine the methodology that produced the most accurate weight values. The accuracy of the estimated values was quantitatively compared using concordance R2 values across different groups using the following equation (4) (S2 Fig).

(4)

where R2 is the concordance coefficient of determination used to evaluate the agreement between estimated and measured values; is the i-th field-measured dry weight; is the corresponding estimated dry weight derived from the representative LWR coefficients; is the mean of the field-measured dry weights; and n is the total number of data points used for comparison.

The study compared 341 dry weight records of organisms from Korean freshwater ecosystem studies with the estimated representative species weight values. These estimated representative species weights were calculated using representative coefficients for length-weight relationships (LWRs) that were determined from each preprocessing strategy. During the validation of each regression model, dry weight records with Q > 6 were excluded, as suggested by Moen et al (1993). Here, where Qi is score to identify outlier values, yi is each monitored dry weight value for a species, MedAD is the median of the absolute values of differences between estimated representative species weights and field measured weights [43,44]. This exclusion was necessitated by potential discrepancies in the measured weight records. These discrepancies include errors in the adult weight measurements of flying species, as reported by Kim [45].

All statistical analyses, including concordance R2 calculations (Eq. 4), outlier detection based on the Q-score method, and logarithmic regression analyses between the number of LWR constants and prediction accuracy with p-value test, were performed using R (version 4.4.2; R Core Team, 2024) within the RStudio integrated development environment (version 2024.10.0 + 467, Posit Software, PBC). Geometric and arithmetic means for LWR coefficients (Eqs. 12) were computed in Microsoft Excel, and all figures were generated using the ggplot2 package in R.

Results

Representative Length-Weight Relationship coefficients for Korean aquatic invertebrates across taxonomic levels

A total of 524 length-weight relationship coefficients, compiled from 23 global studies on aquatic invertebrates [37,4666], were used to calculate representative values at multiple taxonomic levels for Korean aquatic invertebrates. When different species from the same study belonged to the same higher taxonomic level, each was treated as a separate source at that taxonomic level. Representative LWR coefficients were successfully calculated for 230 genera, 122 families, and 25 orders in the control group, with all coefficients based on body length (mm) to dry weight (mg) relationships (S2-S4 Tables).

Genus‐level source counts ranged from single‐source genera (n = 134) to 15 coefficients for the genus Baetis, yielding a mean of 1.97 coefficients per genus (Fig 3). At the family level, 47 families had a single source coefficient, while Chironomidae was represented by 38 coefficients (mean = 4.30 per family). Order‐level coefficients varied widely, with six orders relying on a single source and Ephemeroptera represented by 123 coefficients (mean = 20.96 per order). The most extensively studied taxonomic groups were Ephemeroptera, Diptera, Plecoptera, and Trichoptera at the order level; Chironomidae, Heptageniidae, Perlidae, and Baetidae at the family level; and Baetis, Simulium, Acroneuria, and Stenonema at the genus level. These patterns highlight ecological research focus areas and data gaps in LWR study across taxonomic groups

thumbnail
Fig 3. The number of Length-Weight Relationship (LWR) coefficients belonging to taxonomic groups at the (a) family level, (b) genus level, and (c) order level in database.

https://doi.org/10.1371/journal.pone.0352157.g003

Representative LWR coefficients under different preprocessing criteria

The application of different preprocessing criteria significantly affected the number of taxonomic groups for which representative LWR coefficients could be calculated (S5 Table). Coverage varied substantially between taxonomic levels and preprocessing strategies. At the genus level, data availability decreased markedly with stricter criteria. Specifically, the control group achieved complete coverage (100%, n = 230), while Group 1 covered 66.70% of genera (n = 154). Groups with stricter R2 thresholds showed progressive reduction: Group 2 (42.61%, n = 98), Group 3 (52.17%, n = 130), and Group 4 (33.48%, n = 77). Family-level coverage ranged from 100% (control) to 40.16% (Group 4), with Group 1 achieving 70.49% representation. Order-level coverage showed the highest retention rates across all strategies, ranging from 100% (control) to 52.00% (Group 4), with Group 1 maintaining 76.00% coverage (S6 Table). Only the control group maintained 100% species coverage, while all experimental groups excluded 20% or more of target species due to insufficient qualifying data.

The validation dataset comprised 341 dry weight records representing 49 genera from Korean freshwater ecosystems. Individual data points were recorded separately for each location, time period, and source to account for environmental and individual variations in length-weight relationships. The number of data points per taxonomic group varied considerably, ranging from 1 to 55 records (S1 Fig). This variation reflects differential research attention across taxonomic groups and species abundance patterns in Korean freshwater systems. All 341 datasets included complete information for species length and wet-to-dry conversion factors where applicable. Representative dry weights were successfully calculated for the entire dataset using the hierarchical LWR coefficient application approach.

Statistical analysis and comparison of estimation methods for calculating representative weight

Among 15 preprocessing strategy groups tested, performance varied significantly across taxonomic levels and selection criteria (Table 1, Figs 4-5). The analysis revealed distinct patterns in concordance between estimated and field-measured weights. Among genus-level strategies, the control group achieved the highest concordance (R2 = 0.6633) with minimal outlier exclusion (8 points, 3.33%) (Fig 5a). Groups 1−4 showed poor model fit despite low outlier numbers. Group 1 achieved the best performance (R2 = 0.2393) among family-level strategies (Fig 5b). The control group showed negative concordance (R2 = −0.0262) with 25 outliers (7.99%). Groups 1 and 3 demonstrated positive R2 values, while Groups 2 and 4 showed negative performance. Group 1 was the only strategy showing positive concordance (R2 = 0.2351) among experimental groups at the order level (Fig 5c). The control group achieved modest positive concordance (R2 = 0.0949) with the highest outlier exclusion (45 points, 13.2%). All strategies achieved R2 values below 0.7, with genus-level control conditions demonstrating superior performance compared to all other preprocessing approaches. The number of outliers increased consistently from genus to order level across all preprocessing strategies (Table 1).

thumbnail
Table 1. R2 Values representing concordance between expected and observed values for 15 groups of coefficients.

https://doi.org/10.1371/journal.pone.0352157.t001

thumbnail
Fig 4. Statistical analysis of (a) R2 values for different taxonomic levels with varying LWR coefficients preprocessing criteria and (b) the number of datasets after excepting outliers used for evaluating R2 values in each group.

https://doi.org/10.1371/journal.pone.0352157.g004

thumbnail
Fig 5. Concordance between representative species weight (mg) and field measured dry weight (mg) for preprocessing groups of LWR coefficients at (a) genus, (b) family, and (c) order level.

Group C, G1, G2, G3, and G4 mean control, group 1, group 2, group 3, and group 4. G, F, and O mean genus, family, and order.

https://doi.org/10.1371/journal.pone.0352157.g005

Preprocessing requirements for representative LWR coefficients

The relationship between the number of LWR constants used for representative weight calculation and prediction accuracy was analyzed across different taxonomic levels. Concordance analyses were performed comparing both linear and logarithmic relationships between the average number of LWR constants per data point and the concordance R2 values. The logarithmic relationship showed superior fit compared to the simple linear relationship. The best-fitting logarithmic model yielded R2 = 0.818 (p = 0.0349, significant explanatory power), significantly higher than the linear correlation (R2 = 0.779, p = 0.0473, significant explanatory power) (Fig 6a, 6e). The strength of concordance between LWR constant quantity and estimation accuracy varied significantly across taxonomic levels (Fig 6e-6h). Based on p-values (α = 0.05), significant concordances were observed only at the genus level and for the total data group. The genus level demonstrated the strongest concordance (R2 = 0.818) compared to the total data group (R2 = 0.315). Hence, the data quantity-accuracy relationship is most pronounced when taxonomic specificity is maintained at the genus level.

thumbnail
Fig 6. Correlation between R2 values and average number of data used for estimating coefficients at each taxonomic level (Genus, Family, Order); Concordance versus mean number of LWR constants used for calculating point at (a) Genus, (b) Family, (c) Order levels and (d) total dataset; Concordance versus Ln (mean number of LWR constants used for calculating point) at (e) Genus, (f) Family, (g) Order levels and (h) total dataset.

https://doi.org/10.1371/journal.pone.0352157.g006

Evaluation of optimal LWR pretreatment and literature-based weight estimation method

The performance of different preprocessing methods was evaluated by comparing estimated representative dry weights with field-measured values of Korean aquatic invertebrate species. Comparative analysis of preprocessing approaches revealed that genus-level control conditions achieved the highest concordance with field measurements (R2 = 0.6633). This performance exceeded other taxonomic level combinations and preprocessing criteria, establishing genus-level mean values as the optimal approach for representative weight estimation.

During validation, quality control procedures identified 8 data points (3.33% of total) as outliers in the genus-control group analysis (Table 1). These outliers consisted predominantly of cases where flying adult specimens were inadvertently included with aquatic samples [45]. After outlier removal, the methodology successfully explained the vast majority of field-measured weight variations (R2 = 0.6633). The study established representative LWR constants for 563 invertebrate species in Korean aquatic ecosystems using a hierarchical approach. The application priority followed genus-level representative constants as the primary choice, supplemented by family-level and order-level constants when genus-level data were unavailable (S1 Data). The achieved R2 value of 0.6633 using representative length data from field guides demonstrated comparable performance to single-species LWR studies [65,67,68], validating the effectiveness of the literature-based representative weight calculation approach for multi-species applications.

Discussions

The compilation of LWR coefficients for aquatic invertebrates revealed substantial taxonomic biases and data quality issues inherent to existing databases. Our analysis of 23 studies showed that most taxonomic groups suffered from limited representation, with 58.26% of genera, 35.61% of families, and 24.00% of orders having only single sources [67,6971] (S2-S4 Tables). This data scarcity particularly affects higher taxonomic levels, six order groups (Arhynchobdellida, Ixodida, Lumbriculata, Opisthorchiida, Sabellida, and Tubificida) relied on single sources (S4 Table), necessitating extrapolation of LWR constants across broader taxonomic ranges [34,66,72,73]. The limited availability of multiple sources makes it difficult to apply strict quality criteria at lower taxonomic levels, though such criteria become more feasible at higher levels where larger species pools can be evaluated. Notably, a significant finding was the ‘filtering paradox’ observed in experimental Groups 2 and 4. Despite applying stricter statistical criteria (R2 > 0.9), these groups showed poorer predictive accuracy (R2 < 0) than the unfiltered control group at the genus level. This suggests that requiring high internal correlation in source studies may lead to selection bias, where coefficients represent niche-specific populations that do not generalize well. Conversely, the aggregate mean of all available literature captures a broader range of allometric variance, making it more robust for diverse field validation data.

Our analysis revealed a fundamental trade-off between taxonomic specificity and data availability in LWR coefficient applications. The strong logarithmic correlation between the number of available LWR constants and predictive accuracy demonstrates that data quantity significantly influences estimation reliability, particularly at the genus level where ecological similarity is maximized [7476]. However, this relationship weakens substantially at higher taxonomic levels due to increased ecological and morphological diversity within groups. While broader taxonomic aggregation provides more available data points, the resulting heterogeneity reduces the biological relevance of derived representative constants, leading to decreased predictive accuracy despite larger sample sizes [47,65].

The performance of our literature-based framework at the genus level aligns well with findings from international river systems. Our achieved concordance (R2 = 0.6633) for independent Korean field data is highly robust, considering these are borrowed indices. This performance sits within the range of explanatory power reported in primary studies from Japan (R2 = 0.51–0.83) and North America (R2 > 0.85). The fact that our literature-based estimates approach the precision of locally derived regressions underscores the validity of the genus-level representative strategy for regional biomass assessments [46,54]. This demonstrates that using representative genus-level constants borrowed from global literature provides a level of predictive reliability that approaches that of locally derived regressions. Furthermore, specific allometric constants show remarkable trans-continental stability; for instance, the growth exponent for Baetis in our study (b = 2.69) is highly consistent with Japanese (b = 2.72) and North American (b = 2.42–3.20) records [46,54]. This confirms that genus-level allometry is relatively stable across geographic regions.

Beyond taxonomic specificity, several ecological and methodological factors may influence the transferability of borrowed LWR constants. From an environmental perspective, lotic organisms often exhibit more streamlined body forms with lower b values as an adaptation to minimize shear stress in flowing waters [57]. Methodologically, the choice of preservative in source studies is a critical source of bias; while formalin maintains structural integrity, ethanol is known to dissolve lipids, which can reduce dry biomass estimates by up to 20% [77,78]. These variations explain why the aggregate mean approach used in our control group, which averages out such contextual noise, proved more robust than selecting high-precision constants from single, potentially niche-specific environments. Additionally, other factors such as differences in field sampling techniques (e.g., kick net vs. Surber sampler), specimen drying and measuring methods, the presence or absence of gut contents at the time of weighing, the range of body lengths measured in each study, and the sample size used for deriving coefficients can introduce systematic biases in reported LWR coefficients [7779]. These considerations further support the rationale for aggregating coefficients from multiple sources to average out such contextual variation.

Field validation confirmed that genus-level representative constants provide the optimal balance between taxonomic specificity and data availability for biomass estimation in Korean aquatic invertebrates. The hierarchical application strategy—prioritizing genus-level constants, followed by family- and order-level constants when necessary—proved effective for practical implementation across diverse taxonomic groups. The approach using field guide-based representative lengths successfully captured field-measured weight variations, with most discrepancies attributed to developmental stage mixing rather than methodological limitations [45]. This methodology provides a robust framework for biomass estimation in data-limited contexts while maintaining biological relevance.

This study establishes a practical protocol for estimating aquatic invertebrate biomass using literature-derived LWR relationships, addressing a critical gap in ecological assessment capabilities for Korean freshwater systems. The demonstrated effectiveness of genus-level representative constants suggests that targeted research efforts should prioritize filling data gaps at this taxonomic level rather than pursuing broader taxonomic coverage. The inherent limitations of representing diverse size classes with single values highlight the need for continued field validation and expansion of species-specific databases. Future studies should focus on reducing taxonomic biases in LWR databases and developing region-specific coefficients to improve estimation accuracy for local ecological applications [66,73].

Conclusion

This study proposed a simple and practical approach to estimate the representative dry weight of freshwater invertebrate species in Korean ecosystems using Length-Weight Relationships (LWRs). This easily applicable method reduces the dependence on time-consuming field measurements. The representative weights calculated from expert-compiled field guide data exhibited strong correlation with field-measured dry weights (R2 = 0.6633), with greatest accuracy achieved through genus-level constants under control conditions. This demonstrates that field guide data can function as reliable resources for ecological assessments and management.

There are, however, limitations due to insufficient LWR data for certain species, which reduces accuracy at higher taxonomic levels (family and order). Furthermore, adopting standardized methods for field sampling, laboratory processing (e.g., drying procedures, gut content removal, and body length measurement protocols), and LWR derivation would help to produce more precise and comparable coefficients across studies and regions. Further research aimed at expanding LWR databases with additional field data collected under such standardized protocols, particularly for understudied taxa, will be essential to improve method accuracy and representativeness. Additionally, future studies should apply machine learning-based approaches for more precise optimization of representative weights where data are limited. Such innovations are expected to provide more robust baseline information to support vulnerability assessments, freshwater resource management, and ecological modeling.

Supporting information

S1 Table. Wet to dry conversion ratio for orders level.

https://doi.org/10.1371/journal.pone.0352157.s001

(DOCX)

S2 Table. Calculated representative coefficients of each taxonomic group at genus level and the number of used literatures (n) for calculating for control group.

https://doi.org/10.1371/journal.pone.0352157.s002

(DOCX)

S3 Table. Calculated representative coefficients of each taxonomic group at family level and the number of used literatures (n) for calculating for control group.

https://doi.org/10.1371/journal.pone.0352157.s003

(DOCX)

S4 Table. Calculated representative coefficients of taxonomic group at the order level and the number of literatures used (n) for calculating for control group.

https://doi.org/10.1371/journal.pone.0352157.s004

(DOCX)

S5 Table. The number and ratio of calculated representative coefficients for each taxonomic group at the genus, family, and order level.

https://doi.org/10.1371/journal.pone.0352157.s005

(DOCX)

S6 Table. Calculated representative coefficients of taxonomic group at the order level and the number of literatures used (n) for calculating for each experimental group.

https://doi.org/10.1371/journal.pone.0352157.s006

(DOCX)

S1 Fig. The number of measured dry weight data in each taxonomic group at (a) genus, (b) family, and (c) order levels.

https://doi.org/10.1371/journal.pone.0352157.s007

(TIF)

S2 Fig. Methodology for Evaluating Representative Biomass in Korean Invertebrates Using R2 Calculation.

https://doi.org/10.1371/journal.pone.0352157.s008

(TIF)

S1 Data. Estimated representative species weight for 564 species in Korean rivers.

https://doi.org/10.1371/journal.pone.0352157.s009

(XLSX)

References

  1. 1. Carson R. Silent spring. Thinking about the environment. Routledge. 2015. 150–5.
  2. 2. Bruinen de Bruin Y, Franco A, Ahrens A, Morris A, Verhagen H, Kephalopoulos S, et al. Enhancing the use of exposure science across EU chemical policies as part of the European Exposure Science Strategy 2020-2030. J Expo Sci Environ Epidemiol. 2022;32(4):513–25. pmid:34697409
  3. 3. Gawlik-Kobylińska M, Gudzbeler G, Szklarski Ł, Kopp N, Koch-Eschweiler H, Urban M. The EU-SENSE System for Chemical Hazards Detection, Identification, and Monitoring. Applied Sciences. 2021;11(21):10308.
  4. 4. Hou J, Gai W, Cheng W, Deng Y. Hazardous chemical leakage accidents and emergency evacuation response from 2009 to 2018 in China: A review. Safety Science. 2021;135:105101.
  5. 5. Jung S, Woo J, Kang C. Analysis of severe industrial accidents caused by hazardous chemicals in South Korea from January 2008 to June 2018. Safety Science. 2020;124:104580.
  6. 6. Kumar M, Chen H, Sarsaiya S, Qin S, Liu H, Awasthi MK, et al. Current research trends on micro- and nano-plastics as an emerging threat to global environment: A review. J Hazard Mater. 2021;409:124967. pmid:33517026
  7. 7. Kumar M, Sridharan S, Sawarkar AD, Shakeel A, Anerao P, Mannina G, et al. Current research trends on emerging contaminants pharmaceutical and personal care products (PPCPs): A comprehensive review. Sci Total Environ. 2023;859(Pt 1):160031. pmid:36372172
  8. 8. Wang L, Lv T, Zhang X, Hu H, Cai X. Global research trends and gaps in ecological compensation studies from 1990 to 2020: A scientometric review. Journal for Nature Conservation. 2022;65:126097.
  9. 9. McCallen E, Knott J, Nunez‐Mir G, Taylor B, Jo I, Fei S. Trends in ecology: shifts in ecological research themes over the past four decades. Frontiers in Ecol & Environ. 2019;17(2):109–16.
  10. 10. Ausubel JH. Current Trends and Environmental Research Needs. Interdisciplinary Science Reviews. 1988;13(3):211–21.
  11. 11. Minteer BA, Collins JP. Move it or lose it? The ecological ethics of relocating species under climate change. Ecol Appl. 2010;20(7):1801–4. pmid:21049870
  12. 12. Travis JMJ. Climate change and habitat destruction: a deadly anthropogenic cocktail. Proc Biol Sci. 2003;270(1514):467–73. pmid:12641900
  13. 13. Grimm NB, Chapin FS III, Bierwagen B, Gonzalez P, Groffman PM, Luo Y, et al. The impacts of climate change on ecosystem structure and function. Frontiers in Ecol & Environ. 2013;11(9):474–82.
  14. 14. Rubach MN, Ashauer R, Buchwalter DB, De Lange HJ, Hamer M, Preuss TG. Framework for traits‐based assessment in ecotoxicology. Integrated Environmental Assessment and Management. 2011;7(2):172–86.
  15. 15. Beroya-Eitner MA. Ecological vulnerability indicators. Ecological Indicators. 2016;60:329–34.
  16. 16. Dudgeon D, Strayer DL. Bending the curve of global freshwater biodiversity loss: what are the prospects?. Biological Reviews. 2025;100(1):205–26.
  17. 17. De Lange HJ, Sala S, Vighi M, Faber JH. Ecological vulnerability in risk assessment--a review and perspectives. Sci Total Environ. 2010;408(18):3871–9. pmid:20004002
  18. 18. De Lange HJ, Lahr J, Van der Pol JJC, Wessels Y, Faber JH. Ecological vulnerability in wildlife: an expert judgment and multicriteria analysis tool using ecological traits to assess relative impact of pollutants. Environ Toxicol Chem. 2009;28(10):2233–40. pmid:19432506
  19. 19. Park RA, Clough JS, Wellman MC. AQUATOX: Modeling environmental fate and ecological effects in aquatic ecosystems. Ecological Modelling. 2008;213(1):1–15.
  20. 20. Bartell SM, Pastorok RA, Akçakaya HR, Regan H, Ferson S, Mackay C. Realism and Relevance of Ecological Models Used in Chemical Risk Assessment. Human and Ecological Risk Assessment: An International Journal. 2003;9(4):907–38.
  21. 21. Yeom J, Jeong H, Son S, Lee YJ, Kim KY, Kim SD, et al. Graph neural network-based prediction and interpretation of Daphnia toxicity using distinct scale molecular representations. Water Res. 2026;297:125676. pmid:41795367
  22. 22. McPherson JM, Jetz W. Effects of species? ecology on the accuracy of distribution models. Ecography. 2007;30(1):135–51.
  23. 23. Malmqvist B. Aquatic invertebrates in riverine landscapes. Freshwater Biology. 2002;47(4):679–94.
  24. 24. Collen B, Whitton F, Dyer EE, Baillie JEM, Cumberlidge N, Darwall WRT, et al. Global patterns of freshwater species diversity, threat and endemism. Glob Ecol Biogeogr. 2014;23(1):40–51. pmid:26430385
  25. 25. Hwang S j, Gong D, Chang M, Joo G, Lee C. Stream/River Ecosystem Survey and Health Assessment (2022~2024). 2022.
  26. 26. Subramanian K, Sivaramakrishnan K. Aquatic Insects for Biomonitoring Freshwater Ecosystems-A Methodology Manual. Bangalore, India: Ashoka Trust for Ecology and Environment (ATREE). 2007.
  27. 27. Pauls SU, Alp M, Bálint M, Bernabò P, Čiampor F Jr, Čiamporová‐Zaťovičová Z, et al. Integrating molecular tools into freshwater ecology: developments and opportunities. Freshwater Biology. 2014;59(8):1559–76.
  28. 28. Cho M. Prediction of foodweb dynamics in reservoir aquatic ecosystem using AQUATOX. Seoul: Department of Environmental Science, Kunkuk university. 2016.
  29. 29. Lee D-Y, Lee D-S, Park Y-S. Taxonomic and Functional Diversity of Benthic Macroinvertebrate Assemblages in Reservoirs of South Korea. Int J Environ Res Public Health. 2022;20(1):673. pmid:36612995
  30. 30. Byeon JS, Kim DG. Development of a Wetland Ecosystem Health Assessment Method Using Benthic Macroinvertebrate Community. Wetlands. 2024;45(1).
  31. 31. Yeom J, Kim I, Kim M, Cho K, Kim SD. Coupling of the AQUATOX and EFDC Models for Ecological Impact Assessment of Chemical Spill Scenarios in the Jeonju River, Korea. Biology (Basel). 2020;9(10):340. pmid:33086610
  32. 32. Yoon CG, Rhee H-P, Son Y. Applicability study of ecological impact assessment using AQUATOX model in Paldang Reservoir, South Korea. Desalination and Water Treatment. 2017;60:39–47.
  33. 33. Nash RD, Valencia AH, Geffen AJ. The origin of Fulton’s condition factor—setting the record straight. Fisheries. 2006;31(5):236–8.
  34. 34. Froese R. Cube law, condition factor and weight-length relationships: history, meta-analysis and recommendations. J Appl Ichthyol. 2006;22(4):241–53.
  35. 35. Suh Y, Park J. The Fifth National Report to the Convention on Biological Diversity Republic of Korea. 2014.
  36. 36. Jeon T. Community studies on aquatic insects in three streams, tributaries of Suyong and Naktong rivers. Pusan: Pusan National University. 1998.
  37. 37. Chung K. Body length-mass relationships of aquatic insect of mountain stream in central Korean peninsula. Korean Journal of Ecology and Environment. 2008;41:320–30.
  38. 38. Kim JW. Standing crops of the aquatic insects communities in the east-coastal river in Korea. Korean Journal of Ecology and Environment. 1970;3:15–22.
  39. 39. Wui IS, Ra CH, Choi CG. Standing Crops of Aquatic Insects in Upper Stream of Yeong-san River. Korean Journal of Ecology and Environment. 1974;7:37–44.
  40. 40. Kim JW, Ree PK. Fauna of Fishes and Water-Insects in Upper Mountain Stream of Hal-River. Korean Society of Limnology. 1975;8:39–50.
  41. 41. Yoon SM. Effect of the heating overflow from the Taean thermoelectric power plant on the intertidal invertebrate community. Journal of Life Science Research. 1999;7:57–89.
  42. 42. James DA, Csargo IJ, Von Eschen A, Thul MD, Baker JM, Hayer C-A, et al. A generalized model for estimating the energy density of invertebrates. Freshwater Science. 2012;31(1):69–77.
  43. 43. Moen MO, Griffin KJ, Kalantar AH. Simple regression and outlier detection using the median method. Analytica Chimica Acta. 1993;277(2):477–87.
  44. 44. Yeom J, Kim SD. Comparing efficiency of literature-based estimation of general biomass using Length-Weight Relationships (LWRs) between freshwater and seawater fishes. PLoS One. 2025;20(6):e0321571. pmid:40460106
  45. 45. Kim JW. Standing crops of the aquatic insects communities in the river Kwangnung in Kyungki-do, Korea. Korean Journal of Ecology and Environment. 1968;1:51–4.
  46. 46. Benke AC, Huryn AD, Smock LA, Wallace JB. Length-Mass Relationships for Freshwater Macroinvertebrates in North America with Particular Reference to the Southeastern United States. Journal of the North American Benthological Society. 1999;18(3):308–43.
  47. 47. Johnston TA, Cunjak RA. Dry mass–length relationships for benthic insects: a review with new data from Catamaran Brook, New Brunswick, Canada. Freshwater Biology. 1999;41(4):653–74.
  48. 48. Rosati I, Barbone E, Basset A. Length–mass relationships for transitional water benthic macroinvertebrates in Mediterranean and Black Sea ecosystems. Estuarine, Coastal and Shelf Science. 2012;113:231–9.
  49. 49. Miserendino ML. Length-mass relationships for macroinvertebrates in freshwater environments of Patagonia (Argentina). Ecología Austral. 2001;11(1):3–8.
  50. 50. McCord SB, Christian AD, Grippo RS. Biomass dynamics of Tipula (Insecta: Diptera) in forested streams of the Interior Highlands, Arkansas. Journal of the Arkansas Academy of Science. 2006;60(1):74–9.
  51. 51. Hamdy R, Dorgham MM, El-Rashidy HH, Atta MM. Biometry and reproductive biology of Pseudonereis anomala Gravier 1901 (Polychaeta: Nereididae) on the Alexandria coast, Egypt. Oceanologia. 2014;56(1):41–58.
  52. 52. Mroczyński R, Daliga K. Biomass estimation using a length-weight relationship in beetle larvae (Coleoptera: Aphodiidae, Histeridae, Hydrophilidae, Staphylinidae) obtained from cow dung. Polish Journal of Entomology. 2016;85(4):399–407.
  53. 53. Giustini M, Miccoli FP, De Luca G, Cicolani B. Length–weight relationships for some plecoptera and ephemeroptera from a carbonate stream in central Apennine (Italy). Hydrobiologia. 2008;605(1):183–91.
  54. 54. Miyasaka H, Genkai-Kato M, Miyake Y, Kishi D, Katano I, Doi H, et al. Relationships between length and weight of freshwater macroinvertebrates in Japan. Limnology. 2008;9(1):75–80.
  55. 55. Smock LA. Relationships between body size and biomass of aquatic insects. Freshwater Biology. 1980;10(4):375–83.
  56. 56. Mura G, Brecciaroli B, Nagorskaya L. The effect of rearing density on body size and weight increase in Streptocephalus torvicornis (Anostraca). Crustac. 2004;77(2):231–43.
  57. 57. Baumgärtner D, Rothhaupt K. Predictive Length–Dry Mass Regressions for Freshwater Invertebrates in a Pre‐Alpine Lake Littoral. International Review of Hydrobiology. 2003;88(5):453–63.
  58. 58. Bowen TW. Production of the predaceous midge tribes Sphaeromiini and Palpomyiini (Diptera: Ceratopogonidae) in Lake Norman, North Carolina. Hydrobiologia. 1983;99(2):81–7.
  59. 59. da Rocha SS, da Silva RLS, Santos JL, Oliveira GC. Length-weight relationship and condition factor of Macrobrachium amazonicum (Heller, 1862) (Decapoda, Palaemonidae) from a reservoir in Bahia, Brazil. Nauplius. 2015;23(2):146.
  60. 60. Lill AWT, Closs GP, Savage C, Schallenberg M. Annual secondary production of two estuarine mysid species (Mysidacea: Mysidae) inhabiting an intermittently closed estuary, south-eastern New Zealand. Marine and Freshwater Research. 2011;62(7):823–34.
  61. 61. Ceylan M, Küçükkara R, Akçimen U, Yener O. Reproduction efficiency of the Horse Leech, Haemopis sanguisuga (Linnaeus, 1758). Invertebrate Reproduction & Development. 2017;61(3):182–8.
  62. 62. Zhao W-H, Wang H-J, Wang H-Z, Liu X-Q. Conversion methods of freshwater snail tissue dry mass and ash free dry mass. Ying Yong Sheng Tai Xue Bao. 2009;20(6):1452–8. pmid:19795658
  63. 63. Daborn GR. Length-weight allometric relationships in four crustaceans from Alberta lakes and ponds. Can J Zool. 1974;52(11):1303–10. pmid:4434275
  64. 64. Mocq J, Dekanová V, Boukal DS. Length-mass allometries of the larvae of aquatic dipterans: differences between taxa, morphological traits, and methods. J Insect Sci. 2024;24(1):10. pmid:38367014
  65. 65. Dekanová V, Venarsky MP, Bunn SE. Length–mass relationships of Australian aquatic invertebrates. Austral Ecology. 2021;47(1):120–6.
  66. 66. Sabo JL, Bastow JL, Power ME. Length–mass relationships for adult aquatic and terrestrial invertebrates in a California watershed. Journal of the North American Benthological Society. 2002;21(2):336–43.
  67. 67. Robinson LA, Greenstreet SPR, Reiss H, Callaway R, Craeymeersch J, de Boois I, et al. Length–weight relationships of 216 North Sea benthic invertebrates and fish. J Mar Biol Ass. 2010;90(1):95–104.
  68. 68. Nakagawa H, Takemon Y. Length-mass relationships of macro-invertebrates in a freshwater stream in Japan. Aquatic Insects. 2014;36(1):53–61.
  69. 69. de Barros G, Brito MTS, Peluso LM, de Faria É, Izzo TJ, Teixido AL. Biased research generates large gaps on invertebrate biota knowledge in Brazilian freshwater ecosystems. Perspectives in Ecology and Conservation. 2020;18(3):190–6.
  70. 70. Leather SR. Taxonomic chauvinism threatens the future of entomology. Biologist. 2009;56(1):10–3.
  71. 71. Cardoso P, Erwin TL, Borges PAV, New TR. The seven impediments in invertebrate conservation and how to overcome them. Biological Conservation. 2011;144(11):2647–55.
  72. 72. Froese R, Thorson JT, Reyes RB Jr. A Bayesian approach for estimating length-weight relationships in fishes. J Appl Ichthyol. 2013;30(1):78–85.
  73. 73. Kulbicki M, Guillemot N, Amand M. A general approach to length-weight relationships for New Caledonian lagoon fishes. Cybium. 2005;29(3):235–52.
  74. 74. Eng J. Sample size estimation: how many individuals should be studied?. Radiology. 2003;227(2):309–13. pmid:12732691
  75. 75. Caudill SP, Turner WE, Patterson DG Jr. Geometric mean estimation from pooled samples. Chemosphere. 2007;69(3):371–80.
  76. 76. Abyani M, Asgarian B, Zarrin M. Sample geometric mean versus sample median in closed form framework of seismic reliability evaluation: a case study comparison. Earthq Eng Eng Vib. 2019;18(1):187–201.
  77. 77. Leuven RSEW, Brock TCM, van Druten HAM. Effects of preservation on dry- and ash-free dry weight biomass of some common aquatic macro-invertebrates. Hydrobiologia. 1985;127(2):151–9.
  78. 78. Mährlein M, Pätzig M, Brauns M, Dolman AM. Length–mass relationships for lake macroinvertebrates corrected for back-transformation and preservation effects. Hydrobiologia. 2015;768(1):37–50.
  79. 79. Dekanová V, Svitok M, Bento S, Caramelo J, Peixe E Sousa P, Carreira BM. Length-mass relationships of pond macroinvertebrates do not hold between Southern and Northern Europe. PeerJ. 2024;12:e18576. pmid:39703909