Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Research on emergency management of urban waterlogging based on similarity fusion of multi-source heterogeneous data

Abstract

Global warming has seriously affected the local climate characteristics of cities, resulting in the frequent occurrence of urban waterlogging with severe economic losses and casualties. Aiming to improve the effectiveness of disaster emergency management, we propose a novel emergency decision model embedding similarity algorithms of heterogeneous multi-attribute based on case-based reasoning. First, this paper establishes a multi-dimensional attribute system of urban waterlogging catastrophes cases based on the Wuli-Shili-Renli theory. Due to the heterogeneity of attributes of waterlogging cases, different algorithms to measure the attribute similarity are designed for crisp symbols, crisp numbers, interval numbers, fuzzy linguistic variables, and hesitant fuzzy linguistic term sets. Then, this paper combines the best-worst method with the maximal deviation method for a more reasonable weight allocation of attributes. Finally, the hybrid similarity between the historical and the target cases is obtained by aggregating attribute similarities via the weighted method. According to the given threshold value, a similar historical case set is built whose emergency measures are used to provide the reference for the target case. Additionally, a case of urban waterlogging emergency is conducted to demonstrate the applicability and effectiveness of the proposed model, which exploits historical experiences and retrieves the optimal scheme for the current disaster emergency with heterogeneous multi attributes. Consequently, the proposed model solves the problem of diverse data types to satisfy the needs of case presentation and retrieval. Compared with the existing model, it can better realize the multi-dimensional expression and fast matching of the cases.

1. Introduction

Nowadays, urban waterlogging occurs in various countries frequently due to abnormal climate, destructed eco-environment, and human activities. Modern cities appear extremely vulnerable to heavy rain and floods, which dramatically increases the difficulty of preventing and controlling urban floods because of irresponsible urban planning and poor infrastructure for waterlogging prevention. Urban waterlogging disasters have brought catastrophic damages to infrastructures and people’s lives worldwide. Not only urban waterlogging but various types of emergencies could cause severe consequences. It continually reminds governments of the importance of emergency decision-making. Hence, to improve the efficiency of emergency decision-making, it is crucial to build a suitable emergency decision model suitable for the characteristics of emergencies.

Many researchers have spent considerable effort on emergency decisions and have achieved significant breakthroughs. In public health emergencies, existing research provided the model which combined the best-worst method (BWM) and data envelopment analysis to select the reasonable sites of makeshift hospitals [1] and the method for interactive multi-criteria group decision-making with probabilistic linguistic information to select emergency assistance areas [2]. Geetha et al. proposed a fuzzy case-based reasoning approach for finding COVID-19 patients’ priority in hospitals during the source shortage period [3]. Wan et al. developed a personalized individual semantic-based consensus reaching process and applied it to select COVID-19 surveillance plans [4]. Cui et al. proposed a new group decision-making method to select the right COVID-19 epidemic prevention and control programs [5]. Based on case-based reasoning(CBR), a model embedding the grey relational analysis and the grey wolf optimization algorithms was proposed for engineering emergency decisions [6]. In the disaster emergency, a bi-objective trapezoidal fuzzy mixed-integer linear program model was designed for emergency logistics locations after the earthquake [7]. Yu et al. proposed a method integrating case-based reasoning with intelligent algorithms to support disaster emergency management [8]. An emergency decision-making model for environmental emergencies was constructed based on CBR to deal with gasoline explosion accidents [9].

From the above studies, emergency events are characterized by urgent time and serious consequences. Because of the limitations of experience and knowledge, humans cannot make an optimal decision in the case of limited time. Compared with other methods, CBR is more suitable for on-site optimization of emergency responses, which can provide a referential solution for decision-makers in the shortest time by analyzing prior experience. CBR has been widely applied in many domains, not only including emergency management [6, 8, 9] but also in complex artificial intelligence [1013], waste treatment [14], and biological domain [15]. In construction projects, CBR helped a lot in risk management and estate valuation [16, 17]. Qin et al. [18] proposed an ontology-supported case-based reasoning approach for computer-aided tolerance specification. Thus, considering the advantages and success of similar applications in various fields, CBR appears to be a promising method for emergency management in urban waterlogging.

In the practical CBR applications, both the historical cases stored in the case base and the target case are described by multiple attributes, and the formats of the attribute values are various. For example, Geetha et al. [3] introduced a fuzzy CBR approach and used fuzzy linguistic variables to represent case attributes. A novel model for predicting traffic emission was proposed using interval-valued intuitionistic fuzzy sets and CBR theory [19]. Zheng et al. [20] developed a new case retrieval method for hybrid multi-attribute, which involved interval numbers and intuitionistic fuzzy numbers. With the diversification of data types, a single data type cannot satisfy the needs of case presentation. Many scholars used heterogeneous data to represent case attributes, including crisp symbols, crisp numbers, and fuzzy numbers [2123]. The description of urban waterlogging cases is complicated, so it is necessary to consider various forms of data. For example, the warning level of urban waterlogging can be represented by crisp symbols, and crisp numbers describe rainfall. Meanwhile, transportation, electrical, and communication equipment under urban waterlogging need to be represented by hesitant fuzzy linguistic term sets.

Because hesitancy is a very common problem in decision-making, we use hesitant fuzzy linguistic term sets to increase the flexibility of eliciting and representing linguistic information. The theory of fuzzy sets was proposed in 1965 by Zadeh. Then, Torra [24] introduced hesitant fuzzy sets, a generalization of fuzzy sets that permits us to represent the situation in which different membership functions are considered possible. Further, Xu and Xia [25] studied the distance and similarity of hesitant fuzzy sets. Farhadinia [26] proposed the systematic transformation of the entropy into the similarity measure for hesitant fuzzy sets. Verma [27] studied some new properties arising from operations on hesitant fuzzy sets. However, uncertainty is produced by the vagueness of meanings whose nature is qualitative rather than quantitative. For such cases, Rodríguez et al. [28] proposed the concept of the hesitant fuzzy linguistic term set. Then, Liao et al. [29] studied the distance and similarity measures for hesitant fuzzy linguistic term sets.

Usually, the retrieval of the CBR method is achieved based on the similarity measure between the historical cases and the target case. Thus, the study on the similarity measure method in case retrieval is necessary. In the research of similarity measures, Li et al. [30] introduced some similarity measures for the fuzzy environment. A new method was developed by calculating the similarity between paths to retrieve previously performed routes based on CBR [31]. With the uncertainty of features in cases, a hybrid similarity measure for case retrieval was designed to solve the problem of data diversification in the CBR method [2023].

Although these approaches are effective in dealing with emergency decision-making problems, there are still some research gaps:

  1. There is no research on emergency management of urban waterlogging. Due to the changeable decision-making environment and the difficulty in attribute selection, it is urgent to establish an effective model for assessing the urban waterlogging situation.
  2. Previous CBR studies generally consider one or two forms of data in case presentation. Now, the description of cases is increasingly complicated, so a single data type cannot satisfy the needs of case presentation. The attributes of the cases should be described by different data types, such as crisp symbols, crisp numbers, interval numbers, fuzzy linguistic variables, and hesitant fuzzy linguistic term sets. Especially, an extensive review of CBR literature reveals few kinds of research on heterogeneous multi-attribute decision-making involving hesitant fuzzy linguistic term sets.
  3. There are hybrid attribute values for a case in CBR practical application. Performing case retrieval with high accuracy for multiple formats of attribute values is a significant challenge, but an in-depth study is lacking. It is necessary to design different matching algorithms for diverse data types. Especially, the similarity measure for hesitant fuzzy linguistic term sets in existing studies ignored the hesitance degree, which leads to information loss and distortion.
  4. Because of the abundance of different types of attributes, it is difficult to elicit appropriate attribute weights from human experts. Therefore, determining criteria weight vectors in heterogeneous multi-attributes decision-making problems becomes another research gap.

In order to fill these research gaps, this paper aims to research emergency management of urban waterlogging based on similarity fusion of multi-source heterogeneous data. The contributions of our work are as follows. First, we establish a multi-dimensional attribute system of urban waterlogging disaster cases based on WSR theory. Second, in the case presentation process, we use five different data types to describe the situation of urban waterlogging cases, including crisp symbols, crisp numbers, interval numbers, fuzzy linguistic variables, and hesitant fuzzy linguistic term sets. At the same time, different matching algorithms are designed for heterogeneous attributes to calculate local similarity. We propose a new method to calculate the similarity of hesitant fuzzy linguistic term sets. Third, we develop a new method of weighting that combines the BWM with the maximal deviation method, which integrates the subjective and objective factors. Finally, attribute similarities for the five formats are aggregated using the weighted method to form a hybrid similarity. Based on these results, a heterogeneous multi-attribute emergency decision-making model based on CBR is developed to retrieve the optimal scheme for the current disaster emergency.

The paper is organized as follows: Section 2 constructs a case database of urban waterlogging, which introduces a multi-dimensional attribute system, case presentation, and data sources. Section 3 presents the construction of an emergency decision-making model based on CBR and the decision-making process. In section 4, case studies of urban waterlogging emergencies are conducted to validate the method’s effectiveness. Section 5 provides some numerical results and comparisons to justify the algorithm. Finally, conclusions and suggestions for further research are summarized in Section 6.

2. Establishment of a case base

2.1 Multi-dimensional heterogeneous attributes system of urban waterlogging

This paper adopts a paradigm of WSR system approach studies, which emphasizes the synthesis of perspectives toward problem description and analysis to determine attributes of urban waterlogging cases. “Wuli-Shili-Renli (WSR)” as a system approach is a methodology of integrated system engineering, a typical system thinking of Chinese traditional philosophical views [32]. Wuli refers to the nature and mechanism of things from natural science. Shili is how people can use the mechanism of things to guide the practice through the understanding of Wuli, that is, to provide an answer to the problem. Renli entails the analysis of the influence and relationship between people with various functions in the system, combined with the traditional cultural, legal, and other factors, to study how to give full play to the initiative in practice. It is better to aim to connect WSR to get a comprehensive scenario of what is concerned and find a satisfactory and feasible result. When applying the WSR approach, we emphasize knowing Wuli, sensing Shili, and caring for Renli.

According to the current study on the construction of the urban waterlogging case and the experience of urban waterlogging emergency management, this paper analyzes attributes of urban waterlogging from the perspectives of WSR. The result shows that thirteen factors should be considered to present the case of urban waterlogging. In addition, these attributes are presented by five types of data, as shown in Table 1.

2.2 Case presentation and data sources

Case presentation mainly presents the historical enforcement cases and the target enforcement case according to a specific format, which provides the basis for the CBR process. An appropriate case presentation method can improve the efficiency of extracting historical cases and enhance the accuracy of the results of waterlogging case analysis. In this paper, the case presentation is composed of the case attributes and the emergency measures.

Assume that Z = {Z1, Z2, … Zn} stands for the set of historical cases and Co represents the target case, in which Zi represents the number i historical case, iN = {1, 2, …, n}. Let X = {X1, X2, …, Xm} be the collection of the attributes of waterlogging cases, where Xj represents the number j attribute in the case, jM = {1, 2, …, m}. At the same time, xi = {xi1, xi2, …, xim} and c = {co1, co2, …, com}, jM represent the set of attributes of historical cases and target case, respectively. Assume that yi = {yi1, yi2, …, yil} stands for the collection of emergency measures of each historical case, where yik represents the number k measure in the number i case, k = 1, 2, …l. Let ω = (ω1, ω2ωm)T be the weight vector of the attributes of the urban waterlogging case, where ωj is the weight of the jth attribute of the case. Meanwhile, the attribute values of waterlogging cases can be expressed in the forms of crisp symbols, crisp numbers, interval numbers, fuzzy linguistic variables, and hesitant fuzzy linguistic term sets.

Nowadays, society is one era of information explosion, and how to fully obtain various data from different sources to serve people is becoming more critical. Therefore, the historical case data of this study is from multiple sources, including existing documents, browsing through emergency management-related websites, and using the crawler technology to pull data. Because of the inconsistency of multi-sources data, it is necessary to carry on the operations of choosing data, proving data, and revising data to produce integrated information. In this paper, data pre-processing is done through python programming and Excel.

3. Construction of emergency decision-making model based on CBR

3.1 Local similarity measurement for heterogeneous attributes

Urban waterlogging attributes are measured by five types of value formats, crisp symbols, crisp numbers, interval numbers, fuzzy linguistic variables, and hesitant fuzzy linguistic term sets, as shown in Table 1. This study extends the applicability of similarity measures of the classic CBR from single-category attributes to heterogeneous multi-attributes. In measuring the similarity between the two cases, local similarities are found by measuring the similarity of each attribute. Then, these local similarities are aggregated to calculate hybrid similarity by using the weights. Therefore, this study constructs different similarity algorithms to calculate the local similarity between attributes regarding five different data types. At the same time, coj and xij are the attribute values of historical case Zi and target case Co, respectively.

  1. Crisp symbols
    Crisp symbols are enumeration values. When the attribute value is a crisp symbol, the only thing to do is to compare the same factors of the target case and historical cases, so the following calculation formula can be obtained:(1)
  2. Crisp numbers
    When we use crisp numbers to describe the attributes, the distance-based method can be employed to measure the similarity between the historical and the target cases, and the formula is [33]: (2) where,
  3. Interval numbers
    Interval numbers are used to describe the uncertainty of the attribute value. When attributes are presented by interval numbers, and are the attribute values of the target case and historical case, respectively. The formula is [34]: (3)
  4. Fuzzy linguistic variables
    For the attributes of fuzzy linguistic variables, values are linguistic variables such as very bad(VB), bad(B), medium(M), good(G), and very good(VG). Each of these linguistic variables is represented by a triangular fuzzy number and these numbers are (0, 0, 0.2), (0.1, 0.3, 0.4), (0.3, 0.5, 0.7), (0.6, 0.7, 0.9), (0.8, 1.0, 1.0), respectively. In the following equations [35, 36], and are the attribute values of the target case and historical case: (4)
  5. Hesitant fuzzy linguistic term sets

Definition 1 [37]. Let xiX(i = 1, 2, …, n), be fixed and S = {st|t = −τ, …, −1, 0, 1, …τ} be a linguistic term set. A hesitant fuzzy linguistic term set (HFLTS) on X, is in mathematical terms of HS = {〈xi, hS(xi)〉|xiX}, where hS(xi)is a set of some values in the linguistic term set S and can be expressed as hS(xi) = {sϕn(xi)|sϕn(xi) ∈ S, n = 1, …, N} with N being number of linguistic terms in hS(xi). hS(xi) expresses the possible degrees of linguistic variable xi to the linguistic term set S. hS(xi) is called the hesitant fuzzy linguistic element (HFLE) and HS is the set of all HFLEs.

Definition 2 [29]. Let b = {bl|l = 1, …, #b} be an HFLTS (#b is the number of linguistic terms in b), b+ and b be the maximum and minimum linguistic terms in b respectively, and ς(0 ≤ ς ≤ 1) be an optimized parameter. Then we can add the linguistic term into the HFLTS.

Definition 3. Let S = {st|t = −τ, …, −1, 0, 1, …, τ} be a linguistic term set, and be two hesitant fuzzy linguistic elements. For many distance measures for HFLTSs, the difference in number of linguistic terms between two HFLEs is ignored. This paper improves the distance measure of HFLTSs based on this. The hesitance degree of the hesitant fuzzy linguistic elements is defined as: (5) where is number of linguistic terms in the intersection of and , as well as being number of linguistic terms in the union of and .

Definition 4. For the hesitant fuzzy linguistic elements and in Definition 3, and are number of linguistic terms in and , respectively. If (otherwise, the shorter one will be extended by adding the linguistic terms through the method in Definition 2.), let the linguistic terms in and be arranged in ascending order. Combining the distance measure for HFLTSs in [29] with the hesitance degree of the hesitant fuzzy linguistic elements in Definition 3, this paper proposes a hybrid similarity measurement, such as improved Hamming hesitant fuzzy linguistic similarity: (6) and Euclidean similarity between and : (7)

Thus, the generalized similarity between and can be defined as: (8) Where λ>0.

Particularly, if λ = 1, the above generalized similarity becomes the Hamming similarity. If λ = 2, the above generalized similarity becomes the Euclidean similarity.

3.2 Determination of weights

The weights of each attribute are vital to hybrid similarity measurement, which directly affects the result of case reasoning. There are so many methods of weight determination, such as the analytic hierarchy process [16], the Delphi method [32], and the maximal deviation [38].

A new approach integrating subjective and objective weights is proposed in this study. It calculates two weights for each of the criteria. The subjective weight is obtained using the best worst method. At the same time, the objective weight is obtained using the maximal deviation method. Then, the final weight is calculated as a combination of these two weights.

3.2.1 A linear model of BWM.

The BWM, a recently developed intuitive and robust multi-criteria decision-making technique, was applied to determine the weights of the decision criteria [39]. The original BWM involves a nonlinear model that sometimes results in multiple optimal weights meaning that the weight of each criterion is presented as an interval. In some cases, a unique solution is preferred. Therefore, this paper adopts a linear model of BWM whose linearity results in a unique solution. The steps followed in BWM to derive the weights of the decision criteria are as follows:

  1. Step 1: Determine a set of decision criteria. A set of decision criteria {X1, X2, …, Xm} are identified, which are used to make a decision.
  2. Step 2: Determine the best criteria (B) and the worst criteria (W). The experts choose the B and W criteria arbitrarily.
  3. Step 3: Establish the best-to-others (BO) vector. The preference of the best criterion over all the other criteria is determined using a number between 1 and 9. Then, the BO vector would be given: AB = (aB1, aB2, …, aBm), and aBj shows the preference of the B over criterion j. Obviously, aBB = 1.
  4. Step 4: Establish the others-to-worst (OW) vector. The preference of all the criteria over the worst criterion is determined using a number between 1 and 9. Then, the OW vector would be given: AW = (a1W, a2W, …, amW)T, where, ajW shows the preference of the criterion j over the worst criterion W. Obviously, aWW = 1.
  5. Step 5: In order to determine the optimal weights of the criteria ω′ = (ω1, ω2ωm)T, the following linear programming problem should be solved. (9)

The solution to the above problem is the final weights of the criteria and the value of ξL. ξL (values closer to zero show better consistency) is an indicator of the consistency of the comparisons.

3.2.2 The maximal deviation model.

This paper adopts the maximal deviation method to determine the weight coefficients because it can automatically obtain the exact and reliable weights without subjectivity. Suppose Z = {Z1, Z2, …Zn} is the set of cases for multi-attribute decision-making, where X = {X1, X2, …Xm} represents the set of attributes. Let ω″ = (ω1, ω2ωm)T be the weight vector. P = (pij)n×m represents the normalized decision matrix, where i ∈ N, jM. Let , where dj expresses the total deviation which is relative to Xj. The total deviation among all the different attributes Xj with respect to all the cases can be maximized.

If there is no information about the criteria weight, this paper uses the model to determine the weight ω″ = (ω1, ω2ωm)T as follows: (10)

Solving the model, and according to , the following result can be derived: (11)

If the criteria weight is partly known and O is the set of incomplete weight information [40], we can also solve the following model to determine the weight vector ω″ = (ω1, ω2ωm)T.

(12)

3.2.3 Combination weight.

Based on the above two models, the subjective weight ω′ = (ω1, ω2ωm)T is obtained using the BWM, and the objective weight ω″ = (ω1, ω2ωm)T is obtained using the maximal deviation method. Then, the final weight is calculated using a linear combination (Eq (13)). The value of the α parameter is determined by the decision-maker, which indicates the degree of subjective preference as a number between 0 and 1.

(13)

The decision-maker can decide which of the two weights will play a more significant role in determining the final weights. Generally, the subjective weight obtained by experts receives more attention. Therefore, α = 0.618 makes more sense, which is the golden ratio.

3.3 Hybrid similarity measurement method

Aiming at calculating the hybrid similarity between target case Co and historical case Zi, all local similarities of attributes calculated as mentioned above are aggregated by using weights based on the K-nearest approach. The hybrid similarity is calculated by the formula as follows: (14) Where, the closer SIM(Co, Zi) is to 1, the greater the similarity between the historical case Zi and the target case Co.

In order to improve the rationality of emergency decision-making, it is necessary to set a similarity threshold. Let λ represent the similarity threshold. In this paper, the similarity threshold λ can be calculated by the formula as follows: (15) where τ is determined by the decision-maker. When SIM(Co, Zi) ≥ λ, historical cases and emergency measures will be extracted. The extracted historical cases will form a similar case set Ω. That is, Ω = {Zi|iN*}, N* = {i|SIM(Co, Zi) ≥ λ, iN}, in which N* is a set of subscripts for historical cases with reference significance for the target case.

3.4 Extraction of similar historical cases

To sum up, this study proposes a CBR-based algorithm based on similarity fusion of multi-source heterogeneous data. The primary extraction process of similar historical cases can be seen in the following flow chart Fig 1.

  1. Step 1: Process the data of historical cases and the target case. For hesitant fuzzy linguistic term sets, extend the shorter one by adding the linguistic terms through the method in Definition 2 and get a standardized decision matrix.
  2. Step 2: Based on the linear model of BWM in section 3, compute the subjective weight ω′ = (ω1, ω2ωm)T.
  3. With no weight information about the criteria, use Eq (11) to compute the objective weight ω″ = (ω1, ω2ωm)T. With incomplete weight information, use Eq (12) to compute the objective weight ω″ = (ω1, ω2ωm)T.
    Then, the final weights of the decision criteria can be calculated using Eq (13).
  4. Step 3: According to Eqs (1)–(4) and (7), use different algorithms to measure the local similarity of heterogeneous attributes involving crisp symbols, crisp numbers, interval numbers, fuzzy linguistic variables, and hesitant fuzzy linguistic term sets.
  5. Step 4: Compute hybrid similarity between target case Co and historical case Zi with Eq (14), all local similarities of attributes calculated in the third step are aggregated using weights based on the K-nearest approach.
  6. Step 5: Depending on the results in the fourth step, let the similarities be arranged in descending order. Extract similar historical cases, which construct a similar case set Ω and output emergency measure of similar cases to provide a reference for the target case.

4. Case study

Comprehensive experiments have been conducted to demonstrate the proposed methodology. This paper selects nine urban waterlogging disasters in China, Japan, and Germany as historical cases. It selects the urban waterlogging event which occurred in Zhengzhou, China, on July 20, 2021, as the target case. There are thirteen attributes determined based on the WSR methodology to describe features of urban waterlogging events. The multi-dimensional attributes system of urban waterlogging is shown in Table 1. The Wuli dimension includes the duration of rainfall (X1), rainfall total (X2), maximum rainfall per hour (X3), reservoir operation (X4), state of the embankment (X5), and depth of surface accumulated water (X6). In addition, the Renli dimension includes direct economic losses (X7), casualties (X8), range of traffic disruption (X9), range of communication outages (X10), and range of power outages (X11). Finally, the Shili dimension includes the early-warning level (X12) and relief supplies demand (X13). These attributes are presented by five data types: crisp symbols, crisp numbers, interval numbers, fuzzy linguistic variables, and hesitant fuzzy linguistic term sets. For the attributes of fuzzy linguistic variables, values are linguistic variables such as very bad(VB), bad(B), medium(M), good(G), and very good(VG). Each of these linguistic variables is represented by a triangular fuzzy number and these numbers are (0, 0, 0.2), (0.1, 0.3, 0.4), (0.3, 0.5, 0.7), (0.6, 0.7, 0.9), (0.8, 1.0, 1.0), respectively. For the attributes described by hesitant fuzzy linguistic term sets, a five-point linguistic scale is constructed, which is S = {s-2 = very large, s-1 = large, s0 = normal, s1 = small, s2 = very small}. Depending on the data of historical cases (Z1, Z2, …, Z9) and target case(Co), the initial decision matrix is shown in Table 2.

Based on the reference comparisons in BWM, the priorities of the criteria are determined as numbers between 1 and 9. Table 3 shows the decision-maker preferences.

The Decision-making process is as follows.

  1. Step 1: Process initial data. For hesitant fuzzy linguistic term sets, extend the shorter one by adding the linguistic terms through the method in Definition 2. The result is shown in Table 4.
  2. Step 2: Using the linear model of BWM in section 3 and the pairwise comparison vectors as shown in Table 3, we have: ω’ = (0.0957, 0.0717, 0.1435, 0.0574, 0.0715, 0.0574, 0.0478, 0.0957, 0.0359, 0.0187, 0.0410, 0.2278, 0.0359)T, and ξL = 0.05 which shows the results consistency.
    Because there is no information about the criteria weight, compute the objective weight of each attribute based on Eq (11) and the weight vector is ω” = (0.1090, 0.0561, 0.0803, 0.0488, 0.0477, 0.0637, 0.0670, 0.0831, 0.0593, 0.0445, 0.0697, 0.2269, 0.0439)T.
    After calculating the double weights of the criteria based on the BWM method and the maximal deviation, the final weights of the decision criteria can be calculated using Eq (13).
  3. Step 3: According to Eqs (1)–(4) and (7), compute the local similarity of heterogeneous attributes through different matching algorithms, as shown in Table 5.
  4. Step 4: Compute hybrid similarity SIM(Co, Zi) between target case Co and historical case Zi with Eq (14), and the results are as follows.
  5. Step 5: Depending on the results in Step 4, let the similarities be arranged in descending order.
    Therefore, the sorted result of historical cases is shown below.

Suppose τ = 0.9. Then, the similarity threshold λ calculated using Eq (15) is 0.8331. Consequently, Z6 meets the threshold condition and constitutes a historical case similarity set Ω = {Z6}, which means the measures of the historical case with the high similarity can provide proper guidance for the decision-making of the target case. The emergency measures of case Z6 are shown in Table 6.

thumbnail
Table 6. Emergency measures of the similar historical case.

https://doi.org/10.1371/journal.pone.0270925.t006

5. Comparisons and discussions

In reference [9], only the common combination of three heterogeneous attributes, including crisp numbers, interval numbers and fuzzy numbers, was selected to test the proposed model. In practice, there are far more than these attribute forms. Due to human cognitive limitations, different data sources, and time constraints, a single data type cannot satisfy the needs of the case presentation. The proposed method comprehensively considers the multiple formats of attribute values involved in case retrieval. For a better description of the cases, these attributes are described in five forms in this paper, including crisp symbols, crisp numbers, interval numbers, fuzzy linguistic variables, and hesitant fuzzy linguistic term sets. This paper proposes a novel heterogeneous attribute emergency decision-making model based on CBR, with more forms of expression embedded to further improve the proposed model’s applicability for more complicated emergency decision events.

This paper proposes a novel similarity measurement between hesitant fuzzy linguistic term sets (HFLTSs), considering the difference in the number of linguistic terms between two HFLEs, which is different from the method in [41]. The hybrid similarities SIM(Co, Zi) between target case Co and historical case Zi calculated by the method in [36] are shown below.

The sorted result of historical cases is as follows:

Compared the results computed by two similarity measures for hesitant fuzzy linguistic term sets, the similarity values between cases have changed. The reason is that the hesitance degree of the hesitant fuzzy element has a different influence on the calculation. Although the method in [41] considers the number of membership degrees in HFLEs, defining the hesitance degree based on membership degrees is inadequate. In order to overcome this problem, this study proposes a new hesitance degree considering the ratio of the number between the intersection and the union in HFLEs, which can provide better results. Compared to the method in [41], the model proposed in this paper efficiently avoids information loss and distortion and reduces errors in measurement.

Additionally, this paper proposes an advanced criteria weighting method, which combines the BWM with the maximal deviation method. The factors’ weights in [9] were determined by the analysis hierarchy process(AHP) method. Compared to AHP, the BWM is a method based on the vector that needs fewer comparisons. The BWM method only requires 2n-3 comparisons, while AHP needs n(n − 1)/2 comparisons. In order to avoid the uncertainty of subjective assigning of weights from the experts and extract the accurate information from the available data, this paper uses the maximal deviation method to obtain the objective weight. The combination weight comprehensively considers the prior knowledge of the experts and objective influence factors of indicators.

6. Conclusions

Learning from previous emergency measures is an effective approach to deal with the present emergency. According to the characteristics of urban waterlogging emergencies, this paper establishes the multi-dimensional heterogeneous attributes system based on WSR, which can better describe the cases and lay a foundation for the emergency decision-making model. Different similarity algorithms are designed for five data types: crisp symbols, crisp numbers, interval numbers, fuzzy linguistic variables, and hesitant fuzzy linguistic term sets. A hybrid similarity measure between the historical and the target cases is developed to rank and select similar historical cases. It greatly improves the efficiency of the emergency model. The measures of the historical case with the high similarity can provide proper guidance for the decision-making of the target case. Finally, this study empirically tests the proposed model with an example of an urban waterlogging emergency in Zhengzhou, which proves the method is reasonable and scientific. However, there is still room for improvement. This paper only concerns five forms of data. In practice, there are far more than these attribute forms. Moreover, this study assumed that decision-makers have the same understanding of the linguistic terms used. In decision-making problems, decision-makers may use unbalanced linguistic term sets and personalized individual semantics to provide linguistic assessments [42, 43]. Thus, it is necessary to further consider this point in linguistic decision-making. Furthermore, the increase in historical cases will result in the deterioration of the CBR resolution time. Effective management of the case base is necessary, which may be solved by artificial intelligence. In the future, this will be implemented through efforts.

Acknowledgments

The authors thank Mr. Cao and Mrs. Li for their invaluable help and advice throughout the study; Mr. Liu for assistance with statistics; and Xiaofei for English revision, who is a translator and spares no effort to polish this paper.

References

  1. 1. Chen Z H, Wan S P, Dong J Y. An efficiency-based interval type-2 fuzzy multi-criteria group decision making for makeshift hospital selection[J]. Applied Soft Computing, 2022, 115: 108243. pmid:34899106
  2. 2. Wan S P, Cheng W B H, Dong J Y. Interactive multi-criteria group decision-making with probabilistic linguistic information for emergency assistance of COVID-19[J]. Applied Soft Computing, 2021, 107: 107383. pmid:35721366
  3. 3. Geetha S, Narayanamoorthy S, Manirathinam T, et al. Fuzzy Case-based reasoning approach for finding COVID-19 patients priority in hospitals at source shortage period[J]. Expert Systems with Applications, 2021, 178:114997. pmid:33846668
  4. 4. Wan S P, Yan J, Dong J Y. Personalized individual semantics based consensus reaching process for large-scale group decision making with probabilistic linguistic preference relations and application to COVID-19 surveillance[J]. Expert Systems with Applications, 2022, 191: 116328.
  5. 5. Cui C, Li B Wang L. The Selection of COVID-19 Epidemic Prevention and Control Programs Based on Group Decision Making[J]. Complex & Intelligent Systems, 2022(1):1–10.
  6. 6. Chen W, Wang X, Wang W, et al. A heterogeneous GRA-CBR-based multi-attribute emergency decision-making model considering weight optimization with dual information correlation[J]. Expert Systems with Applications, 2021, 182: 115208.
  7. 7. Wan S P, Chen Z H, Dong J Y. Bi-objective trapezoidal fuzzy mixed integer linear program-based distribution center location decision for large-scale emergencies[J]. Applied Soft Computing, 2021, 110: 107757.
  8. 8. Yu X, Li C, Zhao W X, et al. A novel case adaptation method based on differential evolution algorithm for disaster emergency[J]. Applied Soft Computing, 2020, 92: 106306.
  9. 9. Wang D, Wan K, Ma W. Emergency decision-making model of environmental emergencies based on case-based reasoning method[J]. Journal of Environmental Management, 2020, 262:110382. pmid:32250833
  10. 10. Corbat L, Nauval M, Henriet J, et al. A fusion method based on Deep Learning and Case-Based Reasoning which improves the resulting medical image segmentations[J]. Expert Systems with Applications, 2020, 147:113200.
  11. 11. Guo Y, Zhang B, Sun Y, et al. Machine learning based feature selection and knowledge reasoning for CBR system under big data[J]. Pattern Recognition, 2020, 112(6):107805.
  12. 12. Homem T, Santos P E, Costa A, et al. Qualitative Case-Based Reasoning and Learning[J]. Artificial Intelligence, 2020, 283:103258.
  13. 13. Quemy A. Binary classification in unstructured space with hypergraph case-based reasoning[J]. Information Systems, 2019,85: 92–113.
  14. 14. Chazara P, Negny S, Montastruc L. Flexible knowledge representation and new similarity measure: Application on case based reasoning for waste treatment[J]. Expert Systems with Applications, 2016, 58: 143–154.
  15. 15. Arshadi N, Jurisica I. Data mining for case-based reasoning in high-dimensional biological domains[J]. IEEE Transactions on Knowledge & Data Engineering, 2005, 17(8):1127–1137.
  16. 16. Okudan O, Budayan C, Dikmen I. A knowledge-based risk management tool for construction projects using case-based reasoning[J]. Expert Systems With Applications, 2021, 173: 114776.
  17. 17. I-Cheng Y, Tzu-Kuang H. Building real estate valuation models with comparative approach through case-based reasoning[J]. Applied Soft Computing,2018,65:260–271.
  18. 18. Qin Y, Lu W, Qi Q, et al. Towards an ontology-supported case-based reasoning approach for computer-aided tolerance specification[J]. Knowledge-Based Systems, 2018, 141(2):129–147.
  19. 19. Wei M, Dai Q, Sun S, et al. A prediction model for traffic emission based on interval-valued intuitionistic fuzzy sets and case-based reasoning theory[J]. Journal of Intelligent & Fuzzy Systems, 2016, 31(6):3039–3046.
  20. 20. Zheng J, Wang Y M, Lin Y, et al. Hybrid multi-attribute case retrieval method based on intuitionistic fuzzy and evidence reasoning[J]. Journal of Intelligent and Fuzzy Systems, 2019, 36(1):271–282.
  21. 21. Fan Z P, Li Y H, Zhang Y. Generating project risk response strategies based on CBR: A case study[J]. Expert Systems with Applications, 2015, 42(6): 2870–2883.
  22. 22. Cui C, Li J, Zang Z. Measuring Product Similarity with Hesitant Fuzzy Set for Recommendation [J]. Mathematics, 2021, 9(21): 2657.
  23. 23. Huang Z, Fan H, Shen L. Case-based reasoning for selection of the best practices in low-carbon city development[J]. Frontiers of Engineering Management, 2019, 6(3): 416–432.
  24. 24. Torra V. Hesitant fuzzy sets[J]. International journal of intelligent systems, 2010, 25(6): 529–539.
  25. 25. Xu Z, Xia M. Distance and similarity measures for hesitant fuzzy sets[J]. Information Sciences, 2011, 181(11): 2128–2138.
  26. 26. Farhadinia B. Information measures for hesitant fuzzy sets and interval-valued hesitant fuzzy sets[J]. Information Sciences, 2013, 240(1): 129–144.
  27. 27. Verma R. Operations on hesitant fuzzy sets: Some new results[J]. Journal of Intelligent & Fuzzy Systems, 2015, 29(1): 43–52.
  28. 28. Rodríguez R.M., Martínez L., Herrera F., Hesitant fuzzy linguistic terms sets for decision making, IEEE Trans. Fuzzy Syst, 2012, 20: 109–119.
  29. 29. Liao H, Xu Z, Zeng X J. Distance and similarity measures for hesitant fuzzy linguistic term sets and their application in multi-criteria decision making[J]. Information Sciences, 2014, 271(3):125–142.
  30. 30. Li P, Wei C. A Case-Based Reasoning Decision-Making Model for Hesitant Fuzzy Linguistic Information[J]. International Journal of Fuzzy Systems, 2018, 20(7): 2175–2186.
  31. 31. Quirion-Blais O, Chen L. A case-based reasoning approach to solve the vehicle routing problem with time windows and drivers’ experience[J]. Omega, 2020, 102:102340.
  32. 32. Gu J F, Tang X J, and Zhu Z X. Survey on Wuli-Shili-Renli System Approach, Journal of Transportation Systems Engineering and Information Technology, 2007(6):51–60.
  33. 33. Fan Z P, Li Y H, Wang X H, et al. Hybrid similarity measure for case retrieval in CBR and its application to emergency response towards gas explosion[J]. Expert Systems with Application, 2014,41(5):2526–2534.
  34. 34. Slonim T Y, Schneider M. Design issues in fuzzy case-based reasoning[J]. Fuzzy Sets and Systems, 2001, 117(2):251–267.
  35. 35. Xu Z, Da Q. A least deviation method to obtain a priority vector of a fuzzy preference relation[J]. European Journal of Operational Research, 2005, 164(1):206–216.
  36. 36. Sevastianov P. Numerical methods for interval and fuzzy number comparison based on the probabilistic approach and Dempste-Shafer theory[J]. Information Sciences, 2007, 177(21): 4645–4661.
  37. 37. Liao H, Xu Z, Zeng X J, et al. Qualitative decision making with correlation coefficients of hesitant fuzzy linguistic term sets[J]. Knowledge-Based Systems, 2015, 76(mar.):127–138.
  38. 38. Wang Y. Using the method of maximizing deviation to make decision for multiindices[J]. Journal of Systems Engineering and Electronics, 1997, 8(3):21–26.
  39. 39. Rezaei J. Best-worst multi-criteria decision-making method: Some properties and a linear model[J]. Omega, 2016, 64: 126–130.
  40. 40. Zhang Z, Gao J, Gao Y, et al. Two-sided matching decision making with multi-granular hesitant fuzzy linguistic term sets and incomplete criteria weight information[J]. Expert Systems with Applications, 2021, 168: 114311.
  41. 41. Rezaei K, Rezaei H. New distance and similarity measures for hesitant fuzzy sets and their application in hierarchical clustering[J]. Journal of Intelligent and Fuzzy Systems, 2020, 39(3).
  42. 42. Zhang Z, Li Z, Gao Y. Consensus reaching for group decision making with multi-granular unbalanced linguistic information: A bounded confidence and minimum adjustment-based approach[J]. Information Fusion, 2021, 74: 96–110.
  43. 43. Zhang Z, Li Z. Personalized Individual Semantics-Based Consistency Control and Consensus Reaching in Linguistic Group Decision Making[J]. IEEE Transactions on Systems, Man, and Cybernetics: Systems, 2021.