A fog-assisted group-based truth discovery framework over mobile crowdsensing data streams

Bayan Hashr Saeed Alamri; Muhammad Mostafa Monowar; Suhair Alshehri; Mohammad Haseeb Zafar

doi:10.1371/journal.pone.0330656

Abstract

With the proliferation of mobile crowdsensing (MCS) and crowdsourcing, new challenges are emerging every day. Although crowdsensing has become a popular sensing paradigm to aggregate sensor readings from a variety of sources, data inconsistency has arisen as a serious challenge. Truth discovery (TD) has been developed as an effective method for reducing data inconsistency and as a validity assessment for conflicting data from various sources. In addition, MCS applications and services are moving beyond a single individual participant to community groups and are influenced by group behavior. To address these challenges in this paper, we propose a novel Fog-assisted Group-based Truth Discovery Framework over MCS Data Streams, an efficient TD system for real-time applications. Specifically, we first initialized the weights for the weight update process in TD with the participants’ credibility level. Then, we developed a novel Two-layer Group-based Truth Discovery (TGTD) mechanism in which the first layer estimates the truth of the group’s members and the second layer estimates the aggregated truth for the groups. We have conducted extensive experiments over synthetic and real-world datasets to prove the effectiveness and efficiency of our framework. The results indicate that TGTD achieves superior truth discovery accuracy compared to current streaming truth discovery approaches, while maintaining a reasonable running time. The organization of the streaming process within the fog architecture simulation is identified as an area for further investigation and future work.

Citation: Alamri BHS, Monowar MM, Alshehri S, Zafar MH (2025) A fog-assisted group-based truth discovery framework over mobile crowdsensing data streams. PLoS One 20(8): e0330656. https://doi.org/10.1371/journal.pone.0330656

Editor: Muhammad Anwar, University of Education, PAKISTAN

Received: April 11, 2024; Accepted: July 31, 2025; Published: August 26, 2025

Copyright: © 2025 Hashr Saeed Alamri et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: The data is available in files within the Figshare repository https://doi.org/10.6084/m9.figshare.26251022.

Funding: The author(s) received no specific funding for this work.

Competing interests: The authors have declared that no competing interests exist.

Introduction

In the mobile crowdsensing (MCS) paradigm, data collected by participants are not always accurate or reliable due to background noise, sensor and hardware quality, a lack of effort, and insufficient skill. Moreover, the sensor readings of participants on the same tasks may differ [1–3]. Therefore, while sensor readings are highly valuable, it remains a research challenge to extract truthful data from the conflicting, heterogeneous, and noisy readings reported by participants. Decisions based on untruthful data can cause serious damage. For example, serious consequences may result from patients’ medical records being scattered across different hospitals, and wrong diagnoses can occur when based on incorrect measurements [4]. Furthermore, scientific discovery can be guided in the wrong direction due to faulty data [5]. Therefore, it is crucial to extract the most trusted and reliable results from conflicting sources. Significant improvements in the accuracy of data aggregation have been brought by researchers on the quality of ground truth [6–9]. Hence, to discover truthful information from unreliable data, a possible solution could be truth discovery (TD) [10–12], which has been widely studied recently and applied in many MCS applications [13–15]. The TD goal is to estimate participants’ and workers’ data quality and infer reliable aggregated information through quality-aware data aggregation [16,17]. The TD process aims to estimate the truth, which is the closest value to the true value of the task based on participants’ weights as a reliability input. The main principle of the TD method is that participants will be given a high weight if they contribute sensor readings closer to the ground truth. In addition, their sensor readings will be counted more in the aggregation process. Several studies have focused on TD techniques and variety of TD approaches have been proposed to calculate participants’ weights and aggregated results based on TD principles [16,17]. It is intuitive to trust reliable participants more when deriving the truth, and the naïve approach that regards all participants as reliable and equal in the aggregation may fail to infer reliable results. In most studies on TD, the researchers have assumed that most participants and workers are reliable [8,11,18–23] though this assumption is not practical in most MCS scenarios. Especially under circumstances where the ground truth is unknown. They either rely on the observation that the majority of users contribute reliable data even in the absence of ground truth [8,11]. Some methods initialize a random ground truth at the beginning of the TD process [18], assuming the worker weight to be known [19,23], or assign random weights [20–22] at the start. Formally, for most of the previous studies, as long as most of the participants honestly contribute sensing data, TD can generate an effectively estimated truth [3,6,8,18–32]. What happens, however, if the misbehaving participants are dominant at first? Specifically, the quality estimation in these studies starts with zero knowledge of participants’ reliability. If the misbehaving participants dominate at first, then the quality estimation will be likely to be inaccurate, and they will incorrectly consider these participants as reliable once, accordingly generating false truth estimations [8]. Furthermore, in most truth discovery scenarios data are collected in a streaming manner, where it is reported from multiple sources sequentially, such as traffic monitoring applications, flight data, and weather forecast information [33]. Consequently, it is impractical to wait for all the sensor readings to be collected to estimate the truth and source reliability. As this framework focuses on real-time MCS applications, it resorts to the incremental conflict resolution (iCRH) algorithm [16], which deals with streaming data [23,34]. In addition, MCS applications and services are moving beyond a single individual participant to community groups and are influenced by group behavior [35]. MCS grouping exists to support real-world group activities (e.g., meetings, parties, etc... .). In addition, grouping is an important phase of the design space for MCS systems such as management, economics [36], social networks, and social influence phenomena [37,38].

In this work, what our framework can accomplish is that even if the majority of participants are unreliable, we can still generate accurate truth estimation results. This is because, by initializing the weights in the TD process with the participants’ credibility level, our estimation approach tends to let the high-weight participants contribute more to the TD process, which allows the truth estimation result to be accurately generated. In addition, we adopt the Fog Node (FN) architecture to minimize the overhead on the participants’ side. Such that the FN are located in different geographical locations and provide fog computing services for both the participants and the sensing platform [39]. The FN communicates with the participants, forwards their sensor readings to the platform, and helps in the TD process along with the sensing platform.

In this paper, we address four challenges when developing the Fog-assisted Group-based Truth Discovery Framework over MCS Data Streams, an efficient TD system for real-time applications. First, participants may be lazy and selfish when collecting sensor readings with their resource-constrained devices. Hence, they may reduce their sensing efforts, such as by reducing their resources, time, and attention in the sensing tasks, which will significantly impair the aggregation [40]. Accordingly, the TD algorithm needs to start the process with credible participants’ weights. Second, sensor readings are collected as streams rather than as static data in real-time applications; hence, each participant submits sensor readings at regular intervals. This challenge requires a streaming TD algorithm [16,41]. Third, we address the shortcomings of existing group-based TD in MCS. Therefore, we present a novel Two-layer Group-based Truth Discovery (TGTD) mechanism to calculate TD for group-based activities in MCS. Finally, for large-scale deployment of real-time applications, the TD scheme should be efficient, even if the number of participants grows dramatically. With the increasing complexity and scale, there is a need to enhance MCS with a fog-computing paradigm to reduce computation complexity and communication overhead. Each FN manages a group of participants and works as an intermediate between participants and the sensing platform.

To the best of our knowledge, this work is the first that investigates group-based TD. Moreover, existing works start the TD process to estimate the data quality with zero knowledge about the ground truth and participants’ weight or uniform initialization of user weight; therefore, if the misbehaving participants dominate at the start, it is likely to generate false truth estimation and classify these participants as reliable. To the best of our knowledge, this work is the first to start the TD process with a credible participant’s weight. Finally, existing works failed to achieve efficient TD for a large population of participants.

In particular, the main contribution of this work is A Fog-assisted Group-based Truth Discovery Framework over MCS Data Streams. Overall, our contributions can be summarized below:

We develop a novel TD method for crowdsensing data streams where we initialized the weights in TD with participants’ credibility level. Then we formulate the credibility function based on the participants’ readiness, commitment, and the device’s ability to perform the sensing tasks. This method guarantees the accuracy of the data as participants with a high credibility level are likely to provide accurate data.
We develop a novel fog-assisted Two-layer Group-based Truth Discovery (TGTD) mechanism. The first layer estimates the truth of the group’s members and the second layer estimates the aggregated truth for the groups in the area of interest. Additionally, it offloads the process to the FN deployed in a different geographic location, to minimize the overhead on the participants’ side.
We conduct extensive simulations using synthetic datasets and different real-world datasets and verify that TGTD surpasses other streaming TD methods in terms of TD accuracy, while maintaining reasonable computation time.

The remainder of this article is organized into the following sections: Related work section reviews related work in TD schemes in MCS. Preliminaries section states our system model, design goals, and brief preliminaries of truth discovery. A Fog-assisted Group-based Truth Discovery Framework over MCS Data Streams and its underlying mechanisms are introduced in Proposed framework section. In Performance evaluation section, the framework is evaluated by conducting various experiments and simulations. Finally, we conclude the article in the conclusion.

Related work

Many studies have directed their attention toward exploring the truth discovery problem, which is an effective technique for MCS quality-aware data aggregation from heterogeneous sources [16,17,42]. Li et al. [17] propose a general truth discovery framework, which has been utilized in many other studies. They propose Conflict Resolution on a Heterogeneous (CRH) data approach, which iterative conducts truth aggregation and weight estimation until convergence. Furthermore, in [16] a general truth discovery framework for single, heterogeneous, and steam data types is proposed. Li et al. [42] proposed confidence-aware truth discovery (CATD) to automatically infer truths from conflicting data with long-tail phenomenon.

Thereupon, multiple researchers have adopted the same concept of CRH to find truth discovery from multiple observations to qualify sensor data and reward participants based on their contribution to the truth. The data quality in these studies is based on the deviation between reported sensor data and the ground truth [8,18–20,22–24,26–30]. Yang et al. [8] propose a quality estimation model through unsupervised learning. They adopt the idea of clustering for ground truth estimation and measuring the data quality of each participant. In this model, data quality is based on the deviation between trustworthy data and the ground truth. It achieves better performance in terms of quality estimation compared with other heuristic models. On the other hand, it assumes that every mobile device has equal sensing capabilities; however, the participants’ behaviors are uncertain, and their sensing devices are heterogeneous. Moreover, it assumes that quality data providers dominate at the start of the truth estimation model, which is an impractical assumption in the real-world MCS.

Others address privacy concerns during the truth discovery process [19,20,23–26,28–30,39,43] where they either adopt the Paillier cryptosystem [18,23,26,27,43], anonymity [23,30,39], or differential privacy [19,20,22,30,34,44] to protect customers’ and participants’ sensitive information and the estimated weight. The authors in [39] propose a fog-assisted data collection scheme for MCS. This scheme utilizes a session key agreement mechanism for the MCS data collection environment. It achieves data anonymity and accuracy without relying on TTP. Liu et al. [23] tackle the dropout of participants in the MCS system by proposing robust and scalable TD in real-time MCS applications. It processes sensing heterogeneous data streams based on the iCRH. This scheme achieves highly efficient computation and enough accurate truthful information. In [30], the authors tackle the problem of truth discovery protocols that impose heavy computation and communication overhead. They propose the PerturbTD protocol to reduce participant overhead when they are sharing their weight during the truth discovery process. PerturbTD reduces the overhead on the participant by shifting the truth discovery process to be performed on two cloud servers. Gao et al. [43] propose a novel and efficient location privacy-preserving truth discovery (LoPPTD) mechanism. It investigates location-preserving truth discovery for MCS. It divides the interested area into grids and exploits super-increasing sequences. Then, participants structure their sensory reading into a report that is uploaded to the FN to apply the truth discovery process. Zhao et al. [27] propose PRICE, a privacy-preserving and reliability-aware real-time incentive system in MCS. Furthermore, they design a two-layer stream truth discovery model. This model data stream tackles the single-time slice of the failure STOF problem by adding one round truth discovery layer, in which the first layer processes the sensing data stream and the second layer processes estimated ground truth from the first layer. The problem of reliable aggregate results and protecting participants’ information in truth discovery is addressed in [26]. It adopts the truth discovery method CRH and assigns participants’ weights based on information quality. Two schemes are proposed that use a two-layer randomized response mechanism and a Gaussian noise mechanism. These schemes adopt the truth discovery method, CRH, and assign participants’ weights based on information quality. Hence, the aggregated data do not deviate much from the true value. The authors in [44] propose a LightPrivacy scheme to balance participants’ personalized privacy and task data practicability in MCS. Chen et al. [22] propose a novel robust privacy truth discovery scheme called RPPTD in which differential privacy allows the operation server to obtain the truth without leaking the participants’ privacy. This scheme adopts CRH as a truth discovery process that includes weight update and truth update processes. RPPTD achieves robustness and eliminates single-point failures without the need for trusted third-party servers. The location privacy truth discovery MCS system is addressed in [20], where the authors propose location obfuscation truth estimation based on differential privacy. They model location privacy as an optimization problem that is solved via linear programming to minimize global truth estimation deviation differential privacy constraints. In [19], A novel framework called Truth discovEry via probabilistic eStimation mall under rigorous Local differential privAcy (TESLA) is introduced. In this framework, the privacy-protected noise weakly negatively affects the weight estimation and true aggregation. In addition, TESLA includes a probabilistic weight mechanism for determining a more accurate weight for each participant. Moreover, based on the fused value, the sensing platform computes the truth discovery process while adopting CRH and utilizes a probabilistic weight mechanism as weight estimation for truth discovery. Although there is a need to consider stream-sensing data tasks, TESLA achieves high effectiveness and efficiency. However, these systems impose an overhead on participants and the sensing platform, which discourages them from engaging in the truth discovery process.

Some works improve the efficiency and performance of truth discovery systems by developing their systems under fog or edge computation [22–25,32,34,39,44–48]. Accordingly, they shift the aggregation of sensor data and the truth discovery process into the external node. Xu et al. [45] tackle the problem of false event reports generated in MCS systems. They design a fog-assisted crowdsensing architecture for vehicular applications. This system solves the trust assessment issue by converting it into a maximum likelihood estimation problem. Accordingly, they solve this problem through the expectation-maximization algorithm. The FN in this scheme verifies data trustworthiness and filters and then uploads local traffic conditions to service providers and cloud servers. The authors in [32] address truth discovery in real-time applications with large number of participants. They propose a fog-aided privacy-preserving truth discovery framework that is secure and efficient in handling real-time applications with a large group of participants. In addition, they designed a unique secure aggregation protocol, SecAgg, which can securely and efficiently aggregate inputs from workers in smaller groups. This framework adopts cloud-fog computing architecture to divide the complete worker group into many smaller ones. In [25], edge-assisted truth discovery for large-scale MCS by utilizing CRH to estimate the truth for both the deep cloud and edge cloud. In addition, they propose an incentive mechanism consisting of truth discovery and reverse auction stages. It shows superiority in terms of estimation precision, but it incurs high computation and communication overhead. Zhang et al. [21] tackle truth discovery for stable and moving participant MCS applications. They update the reliability and the ground truth and filter out false data before sending them to the cloud. Furthermore, keeping the computational costs and communication overheads minimal. In these schemes, the fog nodes play an intermediary role between the participants and the sensing platform. They aggregate the sensory data and transmit the aggregated results to the platform. Although they achieve high efficiency and practicality, it is difficult to detect outliers in these schemes. Moreover, these schemes are designed under the two-server settings.

Equally important, multiple studies focus on encouraging participants to engage in improving truth discovery accuracy [18,23,25,44,47] by developing an incentive mechanism to make sure enough trusted sensor data are used in the truth discovery process. The problem of estimated truth discovery on an edge cloud and the incentive for participants to contribute to the truth discovery process is addressed in [25]. They propose edge-assisted truth discovery for large-scale MCS by utilizing CRH to estimate the truth for both the deep cloud and edge cloud. In addition, they propose an incentive mechanism consisting of truth discovery and reverse auction stages. A privacy incentive mechanism based on truth discovery called PAID is proposed in [18]. It sets task constraints such as spatial, temporal, and the type of sensing data to remove untrustworthy participants’ sensor data that do not satisfy these constraints. Then, it calculates the truth discovery based on the remaining qualified participants. In PAID, the servers get the aggregation result and adopt CRH to iteratively calculate the ground truth. Moreover, the data quality is calculated based on the participant’s weight.

Since most of these works are designed for one-time truth discovery, an iCRH is introduced in which the heterogeneous data streams are processed in each time slot [22,23,32,34]. Similarly, [41] introduced centralized S-CATD for streaming crowdsourcing data. Liu et al. [23] process sensing heterogeneous data streams based on iCRH. The participants report their masked data and truth level to the server. Then, the server utilizes secure summation aggregation to learn the sum of participants’ weights and the sum of participants’ sensing data to compute the truth of the sensing data. This scheme achieves highly efficient computation and enough accurate truthful information. Chen et al. [22] address the robustness of the truth discovery framework with a single server model. They propose a novel robust privacy truth discovery scheme called RPPTD in which differential privacy allows the operation server to obtain the truth without leaking the participants’ privacy. This scheme adopts CRH as a truth discovery process that includes weight update and truth update processes. RPPTD achieves robustness and eliminates single-point failures without the need for trusted third-party servers. Wang et al. [34] tackle the truth discovery problem in streaming crowdsourcing tasks and the privacy of the workers. They propose an edge computing-based privacy-preserving truth discovery scheme for streaming crowdsourcing tasks called PrivSTD. It utilizes edge servers to enable workers to estimate local truths and their reliability, based on which the incentive and perturbation mechanisms are developed. It considers correlations among truths over time and the characteristics of participants’ reliability. Mukkamala et al. [41] address the challenge of providing reliable and scalable truth discovery on general streaming data, aiming for higher accuracy and lower cost. They introduce both centralized and decentralized streaming schemes tailored for crowdsourcing applications. These schemes leverage CATD [42], incorporating iterative procedures to enhance TD accuracy. The centralized streaming CATD updates participants’ weights based on their task performance.

To the best of our knowledge, no studies have been published on group-based truth discovery. Furthermore, existing works failed to achieve efficient truth discovery for the large group of participants. In addition, they start the truth discovery process to estimate the data quality with zero knowledge about the ground truth, participants’ weights, or uniform initialization of participants’ weights. Hence, if the misbehaving participants dominate at the start, it is likely to generate false truth estimation and classify these participants as reliable. Moreover, the truth discovery process in the prior works iteratively conducted aggregation and weight estimation steps until convergence. The convergence criterion can be a threshold for the change of the aggregated results in two consecutive iterations or a predefined iteration number. As a result, overhead is imposed on the participants’ side.

In contrast to prior truth discovery systems, our system focuses on estimating group-based truth discovery. Accordingly, it supports data quality measurement in any group-based MCS application. In addition, it improves the quality of sensor data by estimating the truth while considering only credible participants. Hence, we examine first the participants’ credibility and readiness to engage in the sensing task. In the meantime, the overhead on the participants’ side is minimized by leveraging fog-based computing, which assists in calculating the truth value and data quality. Our truth discovery system is built on the iCRH approach due to its state-of-the-art efficiency performance on the data stream [22,23,32,34]. In addition, we adopt a non-private version of the iCRH in this work. Table 1 summarizes the characteristics of this framework in contrast to the above truth discovery schemes.

Download:

Table 1. Comparison with other TD schemes.

https://doi.org/10.1371/journal.pone.0330656.t001

Preliminaries

In this section, we first present the system model, and design goals, and illustrate the underlying TD algorithm. The main notations used in this paper are outlined in Table 2.

Download:

Table 2. Notations and descriptions.

https://doi.org/10.1371/journal.pone.0330656.t002

System model

We consider a crowdsensing scenario, where the sensing platform monitors a phenomenon without knowledge of the ground truth. The architecture of the A Fog-assisted Group-based Truth Discovery Framework over MCS Data Streams mainly involves four entities:

Participant: Is the mobile device user who collects the sensor data about some sensing tasks and submits them to a near fog node.
Fog-node: An entity at the edge of the network with computation capabilities. It is responsible for aggregating the sensing data, managing the group of participants, initializing the weights with the participants’ credibility level, performing the TD process, and uploading the aggregate result to the sensing platform.
Sensing platform: This is an MCS platform that assigns sensing tasks to the participant, collects the aggregated result from the fog nod, cooperates with the FN to perform the TD process, and sends the quality result to the requester.
Requester: Is the end user who publishes the sensing task about some phenomena and requests accurate sensor data about this task.

In this work, the quality of each participant refers to weight, the ability and readiness of the participant to perform a sensing task as credibility, and the truth of each task as ground truth . In our model, we formalize the problem by assuming that there are |S| sensing tasks, |I| participants, and |F| fog-node. Such that, the number of tasks {1,.., |S|}, participant {1,…, |I|}, and FN {1,..,|F|}. In each time slot t, each participant collects sensor reading , then the estimated truth is generated as . The participants with the higher weight, are more likely to be considered reliable. However, in a realistic scenario starting the TD process to estimate the data quality with zero knowledge about the ground truth and participants’ weight, and randomly or uniform initialization of user weight with unknown values, can lead to fault results if unreliable participants dominate at first. Furthermore, it is hard to obtain a more trustworthy sensor reading without knowing the more credible and dependable participants who possess the required attributes.

Design goals

In this article, we intend to devise a TGTD mechanism, that can provide more accurate ground truth for group sensing activities. Specifically, our mechanism achieves the following twofold design goals.

Accuracy: The proposed scheme should output a highly accurate estimation. Accordingly, it is measured by the deviation between the estimated results and the real truths.
Efficiency: The proposed scheme should reach a significantly lower overhead for the participants. Thus, it is measured by the running time of the system to show the scalability and efficiency.

Truth discovery

As the underlining TD algorithm, we adopt the iCRH [16]. The iCRH estimated the truth for the sensor data, in which the sensor reading for participants arrives in a streaming manner. In each time slot t there are three steps in this process: truth update, distance update, and weight update [23,32]. Algorithm 1 shows the iCRH truth discovery for the data stream in MCS.

Algorithm 1. Truth discovery [16].

Input: Sets of participants’ sensor readings x^t, decay rate

Output: Estimated truth

1: Initialization:

2: The truth

3: while Not convergent do

4: Estimate weights with Eq 1

5: Estimate distance with Eq 2

6: Estimate truths with Eq 4

7: end while

8: return:

Truth update
This step estimates the truth of the sensing task at time slot t as a weighted average of the participants’ sensor reading , and the latest estimation of the participant’s weight in the last time slot .(1)
Distance update
In this step, the incremental distance is updated based on the sensor reading, the estimated truth, and the previous distance.(2)
Where D(.) is a distance function to measure the deviation level between the participant sensor reading for task S (, and the estimated truth for the same task (. Note that, the distance function is chosen based on the MCS application, the task requester, and the sensor reading type. In this framework, we focus on the continuous data type, where the commonly adopted distance function is the square distance [21,23].(3)
To allow the recent reading to play a more important role in TD, iCRH utilizes the decay rate α [16,21].
Weight update
In this step, each participant’s weight is updated incrementally based on the distance function of the previous t–1 slot. The basic idea is that the smaller the distance between the participants’ sensor readings and the current truth, the higher the weight the participant gets.(4)

Proposed framework

Overview

This section presents and discusses the Fog-assisted Group-based Truth Discovery framework in detail and its underlying mechanisms. This framework takes the sensor readings from participants in the AoI, quantifies the participants’ credibility, and then initializes the weights with the participants’ credibility level to start the TD process. Moreover, this framework employs two layers of TD to calculate the truth of each group and then the estimated truth of the sensing task. Hence, this framework is mainly divided into two parts; weight initialization and the Two-layer Group-based Truth Discovery (TGTD) mechanism. The former performs TD initial weights initialization with participants’ credibility level. The latter further contains weight update and truth update processes within two layers. Finally, the sensing platform presents an accurate estimation of the monitored environment task.

Therefore, the key point in designing this framework is how to calculate TD for a group-based activity for streaming data while preventing the misbehaving participants’ data from initializing the truth at the beginning of the TD process. An illustration of the framework is shown in Fig 1, the two steps, weight initialization, and TGTD are explained below.

Download:

Fig 1. Framework architecture.

https://doi.org/10.1371/journal.pone.0330656.g001

Weight initialization

In each time slot t, given the set of participants’ sensors reading the fog nod and sensing platform calculates the participants’ credibility C_i to initialize the weight with the participants’ credibility level. Hence, we make sure the TD process starts with reliable participants’ data, and the reliable participants dominate from the start of the TD process.

Here, we adopt the idea at [49], to calculate the participants’ credibility, where the participants satisfy multiple factors that represent their readiness and capabilities to perform the sensing task, according to the following equation:

(5)

In this formula, these weight factors [], are determined by the requester (i.e. task publisher). They specify the importance of each parameter, such that k_i 0 .

Participant’s old credibility (C_i,old): calculated by the sensing platform according to the participant’s previous sensing task, initialized with 0 if the participant is new.

Participant’s readiness and capability (R_i): determined by two device-related parameters [50,51], which are computed by the platform when the participant registers in the MCS system, as the following:

(6)

Where, residual energy (E), is a parameter that measures the device battery level, which is updated dynamically during the sensing task. Sensor availability (A) is the availability of the required sensors in the participants’ mobile devices, we assume that during the registration the platform is aware of each device’s sensors.

Ability ratio (Y_i): is the ratio that measures the ability of the participants to successfully complete the sensing task to the total assigned tasks [50,51]. The set of completed tasks is updated regularly and stored in the sensing platform. The reliability ratio of participants i is given by:

(7)

Then, the credibility level computed in Eq 5, is normalized to the range [0,1], as follows:

(8)

Where is the normalized value of the participant credibility level.

Finally, the fog-nod and the sensing platform initialize the initial weights for the TD process with normalized participants’ credibility level, as:

(9)

Hence, we make sure from the beginning of the truth process that only reliable and credible participants contribute to the truth discovery, to reach an accurate estimation of the truth.

Two-layer Group-based Truth Discovery (TGTD) mechanism

In this mechanism, we consider the calculation of group members’ truth and the estimated truth among groups. Each layer contains truth update, distance update, and weight update steps, to finally get the truth X_S for task S Fig 2, gives an overview of the mechanism as a stream of TD, involving two layers of which the first layer of truth discovery works on finding the groups-estimated truth. Then the second layer works on the first-layer group estimated truth to find the final truth X_S for the sensing task.

Download:

Fig 2. Overview of TGTD.

https://doi.org/10.1371/journal.pone.0330656.g002

The first layer, truth discovery for each group.

This layer performs truth discovery for the group members to find the estimated truth of the groups for the sensing task S. Specifically, given members’ stream sensor readings , and the weight of each member , the TGTD’s first layer calculates an estimated truth of each group , for task S at time slot t, according to the following steps:

Truth update for each group
Participants submit to the FN their credibility level C_i, the weight of the previous time slot , and the sensor readings for task S at the time slot t, all encrypted to be protected, ,. Hence, after initializing the weight with the participants’ credibility level, the truth updates based on the participants’ current sensor readings and the participant’s weight of the previous time slot t. Therefore, the truth of sensing task S for each group in each time slot t, can be estimated as:(10)
Distance update
The next step calculates the participant’s accumulated distance, where the decay rate, α, is adopted to let the most recent sensor readings play more role in the weight update. Here the distance function measures the distance between the participant’s reading and the current group weight to which the participant belongs.(11)
Weight update for each participant
The next important function in the TD process is weight update. Therefore, here the weight of each participant is calculated based on the Eq 11, in each time slot t. Hence, if the participants’ readings are close to the estimated truth of their groups, they are assigned a higher weight:

(12)

The second layer, the truth estimation among groups.

The second layer of TGTD calculates an estimated truth , based on the groups’ estimated truth from the first layers. Specifically, given groups’ estimated truth , and the weight of each group , TGTD sets , as estimated truth for task S for time slot t, according to the following steps:

Truth update among groups
The truth update is performed between the groups, {, . In this step, the estimates of the truth of task S, , is based on the latest groups’ weight estimation, , and the group truth estimation, , which has been calculated in the first layer.(13)
Distance update
Similar to the first layer, the groups’ distance update is based on the distance square function. In this function, we calculate the distance between the groups’ estimated truth, , computed in the first layer, and the estimated truth for task S. Moreover, we also adopt the decay rate to give the most recent groups’ truth estimation more role in estimating the final estimated truth.(14)
Weight update for each group
For each group that participates in the sensing task, their weight is updated based on the distance of the group’s estimated truth calculated from the first layer, and the estimated truth. More weight is given to the group as their estimated truth gets closer to the task-estimated truth, which means their sensor reading is closer to the truth value.

(15)

Algorithm 2. Two-layer Group-based Truth Discovery (TGTD).

Input: , decayrate α.

Output: Estimated truth for task

1: Initialization:

2: //initialize weight with the normalized Credibility level of participants

3: End of Initialization:

4: while each time slot do

5: while //There is still a group

6: for to do //First layer, calculates the truth for each group

7: Estimate weights with Eq 10

8: Estimate distance with Eq 11

9: Estimate truths with Eq 12

10: end for //Second layer, calculates the estimated truth

11: Estimate weights with Eq 13

12: Estimate distance with Eq 14

13: Estimate truths with Eq 15

14: end while

15: end while

16: return:

The TGTD mechanism is designed on Algorithm 2. TGTD has four inputs: the participants’ sensor readings, the participants’ weight of the previous time slot, the participants’ credibility level, and the decay rate. TGTD uses to initialize weight with the credibility level of the participants. In each time slot, and for each group TGTD calculates the group’s estimated truth (as a first layer in the TGTD process based on Eqs 10, 11, 12, and lines (7-9). After that, as a second layer, TGTD calculates the final truth (, based on Eqs 13, 14, 15, and line (11-16).

The running time for the TGTD algorithm is O(G) for the while loop to go through all the groups (line 5). Then the for-loop to to go through all the group members (line 6), takes O(i). Therefore, TGTD is bounded by O(Gi). Hence, the TGTD algorithm is computationally efficient.

However, the analysis of fog architecture simulation and the organization of streaming processes were beyond the scope of this study, as they do not constitute its primary contributions. These topics are identified as valuable directions for future research and in-depth investigation.

Performance evaluation

In this section, the TGTD algorithm over the data stream is evaluated by conducting various experiments and simulations on both real-world datasets and Synthetic datasets. We conduct a comparison between our scheme and the baseline framework for incremental truth discovery on streaming data (iCRH) [16], Algorithm (2), and the centralized streaming CATD (Cen.CATD) [41], Algorithm (1). The iCRH proves its state-of-the-art efficiency performance in our target real-time MCS scenario. Moreover, the three approaches utilize iterative methods to infer truth discovery in streaming data. Similar to iCRH and (Cen.CATD), our approach scans readings once per time slot, resulting in fewer computational steps. However, unlike iCRH and (Cen.CATD), which are designed for individual participant scenarios, our approach is tailored for group mobile crowdsensing scenarios.

Dataset

To demonstrate the effectiveness and efficiency of the proposed TGTD algorithm we use three datasets, two real-world datasets, and one synthetic dataset.

Weather forecast dataset [33] : This dataset contains 18 heterogeneous sources that record daily weather information for 30 cities in the United States, every 45 minutes on a day in Mar 2010 from Jan . 28, 2010 to Feb. 4, 2010. We use the high and low daily temperature properties in the experiments as they are continuous data. Furthermore, we consider the data collected from Accuweather.com as the ground truth.
Stock dataset [33] : This is trading data of 1000 stock symbols collected from 55 sources over 21 working days in July 2011. The volume, shares outstanding, and market cap properties are used in the experiments as they continue data.
The ground truths are given. Based on the fact that the ground truths are known for both the Weather forecast dataset and the Stock dataset, the weight of the source is quantified by measuring the distances between its reading and the ground truths.
Synthetic dataset: We generate the synthetic dataset by simulating 50 sources, and sampling random numbers as ground truth. Then we add different levels of Gaussian noise following normal distribution to simulate sensor readings. Furthermore, we divide these sources into 5 groups.

Performance metrics

To evaluate the framework comprehensively, the following metrics are used:

Accuracy
To evaluate the deviation between the estimated results and the real truths. Hence, we measure the resulting accuracy by adopting the standard root of mean squared error (RMSE) [52] of estimated truth against the ground truth, according to:(16)
The lower the value of RMSE, the better the performance of the scheme. Therefore, to measure if our approach can obtain a more accurate estimation result compared to other approaches. Hence, the smaller the value between the estimated results and the real truths, the higher the score the approaches get.
Efficiency
Measured by the running time of the system to show the scalability and efficiency of the framework, the lower the better. Through this matrix, we can see the computing overhead of our approach, which can prove the practicality of the approach.

Finally, we evaluate the effectiveness and efficiency of TGTD by comparing the results of the three algorithms, TGTD, iCRH and (Cen.CATD).

Simulation setup

In the simulation, to verify the effectiveness and efficiency of the TGTD algorithm we experiment on two real-world datasets and one synthetic dataset and present the result as follows. All mechanisms are implemented in MATLAB R2022b, and experiments are conducted on a PC equipped with Intel(R) Core (TM) i7-8565U CPU and 16.0 GB RAM, running on Windows 10 (64-bit). Furthermore, to comprehensively evaluate the performance of the TGTD algorithm we vary two parameters, the timestamp and sensing task percentage, to observe the accuracy and efficiency of TGTD with different parameter settings. The participants’ credibility is generated randomly following normal distribution in the range [0,1], in different time slots. For simplicity, the population of participants is divided into groups equally. The simulation is conducted on each dataset and the average is taken over 10 runs.

Evaluation

Accuracy

First, we evaluate the accuracy of the final estimated ground truth by varying the timestamp across three datasets. We use RMSE to measure the deviation between the estimated results and the actual truth. Fig 3 presents the estimation error of TGTD compared to iCRH as a baseline and centralized streaming CATD (Cen.CATD). As shown in Fig 3a, despite some fluctuations in accuracy, TGTD outperforms both the baseline algorithm and (Cen.CATD) on the pedestrian dataset. Similarly, Figures Fig 3b and 3c demonstrate that TGTD achieves higher accuracy than both algorithms on the weather and stock datasets. Although TGTD and iCRH show similar accuracy on the stock dataset, TGTD still has a slightly lower error estimation compared to iCRH. These results indicate that the two-layer TD approach of TGTD enhances the accuracy of the truth discovery process in group-based scenarios. Additionally, TGTD effectively identifies the accurate truth among groups, leading to higher quality sensing tasks.

Download:

Fig 3. Accuracy evaluation varying timestamp.

https://doi.org/10.1371/journal.pone.0330656.g003

Fig 4 presents the comparison results of RMSA among TGTD, iCRH, and stream (Cen.CATD), with varying percentages of sensing tasks. As shown in Fig 4a, TGTD exhibits lower error estimation compared to the other two algorithms when the number of tasks increases over the pedestrian dataset. Similarly, Fig 4b and 4c demonstrate that TGTD maintains high accuracy in both the Stock and Weather real-world datasets, outperforming iCRH and stream (Cen.CATD). These results indicate that TGTD remains feasible as the number of sensing tasks increases. TGTD calculates each participant’s credibility from the beginning of the truth discovery process, ensuring that only reliable and credible participants contribute to achieving an accurate estimation of the truth. Additionally, the implementation of two layers of truth discovery enhances the accuracy of each sensing task.

Download:

Fig 4. Accuracy evaluation varying task percentage.

https://doi.org/10.1371/journal.pone.0330656.g004

In summary, TGTD consistently demonstrates high accuracy across both pedestrian and real-world datasets when varying timestamps and task percentages, compared to the baseline algorithms iCRH and stream (Cen.CATD). These findings highlight the significance of the two layers of truth discovery and the positive impact of initiating the process with credible participants on the overall accuracy of the truth.

Efficiency.

In this section, we evaluate the computation time of truth discovery to demonstrate the efficiency of TGTD. The experimental results in Fig 5 show the increase in computation time across the pedestrian and real-world datasets with varying timestamps. Although TGTD’s running time is slightly higher on the pedestrian dataset, it performs nearly as well as the baseline iCRH algorithm, as depicted in Fig 5a. However, it significantly outperforms the streaming Cen.CATD. On the other hand, Fig 5b and 5c clearly show that TGTD has a lower computation time on the Weather and Stock datasets.

Download:

Fig 5. Efficiency evaluation varying timestamp.

https://doi.org/10.1371/journal.pone.0330656.g005

Similarly, Fig 6 illustrates the running time when varying the sensing task percentage. As expected, TGTD’s running time is more reasonable compared to streaming Cen.CATD and approaches the running time of the baseline iCRH across all datasets. The slight increase in TGTD’s running time is due to the additional computation required to calculate the credibility level of participants as the task percentage increases.

Download:

Fig 6. Efficiency evaluation varying task percentage.

https://doi.org/10.1371/journal.pone.0330656.g006

In summary, TGTD takes slightly more time than the baseline iCRH in all cases, but it remains reasonable in real-world scenarios. It consistently outperforms the streamingCen.CATD across all datasets, as shown in Fig 6a, 6b and 6c. One possible explanation is that TGTD performs two layers of truth discovery, starting with highly credible participants, unlike streaming Cen.CATD, which is more affected by outliers. Nevertheless, TGTD’s running time is still acceptable in real-world scenarios, and this is a trade-off for achieving accurate truth discovery for both the group and individual group members after calculating participant credibility.

In conclusion, TGTD is both effective and efficient in comparison to other streaming approaches.

Conclusion

In this article, we address the problem of group-based TD in the data stream MCS and the problem of misbehaving participants dominating at the start of the truth discovery process. We propose A Fog-Assisted Group-based Truth Discovery Framework over MCS Data Streams. In particular, we developed a novel TD method for crowdsensing data streams where we initialized the weights for the weight update process in TD with participants’ credibility level. Where the credibility function is based on the participants’ readiness and commitment and the device’s ability to perform the sensing tasks. In addition, we develop a novel TGTD mechanism in which the first layer estimates the truth of the group and the second layer estimates the aggregated truth sensing task. Finally, we experimentally evaluate the effectiveness and efficiency of the TGTD algorithm on both pedestrian and two real-world datasets, comparing it against the baseline iCRH algorithm and the centralized CATD. The results demonstrate that TGTD outperforms both algorithms in terms of accuracy. Although TGTD requires slightly more computation time than the baseline iCRH, it significantly outperforms Cen.CATD. This additional computation time is a trade-off for TGTD’s ability to provide accurate truth discovery for group-based activities.

In the future, our aim is to address the organization of the streaming process in the context of fog architecture simulation. In addition, we would like to develop an incentive mechanism for group-based MCS systems. Furthermore, we plan to study the mobility of the participants in the participant recruitment system in MCS.

References

1. Ren H, Li H, Dai Y, Yang K, Lin X. Querying in Internet of Things with privacy preserving: challenges, solutions and opportunities. IEEE Network. 2018;32(6):144–51.
- View Article
- Google Scholar
2. Zhang S, Li H, Dai Y, Li J, He M, Lu R. Verifiable outsourcing computation for matrix multiplication with improved efficiency and applicability. IEEE Internet Things J. 2018;5(6):5076–88.
- View Article
- Google Scholar
3. Xu G, Li H, Tan C, Liu D, Dai Y, Yang K. Achieving efficient and privacy-preserving truth discovery in crowd sensing systems. Computers & Security. 2017;69:114–26.
- View Article
- Google Scholar
4. Ghaffaripour S, Miri A. A Decentralized, Privacy-preserving and Crowdsourcing-based Approach to Medical Research. In: 2020 IEEE International Conference on Systems, Man, and Cybernetics (SMC). IEEE; 2020. p. 4510–5. https://doi.org/10.1109/smc42975.2020.9283027
5. Kim M, Gupta BB, Rho S. Crowdsourcing based scientific issue tracking with topic analysis. Applied Soft Computing. 2018;66:506–11.
- View Article
- Google Scholar
6. Luo T, Huang J, Kanhere SS, Zhang J, Das SK. Improving IoT data quality in mobile crowd sensing: a cross validation approach. IEEE Internet of Things Journal. 2019;6(3):5651–64.
- View Article
- Google Scholar
7. Restuccia F, Ferraro P, Sanders TS, Silvestri S, Das SK, Re GL. FIRST: A framework for optimizing information quality in mobile crowdsensing systems. ACM Transactions on Sensor Networks. 2018;15(1):1–35.
- View Article
- Google Scholar
8. Yang S, Wu F, Tang S, Gao X, Yang B, Chen G. On designing data quality-aware truth estimation and surplus sharing method for mobile crowdsensing. IEEE J Select Areas Commun. 2017;35(4):832–47.
- View Article
- Google Scholar
9. He Z, Cao J, Liu X. High quality participant recruitment in vehicle-based crowdsourcing using predictable mobility. In: 2015 IEEE Conference on Computer Communications (INFOCOM). IEEE; 2015. p. 2542–50. https://doi.org/10.1109/infocom.2015.7218644
10. Huang C, Wang D, Chawla NV. Scalable uncertainty-aware truth discovery in big data social sensing applications for cyber-physical systems. IEEE Transactions on Big Data. 2017;6(4):702–13.
- View Article
- Google Scholar
11. Wan M, Chen X, Kaplan L, Han J, Gao J, Zhao B. From truth discovery to trustworthy opinion discovery: An uncertainty-aware quantitative modeling approach. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2016. p. 1885–94.
12. Ouyang RW, Srivastava M, Toniolo A, Norman TJ. Truth discovery in crowdsourced detection of spatial events. In: Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management. 2014. p. 461–70. https://doi.org/10.1145/2661829.2662003
13. Xiong J, Ma R, Chen L, Tian Y, Li Q, Liu X, et al. A personalized privacy protection framework for mobile crowdsensing in IIoT. IEEE Trans Ind Inf. 2020;16(6):4231–41.
- View Article
- Google Scholar
14. Cai C, Zheng Y, Wang C. Leveraging crowdsensed data streams to discover and sell knowledge: a secure and efficient realization. In: 2018 IEEE 38th International Conference on Distributed Computing Systems (ICDCS). IEEE; 2018. p. 589–99. https://doi.org/10.1109/icdcs.2018.00064
15. Zheng Y, Duan H, Tang X, Wang C, Zhou J. Denoising in the dark: privacy-preserving deep neural network-based image denoising. IEEE Trans Dependable and Secure Comput. 2021;18(3):1261–75.
- View Article
- Google Scholar
16. Li Y, Li Q, Gao J, Su L, Zhao B, Fan W, et al. Conflicts to harmony: a framework for resolving conflicts in heterogeneous data by truth discovery. IEEE Trans Knowl Data Eng. 2016;28(8):1986–99.
- View Article
- Google Scholar
17. Li Q, Li Y, Gao J, Zhao B, Fan W, Han J. Resolving conflicts in heterogeneous data by truth discovery and source reliability estimation. In: Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data. 2014. p. 1187–98. https://doi.org/10.1145/2588555.2610509
18. Wan T, Yue S, Liao W. PAID: privacy-preserving incentive mechanism based on truth discovery for mobile crowdsensing. In: International Conference on Mobile Multimedia Communications. Springer; 2021. p. 264–77.
19. Zhang P, Cheng X, Su S, Wang N. Effective truth discovery under local differential privacy by leveraging noise-aware probabilistic estimation and fusion. Knowl-Based Syst. 2023;261:110213.
- View Article
- Google Scholar
20. Zhou T, Cai Z, Su J. Discovering truth in mobile crowdsensing with differential location privacy. In: GLOBECOM 2022 -2022 IEEE Global Communications Conference. IEEE; 2022. p. 903–8.
21. Zhang C, Zhu L, Xu C, Liu X, Sharif K. Reliable and privacy-preserving truth discovery for mobile crowdsensing systems. IEEE Trans Dependable and Secure Comput. 2019:1.
- View Article
- Google Scholar
22. Chen J, Liu Y, Xiang Y, Sood K. RPPTD: robust privacy-preserving truth discovery scheme. IEEE Systems Journal. 2021;16(3):4525–31.
- View Article
- Google Scholar
23. Liu Y, Tang S, Wu H-T, Zhang X. RTPT: a framework for real-time privacy-preserving truth discovery on crowdsensed data streams. Computer Networks. 2019;148:349–60.
- View Article
- Google Scholar
24. Zhao B, Tang S, Liu X, Zhang X, Chen WN. IronM: privacy-preserving reliability estimation of heterogeneous data for mobile crowdsensing. IEEE Internet of Things Journal. 2020;7(6):5159–70.
- View Article
- Google Scholar
25. Xu J, Yang S, Lu W, Xu L, Yang D. Incentivizing for truth discovery in edge-assisted large-scale mobile crowdsensing. Sensors (Basel). 2020;20(3):805. pmid:32024221
- View Article
- PubMed/NCBI
- Google Scholar
26. Li Y, Xiao H, Qin Z, Miao C, Su L, Gao J, et al. Towards differentially private truth discovery for crowd sensing systems. In: 2020 IEEE 40th International Conference on Distributed Computing Systems (ICDCS). 2020. p. 1156–66. https://doi.org/10.1109/icdcs47774.2020.00037
27. Zhao B, Liu X, Chen W-N, Liang W, Zhang X, Deng RH. PRICE: privacy and reliability-aware real-time incentive system for crowdsensing. IEEE Internet Things J. 2021;8(24):17584–95.
- View Article
- Google Scholar
28. Xue K, Zhu B, Yang Q, Gai N, Wei DS, Yu N. InPPTD: a lightweight incentive-based privacy-preserving truth discovery for crowdsensing systems. IEEE Internet of Things Journal. 2020;8(6):4305–16.
- View Article
- Google Scholar
29. Xu G, Li H, Liu S, Wen M, Lu R. Efficient and privacy-preserving truth discovery in mobile crowd sensing systems. IEEE Transactions on Vehicular Technology. 2019;68(4):3854–65.
- View Article
- Google Scholar
30. Tang J, Fu S, Liu X, Luo Y, Xu M. Achieving privacy-preserving and lightweight truth discovery in mobile crowdsensing. IEEE Transactions on Knowledge and Data Engineering. 2021;34(11):5140–53.
- View Article
- Google Scholar
31. Miao C, Jiang W, Su L, Li Y, Guo S, Qin Z, et al. Privacy-preserving truth discovery in crowd sensing systems. ACM Trans Sen Netw. 2019;15(1):1–32.
- View Article
- Google Scholar
32. Yuan S, Zhu B, Liu F, Li J, Xue K. A fog-aided privacy-preserving truth discovery framework over crowdsensed data streams. In: 2021 IEEE Global Communications Conference (GLOBECOM). IEEE; 2021. p. 1–6.https://doi.org/10.1109/globecom46510.2021.9685817
33. Li X, Dong XL, Lyons K, Meng W, Srivastava D. Truth finding on the deep web: is the problem solved? arXiv preprint 2015.
- View Article
- Google Scholar
34. Wang D, Ren J, Wang Z, Pang X, Zhang Y, Shen X. Privacy-preserving streaming truth discovery in crowdsourcing with differential privacy. IEEE Transactions on Mobile Computing. 2021;21(10):3757–72.
- View Article
- Google Scholar
35. Lane ND. Community-aware smartphone sensing systems. IEEE Internet Comput. 2012;16(3):60–4.
- View Article
- Google Scholar
36. Sayankar V. Effect of group behavior and group dynamics in work culture of organization. Int J Marketing Financ Services Manag Res. 2015;3(10):69–75.
- View Article
- Google Scholar
37. Mason WA, Conrey FR, Smith ER. Situating social influence processes: dynamic, multidirectional flows of influence within social networks. Pers Soc Psychol Rev. 2007;11(3):279–300. pmid:18453465
- View Article
- PubMed/NCBI
- Google Scholar
38. Li C-T, Shan M-K. Composing activity groups in social networks. In: Proceedings of the 21st ACM International Conference on Information and Knowledge Management. 2012. p. 2375–8. https://doi.org/10.1145/2396761.2398644
39. Liu YN, Wang YP, Wang XF, Xia Z, Xu J. Privacy-preserving data collection for mobile phone sensing tasks. In: Information Security Practice, Experience: 14th International Conference and ISPEC 2018, Tokyo, Japan, September 25–27, 2018, Proceedings 14. Springer; 2018. p. 506–18.
40. Zhang Z, He S, Chen J, Zhang J. REAP: an efficient incentive mechanism for reconciling aggregation accuracy and individual privacy in crowdsensing. IEEE TransInformForensic Secur. 2018;13(12):2995–3007.
- View Article
- Google Scholar
41. Mukkamala PS, Wu H, Düdder B. Reliable and streaming truth discovery in blockchain-based crowdsourcing. In: 2023 20th Annual IEEE International Conference on Sensing, Communication, and Networking (SECON). IEEE; 2023. p. 492–500. https://doi.org/10.1109/secon58729.2023.10287465
42. Li Q, Li Y, Gao J, Su L, Zhao B, Demirbas M. A confidence-aware approach for truth discovery on long-tail data. Proceedings of the VLDB Endowment. 2014;8(4):425–36.
43. Gao J, Fu S, Luo Y, Xie T. Location privacy-preserving truth discovery in mobile crowd sensing. In: 2020 29th International Conference on Computer Communications and Networks (ICCCN). IEEE; 2020. p. 1–9.
44. Xiong J, Liu H, Jin B, Li Q, Yao Z. A lightweight privacy protection scheme based on user preference in mobile crowdsensing. Transactions on Emerging Telecommunications Technologies. 2021;32(5):e4000.
- View Article
- Google Scholar
45. Xu Z, Yang W, Xiong Z, Wang J, Liu G. TPSense: a framework for event-reports trustworthiness evaluation in privacy-preserving vehicular crowdsensing systems. Journal of Signal Processing Systems. 2021;93(2–3):209–19.
- View Article
- Google Scholar
46. Wei J, Wang X, Li N, Yang G, Mu Y. A privacy-preserving fog computing framework for vehicular crowdsensing networks. IEEE Access. 2018;6:43776–84.
- View Article
- Google Scholar
47. Ni J, Zhang K, Yu Y, Lin X, Shen XS. Providing task allocation and secure deduplication for mobile crowdsensing via fog computing. IEEE Trans Dependable and Secure Comput. 2020;17(3):581–94.
- View Article
- Google Scholar
48. Yu Y, Li F, Liu S, Huang J, Guo L. Reliable fog-based crowdsourcing: a temporal–spatial task allocation approach. IEEE Internet Things J. 2020;7(5):3968–76.
- View Article
- Google Scholar
49. Alamri BHS, Monowar MM, Alshehri S. Privacy-preserving trust-aware group-based framework in mobile crowdsensing. IEEE Access. 2022;10:134770–84.
- View Article
- Google Scholar
50. Azzam R, Mizouni R, Otrok H, Ouali A, Singh S. GRS: a group-based recruitment system for mobile crowd sensing. Journal of Network and Computer Applications. 2016;72:38–50.
- View Article
- Google Scholar
51. Alagha A, Mizouni R, Singh S, Otrok H, Ouali A. SDRS: a stable data-based recruitment system in IoT crowdsensing for localization tasks. Journal of Network and Computer Applications. 2021;177:102968.
- View Article
- Google Scholar
52. Chai T, Draxler RR. Root mean square error (RMSE) or mean absolute error (MAE)?–Arguments against avoiding RMSE in the literature. Geoscientific Model Development. 2014;7(3):1247–50.
- View Article
- Google Scholar

[ref1] 1. Ren H, Li H, Dai Y, Yang K, Lin X. Querying in Internet of Things with privacy preserving: challenges, solutions and opportunities. IEEE Network. 2018;32(6):144–51.
View Article
Google Scholar

[2] View Article

[3] Google Scholar

[ref2] 2. Zhang S, Li H, Dai Y, Li J, He M, Lu R. Verifiable outsourcing computation for matrix multiplication with improved efficiency and applicability. IEEE Internet Things J. 2018;5(6):5076–88.
View Article
Google Scholar

[5] View Article

[6] Google Scholar

[ref3] 3. Xu G, Li H, Tan C, Liu D, Dai Y, Yang K. Achieving efficient and privacy-preserving truth discovery in crowd sensing systems. Computers & Security. 2017;69:114–26.
View Article
Google Scholar

[8] View Article

[9] Google Scholar

[ref4] 4. Ghaffaripour S, Miri A. A Decentralized, Privacy-preserving and Crowdsourcing-based Approach to Medical Research. In: 2020 IEEE International Conference on Systems, Man, and Cybernetics (SMC). IEEE; 2020. p. 4510–5. https://doi.org/10.1109/smc42975.2020.9283027

[ref5] 5. Kim M, Gupta BB, Rho S. Crowdsourcing based scientific issue tracking with topic analysis. Applied Soft Computing. 2018;66:506–11.
View Article
Google Scholar

[12] View Article

[13] Google Scholar

[ref6] 6. Luo T, Huang J, Kanhere SS, Zhang J, Das SK. Improving IoT data quality in mobile crowd sensing: a cross validation approach. IEEE Internet of Things Journal. 2019;6(3):5651–64.
View Article
Google Scholar

[15] View Article

[16] Google Scholar

[ref7] 7. Restuccia F, Ferraro P, Sanders TS, Silvestri S, Das SK, Re GL. FIRST: A framework for optimizing information quality in mobile crowdsensing systems. ACM Transactions on Sensor Networks. 2018;15(1):1–35.
View Article
Google Scholar

[18] View Article

[19] Google Scholar

[ref8] 8. Yang S, Wu F, Tang S, Gao X, Yang B, Chen G. On designing data quality-aware truth estimation and surplus sharing method for mobile crowdsensing. IEEE J Select Areas Commun. 2017;35(4):832–47.
View Article
Google Scholar

[21] View Article

[22] Google Scholar

[ref9] 9. He Z, Cao J, Liu X. High quality participant recruitment in vehicle-based crowdsourcing using predictable mobility. In: 2015 IEEE Conference on Computer Communications (INFOCOM). IEEE; 2015. p. 2542–50. https://doi.org/10.1109/infocom.2015.7218644

[ref10] 10. Huang C, Wang D, Chawla NV. Scalable uncertainty-aware truth discovery in big data social sensing applications for cyber-physical systems. IEEE Transactions on Big Data. 2017;6(4):702–13.
View Article
Google Scholar

[25] View Article

[26] Google Scholar

[ref11] 11. Wan M, Chen X, Kaplan L, Han J, Gao J, Zhao B. From truth discovery to trustworthy opinion discovery: An uncertainty-aware quantitative modeling approach. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2016. p. 1885–94.

[ref12] 12. Ouyang RW, Srivastava M, Toniolo A, Norman TJ. Truth discovery in crowdsourced detection of spatial events. In: Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management. 2014. p. 461–70. https://doi.org/10.1145/2661829.2662003

[ref13] 13. Xiong J, Ma R, Chen L, Tian Y, Li Q, Liu X, et al. A personalized privacy protection framework for mobile crowdsensing in IIoT. IEEE Trans Ind Inf. 2020;16(6):4231–41.
View Article
Google Scholar

[30] View Article

[31] Google Scholar

[ref14] 14. Cai C, Zheng Y, Wang C. Leveraging crowdsensed data streams to discover and sell knowledge: a secure and efficient realization. In: 2018 IEEE 38th International Conference on Distributed Computing Systems (ICDCS). IEEE; 2018. p. 589–99. https://doi.org/10.1109/icdcs.2018.00064

[ref15] 15. Zheng Y, Duan H, Tang X, Wang C, Zhou J. Denoising in the dark: privacy-preserving deep neural network-based image denoising. IEEE Trans Dependable and Secure Comput. 2021;18(3):1261–75.
View Article
Google Scholar

[34] View Article

[35] Google Scholar

[ref16] 16. Li Y, Li Q, Gao J, Su L, Zhao B, Fan W, et al. Conflicts to harmony: a framework for resolving conflicts in heterogeneous data by truth discovery. IEEE Trans Knowl Data Eng. 2016;28(8):1986–99.
View Article
Google Scholar

[37] View Article

[38] Google Scholar

[ref17] 17. Li Q, Li Y, Gao J, Zhao B, Fan W, Han J. Resolving conflicts in heterogeneous data by truth discovery and source reliability estimation. In: Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data. 2014. p. 1187–98. https://doi.org/10.1145/2588555.2610509

[ref18] 18. Wan T, Yue S, Liao W. PAID: privacy-preserving incentive mechanism based on truth discovery for mobile crowdsensing. In: International Conference on Mobile Multimedia Communications. Springer; 2021. p. 264–77.

[ref19] 19. Zhang P, Cheng X, Su S, Wang N. Effective truth discovery under local differential privacy by leveraging noise-aware probabilistic estimation and fusion. Knowl-Based Syst. 2023;261:110213.
View Article
Google Scholar

[42] View Article

[43] Google Scholar

[ref20] 20. Zhou T, Cai Z, Su J. Discovering truth in mobile crowdsensing with differential location privacy. In: GLOBECOM 2022 -2022 IEEE Global Communications Conference. IEEE; 2022. p. 903–8.

[ref21] 21. Zhang C, Zhu L, Xu C, Liu X, Sharif K. Reliable and privacy-preserving truth discovery for mobile crowdsensing systems. IEEE Trans Dependable and Secure Comput. 2019:1.
View Article
Google Scholar

[46] View Article

[47] Google Scholar

[ref22] 22. Chen J, Liu Y, Xiang Y, Sood K. RPPTD: robust privacy-preserving truth discovery scheme. IEEE Systems Journal. 2021;16(3):4525–31.
View Article
Google Scholar

[49] View Article

[50] Google Scholar

[ref23] 23. Liu Y, Tang S, Wu H-T, Zhang X. RTPT: a framework for real-time privacy-preserving truth discovery on crowdsensed data streams. Computer Networks. 2019;148:349–60.
View Article
Google Scholar

[52] View Article

[53] Google Scholar

[ref24] 24. Zhao B, Tang S, Liu X, Zhang X, Chen WN. IronM: privacy-preserving reliability estimation of heterogeneous data for mobile crowdsensing. IEEE Internet of Things Journal. 2020;7(6):5159–70.
View Article
Google Scholar

[55] View Article

[56] Google Scholar

[ref25] 25. Xu J, Yang S, Lu W, Xu L, Yang D. Incentivizing for truth discovery in edge-assisted large-scale mobile crowdsensing. Sensors (Basel). 2020;20(3):805. pmid:32024221
View Article
PubMed/NCBI
Google Scholar

[58] View Article

[59] PubMed/NCBI

[60] Google Scholar

[ref26] 26. Li Y, Xiao H, Qin Z, Miao C, Su L, Gao J, et al. Towards differentially private truth discovery for crowd sensing systems. In: 2020 IEEE 40th International Conference on Distributed Computing Systems (ICDCS). 2020. p. 1156–66. https://doi.org/10.1109/icdcs47774.2020.00037

[ref27] 27. Zhao B, Liu X, Chen W-N, Liang W, Zhang X, Deng RH. PRICE: privacy and reliability-aware real-time incentive system for crowdsensing. IEEE Internet Things J. 2021;8(24):17584–95.
View Article
Google Scholar

[63] View Article

[64] Google Scholar

[ref28] 28. Xue K, Zhu B, Yang Q, Gai N, Wei DS, Yu N. InPPTD: a lightweight incentive-based privacy-preserving truth discovery for crowdsensing systems. IEEE Internet of Things Journal. 2020;8(6):4305–16.
View Article
Google Scholar

[66] View Article

[67] Google Scholar

[ref29] 29. Xu G, Li H, Liu S, Wen M, Lu R. Efficient and privacy-preserving truth discovery in mobile crowd sensing systems. IEEE Transactions on Vehicular Technology. 2019;68(4):3854–65.
View Article
Google Scholar

[69] View Article

[70] Google Scholar

[ref30] 30. Tang J, Fu S, Liu X, Luo Y, Xu M. Achieving privacy-preserving and lightweight truth discovery in mobile crowdsensing. IEEE Transactions on Knowledge and Data Engineering. 2021;34(11):5140–53.
View Article
Google Scholar

[72] View Article

[73] Google Scholar

[ref31] 31. Miao C, Jiang W, Su L, Li Y, Guo S, Qin Z, et al. Privacy-preserving truth discovery in crowd sensing systems. ACM Trans Sen Netw. 2019;15(1):1–32.
View Article
Google Scholar

[75] View Article

[76] Google Scholar

[ref32] 32. Yuan S, Zhu B, Liu F, Li J, Xue K. A fog-aided privacy-preserving truth discovery framework over crowdsensed data streams. In: 2021 IEEE Global Communications Conference (GLOBECOM). IEEE; 2021. p. 1–6.https://doi.org/10.1109/globecom46510.2021.9685817

[ref33] 33. Li X, Dong XL, Lyons K, Meng W, Srivastava D. Truth finding on the deep web: is the problem solved? arXiv preprint 2015.
View Article
Google Scholar

[79] View Article

[80] Google Scholar

[ref34] 34. Wang D, Ren J, Wang Z, Pang X, Zhang Y, Shen X. Privacy-preserving streaming truth discovery in crowdsourcing with differential privacy. IEEE Transactions on Mobile Computing. 2021;21(10):3757–72.
View Article
Google Scholar

[82] View Article

[83] Google Scholar

[ref35] 35. Lane ND. Community-aware smartphone sensing systems. IEEE Internet Comput. 2012;16(3):60–4.
View Article
Google Scholar

[85] View Article

[86] Google Scholar

[ref36] 36. Sayankar V. Effect of group behavior and group dynamics in work culture of organization. Int J Marketing Financ Services Manag Res. 2015;3(10):69–75.
View Article
Google Scholar

[88] View Article

[89] Google Scholar

[ref37] 37. Mason WA, Conrey FR, Smith ER. Situating social influence processes: dynamic, multidirectional flows of influence within social networks. Pers Soc Psychol Rev. 2007;11(3):279–300. pmid:18453465
View Article
PubMed/NCBI
Google Scholar

[91] View Article

[92] PubMed/NCBI

[93] Google Scholar

[ref38] 38. Li C-T, Shan M-K. Composing activity groups in social networks. In: Proceedings of the 21st ACM International Conference on Information and Knowledge Management. 2012. p. 2375–8. https://doi.org/10.1145/2396761.2398644

[ref39] 39. Liu YN, Wang YP, Wang XF, Xia Z, Xu J. Privacy-preserving data collection for mobile phone sensing tasks. In: Information Security Practice, Experience: 14th International Conference and ISPEC 2018, Tokyo, Japan, September 25–27, 2018, Proceedings 14. Springer; 2018. p. 506–18.

[ref40] 40. Zhang Z, He S, Chen J, Zhang J. REAP: an efficient incentive mechanism for reconciling aggregation accuracy and individual privacy in crowdsensing. IEEE TransInformForensic Secur. 2018;13(12):2995–3007.
View Article
Google Scholar

[97] View Article

[98] Google Scholar

[ref41] 41. Mukkamala PS, Wu H, Düdder B. Reliable and streaming truth discovery in blockchain-based crowdsourcing. In: 2023 20th Annual IEEE International Conference on Sensing, Communication, and Networking (SECON). IEEE; 2023. p. 492–500. https://doi.org/10.1109/secon58729.2023.10287465

[ref42] 42. Li Q, Li Y, Gao J, Su L, Zhao B, Demirbas M. A confidence-aware approach for truth discovery on long-tail data. Proceedings of the VLDB Endowment. 2014;8(4):425–36.

[ref43] 43. Gao J, Fu S, Luo Y, Xie T. Location privacy-preserving truth discovery in mobile crowd sensing. In: 2020 29th International Conference on Computer Communications and Networks (ICCCN). IEEE; 2020. p. 1–9.

[ref44] 44. Xiong J, Liu H, Jin B, Li Q, Yao Z. A lightweight privacy protection scheme based on user preference in mobile crowdsensing. Transactions on Emerging Telecommunications Technologies. 2021;32(5):e4000.
View Article
Google Scholar

[103] View Article

[104] Google Scholar

[ref45] 45. Xu Z, Yang W, Xiong Z, Wang J, Liu G. TPSense: a framework for event-reports trustworthiness evaluation in privacy-preserving vehicular crowdsensing systems. Journal of Signal Processing Systems. 2021;93(2–3):209–19.
View Article
Google Scholar

[106] View Article

[107] Google Scholar

[ref46] 46. Wei J, Wang X, Li N, Yang G, Mu Y. A privacy-preserving fog computing framework for vehicular crowdsensing networks. IEEE Access. 2018;6:43776–84.
View Article
Google Scholar

[109] View Article

[110] Google Scholar

[ref47] 47. Ni J, Zhang K, Yu Y, Lin X, Shen XS. Providing task allocation and secure deduplication for mobile crowdsensing via fog computing. IEEE Trans Dependable and Secure Comput. 2020;17(3):581–94.
View Article
Google Scholar

[112] View Article

[113] Google Scholar

[ref48] 48. Yu Y, Li F, Liu S, Huang J, Guo L. Reliable fog-based crowdsourcing: a temporal–spatial task allocation approach. IEEE Internet Things J. 2020;7(5):3968–76.
View Article
Google Scholar

[115] View Article

[116] Google Scholar

[ref49] 49. Alamri BHS, Monowar MM, Alshehri S. Privacy-preserving trust-aware group-based framework in mobile crowdsensing. IEEE Access. 2022;10:134770–84.
View Article
Google Scholar

[118] View Article

[119] Google Scholar

[ref50] 50. Azzam R, Mizouni R, Otrok H, Ouali A, Singh S. GRS: a group-based recruitment system for mobile crowd sensing. Journal of Network and Computer Applications. 2016;72:38–50.
View Article
Google Scholar

[121] View Article

[122] Google Scholar

[ref51] 51. Alagha A, Mizouni R, Singh S, Otrok H, Ouali A. SDRS: a stable data-based recruitment system in IoT crowdsensing for localization tasks. Journal of Network and Computer Applications. 2021;177:102968.
View Article
Google Scholar

[124] View Article

[125] Google Scholar

[ref52] 52. Chai T, Draxler RR. Root mean square error (RMSE) or mean absolute error (MAE)?–Arguments against avoiding RMSE in the literature. Geoscientific Model Development. 2014;7(3):1247–50.
View Article
Google Scholar

[127] View Article

[128] Google Scholar