Collective evolution learning model for vision-based collective motion with collision avoidance

David L. Krongauz; Teddy Lazebnik

doi:10.1371/journal.pone.0270318

Abstract

Collective motion (CM) takes many forms in nature; schools of fish, flocks of birds, and swarms of locusts to name a few. Commonly, during CM the individuals of the group avoid collisions. These CM and collision avoidance (CA) behaviors are based on input from the environment such as smell, air pressure, and vision, all of which are processed by the individual and defined action. In this work, a novel vision-based CM with CA model (i.e., VCMCA) simulating the collective evolution learning process is proposed. In this setting, a learning agent obtains a visual signal about its environment, and throughout trial-and-error over multiple attempts, the individual learns to perform a local CM with CA which emerges into a global CM with CA dynamics. The proposed algorithm was evaluated in the case of locusts’ swarms, showing the evolution of these behaviors in a swarm from the learning process of the individual in the swarm. Thus, this work proposes a biologically-inspired learning process to obtain multi-agent multi-objective dynamics.

Citation: Krongauz DL, Lazebnik T (2023) Collective evolution learning model for vision-based collective motion with collision avoidance. PLoS ONE 18(5): e0270318. https://doi.org/10.1371/journal.pone.0270318

Editor: Ning Cai, Beijing University of Posts and Telecommunications, CHINA

Received: June 6, 2022; Accepted: March 9, 2023; Published: May 10, 2023

Copyright: © 2023 Krongauz, Lazebnik. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: All relevant data are within the paper and its Supporting information files.

Funding: The authors received no specific funding for this work. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Competing interests: The authors have declared that no competing interests exist.

1 Introduction

Collective motion (CM) is a phenomenon occurring in homogeneous populations of interacting individuals [1–4]. It is manifested in multiple forms in nature such as locusts aggregate into groups [5], birds flocking in an organized manner [6, 7], and fish schools responding to predators and showing migratory movements [8, 9]. During the last few decades, researchers seek to elucidate the mechanisms allowing animals to achieve synchronized motion while lacking centralized control. Empirical data from experimental observations [10, 11] is analyzed with the aim to uncover individual’s behavior using mathematical and computational models [12–14].

A large body of work can be categorized as agent-based models, where the collective dynamics are explained by the action on the individual-level [15]. One of the leading approaches in agent-based simulation for CM is Self-Propelled-Particles, popularized by the model proposed by Vicsek et al. [16], where agents align their velocity according to short-range interaction with neighboring agents, commonly known as the metric-based approach. In this group of models [17–19], agents are moving with constant speed [16], aligning their velocity direction in response to inter-agent interactions. Moreover, a notable subgroup of models is the Avoidance-Attraction-Alignment group [4, 20, 21] where agents are governed by three types of superimposed forces: aligning force (For example see [16]), separating force for collision avoidance, and cohesion to the center of mass of the group. Couzin et al. [22] showed how efficiently the information can transfer between members of animal groups. Another model incorporating metric-based interaction proposed by Bhattacharya and Vicsek [23], described the processes involved in birds flocking and synchronized landing. In addition, describing the dynamics and synchronization of general clusters of self-propelling-particles utilize the metric approach [24, 25].

Contrary to the metric-based approach, the topological-based approach has suggested that birds flock according to interactions with a fixed number of neighbors regardless of their distance from each other [6]. Namely, this approach differs from the metric in neighbors’ capacity, since theoretically the entire swarm can be classified as ‘neighbors’ if they are located within a predefined distance from each other. Shang and Bouffanais [26] proposed a method to find the optimal number of neighbors for the topological-based approach while taking into consideration the interacting self-propelled particles that aim to reach a consensus. Camaperi et al. [27] compared the metric-based and topological-based approaches, and showed superior stability for the latter due to the fact that particles displaying a more uniform distribution lead to higher cluster stability. Kumar and De [28] explored and compared the flocking efficiency of metric-based and topological-based approaches.

Although these models reproduce aspects of the macro-dynamics of natural swarms [29, 30], they do not address the problem of information acquisition by the agents, since models relying on globally available information. For instance, accurate positions and velocities of agents in the environment are commonly assumed to be fully known to the focal agent [16, 17, 20]. Nonetheless, in reality, agents’ perception of the environment is mediated by their sensory systems (e.g. vision, hearing, echolocation, magnetic field sensing etc.) [31–34]. Distinctively, objects in the environment are reflected on the retina of the focal agent’s eye and later translated to the brain. This input is used as part of the agent’s decision-making process, allowing an individual to act upon the visual stimuli it observed [35, 36]. Indeed, many animals use vision as the main sense for coordination and navigation [37–39]. Hence, to obtain more biologically accurate modeling of collective motion, agents in the swarm are assumed to estimate an object’s location and velocity using vision-based sensing.

Multiple models that encompass visual projection in CM modelling were proposed [40–43]. Strandburg-Peshkin et al. [42] explored visual sensor networks and their role in information transfer between schools of fish. Bastien and Romanczuk [44] used elementary occlusions of visual rays to create avoidance and self-organizations in disc-like shaped agents. The authors demonstrated several group behaviors with steering by angular visual occlusions. A follow-up work introduced an additional collision avoidance and demonstrated collective behavior in the presence of static obstacles in several settings [45]. Both models are oblivious to object recognition, thus any agent-specific features are ignored. In the present work, we propose and demonstrate a different approach in which the shape and size of the agents in the swarm are effective the decision-making process due to the vision-based input.

Several attempts to model CM using learning models are presented [46–48]. Durve et al. [46] developed both Vicsek-inspired CM teacher-based reinforcement learning and concurrent learning. We conclude that the velocity alignment mechanism allows spontaneously emerge from the minimization of neighbor-loss (e.g., agents that are not in the visual radius between two observations) rate. Thus, the model learn to keep group cohesion when the perception is limited to the velocity of neighboring agents [46]. They assume the focal agent is accurately aware of the location and velocities of the neighboring agents. Into the bargain, Young and La [47] presented a hybrid system that achieves an efficient CM behavior for escaping attacking predators while maintaining a flocking formation using multiple reinforcement learning methods. Additionally, it was shown that independent learning with and without function approximation proved to be unreliable in learning to flock together towards the same target. Similarly, in our work, it is assumed the focal agent is accurately aware of the location and velocities of the neighboring agents while solving a multi-objective task. Nonetheless, they were not able to obtain a collective behavior by learning a policy at the individual level as obtained by Durve et al. [46]. Lopez-Incera et al. [48] studied the collective behavior of artificial learning agents driven by reinforcement learning action-making process and with abstract sensing mechanism, that arises as they attempt to survive in foraging environments. They design multiple foraging scenarios in one-dimensional worlds in which the resources are either near or far from the region where agents are initialized and show the emergence of CM [48]. While their work took into consideration a sensing-based approach, they investigated a single objective behavior in an extremely simplistic scenario.

The novelty of the proposed work lies in the development of a Collective Evolution Learning (CEL) algorithm that allows learning a multi-objective collective behavior based on the optimization of a single member’s decision-making process even with noisy input data. In particular, we show that CEL is able to achieve flocking with inter-agent collision avoidance using only vision-based inputs.

Formally, our model produces vision-based collective motion (VCM) group behavior combined with collision avoidance (CA) or VCMCA for short. The model is obtained by simulating the evolution process of an animal population that learns the desired behavior over time using a trial-and-error mechanism. We evaluate the performance of the algorithm to obtain a VCMCA model in different settings, showing robust results. The remaining of the manuscript’s structure is shown in Fig 1.

Download:

Fig 1. A schematic view of the manuscript’s structure.

Section 2, outlines the mathematical formalization for the CM with CA in a two-dimensional setting with agents that based their decisions on a vision mechanism. In Section 3, a novel computational model for vision-based CM with CA, called VCMCA, is proposed. In Section 4, the performance of the VCMCA using several in silico experiments is evaluated. In Section 5, biological outcomes, comparison with other RL- and Viscak- based models, and the limitations of the model are discussed.

https://doi.org/10.1371/journal.pone.0270318.g001

2 Vision-based collective motion with collision avoidance

In this section, we describe the proposed VCMCA model, constructed from three components: the vision-based input mechanism on the agent level, the movement and collision physical mechanism, and the swarm-level CM and CA dynamics. A table that summarizes the parameters used in the model, their definition, and notations are provided in the Appendix.

2.1 Settings and vision-based input

In order to represent the open-field conditions that characterize swarming organisms’ environments in a simple manner, we define a two-dimensional world with two-dimensional agents, in a similar fashion to Vicsek’s model [23]. We use the metric-based approach to define the neighbors of an agent.

The model (M) is defined by the tuple M ≔ (E, P) where is the environment and P is a fixed sized (‖P‖ ≔ N) population of rectangular-shaped agents populating it. Formally, the environment, E, is a continuous space with height and width with a toroidal geometrical configuration. Namely, a two-dimensional rectangle with periodic boundary conditions. Each agent, p ∈ P, is defined by a tuple p ≔ (l, v, a, h, w, r, γ) where l = (l_x, l_y) ∈ E is the location of the agent’s center of mass, is the current velocity vector of the agent, is the current acceleration vector of the agent, h, w are the agent’s height and width, respectively, and r is the metric-based sensing radius of the agent.

The parameters used for the agent’s visual mechanism are inspired by previous works dealing with vision-based information acquisition of flocking animals [43, 49]. Formally, the vision-based input is parameterized by , where θ and ψ are the angular position and area subtended on the retina of the focal agent, respectively, and are their respective derivatives in time.

Based on the sensing radius, r, we define the local environment of a focal agent p_o ∈ P (we use o to denote parameters related to the focal agent) to be the subset of agents P* that are located in a sensing distance from the focal agent P* ≔ {p_i ∈ P: ‖l_i − l_o‖ < r_o} such that i ≠ o. Now, each agent in the local environment (p_i ∈ P*) of a focal agent p_o is sensed using the vision mechanism if it is not fully obscured by other agents. The vision-based properties (γ) are computed for each neighbor n in the local environment of the focal agent, resulted in a vector such that ν ≔ [γ₁, …, γ_|P*|]. The subtended angular area ψ for each neighbor n is obtained as follows: (1) where are the relative positions of the corners of n-th neighbor as seen in the focal agent frame. Formally, the lab frame coordinates of the corners of the nth neighbor: (2) such that index i ∈ {1, 2, 3, 4}, is the perpendicular vector to the velocity vector v of the neighbor agent, and l_n is the position of the nth neighbor’s center of mass. The coordinates in the focal agent (o) frame would result in . Thus, the angular position parameter θ of the nth neighbor is defined as follows: (3)

An agent can be fully latent to the focal agent by other agents in its local environment. In such a case, it is not taken into consideration in the vision-based computation. A schematic view of the focal’s agent vision-based sensing (γ) is shown in Fig 2. More details about the computation of Eqs (2) and (3) are provided in the Appendix.

Download:

Fig 2. A schematic view of the vision sensing mechanism of an agent.

The computation is presented for a single focal agent but performed symmetrically for all the agents in the population. (a) Spatial view from a single agent’s point of view. (b) Computation interaction from a single agent’s point of view.

https://doi.org/10.1371/journal.pone.0270318.g002

2.2 Agent’s movement and collision dynamics

The proposed model has a discrete, synchronized clock that all the agents, P, follow. At the beginning of the simulation (t₀), the locations and velocities of the agents’ population (P) are chosen at random from a pre-defined distribution. Then, for each clock tick (marked by t_i for the i_th tick) the following happens: first, each agent p_o ∈ P senses its local environment and computes its action based on a given decision-making model, in a random order. Second, each agent moves as follows: (4) where is a weighted variable indicates the influence of the desired direction (x(t_i)) on the current direction. Due to the toroidal geometrical configuration of the environment, if l ∉ E then and .

In case two agents p₁ ∈ P, p₂ ∈ P collide, a collision protocol is employed: first, the simulation identifies the both colliding agents to be either active or passive colliders. Namely, in a head-tail collision, the head colliding agent is regarded as the active and the tail colliding as the passive. In the case of a frontal collision, both parties are regarded as active colliders. The active colliding agent(s) are stopped (i.e., v(t_i) = 0) while the passive colliding agent’s velocity follows purely-elastic collision: (5) where v_colliding(t_i) is the velocity of the colliding agent at the time of the collision (t_i). A schematic view of the perception-action-making loop of a single agent is presented in Fig 2b.

2.3 Collective motion and collision avoidance

We define the agents’ population CM metric by the degree of alignment of the velocity headings of the simulated agents, in a similar fashion to other CM models’ metrics [4, 16]. Formally, the CM metric is a function where is the space of all possible agent populations. F(P) is defined as follows: (6) where .

In addition, we define a collision between two agents p₁ ∈ P and p₂ ∈ P taking place if and only if: (7) The motivation behind this definition is to check if two agents p1, p2 would collide after they both make a move. In order to evaluate if a collision happened, we check if the distance between the agents’ centers of masses (l₁ + v₁ and l₂ + v₂) is smaller than their width or height. This definition is optimal when assuming circular-shaped agents. Nonetheless, for non-circular-shaped agents, such as in our case, is considered a bit aggravating as there are angles in which the condition is met but the agents do not physically collide. Therefore, a function such that C(P) counts the rate of collisions between the agents of the population to the possible number of collisions in a single point of time, is used as the CA metric. In our context, C is normalized to [0, 1] by dividing C by the largest hypothetical number of collisions possible in a single step in time.

Based on these two metrics, the objective of the CM with CA takes the form: (8) where Ω is the space of all possible policies of the agents and ζ ∈ [0, 1] is the weight of the CM score compared to the CA score. The optimization is done on any finite duration in time such that t₀ < t_f and F_t and C_t represent the CM and CA at time t, respectively.

3 Collective evolution learning algorithm

3.1 Motivation

The CM behavior of locusts is believed to develop over time throughout an evolution process. The locusts try to adapt to their environment and generate the next population of locusts. During this process, multiple locusts experienced similar situations and operate slightly differently based on unique properties each locust has which resulted in different outcomes. Locusts that perform better were more likely to reproduce and keep their behavior in the population over time. As such, the wanted behavior (e.g., CM with CA) emerged throughout the collective experience of the locust and improved over time using a natural trial-and-error approach.

Based on this motivation, we propose a reinforcement learning (RL) based algorithm (A) called Collective Evolution Learning (CEL for short). The proposed algorithm is based on the assumption the agents in a swarm are identical and share common knowledge (policy). Intuitively, the CEL algorithm receives the observable parameters (γ) of a subset of agents (P* ⊂ P/{p}) in the local environment (R) of an agent (p) and returns the action (x) that best approximate the optimal action of the CM with CA objective metric (Eq (8)) using a Q-learning based model [50, 51]. Since both CM and CA are swarm-level metrics, CEL has an intermediate genetic algorithm (GA) based model [52–56] to optimize CM and CA on the agent’s level. Moreover, to obtain a well-performing Q-learning-based model, one is required to well sample the state space of the model [57]. In order to do that, CEL introduces a layer on top of the Q-learning model that measures its performance for different scenarios and uses the K-nearest neighbors (KNN) algorithm [58] to define a new sampling strategy for further training of the Q-learning model. A schematic view of the model training, validation, and testing phases with the interaction between them is presented in Fig 3.

Download:

Fig 3. A schematic view of the model training, validation, and testing phases with the interaction between them.

https://doi.org/10.1371/journal.pone.0270318.g003

3.2 Swarm dynamics using a Q-learner

Formally, CEL is a function such that A(ν) → [x₁, …, x_n] where is the maximum possible number of agents that an agent p ∈ P can sense in a single point in time multiplied by the number of vision-based parameters per agent (i.e., and ), is the outcome of the vision-based sensing of an agent p ∈ P, and is the set of all possible actions an agent can make. Following this definition, one can treat A as a policy function getting a state ν for each agent and return the action x it should perform. To learn the policy A, we use a Q-learning-based model. The Q-learner stochastically maps each state ν to the set of possibles actions by assuming a discretization of both the input and output spaces. Hence, it is assumed that the number of possible actions is finite and that is obtained by the discretization of each vision parameter to define a classification problem rather than a regression problem.

The process of obtaining the model, is divided into three main phases: training, evaluation, and testing. The training and evaluation phases are executed sequentially multiple times until a stop condition is satisfied. Afterward, the testing phase is taking place to compute the VCMCA score of the obtained model.

The training phase is further divided into the first time the training phase is utilized and any other time afterward. For the first time, the Q-learning matrix is initialized with all possible states and their actions such that each action has a uniform distribution to be chosen for each state. Afterward, and for each utilization of the training phase, the training phase includes three linear processes. First, a scenario including zero, one or more (limited by k) neighbor agents, and a focal agent is generated, at random and in an iterative manner. Based on this scenario, the Q-learner’s state is computed from the focal agent’s point of view. Second, an approximation to the optimal actions for this scenario is obtained using a GA-based model aiming to optimize the CMCA for steps in time. This step is described in detail in Section 3.3. Lastly, the results of the algorithm are used to update the Q-learner. The Q-learner is updated by adding to the current value a score obtained by the GA process multiplied by some pre-defined learning rate, λ ∈ [0, 1]. This process repeats itself for times.

The evaluation phase includes three linear processes. First, a set of scenarios of size are generated at random. For each scenario, the Q-learner’s state is computed from the focal agent’s point of view. Pending, the performance of the Q-learner for each state is computed by simulating η steps in time such that both the focal and neighbor agents follow the Q-learner’s model. Afterward, CEL uniformly samples the state space and computes the estimated performance of the model on these values using the distance-weighted KNN algorithm with neighbor data points. The outcome of the last step is a 4k-dimensional distribution function. If the total variation distance between the obtained distribution and a uniform distribution is larger than a pre-defined threshold then CEL gets back to the training phase with the obtained distribution to sample from for the next set of training samples. Otherwise, CEL moves to the testing phase.

The testing phase includes three linear processes. First, CEL generates a set of simulation scenarios of size , at random. Notably, the simulation scenarios are of the entire population and not of a single focal agent and its neighbor agents as used during the training and evaluation phases. Second, for each simulation scenario, the simulation is performed for steps in time such that all the agents in the population follow the Q-learner model. Lastly, the performance of the CEL model is defined to be the average CMCA score of the simulations.

3.3 Agent’s decision-making using a genetic algorithm

An approximation to the optimal actions for a given scenario is obtained using a GA-based model aiming to optimize the CMCA score of a single agent over a short duration of time. To use the GA, we first define a local, individual objective function, based on the collective metric Eq (8), as follows: (9) where P* ⊂ P is the subset of agents in the local environment of a focal agent p ∈ P, and ϱ is a gene used by the GA. This definition assumes a semi-Markovian property [59] to the action-making processes of the agents which means that the agents are aiming to perform the optimal action for the next state, taking into consideration the current and previous states.

First, initialization of the gene population is taken place. Formally, a gene’s population of size such that each gene (ϱ) is defined by a vector of size η corresponding to the number of steps in time the focal agent needs to perform is generated. Each value of the vector is an action x ∈ X. Afterward, for generation (iterations), three operations took place: cross-over, mutation, and next generation. To be exact, the cross-over operator gets two genes ϱ₁, ϱ₂ and a weight vector w ∈ [0, 1]^η and returns two new genes such that and , where i ∈ [2, η] is an index of the genes’ vector sampled in a distribution defined by w [60]. The mutation operator MO(ϱ) → ϱ′ gets a gene ϱ and returns an new gene ϱ′ such that one of the values of ϱ altered with a different action in a uniformly distributed manner. The next generation operator (NGO) leaves a portion ρ ∈ [0, 1] of the genes with the highest fitness score while the remaining genes are chosen by a distribution defined by the L₁ normalization of the fitness scores of the remaining genes [61]. Once the GA finished its calculations, the gene with the best fitness score is taken. From this gene, the first action is provided for the Q-learner.

Notably, one does not have to use the GA approach as in theory the Q-learning algorithm should coverage to a similar policy without the GA component. However, as the number of samples that results in better fitness is really small compared to the action space’s size, a Q-learner would require a very large number of samples to achieve similar results. As such, it would cause a much longer training process.

A time complexity analysis of CEL is provided in the Appendix.

4 Results

To evaluate the ability of the CEL algorithm to obtain a VCMCA model and explore the influence of several properties of the agents on the model’s performance, we conduct three experiments. First, measure the performance of the proposed algorithm compared to two other approaches—vision-based CM without CA and random walk. Second, we analyze the influence of population density, agent’s height to width ratio, and sensing radius on the VCMCA model’s performance. Finally, we approximate the VCMCA model obtained by the CEL algorithm using an explainable Random Forest [62] model to extract the rule-base structure for the agent’s action-making process, allowing the user to interpret the connection between the visual stimuli and the reaction of the agent to the visual stimuli.

4.1 Setup

We implemented the VCMCA model and CEL algorithm as a simulator for the case of locusts. We chose locusts since, at the beginning of the flocking, they move on the ground, which is approximated by a 2D plan. However, the model can be adapted to other species as well. The simulation is divided into two phases: 1) agent’s individual VCMCA model training using the CEL algorithm; and 2) usage of the obtained VCMCA model in the locust swarm simulation. The default hyperparameters’ values used in the simulation are picked to represent a population of locusts that become locusts. The parameters with their descriptions and values are summarized in Table 1. If not stated otherwise, these hyperparameters’ values are used across all experiments. In particular, the agent’s height to width ratio is set to be 7/1. This ratio is obtained by rounding the closest natural number to the average ratio of 50 locusts. To be exact, we manually tag the bounding box of 50 upper-view locust images, obtaining an average height-to-width ratio of 6.78 with 1.48 standard deviation. Specifically, the locust images were of the Schistocerca cancellata and Schistocerca gregaria species since these are known to exhibit swarming behaviors and density-dependent phase transition [63, 64]. In addition, group population sizes were set with accordance to available biological records of locust swarms [64, 65]. Though the exact density values are one order of magnitude larger than the simulations, we concentrate on the emerging phases of the flock when the density is yet to reach its maximal numbers. The model uses abstract discrete time steps. In order to calibrate the simulation’s abstract time step, we define each time step to be the average duration required by the average locust (agent) to make a decision and move [66].

Download:

Table 1. The description of the model’s parameters and their default values.

https://doi.org/10.1371/journal.pone.0270318.t001

4.2 CEL algorithm training

We train the locusts’ VCMCA model using the CEL algorithm for n = 30 times to evaluate the average relationship between the number of training samples and the obtained VCMCA model’s performance due to the stochastic nature of the algorithm. In Fig 4 the blue line indicates the training fitness’ function value (i.e., Eq (9)), and the green line indicates the CMCA score (i.e., Eq (8)). The results are shown as mean ± standard deviation. The vertical dashed-yellow lines indicate the validation phase during the training process as described in Fig 3.

Download:

Fig 4. Performance of the learner as training progresses.

The training fitness is the agent’s score in the testing phase. The CMCA score is computed by using the trained agent in the simulation. The CMCA score is complement to 1 of the CMCA objective function (Eq 8). The results are shown as mean ± standard deviation of n = 30 repetition. Namely, the error bars are one standard deviation. The vertical dashed-yellow lines indicate the validation phase during the training process. The training is converged after five validations for κ = 0.05, Z_training = 10000000, and Z_validate = 100000.

https://doi.org/10.1371/journal.pone.0270318.g004

We fitted the average training fitness and CMCA score using the least mean square method, obtaining F₁: fitness = 0.189ln(samples) + 0.053 and F₂: cmca = 0.196ln(samples) − 0.268 with coefficient of determination R² = 0.989 and R² = 0.972, respectively. Hence, the functions F₁ and F₂ are well-fitting the dynamics. Since both functions are monotonically increasing, it shows that there is a connection between the CMCA score of an individual and the population’s CMCA score. Similarly, the Pearson correlation between the training fitness and CMCA scores in the validation stages is R² = 0.971 with a p-value of 0.0057. Thus, the scores are strongly linearly correlated as well.

Screenshots of the simulator at the beginning and end of the simulation of an average performance VCMCA model after 50 million training samples are shown in Fig 5a and 5b, respectively. Each agent is represented by a black rectangle with a surrounding gray circle that represents the sensing radius of the agent and its index number. The agents marked in dashed blue indicate that they have collided. One can see that in Fig 5a that the agents are heading in random directions while in Fig 5b the agents heading in the same direction (top-right corner), obtaining CM.

Download:

Fig 5. A screenshot of the simulation.

Blue-dashed agents collided in the current simulation step. (a) At the beginning of the simulation (t = 0). (b) At the end of the simulation (t = 500).

https://doi.org/10.1371/journal.pone.0270318.g005

4.3 Baseline

We used the obtained VCMCA model to evaluate the CM, CA, and CMCA scores over time. We define the scores as the complement values to the objective functions of CM (Eq 6), CA (Eq 7), and CMCA (Eq 8). Thus, higher values of CM or CA scores signify better flocking performance or lower collision rates, respectively. We compared these results to a model obtained in the same way but with ζ = 1 rather than ζ = 0.5. Namely, in this configuration, the CEL algorithm aims to optimize the CM while ignoring CA. We refer to this model as CM model. In addition, we compare the proposed model with a random walk. The random walk model assumes a uniform distribution probability for each action and picks one action each time at random.

The results of the CM, CA, and CMCA scores are shown as mean ± standard deviation of n = 30 simulations in Fig 6a–6c, respectively.

Download:

Fig 6. The collective motion, collision avoidance, and CMCA score of the VCMCA algorithm, VCMCA algorithm without CA, and random walk over the simulation steps for n = 30 repetitions.

Namely, the error bars represent one standard deviation. The models’ scores are defined as the complement values of the respective objective functions. (a) Collective motion (CM) score. (b) Collision avoidance (CA) score. (c) Collective motion with collision avoidance (CMCA) score.

https://doi.org/10.1371/journal.pone.0270318.g006

As expected, both the CM and VCMCA models obtain flocking with a CM score of 0.75 and 0.7 while the random walk converges to 0.4, as presented in Fig 6a. On the other hand, while the CMCA model converges to a CA score of 0.74, the random walk and CM models obtained 0.29 and 0.17, respectively, as shown in Fig 6b. Of note, the CM model obtains the worse CA score out of the three. Since the CM model being trained only to flock with other agents, it disregards the collisions in its training, causing the agent to move closer to each other as shown in [46].

In addition, the mean results of each model over time are shown in Table 2 such that the results are the mean ± standard deviation computed on the values presented in Fig 6a–6c, respectively. As seen in Fig 6a, the agents’ convergence to cohesive movement, using the CM and CMCA models, emerges gradually with the collective motion index increasing with slowing rates over time. This process is similar to displays of hopper-bands formations, in which desert locust nymphs gather together into flocks of thousands of individuals. Due to the initially disordered state, it takes time for the visual information to be mediated between the swarm members and for them to change and adapt their heading on an individual level, which later results in the swarm reaching a coordinated heading direction on a group level. Another notable aspect is the collision rate (Fig 6b) for CM and Random models reaching its final value in a relatively short period, in contrast to the CMCA model that decreases moderately over the time of the simulation, reaching a saturation value more than four times larger than the CM model. The gradual decrease in the CA score (that is by definition equal to an increase in the collision rate) in the CMCA model could be attributed to the CM component causing the agents to align and follow each other, in turn bringing them closer together. These results are aligning with the biological inspiration since flock members facilitate coherent movement partly for their collision-avoidance abilities [67, 68], preventing them from constantly colliding into each other, which could potentially cause high mortality in flocking animals. The models’ mean scores for each evalutaion metric are summarized in Table 2.

Download:

Table 2. A mean ± standard deviation of the CMCA, CM, and CA scores for the CMCA model, CM model, and random walk for n = 30 repetitions.

https://doi.org/10.1371/journal.pone.0270318.t002

4.4 Parameter sensitively analysis

To estimate the sensitivity of the population density, agent’s height to width ratio, and sensing radius on the VCMCA model’s performance. Given the trained VCMCA model from the CEL algorithm, we computed the CMCA score over n = 30 simulations with different population sizes, as presented in Fig 7. The results are shown as mean ± standard deviation. Notably, the results are obtained after the VCMCA model is obtained on the default values (see Table 1) and evaluated on the other configurations. The results indicate an optimal flock size (P = 40), suggesting a strong link between the population’s size and flocking emergence. For small population sizes, the density is insufficient for phase transformation to occur since interactions between the moving agents are scarce and happen across large time intervals, thus preventing the ‘directional cluster’ aggregation. In other words, agents’ behavior is similar to random-walk since local interactions do not translate to the group level. An increase in population size allows for the accumulation of aligning inter-agent interactions, in which a small cluster of agents’ incoherent movement gradually aligns the rest of the swarm resulting in a large-scale coordinated motion reminiscent of natural flocks. Further increasing the population size leads to an inherent raise in the collision rate or CA maneuvers, where agents are steering away from an aligned direction to avoid an imminent collision with an approaching neighbor. Since our model strives to minimize the collision rate, relatively large populations hinder the aligning mechanism since the CA mechanism dominates the CM mechanism. Moreover, as the size of the environment was kept constant in all the experiments, there is a linear correlation between the population’s size and density: σ ≡ |P|/(W ⋅ H), where σ is the population’s density. Hence, one can obtain the same connection between the population’s (swarm) density and the flocking emergence.

Download:

Fig 7. A mean ± standard deviation of the CMCA score (Eq (8)) as a function of the population size (|P|) for n = 30 repetitions.

Namely, the error bars represent one standard deviation.

https://doi.org/10.1371/journal.pone.0270318.g007

In a similar manner, we computed the influence of the agents’ height to width ratio (e.g. h/s) on the CMCA score over n = 30 simulations, as shown in Fig 8.

Download:

Fig 8. A mean ± standard deviation of the CMCA score (Eq (8)) as a function of the agent’s ratio between the height and the width (h/w) for n = 30 repetitions Namely, the error bars represent one standard deviation.

https://doi.org/10.1371/journal.pone.0270318.g008

Expectedly, larger height-to-width ratios yield lower CMCA scores. As mentioned in Section 2, elongated agent morphologies, as opposed to circular, complicate the processing of visual stimuli. Neighbor’s orientations greatly affect the subtended angle ψ, especially in longer agents, since the same agent with the same distance to its center could have highly differing values of ψ. For example, observing an agent’s head or bottom vs. observing its side will greatly change the input information and introduce significant noise into the state domain. Moreover, Fig 8 shows that square-shaped agents got the highest CMCA score. One can conclude that a square, with its similarity to a disc, can artificially improve the convergence of a vision-based model [44, 45], since it eliminates the problem of orientational noise with its one-to-one function of distance to the angular area. However, such agent morphologies are biologically inaccurate, especially when addressing flocking animals as locusts, starlings, fish, etc [69–71].

In a similar manner, we computed the influence of the agents’ sensing radius (r) on the CMCA score over n = 30 simulations, as shown in Fig 9, such that the x-axis indicates both the sensing radius in absolute size and as the multiplier of the agent’s height.

Download:

Fig 9. A mean ± standard deviation of the CMCA score (Eq (8)) as a function of the agent’s sensing radius (r) for n = 30 repetitions.

Namely, the error bars represent one standard deviation.

https://doi.org/10.1371/journal.pone.0270318.g009

4.5 Explainability of the model

The proposed model is the outcome of multiple stochastic processes, converging into collective and intelligent behavior. In this section, we aimed to measure the ability of a simple model such as Decision Tree (DT) or even its more complex but still explainable extension, the Random Forest (RF) model in capturing the complexity of the proposed model. First, we generate 1000000 cases at random, in the same way, we have done for the testing phase of the CEL algorithm (see Section 3) and query the model, resulting in a corresponding list of actions for each state. Based on this data, we train RF models with 1000 trees to allow a large number of parameters, allowing them to fit complex dynamics [62]. The obtained RF models’ accuracy relative to the VCMCA model with a restriction of the number of leaf nodes and the tree’s maximum depth is shown in Fig 10a and 10b, respectively.

Download:

Fig 10. Random Forest model with 1000 trees approximated the proposed algorithm on the same set of samples.

The percent of accuracy to reproduce the proposed model on the default configuration as a function of the tree’s depth and the number of leaf nodes as mean ± deviation of each decision tree is shown. Namely, the error bars represent one standard deviation. (a) Decision tree’s depth. (b) Decision tree’s leaf nodes count.

https://doi.org/10.1371/journal.pone.0270318.g010

Unsurprisingly, as more leaves are available as well as more depth to the DTs, the better the RF model approximates the VCMCA model. However, after 320 leaf nodes (or max-depth of 8) there is no significant improvement in the model’s performance over 93.37%, indicating that 6.63% of the cases are anomalies. Moreover, there is a sharp improvement between 70 leaf nodes and 80. This sharp improvement is associated with the RF model’s to differ between keeping the current vector of movement or rotating 15 degrees (e.g., the smallest turn possible in the default configuration of the model) to either the right or the left. These three actions seem to be very common and the correct separation between them causes a large performance improvement. A slight improvement can be obtained by using advance post-pruning methods such as the SAT-PP algorithm [72] but these are not significantly changing the overall observed behavior.

5 Discussion

In this work, we propose a vision-based collective motion with collision avoidance (VCMCA) model in a two-dimensional space. In addition, we propose a collective evolution learning (CEL) algorithm that simulates the evolution process of an individual in the context of the group (population) aiming to optimize a local VCMCA task. Individuals better perform in the task at each generation contributing to the collective knowledge, which emerges the desired multi-objective collective behavior.

We used the VCMCA model obtained from the CEL algorithm for the case of locust, showing that multi-objective swarm behavior, such as CM with CA, emerges from the optimization of the individual-level objective. These results are presented in Fig 4 and Table 2. Moreover, Figs 7–9 show that the results are robust over the population’s density, agents’ shape, and sensing radius, respectively. In particular, disc-like agents with vision-based input are obtaining better results than agents with more complex topology, reproducing [44]’s results.

In addition, by approximating the obtained VCMCA model using explainable machine-learning-based models we show that small rule-based models are not able to capture the complexity of the VCMCA model and it is requiring several hundreds of rule combinations to capture only 93.3% of the VCMCA model’s ability. This model size is commonly considered out of the ability of the average person or even domain expert to explain [73]. This outcome suggests that the proposed VCMCA model obtained by the CEL is not explainable even when trying to approximate with the RF (or the DT) model, agreeing with similar results reported by [74]. Thus, while one can point to the “reason” the RF model makes each decision, this outcome will be in the form of a majority vote between multiple boolean satisfiability problems (on the DT level), making it impractical to analyze or to extract generic roles. Nevertheless, while the proposed model is not explainable on the single decision level, the overall behavior agrees with known biological behaviors [48].

These results agree with previous outcomes proposed by [46] which shows that agents who learned secondary goals such as keeping contact with neighbors were able to keep CM. These results raise the question that if CM or CA are the primary objective of the individual, as assumed in this work, or other biological objectives such as reducing their risk of being predated or increasing their probability of finding resources are the source of these behaviors. In particular, one can evaluate the performance of CEL for ζ = 0 and ζ = 1, concluding the influence of each objective to the overall swarm behavior. In a similar manner, evaluating CEL’s performance for a topological-based rather than metric-based approach which is believed to better represent multiple scenarios in nature can shed light on the influence of the flocking mechanism on collective learning. In particular, extending the proposed model to handle three-dimensional geometry would make the model more realistic. This direction can be a promising future work.

Since the proposed VCMCA model extends the model proposed by Vicsek et al. [16], a comparison between the two is expected. Nonetheless, such a comparison is infeasible since comparing Viscak’s model as well as other similar models use different types of input data for the agents. Indeed, one would need to modify the input of these models (or algorithms) to match our own vision-based input to make the comparison legitimate. However, by doing so, the examined model would have to be altered as well to handle the new kind of input. Hence, the altered model can not be assumed to have the same behavior as the original model.

Moreover, in this work, we tackle a swarm task that assumes a group of identical agents. However, in nature, agents in the population differ by their size, capability, and even action-making process. This diversity may result in more behaviors among the agents, such as picking a leader and introducing new and complex challenges for the model.

The proposed question and many others can be addressed by introducing modifications to the proposed VCMCA model and CEL algorithm proposed in this work. As such, we believe that the proposed approach can be used as a computational background to investigate a more biologically accurate representation of swarm dynamics.

6 Appendix

6.1 The collective evolution learning algorithm time complexity

Analyzing the time and memory complexity of the proposed Collective Evolution Learning (CEL) is challenging since it uses multiple levels of stochastic processes that depended on each other. First, CEL uses the Q-learning model which converges from uniform distribution to the learned distribution up to eⁿ where n is the number of states in the Q-learner’s table [75]. Moreover, for each evaluation of the action’s score for the Q-learner, CEL uses the Genetic Algorithm (GA) [76]. In order to evaluate the time complexity of the GA, it can be reduced to a “bitone” task in which the GA algorithm starts with a random population of genes represented by a string of bits and aims to make the entire population identical to a pre-defined target string of bits. The target bit string is the optimal FS represented in the same manner as the gene. As such, a binary string can be of length min_k(|S| < 2^k). While the target bit string is unavailable to us, we assume that the fitness function implicitly defines it. Thus, the GA is lower bounded by an exponent of the number of generations, g, and the genes’ population size, τ. Thus, CEL’s time complexity is O(e^gτn).

6.2 Model parameters summary

A summary of the model parameters is provided in Table 3.

Download:

Table 3. A summary of the parameters used in the proposed VCMCA model with their notation and detailed description.

https://doi.org/10.1371/journal.pone.0270318.t003

6.3 Vision mechanism special cases

Following the definition of the vision mechanism described in Section 2.1, one can define three special cases for the computation of Eq (2) following the values obtained from Eq (3). These three cases are presented in Fig 11.

Download:

Fig 11. Schematic of angular area (ψ) calculation.

FA is the focal agent and χ_i are the vectors connecting the FA body-center and the corners of the neighbor. The angular area ψ, i.e., the angle between the two extreme vectors to the neighbor, is denoted in blue. The extreme neighbor’s corners are the vertices subtending the largest possible ψ. This can be classified into three cases: (A) in which one of the diagonals subtends the largest angular area, or the two special cases (B, C), where the corners are the vertices on the closest neighbor’s edge to FA. The latter occurs when the line-of-sight (the vector connecting between the FA and the neighbor’s center of masses) is exactly perpendicular or parallel to the neighbor’s velocity, respectively.

https://doi.org/10.1371/journal.pone.0270318.g011

Supporting information

S1 Data.

https://doi.org/10.1371/journal.pone.0270318.s001

(XLSX)

S2 Data.

https://doi.org/10.1371/journal.pone.0270318.s002

(TXT)

References

1. Chate H, Ginelli F, Peruani F, Raynaud F. Modeling collective motion: variations on the Vicsek model. The European Physical Journal B. 2008;64:451–456.
- View Article
- Google Scholar
2. Baglietto G, Albano EV. Nature of the order-disorder transition in the Vicsek model for the collective motion of self-propelled particles. Physical Review E. 2009;80:050103. pmid:20364937
- View Article
- PubMed/NCBI
- Google Scholar
3. Deutsch A, Theraulaz G, Vicsek T. Collective motion in biological systems. Interface Focus. 2012;2(6):050103.
- View Article
- Google Scholar
4. Warren WH. Collective Motion in Human Crowds. Current Directions in Psychological Science. 2018;27(4):232–240. pmid:30197472
- View Article
- PubMed/NCBI
- Google Scholar
5. Ariel G, Ayali A. Locust Collective Motion and Its Modeling. PLOS computational Biology. 2015;11(12):e1004522. pmid:26656851
- View Article
- PubMed/NCBI
- Google Scholar
6. Bellerini M, Cabibbo N, Candelier R, Cavagna A, Cisbani E, Giardina I, et al. Empirical investigation of starling flocks: a benchmark study in collective animal behaviour. Animal Behaviour. 2008;76(1):201–215.
- View Article
- Google Scholar
7. Bajec IL, Heppner FH. Organized flight in birds. Animal Behaviour. 2009;78(4):777–789.
- View Article
- Google Scholar
8. Seghers BH. Schooling Behavior in the Guppy (Poecilia reticulata): An Evolutionary Response to Predation. Evolution. 1974;28(3):486–489. pmid:28564850
- View Article
- PubMed/NCBI
- Google Scholar
9. Weihs D. The hydrodynamics of dolphin drafting. Journal of Biology. 2004;3. pmid:15132740
- View Article
- PubMed/NCBI
- Google Scholar
10. Cavagna A, Cimarelli A, Giardina I, Parisi G, Sanagati R, Stefanini F, et al. From Empirical Data to Inter-Individual Interactions: Unveiling the Rules of Collective Animal Behavior. Mathematical Models and Methods in Applied Sciences. 2010;20:1491–1510.
- View Article
- Google Scholar
11. Lukeman R, Li YX, Edelstein-Keshet L. Inferring individual rules from collective behavior. PNAS. 2010;107(28):12576–12580. pmid:20616032
- View Article
- PubMed/NCBI
- Google Scholar
12. Gregorie G, Chate H, Tu Y. Moving and staying together without a leader. Physica D. 2003;181.
- View Article
- Google Scholar
13. Bando M, Hasebe K, Nakayama A, Shibata A, Sugiyama Y. Dynamical model of traffic congestion and numerical simulation. Physical Review E. 1995;51(2):1035–1042. pmid:9962746
- View Article
- PubMed/NCBI
- Google Scholar
14. Ebeling W, Schimansky-Geier L. Swarm dynamics, attractors and bifurcations of active Brownian motion. European Physical Journal Special Topics. 2008;157:17–31.
- View Article
- Google Scholar
15. Edelstein-Keshet L. Mathematical models of swarming and social aggregation. In: Proceedings of the 2001 International Symposium on Nonlinear Theory and Its Applications, Miyagi, Japan. Citeseer; 2001. p. 1–7.
16. Vicsek T, Czirok A, Ben-Jacob E, Cohen I, Shochet O. Novel type of phase transition in a system of self-driven particles. Physical Review Letters. 1995;75(6):1226–1229. pmid:10060237
- View Article
- PubMed/NCBI
- Google Scholar
17. Czirok A, Vicsek T. Collective behavior of interacting self-propelled particles. Physica A. 2000;281:17–29.
- View Article
- Google Scholar
18. Romanczuk P, Couzin ID, Schimansky-Geier L. Collective motion due to individual escape and pursuit response. Physical Review Letters. 2000;102(1).
- View Article
- Google Scholar
19. Wood AJ, Ackland GJ. Evolving the selfish herd: emergence of distinct aggregating strategies in an individual-based model. Proceedings of Royal Society B. 2007;274:1637–1642.
- View Article
- Google Scholar
20. Aoki I. A simulation study on the schooling mechanism in fish. Bulletin of the Japanese Society of Fisheries. 1982;48.
- View Article
- Google Scholar
21. Reynolds CW. Flocks, herds and schools: A distributed behavioral model. In: Proceedings of the 14th annual conference on Computer graphics and interactive techniques; 1987. p. 25–34.
22. Couzin ID, Krause J, Franks NR, Levin SA. Effective leadership and decision-making in animal groups on the move. Nature. 2005;433(7025):513–516. pmid:15690039
- View Article
- PubMed/NCBI
- Google Scholar
23. Bhattacharya K, Vicsek T. Collective decision making in cohesive flocks. New Journal of Physics. 2010;12(9):093019.
- View Article
- Google Scholar
24. Peruani F, Schimansky-Geier L, Baer M. Cluster dynamics and cluster size distributions in systems of self-propelled particles. The European Physical Journal Special Topics. 2010;191(1):173–185.
- View Article
- Google Scholar
25. Gao J, Havlin S, Xu X, Stanley HE. Angle restriction enhances synchronization of self-propelled objects. Physical Review E. 2011;84(4):046115. pmid:22181235
- View Article
- PubMed/NCBI
- Google Scholar
26. Shang Y, Bouffanais R. Influence of the number of topologically interacting neighbors on swarm dynamics. Scientific reports. 2014;4(1):1–7. pmid:24567077
- View Article
- PubMed/NCBI
- Google Scholar
27. Camperi M, Cavagna A, Giardina I, Parisi G, Silvestri E. Spatially balanced topological interaction grants optimal cohesion in flocking models. Interface focus. 2012;2(6):715–725. pmid:24312725
- View Article
- PubMed/NCBI
- Google Scholar
28. Kumar V, De R. Efficient flocking: metric versus topological interactions. Royal Society open science. 2021;8(9):202158. pmid:34631117
- View Article
- PubMed/NCBI
- Google Scholar
29. Giardina I. Collective behavior in animal groups: theoretical models and empirical studies. HFSP Journal. 2008;2(4):205–219. pmid:19404431
- View Article
- PubMed/NCBI
- Google Scholar
30. Ballerini M, Cabibbo N, Candelier R, Cavagna A, Cisbani E, Giardina I, et al. Interaction ruling animal collective behavior depends on topological rather than metric distance: Evidence from a field study. PNAS. 2008;105(4):1232–1237. pmid:18227508
- View Article
- PubMed/NCBI
- Google Scholar
31. Sankey DWE, Storms RF, Musters RJ, Russell TW, Hemelrijk CK, Protugal SJ. Absence of “selfish herd” dynamics in bird flocks under threat. Current Biology. 2021;31:3192–3198. pmid:34089647
- View Article
- PubMed/NCBI
- Google Scholar
32. Anderson JM, Clegg TM, Veras LVMVQ, Holland KN. Insight into shark magnetic field perception from empirical observations. Scientific Reports. 2017;7:11042. pmid:28887553
- View Article
- PubMed/NCBI
- Google Scholar
33. Steiner G. Orientation and Sense of Smell in Animals. Universitas. 1959;3(1).
- View Article
- Google Scholar
34. Heinze S, Homberg U. Linking the Input to the Output: New Sets of Neurons Complement the Polarization Vision Network in the Locust Central Complex. Journal of Neuroscience. 2009;29(15):4911–4921. pmid:19369560
- View Article
- PubMed/NCBI
- Google Scholar
35. Masland RH, Martin PR. The unsolved mystery of vision. Current Biology. 2007;17(15):RS577–RS582. pmid:17686423
- View Article
- PubMed/NCBI
- Google Scholar
36. O’Shea M, Williams JLD. The Anatomy and Output Connection of a Locust Visual lnterneurone. J comp Physiol. 1974;91:257–266.
- View Article
- Google Scholar
37. Zeli J, Borddeker N, Hemmi JM. Vision and the organization of behaviour. Current Biology. 2008;18(8):R320–R323.
- View Article
- Google Scholar
38. Serres JR, Ruffier F. Optic flow-based collision-free strategies: From insects to robots. Arthropod Structure & Development. 2017;46(5):703–717. pmid:28655645
- View Article
- PubMed/NCBI
- Google Scholar
39. Ros IG, Biewener AA. Pigeons (C. livia) follow their head during turning flight: Head stabilization underlies the visual control of flight. Frontiers in Neuroscience. 2017;11:655. pmid:29249929
- View Article
- PubMed/NCBI
- Google Scholar
40. Lemasson BH, Anderson JJ, Goodwin RA. Collective motion in animal groups from a neurobiological perspective: The adaptive benefits of dynamic sensory loads and selective attention. Journal of Theoretical Biology. 2009;261(4):501–510. pmid:19699212
- View Article
- PubMed/NCBI
- Google Scholar
41. Lemasson BH, Anderson JJ, Goodwin RA. Motion-guided attention promotes adaptive communications during social navigation. Proceedings of the Royal Society B: Biological Sciences. 2013;280(1754):20122003. pmid:23325772
- View Article
- PubMed/NCBI
- Google Scholar
42. Strandburg-Peshkin A, Twomey CR, Bode NWF, Kao AB, Katz Y, Ioannou CC, et al. Visual sensory networks and effective information transfer in animal groups. Current Biology. 2013;23(17):R709–R711. pmid:24028946
- View Article
- PubMed/NCBI
- Google Scholar
43. Collignon B, Séguret A, Halloy J. A stochastic vision-based model inspired by zebrafish collective behaviour in heterogeneous environments. Royal Society open science. 2016;3(1):150473. pmid:26909173
- View Article
- PubMed/NCBI
- Google Scholar
44. Bastien R, Romanczuk P. A model of collective behavior based purely on vision. Science advances. 2020;6(6). pmid:32076645
- View Article
- PubMed/NCBI
- Google Scholar
45. Qi J, Bai L, Xiao Y, Wei Y, Wu W. The emergence of collective obstacle avoidance based on a visual perception mechanism. Information Sciences. 2022;582:850–864.
- View Article
- Google Scholar
46. Durve M, Peruani F, Celani A. Learning to flock through reinforcement. Physical Review E. 2020;102:012601. pmid:32794942
- View Article
- PubMed/NCBI
- Google Scholar
47. Young Z, La HM. Consensus, cooperative learning, and flocking for multiagent predator avoidance. International Journal of Advanced Robotic Systems. 2020. pmid:34819959
- View Article
- PubMed/NCBI
- Google Scholar
48. Lopez-Incera A, Ried K, Briegel HJ. Development of swarm behavior in artificial learning agents that adapt to different foraging environments. PLoS ONE. 2020;15(12):e0243628. pmid:33338066
- View Article
- PubMed/NCBI
- Google Scholar
49. Rosenthal SB, Twomey CR, Hartnett AT, Wu HS, Couzin ID. Revealing the hidden networks of interaction in mobile animal groups allows prediction of complex behavioral contagion. Proceedings of the National Academy of Sciences. 2015;112(15):4690–4695. pmid:25825752
- View Article
- PubMed/NCBI
- Google Scholar
50. Watkins CJCH, Dayan P. Q-Learning. Machine Learning. 1992;8:279–292.
- View Article
- Google Scholar
51. Watkin CJCH, Dayan P. Technical Note: Q-Learning. Machine Learning. 1992; p. 279–292.
- View Article
- Google Scholar
52. Mirjalili S. Genetic algorithm. In: Evolutionary algorithms and neural networks. Springer; 2019. p. 43–55.
53. Lazebnik T. Cell-Level Spatio-Temporal Model for a Bacillus Calmette–Guerin-Based Immunotherapy Treatment Protocol of Superficial Bladder Cancer. Cells. 2022;11(15). pmid:35954213
- View Article
- PubMed/NCBI
- Google Scholar
54. Routledge BR. Genetic Algorithm Learning To Choose and Use Information. Macroeconomic Dynamics. 2001;5(2):303–325.
- View Article
- Google Scholar
55. Kumar M, Husain M, Upreti N, Gupta D. Genetic Algorithm: Review and Application. International Journal of Information Technology and Knowledge Management. 2010;2(2):451–454.
- View Article
- Google Scholar
56. Lazebnik T, Somech A, Weinberg AI. SubStrat: A Subset-Based Optimization Strategy for Faster AutoML. Proceedings of the VLDB Endowment. 2022;16(4):772–780.
- View Article
- Google Scholar
57. Chang Y, Ho T, Kaelbling L. All learning is Local: Multi-agent Learning in Global Reward Games. In: Thrun S, Saul L, Scholkopf B, editors. Advances in Neural Information Processing Systems. vol. 16. MIT Press; 2003.
58. Zang B, Huang R, Wang L, Chen J, Tian F, Wei X. An Improved KNN Algorithm Based on Minority Class Distribution for Imbalanced Dataset. In: 2016 International Computer Symposium (ICS); 2016. p. 696–700.
59. Behrends E. Introduction to Markov Chains. Vieweg and Teubner Verlag; 2000.
60. Davis L. Applying adaptive algorithms to epistatic domains. Proceedings of the international joint conference on artificial intelligence. 1985; p. 162–164.
61. Bo ZW, Hua LZ, Yu ZG. Optimization of process route by genetic algorithms. Robotics and Computer-Integrated Manufacturing. 2006;22:180–188.
- View Article
- Google Scholar
62. Rokach L. Decision forest: twenty years of research. Information Fusion. 2016;27:111–125.
- View Article
- Google Scholar
63. Pocco ME, Cigliano MM, Foquet B, Lange CE, Nieves EL, Song H. Density-Dependent Phenotypic Plasticity in the South American Locust, Schistocerca cancellata (Orthoptera: Acrididae). Annals of the Entomological Society of America. 2019;112(5):458–472.
- View Article
- Google Scholar
64. Showler AT. In: Desert Locust, Schistocerca gregaria Forskål (Orthoptera: Acrididae) Plagues. Springer Netherlands; 2008. p. 1181–1186.
65. Sanchez-Arroyo H. Largest Swarm. University of Florida book of insect records University of Florida, Gainesville. 1997;2325.
66. Edelstein-Keshet L, Watmough J, Grunbaum D. Do travelling band solutions describe cohesive swarms? An investigation for migratory locusts. Journal of mathematical biology. 1998;36(6):515–549.
- View Article
- Google Scholar
67. Robertson RM, Johnson AG. Collision avoidance of flying locusts: steering torques and behaviour. Journal of Experimental Biology. 1993;183(1):35–60.
- View Article
- Google Scholar
68. Rind FC, Santer RD, Wright GA. Arousal facilitates collision avoidance mediated by a looming sensitive visual neuron in a flying locust. Journal of neurophysiology. 2008;100(2):670–680. pmid:18509080
- View Article
- PubMed/NCBI
- Google Scholar
69. Reis EG, Pawson MG. Fish morphology and estimating selectivity by gillnets. Fisheries research. 1999;39(3):263–273.
- View Article
- Google Scholar
70. Snodgrass RE. Principles of insect morphology. Cornell University Press; 2018.
71. Hume JP. Systematics, morphology, and ecological history of the Mascarene starlings (Aves: Sturnidae) with the description of a new genus and species from Mauritius. Zootaxa. 2014;3849(1):1–75. pmid:25112426
- View Article
- PubMed/NCBI
- Google Scholar
72. Lazebnik T, Bunimovich-Mendrazitsky S. Decision tree post-pruning without loss of accuracy using the SAT-PP algorithm with an empirical evaluation on clinical data. Data & Knowledge Engineering. 2023;145:102173.
- View Article
- Google Scholar
73. Blanco-Justicia A, Domingo-Ferrer J. Machine learning explainability through comprehensible decision trees. In: International Cross-Domain Conference for Machine Learning and Knowledge Extraction. Springer; 2019. p. 15–26.
74. Rudin C. Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead. Nature Machine Intelligence. 2019; p. 206–215. pmid:35603010
- View Article
- PubMed/NCBI
- Google Scholar
75. Whitehead SD. A complexity analysis of cooperative mechanisms in reinforcement learning. Proceedings of the AAAI. 1991; p. 607–613.
76. Bo L, Rein L. Comparison of the Luus–Jaakola optimization procedure and the genetic algorithm. Engineering Optimization. 2005;37(4):381–396.
- View Article
- Google Scholar

[ref1] 1. Chate H, Ginelli F, Peruani F, Raynaud F. Modeling collective motion: variations on the Vicsek model. The European Physical Journal B. 2008;64:451–456.
View Article
Google Scholar

[2] View Article

[3] Google Scholar

[ref2] 2. Baglietto G, Albano EV. Nature of the order-disorder transition in the Vicsek model for the collective motion of self-propelled particles. Physical Review E. 2009;80:050103. pmid:20364937
View Article
PubMed/NCBI
Google Scholar

[5] View Article

[6] PubMed/NCBI

[7] Google Scholar

[ref3] 3. Deutsch A, Theraulaz G, Vicsek T. Collective motion in biological systems. Interface Focus. 2012;2(6):050103.
View Article
Google Scholar

[9] View Article

[10] Google Scholar

[ref4] 4. Warren WH. Collective Motion in Human Crowds. Current Directions in Psychological Science. 2018;27(4):232–240. pmid:30197472
View Article
PubMed/NCBI
Google Scholar

[12] View Article

[13] PubMed/NCBI

[14] Google Scholar

[ref5] 5. Ariel G, Ayali A. Locust Collective Motion and Its Modeling. PLOS computational Biology. 2015;11(12):e1004522. pmid:26656851
View Article
PubMed/NCBI
Google Scholar

[16] View Article

[17] PubMed/NCBI

[18] Google Scholar

[ref6] 6. Bellerini M, Cabibbo N, Candelier R, Cavagna A, Cisbani E, Giardina I, et al. Empirical investigation of starling flocks: a benchmark study in collective animal behaviour. Animal Behaviour. 2008;76(1):201–215.
View Article
Google Scholar

[20] View Article

[21] Google Scholar

[ref7] 7. Bajec IL, Heppner FH. Organized flight in birds. Animal Behaviour. 2009;78(4):777–789.
View Article
Google Scholar

[23] View Article

[24] Google Scholar

[ref8] 8. Seghers BH. Schooling Behavior in the Guppy (Poecilia reticulata): An Evolutionary Response to Predation. Evolution. 1974;28(3):486–489. pmid:28564850
View Article
PubMed/NCBI
Google Scholar

[26] View Article

[27] PubMed/NCBI

[28] Google Scholar

[ref9] 9. Weihs D. The hydrodynamics of dolphin drafting. Journal of Biology. 2004;3. pmid:15132740
View Article
PubMed/NCBI
Google Scholar

[30] View Article

[31] PubMed/NCBI

[32] Google Scholar

[ref10] 10. Cavagna A, Cimarelli A, Giardina I, Parisi G, Sanagati R, Stefanini F, et al. From Empirical Data to Inter-Individual Interactions: Unveiling the Rules of Collective Animal Behavior. Mathematical Models and Methods in Applied Sciences. 2010;20:1491–1510.
View Article
Google Scholar

[34] View Article

[35] Google Scholar

[ref11] 11. Lukeman R, Li YX, Edelstein-Keshet L. Inferring individual rules from collective behavior. PNAS. 2010;107(28):12576–12580. pmid:20616032
View Article
PubMed/NCBI
Google Scholar

[37] View Article

[38] PubMed/NCBI

[39] Google Scholar

[ref12] 12. Gregorie G, Chate H, Tu Y. Moving and staying together without a leader. Physica D. 2003;181.
View Article
Google Scholar

[41] View Article

[42] Google Scholar

[ref13] 13. Bando M, Hasebe K, Nakayama A, Shibata A, Sugiyama Y. Dynamical model of traffic congestion and numerical simulation. Physical Review E. 1995;51(2):1035–1042. pmid:9962746
View Article
PubMed/NCBI
Google Scholar

[44] View Article

[45] PubMed/NCBI

[46] Google Scholar

[ref14] 14. Ebeling W, Schimansky-Geier L. Swarm dynamics, attractors and bifurcations of active Brownian motion. European Physical Journal Special Topics. 2008;157:17–31.
View Article
Google Scholar

[48] View Article

[49] Google Scholar

[ref15] 15. Edelstein-Keshet L. Mathematical models of swarming and social aggregation. In: Proceedings of the 2001 International Symposium on Nonlinear Theory and Its Applications, Miyagi, Japan. Citeseer; 2001. p. 1–7.

[ref16] 16. Vicsek T, Czirok A, Ben-Jacob E, Cohen I, Shochet O. Novel type of phase transition in a system of self-driven particles. Physical Review Letters. 1995;75(6):1226–1229. pmid:10060237
View Article
PubMed/NCBI
Google Scholar

[52] View Article

[53] PubMed/NCBI

[54] Google Scholar

[ref17] 17. Czirok A, Vicsek T. Collective behavior of interacting self-propelled particles. Physica A. 2000;281:17–29.
View Article
Google Scholar

[56] View Article

[57] Google Scholar

[ref18] 18. Romanczuk P, Couzin ID, Schimansky-Geier L. Collective motion due to individual escape and pursuit response. Physical Review Letters. 2000;102(1).
View Article
Google Scholar

[59] View Article

[60] Google Scholar

[ref19] 19. Wood AJ, Ackland GJ. Evolving the selfish herd: emergence of distinct aggregating strategies in an individual-based model. Proceedings of Royal Society B. 2007;274:1637–1642.
View Article
Google Scholar

[62] View Article

[63] Google Scholar

[ref20] 20. Aoki I. A simulation study on the schooling mechanism in fish. Bulletin of the Japanese Society of Fisheries. 1982;48.
View Article
Google Scholar

[65] View Article

[66] Google Scholar

[ref21] 21. Reynolds CW. Flocks, herds and schools: A distributed behavioral model. In: Proceedings of the 14th annual conference on Computer graphics and interactive techniques; 1987. p. 25–34.

[ref22] 22. Couzin ID, Krause J, Franks NR, Levin SA. Effective leadership and decision-making in animal groups on the move. Nature. 2005;433(7025):513–516. pmid:15690039
View Article
PubMed/NCBI
Google Scholar

[69] View Article

[70] PubMed/NCBI

[71] Google Scholar

[ref23] 23. Bhattacharya K, Vicsek T. Collective decision making in cohesive flocks. New Journal of Physics. 2010;12(9):093019.
View Article
Google Scholar

[73] View Article

[74] Google Scholar

[ref24] 24. Peruani F, Schimansky-Geier L, Baer M. Cluster dynamics and cluster size distributions in systems of self-propelled particles. The European Physical Journal Special Topics. 2010;191(1):173–185.
View Article
Google Scholar

[76] View Article

[77] Google Scholar

[ref25] 25. Gao J, Havlin S, Xu X, Stanley HE. Angle restriction enhances synchronization of self-propelled objects. Physical Review E. 2011;84(4):046115. pmid:22181235
View Article
PubMed/NCBI
Google Scholar

[79] View Article

[80] PubMed/NCBI

[81] Google Scholar

[ref26] 26. Shang Y, Bouffanais R. Influence of the number of topologically interacting neighbors on swarm dynamics. Scientific reports. 2014;4(1):1–7. pmid:24567077
View Article
PubMed/NCBI
Google Scholar

[83] View Article

[84] PubMed/NCBI

[85] Google Scholar

[ref27] 27. Camperi M, Cavagna A, Giardina I, Parisi G, Silvestri E. Spatially balanced topological interaction grants optimal cohesion in flocking models. Interface focus. 2012;2(6):715–725. pmid:24312725
View Article
PubMed/NCBI
Google Scholar

[87] View Article

[88] PubMed/NCBI

[89] Google Scholar

[ref28] 28. Kumar V, De R. Efficient flocking: metric versus topological interactions. Royal Society open science. 2021;8(9):202158. pmid:34631117
View Article
PubMed/NCBI
Google Scholar

[91] View Article

[92] PubMed/NCBI

[93] Google Scholar

[ref29] 29. Giardina I. Collective behavior in animal groups: theoretical models and empirical studies. HFSP Journal. 2008;2(4):205–219. pmid:19404431
View Article
PubMed/NCBI
Google Scholar

[95] View Article

[96] PubMed/NCBI

[97] Google Scholar

[ref30] 30. Ballerini M, Cabibbo N, Candelier R, Cavagna A, Cisbani E, Giardina I, et al. Interaction ruling animal collective behavior depends on topological rather than metric distance: Evidence from a field study. PNAS. 2008;105(4):1232–1237. pmid:18227508
View Article
PubMed/NCBI
Google Scholar

[99] View Article

[100] PubMed/NCBI

[101] Google Scholar

[ref31] 31. Sankey DWE, Storms RF, Musters RJ, Russell TW, Hemelrijk CK, Protugal SJ. Absence of “selfish herd” dynamics in bird flocks under threat. Current Biology. 2021;31:3192–3198. pmid:34089647
View Article
PubMed/NCBI
Google Scholar

[103] View Article

[104] PubMed/NCBI

[105] Google Scholar

[ref32] 32. Anderson JM, Clegg TM, Veras LVMVQ, Holland KN. Insight into shark magnetic field perception from empirical observations. Scientific Reports. 2017;7:11042. pmid:28887553
View Article
PubMed/NCBI
Google Scholar

[107] View Article

[108] PubMed/NCBI

[109] Google Scholar

[ref33] 33. Steiner G. Orientation and Sense of Smell in Animals. Universitas. 1959;3(1).
View Article
Google Scholar

[111] View Article

[112] Google Scholar

[ref34] 34. Heinze S, Homberg U. Linking the Input to the Output: New Sets of Neurons Complement the Polarization Vision Network in the Locust Central Complex. Journal of Neuroscience. 2009;29(15):4911–4921. pmid:19369560
View Article
PubMed/NCBI
Google Scholar

[114] View Article

[115] PubMed/NCBI

[116] Google Scholar

[ref35] 35. Masland RH, Martin PR. The unsolved mystery of vision. Current Biology. 2007;17(15):RS577–RS582. pmid:17686423
View Article
PubMed/NCBI
Google Scholar

[118] View Article

[119] PubMed/NCBI

[120] Google Scholar

[ref36] 36. O’Shea M, Williams JLD. The Anatomy and Output Connection of a Locust Visual lnterneurone. J comp Physiol. 1974;91:257–266.
View Article
Google Scholar

[122] View Article

[123] Google Scholar

[ref37] 37. Zeli J, Borddeker N, Hemmi JM. Vision and the organization of behaviour. Current Biology. 2008;18(8):R320–R323.
View Article
Google Scholar

[125] View Article

[126] Google Scholar

[ref38] 38. Serres JR, Ruffier F. Optic flow-based collision-free strategies: From insects to robots. Arthropod Structure & Development. 2017;46(5):703–717. pmid:28655645
View Article
PubMed/NCBI
Google Scholar

[128] View Article

[129] PubMed/NCBI

[130] Google Scholar

[ref39] 39. Ros IG, Biewener AA. Pigeons (C. livia) follow their head during turning flight: Head stabilization underlies the visual control of flight. Frontiers in Neuroscience. 2017;11:655. pmid:29249929
View Article
PubMed/NCBI
Google Scholar

[132] View Article

[133] PubMed/NCBI

[134] Google Scholar

[ref40] 40. Lemasson BH, Anderson JJ, Goodwin RA. Collective motion in animal groups from a neurobiological perspective: The adaptive benefits of dynamic sensory loads and selective attention. Journal of Theoretical Biology. 2009;261(4):501–510. pmid:19699212
View Article
PubMed/NCBI
Google Scholar

[136] View Article

[137] PubMed/NCBI

[138] Google Scholar

[ref41] 41. Lemasson BH, Anderson JJ, Goodwin RA. Motion-guided attention promotes adaptive communications during social navigation. Proceedings of the Royal Society B: Biological Sciences. 2013;280(1754):20122003. pmid:23325772
View Article
PubMed/NCBI
Google Scholar

[140] View Article

[141] PubMed/NCBI

[142] Google Scholar

[ref42] 42. Strandburg-Peshkin A, Twomey CR, Bode NWF, Kao AB, Katz Y, Ioannou CC, et al. Visual sensory networks and effective information transfer in animal groups. Current Biology. 2013;23(17):R709–R711. pmid:24028946
View Article
PubMed/NCBI
Google Scholar

[144] View Article

[145] PubMed/NCBI

[146] Google Scholar

[ref43] 43. Collignon B, Séguret A, Halloy J. A stochastic vision-based model inspired by zebrafish collective behaviour in heterogeneous environments. Royal Society open science. 2016;3(1):150473. pmid:26909173
View Article
PubMed/NCBI
Google Scholar

[148] View Article

[149] PubMed/NCBI

[150] Google Scholar

[ref44] 44. Bastien R, Romanczuk P. A model of collective behavior based purely on vision. Science advances. 2020;6(6). pmid:32076645
View Article
PubMed/NCBI
Google Scholar

[152] View Article

[153] PubMed/NCBI

[154] Google Scholar

[ref45] 45. Qi J, Bai L, Xiao Y, Wei Y, Wu W. The emergence of collective obstacle avoidance based on a visual perception mechanism. Information Sciences. 2022;582:850–864.
View Article
Google Scholar

[156] View Article

[157] Google Scholar

[ref46] 46. Durve M, Peruani F, Celani A. Learning to flock through reinforcement. Physical Review E. 2020;102:012601. pmid:32794942
View Article
PubMed/NCBI
Google Scholar

[159] View Article

[160] PubMed/NCBI

[161] Google Scholar

[ref47] 47. Young Z, La HM. Consensus, cooperative learning, and flocking for multiagent predator avoidance. International Journal of Advanced Robotic Systems. 2020. pmid:34819959
View Article
PubMed/NCBI
Google Scholar

[163] View Article

[164] PubMed/NCBI

[165] Google Scholar

[ref48] 48. Lopez-Incera A, Ried K, Briegel HJ. Development of swarm behavior in artificial learning agents that adapt to different foraging environments. PLoS ONE. 2020;15(12):e0243628. pmid:33338066
View Article
PubMed/NCBI
Google Scholar

[167] View Article

[168] PubMed/NCBI

[169] Google Scholar

[ref49] 49. Rosenthal SB, Twomey CR, Hartnett AT, Wu HS, Couzin ID. Revealing the hidden networks of interaction in mobile animal groups allows prediction of complex behavioral contagion. Proceedings of the National Academy of Sciences. 2015;112(15):4690–4695. pmid:25825752
View Article
PubMed/NCBI
Google Scholar

[171] View Article

[172] PubMed/NCBI

[173] Google Scholar

[ref50] 50. Watkins CJCH, Dayan P. Q-Learning. Machine Learning. 1992;8:279–292.
View Article
Google Scholar

[175] View Article

[176] Google Scholar

[ref51] 51. Watkin CJCH, Dayan P. Technical Note: Q-Learning. Machine Learning. 1992; p. 279–292.
View Article
Google Scholar

[178] View Article

[179] Google Scholar

[ref52] 52. Mirjalili S. Genetic algorithm. In: Evolutionary algorithms and neural networks. Springer; 2019. p. 43–55.

[ref53] 53. Lazebnik T. Cell-Level Spatio-Temporal Model for a Bacillus Calmette–Guerin-Based Immunotherapy Treatment Protocol of Superficial Bladder Cancer. Cells. 2022;11(15). pmid:35954213
View Article
PubMed/NCBI
Google Scholar

[182] View Article

[183] PubMed/NCBI

[184] Google Scholar

[ref54] 54. Routledge BR. Genetic Algorithm Learning To Choose and Use Information. Macroeconomic Dynamics. 2001;5(2):303–325.
View Article
Google Scholar

[186] View Article

[187] Google Scholar

[ref55] 55. Kumar M, Husain M, Upreti N, Gupta D. Genetic Algorithm: Review and Application. International Journal of Information Technology and Knowledge Management. 2010;2(2):451–454.
View Article
Google Scholar

[189] View Article

[190] Google Scholar

[ref56] 56. Lazebnik T, Somech A, Weinberg AI. SubStrat: A Subset-Based Optimization Strategy for Faster AutoML. Proceedings of the VLDB Endowment. 2022;16(4):772–780.
View Article
Google Scholar

[192] View Article

[193] Google Scholar

[ref57] 57. Chang Y, Ho T, Kaelbling L. All learning is Local: Multi-agent Learning in Global Reward Games. In: Thrun S, Saul L, Scholkopf B, editors. Advances in Neural Information Processing Systems. vol. 16. MIT Press; 2003.

[ref58] 58. Zang B, Huang R, Wang L, Chen J, Tian F, Wei X. An Improved KNN Algorithm Based on Minority Class Distribution for Imbalanced Dataset. In: 2016 International Computer Symposium (ICS); 2016. p. 696–700.

[ref59] 59. Behrends E. Introduction to Markov Chains. Vieweg and Teubner Verlag; 2000.

[ref60] 60. Davis L. Applying adaptive algorithms to epistatic domains. Proceedings of the international joint conference on artificial intelligence. 1985; p. 162–164.

[ref61] 61. Bo ZW, Hua LZ, Yu ZG. Optimization of process route by genetic algorithms. Robotics and Computer-Integrated Manufacturing. 2006;22:180–188.
View Article
Google Scholar

[199] View Article

[200] Google Scholar

[ref62] 62. Rokach L. Decision forest: twenty years of research. Information Fusion. 2016;27:111–125.
View Article
Google Scholar

[202] View Article

[203] Google Scholar

[ref63] 63. Pocco ME, Cigliano MM, Foquet B, Lange CE, Nieves EL, Song H. Density-Dependent Phenotypic Plasticity in the South American Locust, Schistocerca cancellata (Orthoptera: Acrididae). Annals of the Entomological Society of America. 2019;112(5):458–472.
View Article
Google Scholar

[205] View Article

[206] Google Scholar

[ref64] 64. Showler AT. In: Desert Locust, Schistocerca gregaria Forskål (Orthoptera: Acrididae) Plagues. Springer Netherlands; 2008. p. 1181–1186.

[ref65] 65. Sanchez-Arroyo H. Largest Swarm. University of Florida book of insect records University of Florida, Gainesville. 1997;2325.

[ref66] 66. Edelstein-Keshet L, Watmough J, Grunbaum D. Do travelling band solutions describe cohesive swarms? An investigation for migratory locusts. Journal of mathematical biology. 1998;36(6):515–549.
View Article
Google Scholar

[210] View Article

[211] Google Scholar

[ref67] 67. Robertson RM, Johnson AG. Collision avoidance of flying locusts: steering torques and behaviour. Journal of Experimental Biology. 1993;183(1):35–60.
View Article
Google Scholar

[213] View Article

[214] Google Scholar

[ref68] 68. Rind FC, Santer RD, Wright GA. Arousal facilitates collision avoidance mediated by a looming sensitive visual neuron in a flying locust. Journal of neurophysiology. 2008;100(2):670–680. pmid:18509080
View Article
PubMed/NCBI
Google Scholar

[216] View Article

[217] PubMed/NCBI

[218] Google Scholar

[ref69] 69. Reis EG, Pawson MG. Fish morphology and estimating selectivity by gillnets. Fisheries research. 1999;39(3):263–273.
View Article
Google Scholar

[220] View Article

[221] Google Scholar

[ref70] 70. Snodgrass RE. Principles of insect morphology. Cornell University Press; 2018.

[ref71] 71. Hume JP. Systematics, morphology, and ecological history of the Mascarene starlings (Aves: Sturnidae) with the description of a new genus and species from Mauritius. Zootaxa. 2014;3849(1):1–75. pmid:25112426
View Article
PubMed/NCBI
Google Scholar

[224] View Article

[225] PubMed/NCBI

[226] Google Scholar

[ref72] 72. Lazebnik T, Bunimovich-Mendrazitsky S. Decision tree post-pruning without loss of accuracy using the SAT-PP algorithm with an empirical evaluation on clinical data. Data & Knowledge Engineering. 2023;145:102173.
View Article
Google Scholar

[228] View Article

[229] Google Scholar

[ref73] 73. Blanco-Justicia A, Domingo-Ferrer J. Machine learning explainability through comprehensible decision trees. In: International Cross-Domain Conference for Machine Learning and Knowledge Extraction. Springer; 2019. p. 15–26.

[ref74] 74. Rudin C. Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead. Nature Machine Intelligence. 2019; p. 206–215. pmid:35603010
View Article
PubMed/NCBI
Google Scholar

[232] View Article

[233] PubMed/NCBI

[234] Google Scholar

[ref75] 75. Whitehead SD. A complexity analysis of cooperative mechanisms in reinforcement learning. Proceedings of the AAAI. 1991; p. 607–613.

[ref76] 76. Bo L, Rein L. Comparison of the Luus–Jaakola optimization procedure and the genetic algorithm. Engineering Optimization. 2005;37(4):381–396.
View Article
Google Scholar

[237] View Article

[238] Google Scholar

Figures

Abstract

1 Introduction

2 Vision-based collective motion with collision avoidance

2.1 Settings and vision-based input

2.2 Agent’s movement and collision dynamics

2.3 Collective motion and collision avoidance

3 Collective evolution learning algorithm

3.1 Motivation

3.2 Swarm dynamics using a Q-learner

3.3 Agent’s decision-making using a genetic algorithm

4 Results

4.1 Setup

4.2 CEL algorithm training

4.3 Baseline

4.4 Parameter sensitively analysis

4.5 Explainability of the model

5 Discussion

6 Appendix

6.1 The collective evolution learning algorithm time complexity

6.2 Model parameters summary

6.3 Vision mechanism special cases

Supporting information

S1 Data.

S2 Data.

References