Participatory approaches and open data on venomous snakes: A neglected opportunity in the global snakebite crisis?

1 Institute of Global Health, Faculty of Medicine, University of Geneva, Geneva, Switzerland, 2 EnviroSPACE Lab, Institute for Environmental Sciences, University of Geneva, Geneva, Switzerland, 3 Division of Tropical and Humanitarian Medicine, University Hospitals of Geneva, Geneva, Switzerland, 4 Médecins Sans Frontières, Geneva, Switzerland, 5 Citizen Cyberlab, CERN-UNITAR-University of Geneva, Geneva, Switzerland

This study aims to build the first global observational map of MIVS based on crowdsourcing existing snake observations from a CS project and these two open-access biodiversity platforms, GBIF and VertNet. With over 715 million and 50 million observations respectively, these are massive platforms that gather global information on animals [10]. The study also aims to identify data gaps in currently available online data for MIVS distribution but does not intend to add new data on snakebite. More widely, the goal of this study is to highlight interest in and discuss the limitations of participatory approaches and open data in the context of MIVS ecology and their application for more specific public health questions around snakebite.

Open global datasets on MIVS
We created a project titled "Medically Important Venomous Snakes" on February 4, 2017, on iNaturalist (www.inaturalist.org), a major CS platform dedicated to biodiversity [11]. We gathered existing geolocalized MIVS observations from iNaturalist user accounts from February 4 to February 15, 2017. Only "research-grade" observations were considered to ensure quality of identifications. These are defined by iNaturalist as observations having the agreement of more than two-thirds of identifiers in the user community on their taxonomic identification. Additional geolocalized observations were collected from five CS projects available on GBIF. After removal of redundant observations, all observations were merged into a single dataset (citizengenerated observations). These are generated by a potentially heterogeneous community of volunteer citizens with or without scientific background and with differing motivations. A second dataset consisting of traditional scientific sources of geolocalized MIVS observations (scientist-generated observations) was built from GBIF and VertNet on February 26, 2017. These observations are generated by scientists in the field as part of their research projects. All observations from these three sources of data (iNaturalist, GBIF, and VertNet) were identified at the species level.

Data analysis
A total of 9,113 citizen-generated and 70,697 scientist-generated observations were statistically and spatially analysed using STATA 14 and QGIS 2.18.2. Descriptive statistics for Global Burden of Disease (GBD) regions were obtained for both types of observations. Estimates of envenoming and mortality were obtained from Kasturiratne et al. (2008) [12] to calculate snakebite-induced mortality rate per GBD region, and a linear regression analysis was performed between the log-transformed mortality rate and the number of observations. To infer the spatial congruence between both types of observations, we performed a linear regression analysis between the log-transformed numbers of citizen-generated and scientist-generated observations in the United States of America for the period 1990-2017. Moreover, distribution patterns of the four most frequently observed MIVS (Agkistrodon contortrix, A. piscivorus, Crotalus atrox, and C. oreganus) in our citizen-generated observations were compared to their respective species range maps for the US. The time frame of these observations was 1990-2017, and the species range maps were provided by the International Union for Conservation of Nature (IUCN). A points-in-polygon analysis was carried out for each species to determine the percentage of observations falling within range. A detailed version of the methodology is provided as supporting information (S1 Text).

Results
Citizen-generated and scientist-generated observations included respectively 55.7% and 80.8% of species listed by WHO as MIVS and covered 96 and 137 countries from all GBD regions of the world (Table 1). However, the global distribution of observations was strongly biased, with high-income North America, particularly the US, concentrating more than 79% (n = 7,207) of all citizen-generated observations and more than 34% (n = 24,134) of all scientist-generated observations (Fig 1 and Table 2). High-income North America had the highest overlap between citizen-generated and scientist-generated observations, and both groups coincided in 64% of all the species observed for this region ( Table 2). The log-transformed number of citizen-generated observations was positively correlated with the log-transformed number of scientist-generated observations across the US (p < 0.01, r = 0.177-0.571, n = 79-466). The most frequently observed species and genus coincided between the two types of observations in 10 GBD regions (Table 3).

Participatory approaches and open data: Opportunities and challenges
We gathered an unprecedented set of 79,810 georeferenced MIVS observations from all GBD regions of the world and built the first global observational map of MIVS that highlights the contribution of volunteer citizens, mobile technologies, and open participatory platforms for a rapid collection and public sharing of snake data. We found a severe geographical bias, with high-income North America concentrating most MIVS observations but very low snakebite mortality (<10 deaths/year), while GBD regions with the highest snakebite mortality (>1,000 deaths/year) represented only 5% of observations [12]. Our study provides insights on the quality of citizen-generated observations and crowdsourced identifications, opening innovative opportunities for contributions by citizens and collaborations with experts to study snakes in the context of the global snakebite crisis. Snakes attract a global community of citizens and snake enthusiasts instantly and continuously sharing experiences and observations on open platforms such as iNaturalist. These observations are collected from wild and/or urban habitats worldwide, offering large volumes of data to complement the efforts of experts, whose data collection is bound to specific regions, species, and/or periods of the year depending on their research objectives [8]. We found that citizen-generated and scientist-generated observations correlated for the US and that 64% of them identified the same MIVS species. Besides, 97.3% of citizen-generated observations gathered from iNaturalist for C. atrox, C. oreganus, A. contortrix, and A. piscivorus were within their known species geographical range. More widely, the most frequently observed species and genera coincided between the two types of observations for 10 GBD regions. This highlights the quantitative and qualitative value of citizen-generated observations. Although we could potentially expect a reporting bias with a focus on large photogenic species by citizengenerated observations, these observations also include smaller and more cryptic MIVS species (e.g., Echis omanensis, Atheris squamigera, etc.), which seems to indicate that our iNaturalist community is interested in snake biodiversity in its widest sense. For instance, on September 1, 2017, the top five users with the most species on our iNaturalist project had each on average contributed sightings of 20 different MIVS species.
Although citizen participation has been used in the field of emerging zoonoses [7], its potential remains largely unexploited for snake ecology and its public health implications. The growing mobile technology markets in emerging and developing countries [13] open new opportunities for innovative and dynamic data collection and analysis based on citizen participation to improve our understanding of snake ecology and snakebite eco-epidemiology, particularly with rapid anthropogenic changes (e.g., climate change and urbanization [5,8]).
The strong geographical bias toward North America in our citizen-generated observations matches previously observed gaps in snakebite research [14]. This could be explained in part by the American origin and development of the CS movement in the 1990s [15,16]. The democratizing and transparent culture of CS could clash with many sociocultural and political systems [7] and limit its penetration in certain regions of the world. iNaturalist was launched in California in 2008, and despite its massive and global growth for some taxa (e.g., birds), it still remains undiscovered by many herpetology enthusiasts in emerging and developing countries. On the other hand, the gaps in scientist-generated observations from GBIF and VertNet are also severe and could be due to lack of funding, political and logistical challenges, and/or the risks of working with dangerous snakes in the poorest areas of the world. However, scientist-generated observations from GBIF and VertNet suffer less from the geographical bias toward high-income North America compared to citizen-generated observations. This can be explained by the lower temporal bias in scientist-generated observations from GBIF and Vert-Net due to their much longer time span (the period 1700-2016) [17] and to the great diversity of international partners around the world. Additionally, it is important to evaluate this global distribution map of MIVS species critically, as it might be biased by the unavailability of webbased technologies for CS activities in the poorest regions of the world or by field stations for research institutions (leading to false conclusions on the presence and abundance of certain MIVS species and subsequent risk of snakebite for local populations). Another similar challenge for open-access biodiversity platforms such as GBIF and VertNet is the Wallacean shortfall [17]. Due to incomplete records of compiled observations from different surveys or improper coverage of certain species along spatiotemporal and environmental dimensions [17], the geographical distribution of species is sometimes flawed by many gaps, resulting in a poor understanding of their ranges [18]. Despite these challenges, CS and open biodiversity platforms could offer a valuable source of information and expertise to be further exploited innovatively to better understand MIVS distribution in a more dynamic and local fashion. For example, the creation of a specific platform with targeted objectives dedicated to collect new contributions on venomous snake observations from enthusiasts could help in understanding their distribution more extensively. An interesting CS model is the one developed by the Cornell Lab of Ornithology, whose CS projects gather millions of species observations annually [19]. Such a CS project would require not only an interdisciplinary team of educators, evaluators, scientists, and technologists to monitor, promote, and safeguard the integrity of the project [19] but also ethicists to ensure that participants are not exposed to unintentional health risks and that their personal data are protected. Moreover, previously field-tested protocols and formally designed data collection forms would be required for data homogeneity and completeness of snake observations, minimization of bias during data collection, and an easier analysis of collected data from different regions of the world [19]. Additionally, CS projects could create links between volunteer citizens and experts and between society and universities, which would in turn ensure data quality of observations made by volunteer citizens and broaden the geographic scope of university research projects on venomous snake species. Furthermore, this CS project could gather data on snakebite-induced morbidity and loss of productivity in a relatively easy and cost-effective way through information gathering from afflicted communities by users. This would otherwise prove to be difficult and expensive since the only reliable method would be epidemiological community-based studies [20]. It is also worth noting that our iNaturalist project received quality contributions from volunteer herpetologists, university professors, and professional wildlife photographers. Additionally, the iNaturalist platform anticipated the time delay problem with the crowdsourced identification of species by its community and is now integrating machine learning techniques, such as computer vision, to provide automated higher-quality taxonomic identification of species uploaded on the platform [21].
CS initiatives could be powerful tools for educating and raising awareness on snakebite [7,8] but would require very careful consideration to minimize risks and sociocultural rejection. Public participation could be particularly encouraged in regions where data is severely lacking. This could be done by creating partnerships with local universities, student associations interested in conservation biology or herpetology (e.g., zoology groups), local conservation groups, and snake catchers or rescuers. It could also be done through public service announcements, local workshops [19], and educational campaigns in communities. One successful example is the ongoing "Big 4 Mapping" project, which was launched in 2017 to map the distribution of four MIVS species (Daboia russelii, Naja naja, Bungarus caeruleus, and E. carinatus) responsible for over 90% of snakebite-induced deaths in India by employing a network of mainly snake catchers and/or rescuers [22]. In just a few months, it mapped over 1,600 snake observations in India [22]. This distribution map could then serve as a guide for effective antivenom distribution in snakebite hyperendemic areas [22]. Additionally, community leaders, traditional healers, village headmen, and local public health institutions could also be involved directly, such as in the recruitment of possible participants. The CS project would also benefit from adapting to the literacy level of its users, particularly when poor and vulnerable communities have low literacy rates. For example, 38% of the African population is nonliterate [23], and the use of standard data-collecting forms on smartphones might prove to be challenging in this situation. One successful CS project, Cybertracker (www.cybertracker.org), allows literate and nonliterate users to collect complex and rich geolocalized data on species distribution and behavior [24] and even provides recommendations and safety measures when tracking dangerous animals (including venomous snake species [25]). One interesting result of its application was its role as an early surveillance system for Ebola outbreaks in wildlife from 2001-2003, alerting authorities before these outbreaks constituted a public health threat in the Republic of Congo and Gabon [24]. Similar approaches could potentially be considered in the context of snake detection and identification as well as snakebite prevention and management. Participants could also be trained in safety and first aid techniques against snakebite, opening the door to collaboration with academia and health institutions (e.g., hospitals). Training programs with certified courses have been shown to increase adherence in previous CS projects [19].
In the context of snakebite management, taking a photo of the biting snake could be critical for a subsequent correct administration of antivenom or other lifesaving care. This could reduce the risks of new bites. In many regions of the world, the victim or bystanders usually try to kill and carry the snake to the clinician for identification. However, clinicians are not usually trained in herpetology, and their capacity to identify the snake is generally very limited. Urgent crowdsourced expertise with potentially massive global contributions (e.g., via the iNaturalist community or others) and/or machine learning systems based on computer vision offer potentially interesting decision support tools for clinicians to identify snakes via photos. Observations and associated photos such as those gathered in this study are valuable material to train computers in snake identification. Although promising molecular diagnostic tools could make a substantial contribution in reducing snakebite-induced morbidity and mortality in afflicted regions, they are still not fit to be used as point-of-care testing devices [26]. A pilot study done by Sharma et al. [26] has shown that in resource-limited settings, the time delay for seeking help by the bitten person or the use of inappropriate first-aid techniques (e.g., tourniquet) reduce the sensitivity of these molecular diagnostic techniques. Digital innovations such as crowdsourcing and machine learning (e.g., computer vision) could be complementary to molecular diagnostic tools. In some cases, these could be the only solution for resource-limited health centers, which do not have the financial and technical resources to implement and sustain some of these tools. Nonetheless, these approaches would only effectively reduce the snakebite burden in afflicted regions if the current antivenom crisis is addressed. For example, pharmaceutical companies have no market incentive for antivenom production, and Africa is running out of one of the most effective antivenoms against vipers and mambas [2,27]. The current antivenom shortage and inadequate distribution of antivenom in different parts of the world is costing many lives. Urgent solutions with possible innovative incentives and mechanisms for accelerated research and implementation are needed.
The renewed momentum at the international level, through reintroduction of snakebite in WHO's NTD list at WHA 70 in Geneva, should help raise funds to tackle snakebite in most afflicted regions and promote research in the field of antivenomics by designing improved polyspecific antivenoms [28] effective against the venoms of African and Asian MIVS. These could partially help in tackling the current antivenom crisis.
Alternative and complementary tools to antivenom should also be explored using scientific research and innovation (e.g., digital technologies) to tackle snakebite at different levels such as prevention and in different geographical contexts. Snakebite is not only a humanitarian crisis but also a data crisis. This study illustrates for the first time the potential of participatory approaches and citizen-generated data in this context. This innovative social and digital approach could contribute to data collection on snake ecology (e.g., geographical distribution of snakes) and subsequent snakebite epidemiology (e.g., hotspots of snakebite risk). For example, finer geographical maps of MIVS distribution could help to make public health interventions more specific by distributing the current limited supplies of antivenom where most needed. Careful consideration should also be given to the anticipation and prevention of potential harm for participants in this approach through a comprehensive strategy involving key stakeholders (e.g., members of afflicted communities, academia, public health entities, etc.) and adapted to the local socio-cultural context and health systems.