Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

A game-theoretic analysis of Wikipedia’s peer production: The interplay between community’s governance and contributors’ interactions

  • Santhanakrishnan Anand ,

    Roles Conceptualization, Formal analysis, Methodology, Visualization, Writing – original draft, Writing – review & editing

    asanthan@nyit.edu

    Affiliation Department of Electrical and Computer Engineering, New York Institute of Technology, New York, NY, United States of America

  • Ofer Arazy,

    Roles Methodology, Visualization, Writing – original draft, Writing – review & editing

    Affiliation Department of Information Systems, University of Haifa, Haifa, Israel

  • Narayan Mandayam,

    Roles Conceptualization, Funding acquisition, Project administration, Supervision

    Affiliation Wireless Information Networks Lab (WINLAB), Department of Electrical and Computer Engineering, Rutgers University, North Brunswick, New Jersey, United States of America

  • Oded Nov

    Roles Conceptualization, Funding acquisition, Investigation, Writing – original draft, Writing – review & editing

    Affiliation Department of Technology Management, New York University, Brooklyn, New York, United States of America

Abstract

Peer production, such as the collaborative authoring of Wikipedia articles, involves both cooperation and competition between contributors. Cooperatively, Wikipedia’s contributors attempt to create high-quality articles, and at the same time, they compete to align Wikipedia articles with their personal perspectives and “take ownership” of the article. This process is governed collectively by the community, which works to ensure the neutrality of the content. We study the interplay between individuals’ cooperation and competition, considering the community’s endeavor to ensure a neutral point of view (NPOV) on articles. We develop a two-level game-theoretic model: the first level models the interactions between individual contributors who seek both cooperative and competitive goals and the second level models governance of co-production as a Stackelberg (leader-follower) game between contributors and the communal neutrality-enforcing mechanisms. We present our model’s predictions regarding the relationship between contributors’ personal benefits of content ownership and their characteristics, namely their cooperative/competitive orientation and their activity profile (whether creators or curators of content). We validate the model’s prediction through an empirical analysis, by studying the interactions of 219,811 distinct contributors that co-produced 864 Wikipedia articles over a decade. The analysis and empirical results suggest that the factor that determines who ends up owning content is the ratio between one’s cooperative/competitive orientation (estimated based on whether a core or peripheral community member) and the contributor’s creator/curator activity profile (proxied through average edit size per sentence). Namely, under the governance mechanisms, the fractional content that is eventually owned by a contributor is higher for curators that have a competitive orientation. Although neutrality-seeking mechanisms are essential for ensuring that ownership is not concentrated within a small number of contributors, our findings suggest that the burden of excessive governance may deter contributors from participating, and thus indirectly curtail the peer production of high-quality articles.

1 Introduction

Over the past three decades, peer-produced online information goods such as open source software and Wikipedia have become extremely popular. Wikipedia recruited over 30 million volunteers to produce hundreds of millions of encyclopedic entries in 314 languages. As Wikipedia has become one of the most popular information sources on the web and the destination most internet users turn to when they seek information [1], the quality of its articles has been the topic of public debate. Wikipedia is based on wiki technology: a web-based collaborative authoring tool that allows contributors to add new content, append existing content, and delete or overwrite prior contributions [2]. Thus, Wikipedia articles evolve through the continuous additions, deletions, and shaping of existing content (rewriting, reorganizing, and integrating).

The success of online production communities, such as Wikipedia, has attracted much interest within the scholarly community, given that the organization of such communities seem to defy traditional organizational logic [37]. Broadly speaking, two key issues have been investigated. The first pertains to contributors’ characteristics and their interactions during the collaborative production [812]. Namely, given that most communities do not have formal procedures to select, train and assign contributors to a task, it is not clear how work effectively gets done and how high-quality goods are co-produced. This issue is particularly interesting since communities are often heterogeneous and members vary greatly in terms of their motivation, affiliation with the community and identification with its goals, and their activity profiles. The second key issue is community governance [1315]. Specifically, communities need to balance openness and freedom of action so as to allow contributors to self-select and self-organize, while controlling uncooperative behavior, such as manipulations and vandalism. In other words, there is a fundamental tension between emergent behavior and regulation in peer production [16]. Successful communities, thus, had to devise “light” governance mechanism [15] that would strike a fine balance between these two opposing objectives. Extensive research in the area over the last three decades has been able to shed light on many aspects of peer production; still, fundamental questions remain unclear, specifically regarding the way in which governance shapes individual-level interactions. Hence, the overarching objective of our study is to investigate the interplay between the community’s governance and contributors’ cooperation-competition dynamics in peer production (and specifically, in Wikipedia).

Large and complex socio-technical systems, such as Wikipedia and other large-scale peer production projects, are difficult to study in a holistic manner, and by and large, scholarly investigations have focused on a particular angle, for example contributors’ motivations [11, 17], coordination [9], or conflict management [8]. One particularly relevant approach for studying such complex systems is through modeling. A model, by definition, presents a simplified depiction of the complex reality, formalizing key salient features of the system, while ignoring other less critical aspects. Models have the potential to reveal deep insights about the phenomenon being investigated. Moreover, when the model-based predictions are validated with empirical data, credence is lent to the model’s results. In this study we employ game-theory to study the interplay between the community’s governance and contributors’ individual-level interactions.

Game theory, “the study of mathematical models of conflict and cooperation between intelligent rational decision-makers” [18], is well suited for studying competitive and cooperative dynamics and thus may reveal insights into peer production. Despite its demonstrated utility in studying cooperation and competition, the use of game theory in peer production research has been rather limited [19]. In this study, we employ game-theoretic modeling in order to study the interplay between the community’s governance and contributors’ individual-level interactions. More specifically, we develop a two-step leader-follower game-theoretic model [20, 21] that captures both cooperation-competition dynamics among the contributors to the article’s collective authorship (i.e., “follower”), as well as the community’s governance efforts to enforce neutrality (i.e., “leader”). We briefly detail the key elements of our model, beginning with the former: the dynamics of contributors’ interactions.

The “follower” level of our model assumes a heterogeneous community, where contributors are characterized by two important traits. First, we assume that each contributor combines self-interest (e.g., promoting one’s viewpoints and enhancing their own reputation [22, 23]) with others-oriented interests (e.g., helping the community co-create high-quality goods) [24]. The self-interest is linked to a competitive behavior and others-orientation to a more cooperative conduct [25]. A contributor is driven by a combination of cooperative and competitive goals, and we refer to this trait as the contributor’s cooperative/competitive orientation.

A second important characteristic that we model is contributors’ activity profile. Despite the freedom to select the particular ways in which one contributes (for example, in Wikipedia: adding new content or shaping others’ previous contributions), members of online production communities are typically characterized by highly stable activity patterns (aka “emergent roles”; [26]). Building on prior studies in the area [27, 28], we distinguish between the roles of a creator (in Wikipedia, mostly adding new content) and a curator (refining, shaping and reorganizing existing content), modeling each individual’s activity profile as some combination of creation and curation activities. It should be noted that in the context of Wikipedia, the creator/curator distinction applies to activity in the primary area (referred to as the Main namespace): co-producing encyclopedic entries; other areas (namespaces) within Wikipedia are dedicated to coordination work. For example, each article is coupled with a TalkPage where discussions regarding the article take place. Our model assumes that these two characteristics, i.e., contributors’ cooperative/competitive orientation and their creator/curator profile are stable traits (at least within the timeframe of our analysis).

We further presume that the contributor’s decision variable is the the amount of work to be done (i.e., number of content elements, e.g., sentences, to be edited), reflecting their individual motivation and availability. This decision is influenced by the utility function, i.e., the balance of costs and benefits. We assume that contributors derive both competitive benefits (namely, shaping a Wikipedia article to align with one’s personal viewpoint, thus coming to “own” portions of that article) and cooperative benefits (i.e., co-producing a comprehensive encyclopedic entry). Similarly, the costs of editing the focal article are influenced by one’s activity profile (creator/curator, and hence are applicable to both cooperative and competitive contributors), the amount of coordination work required (more applicable to cooperatively-oriented contributors) and the effort of handling the community’s governance (more applicable to competitively-oriented contributors). When many contributors participating in a collective authoring effort attempt to promote their reputation and perspectives, competition between them is inevitable [25].

Contributors’ dynamics does not occur in a void. The interactions among the contributors are influenced by the community’s governance (i.e., the “leader” in the two-stage game-theoretic model). Wikipedia’s governance mechanisms have been investigated in prior studies, e.g., [1315]. These studies describe Wikipedia’s extensive mechanisms for ensuring that the co-produced content is of high quality, including instruments such as norms, policies, as well as technical tools for facilitating collaboration, resolving conflicts and fighting vandalism [29]. In particular, Wikipedia emphasizes objectivity (Neutral Point of View; NPOV) as a central pillar of the community (http://en.wikipedia.org/wiki/Wikipedia:FivePillars) and lists “susceptibility to editorial and systemic bias” as one of the key aspects of its quality assurance work [25, 30, 31], http://en.wikipedia.org/wiki/ReliabilityofWikipedia. Of particular relevance to our investigation are studies of governance mechanisms that are intended to ensure neutrality by countering attempts toward bias in Wikipedia articles [32, 33].

To recap, contributors are characterized by their cooperative/competitive orientation and by their typical activity pattern (creators or curators of content). A contributor’s decision regarding the amount of content to be edited is based on that individual’s utility function, and is also influenced by the strategies used by other contributors, which in turn are affected by adherence to governance (i.e., the principles of neutrality). Contributors’ optimal strategies are calculated using the Nash equilibrium of the game.

To lend greater credence to our model’s predictions, we validated the findings with empirical data. We performed a computationally-heavy empirical analysis of 219811 distinct contributors co-producing 864 Wikipedia articles (articles’ lifespan ranges from 129 to 4078 days; mean = 2681 days). We performed the empirical validation for the entire period covered by our data set (2001–2012), as well as for each yearly interval independently. We estimated contributors’ cooperative/competitive orientation based on their role in the community (those closer to the community’s core are assumed to be more cooperative) and estimated their position on the creator-curator continuum based on the average size of their edits (i.e., larger edits within a sentence are typical of content creators) [3437].

To verify the validity of the “creator”/“curator” labels, we performed an analysis on the dataset from [38], calculating each contributor’s average edit size and profiling the types of activities that she makes. We found that: (a) the larger the relative proportion of the “Add Content” and “Structural Changes” edit types in a contributor’s profile, the larger is the contributor’s average edit size; and (b) the larger the proportion of “Corrections” and “Add References” edit types, the smaller is the contributor’s average edit size. Together, these patterns suggest that contributors with a large edit size are the “creators” who structure and article and add much of the content, whereas those making small edit size are “curators” that mostly engage in refining existing content.

We used a sentence as our basic content unit, and our estimate of the cost of editing a sentence, i.e., the effort contributors invest, is approximated as a linear function of contributors’ average edit size per sentence [39, 40]. We estimated contributors’ competitive benefit based on the portion of the article that they own, employing the “content ownership” algorithm from a prior study [38], and their cooperative benefit from the comprehensiveness of the co-produced article (estimated through the article’s length).

The key results of our game-theoretic analysis are (a) the factors that determine who ends up owning content are contributor’s characteristics: their cooperative/competitive orientation and their creator/curator activity profile, and (b) governance levels should be limited, so as not to overburden contributors.

2 Background and related work

This section reviews prior work on Wikipedia, addressing the following issues: (a) contributors’ cooperation and competition (Section 2.1); (b) the activities involved in Wikipedia’s collaborative authoring process (Section 2.2); and (c) community-based governance of Wikipedia’s peer production (Section 2.3).

2.1 Cooperation and competition in Wikipedia’s content production

Scholarly accounts of peer production initiatives such as Wikipedia tend to emphasize the collaborative aspects [3, 8, 10, 41, 42]. Namely, contributors collaborate in the creation of encyclopedic entries, collectively attempting to ensure the quality of these articles. Cooperatively, participants in Wikipedia’s collaborative-authoring process attempt to create encyclopedic entries that cover the article’s topic in a comprehensive manner, are factually accurate, as well as are: objective (i.e. unbiased), timely and relevant [43, 44]. Particularly, comprehensiveness is considered a key indicator of information quality [45], for example, the number of words or sentences is a very good predictor of article quality [46, 47]. Hence, cooperative Wikipedia members contribute their knowledge and share content, with the goal of collectively covering the focal topic in a comprehensive manner. Cooperation also entails the investment of additional effort beyond the direct work of peer-producing articles. Namely, collaborative contributors often take on additional roles and responsibilities, such as the coordination of work and in the enforcement of policies. The cooperative goals are well understood, and thus are only described here in brief. However, participants are also driven by competitive goals [48] and explaining these goals requires some elaboration.

Given that Wikipedia has become the primary entry point for those seeking information on the web [1], the impact, and thus the associated benefit, of shaping an article’s contents can be high [49], as demonstrated by commercially sponsored attempts to bias Wikipedia’s content [22, 23]. Recent news stories revealed that commercial entities-often through agents, such as PR agencies or political operatives-have been attempting to manipulate Wikipedia articles’ content [23, 50, 51]. Such manipulations are intended to portray the interested parties favorably, and have been carried out over years, often employing sophisticated methods (e.g., using multiple accounts, building trust in the community and gaining access to special privileges) [52]. Wikipedia fights these manipulations through social, technical and legal means [22].

Additionally, competition over content can also originate from well-intentioned contributors. Prior studies have looked at conflicts of opinion between contributors [8], focusing on conflict resolution mechanisms and the impact of conflicts on the article’s quality. More recently, several research studies explicitly discussed contributors’ attempt to influence articles. For example, [5355] describe various forms of biases and violations of Wikipedia’s NPOV policy, and propose automatic methods for detecting these biases. Moreover, recent studies have demonstrated empirically that Wikipedia contributors often shape an article in a way that is aligned with their worldview, so that the articles often evolve through the “pulling” of the article towards the direction that represents the editor’s perspective. [56] stated that “contributors to a focal artifact manipulate the article according to their particular viewpoints, thus pulling the artifact’s trajectory in different directions” (p. 2014) and then suggested that such “pulls” may possibly stem from “⋯ a more deliberate attempt to shape the artifact according to the contributor’s personal vision.” (p. 2016). Together, these studies demonstrate that competition and struggle over the views expressed in articles are salient features of Wikipedia’s co-production.

Psychological ownership theory [57] may provide a potential explanation for why contributors compete to own pieces of an article. According to this theoretical framework, actors that occupy a shared social space can easily form an ownership feeling over a target (in this case, a Wikipedia article) if they invest much time or energy on it, are familiar with it, or have control over it. This is particularly pertinent in cases where the individual is the originator of the target (or portion of that target), as in the case of contributing content to a Wikipedia article. A contributor of content to the article will feel the content is her personal psychological property, and subsequently will be unwilling to either lose control of or share the ownership with others [58].

Territoriality theory [59] may further explain how psychological ownership could result in competition over an article’s contents. According to this theory, the stronger an individual’s sense psychological ownership of an object, the greater the likelihood they will treat that object as their territory. Territoriality behavior is likely to emerge as community members work together to accomplish a common goal. It signals to others one’s stake in a territory or an object. Territoriality may be expressed through the defense of one’s turf from perceived invasions [59]. From such a viewpoint, anyone experiencing a strong sense of ownership of personally contributed content may exhibit territorial behavior over that section of the article, which subsequently leads one to defend the “owned” content from others’ attempts to change or overwrite it [6063]. Contributors may also try to protect a particular version of the article and exhibit territorial behavior even if they did not edit the article, e.g., by protecting the content of like-minded contributors. Kane et al [64] reported that roughly two-thirds of the collaborative-authoring patterns in the Wikipedia article they had investigated entailed “defensive filtering”. They explained that in the defensive filtering pattern, “The focus of production was on protecting the content that has been created by the co-production community” and that the nature of the interaction was defensive and combative. Contributors may become committed to stabilizing knowledge already co-produced. For example, when assessing whether a change to the article fits with the article’s current articulation, they may try to ensure that it does not “violate organizational and content decisions already made” [64]. [65] describe power dynamics in Wikipedia whereby certain contributors may attempt to dominate an article and block alternative viewpoints. [33] described such a domination dynamic that was related to gender bias, where core contributors wrestled against peripheral contributors to shape articles according to their viewpoints.

While territoriality may be linked to positive behaviors, such as increased motivation and commitment and thus could help in the organization of work, it raises concerns regarding competitive tensions and the potential for biases in the co-produced articles. For example, [60] showed that territorial behavior is directly linked to turf wars (namely, the revert of a Wikipedia article to its previous version). Furthermore, territorial behavior may deter members from participating [63].

The scholarly literature in the field shows that people choose to play different functional roles within an online community [66], broadly organized into a core-periphery structure [67], where the small core includes highly-committed individuals who take on various administrative duties and work to serve the community (i.e., cooperative orientation), and a much larger periphery that includes less-committed contributors who often are interested in promoting personal interests (i.e., competitive orientation) [24]. This community structure has been shown to be mostly consistent, as contributors keep their core/periphery positions over prolonged periods [66], although over extended time, some contributors may gradually transition from the community’s periphery to its core [68].

2.2 Contributors’ activity profiles: Creators and curators

To understand contributors’ competition over content ownership, it is important that we explicate what it means for a contributor to “own” a unit of content, say a sentence. Ownership could be gained by either originating the content (i.e. adding substantive content) or alternatively by shaping existing content (reorganizing, completing missing pieces, adding references, etc.). For example, the content ownership algorithm that was employed by [38] assigns the ownership over a sentence to the contributor who has added/changed over 50% of the words in that sentence. A contributor to Wikipedia may engage in the creation of new content or in the curation of content that was originated by others.

Several studies have attempted to profile the activity patterns of Wikipedia’s contributors, and their empirical results are quite consistent [2628]. Broadly speaking, these studies have identified several prototypical activity patterns that fall into two primary categories: (a) creators of content, e.g., “substantive expert” in [28] or “quick-and-dirty editors” in [26] or “starters” in [27]; (b) curators of content, responsible for the shaping of content and copyediting, e.g. “fact-checker” and “copyeditor” in [28], “content shapers,” “layout shapers” and “copyeditors” in [26], and “content justifiers,” “cleaners” and “copyeditors” in [27]. Interestingly, these studies have also identified a hybrid profile that is engaged in both creation and the curation of content, e.g. “all-round-contributors” in both [26, 27] This suggests that creation and curation are not exclusive, such that a contributor’s activity profile may be positioned anywhere on the creation-curation continuum.

It is worth mentioning that studies of Wikipedia have also identified activity profiles that do not directly influence the content of Wikipedia articles. For example, vandals and vandal-fighters (e.g., “watchdogs”), which represent only a small portion (∼ 8%) of Wikipedia’s editing activity [56]. Vandalism does not have much influence on the content that is made available to readers, as it is very quickly corrected such that it doesn’t have a lasting effect on Wikipedia articles [69, 70]. Hence, the treatment of vandalism is left outside the scope of the current study. In addition, some Wikipedians are engaged in administrative tasks and in the governance of the community, being active primarily in auxiliary areas of Wikipedia, and this activity does not directly affect the main article wiki pages. Hence, these contributor profiles are less relevant for the current study.

2.3 Governance of Wikipedia’s content production: Neutral Point of View (NPOV) policy

Distributed collaborative communities often employ policies and community norms in order to prevent territorial behavior. The Wikipedia community explicitly discourages contributors from taking ownership of their work. Wikipedia founder, Jimmy Wales, speaks openly against the notion of ownership. For example, see a talk in the 21st Chaos Communication Congress in December 2004 (talk titled “Wikipedia Sociographics”); http://ccc.de/congress/2004/fahrplan/event/59.en.html. Also, the “Ownership of Content” policy (see https://en.wikipedia.org/wiki/Wikipedia:Ownership_of_content retrieved February 8, 2021) specifically points out that being a primary contributor is not a reason for asserting possession of an article:

All Wikipedia content − articles, categories, templates, and other types of pages − are edited collaboratively. No one [emphasis in source], no matter how skilled, or how high standing in the community, has the right to act as though they are the owner [originally emphasized in italics] of a particular page. ⋯ Some contributors feel possessive about material they have contributed to Wikipedia. ⋯. Believing that an article has an owner of this sort is a common mistake people make on Wikipedia

How are socio-technical governance mechanisms used to mitigate the effects of ownership and territoriality? Large-scale social production systems, such as Wikipedia, require governance mechanisms in order to direct the integration of dispersed knowledge resources in the process of value creation. Without such mechanisms, coordinating the work of assessing, selecting, shaping, integrating and oftentimes rejecting contributors’ postings would be impossible [13]. Wikipedia employs a community-based governance model that is based on egalitarian principles, rather than on formal contracts and hierarchies [56]. Over the years, Wikipedia has developed an extensive set of mechanisms to ensure that the co-produced content is of high quality, including norms, policies, and technical tools for facilitating collaboration, resolving conflicts and fighting vandalism [14, 15, 29]. What is unique to Wikipedia governance model is that it is collective and dynamic. That is, the community organically develops policies and procedures, and these continuously undergo changes by community members [13]. Thus, governance is collective in the sense that it does not depend on any particular individual or a group of individuals within the system.

Central to Wikipedia’s governance is the NPOV principle [25, 3033], which promotes objectivity and acts to ensure that no one particular viewpoint dominates an article. NPOV is one of the five fundamental principles (or pillars) of Wikipedia. The principles (see https://en.wikipedia.org/wiki/Wikipedia:Five_pillars retrieved February 8, 2021) state that:

We strive for articles that document and explain major points of view, giving due weight with respect to their prominence in an impartial tone. We avoid advocacy and we characterize information and issues rather than debate them. In some areas there may be just one well-recognized point of view; in others, we describe multiple points of view, presenting each accurately and in context ⋯ Editors’ personal experiences, interpretations, or opinions do not belong.

Contributors to Wikipedia are requested to edit articles with a neutral perspective, representing views fairly, and trying to avoid bias. Wikipedia founder, Jimmy Wales, commented in a mailing list posting that “If a viewpoint is held by an extremely small (or vastly limited) minority, it does not belong in Wikipedia regardless of whether it is true or not and regardless of whether you can prove it or not.” (https://lists.wikimedia.org/pipermail/wikien-l/2003-September/006715.html; retrieved February 8, 2021). However, biases, the opposite of NPOV, may arise when subjective information is too difficult to verify, when an issue is too complex to be fully represented (whereby generating a consensus requires considerable effort and expertise), or in cases when contributors intentionally try to impose their own ideology in an attempt to influence readers’ viewpoints [25, 33, 56]. Enforcing NPOV has become a central concern for Wikipedians: the focus of many of the discussions found on articles’ coordination spaces (i.e. TalkPages) [8] and dedicated Internet Relay Chat (IRC) channels [25] is on whether particular section of an article reflects the NPOV principle. Editorial activity is often not distributed evenly within a Wikipedia article, such that a few highly active contributors are responsible for a large share of the activity. Thus, in enforcing neutrality, the community tries to ensure that the “ownership” of an article’s content is not concentrated in the hands of few contributors.

Governance is primarily intended to regulate counter-productive behavior. We suggest two potential mechanisms by which NPOV enforcement could affect the competitive dynamics. On the one hand, as intended, the efforts to ensure neutrality may serve to curtail excessive one-sided ownership of an article, and consequently directly impact contributors’ attempts to own article fractions. On the other hand, the adherence to norms, policies and procedures could deter participation in co-production, leading to lower participation levels, which, in turn, might influence the quality of the co-produced article.

3 Research question

Notwithstanding the value of prior research, the scholarly literature on peer production has not investigated the interplay between individuals’ cooperative/competitive dynamics and the socio-technical governance mechanisms. Our goal in this paper is, thus, to investigate Wikipedians’ cooperative content production and the competition over content ownership, as well as to explore the way in which this competitive dynamic is shaped by the governance community’s effort to ensure an NPOV. We recall that contributors derive benefits from both the collective production of a high-quality article (more applicable to those with a cooperative orientation) and from the ownership of content (more applicable to competitively-oriented contributors). Contributors also incur costs in producing content (effort is a function of one’s activity profile: creator or curator), coordinating and organizing work (largely applicable to cooperative contributors) and in complying with Wikipedia’s neutrality-enforcement mechanisms (primarily for competitive contributors). Hence, we pose the following two research questions, which we investigate to answer in this research.

RQ1: What is the optimal strategy for a contributor who is attempting to balance the costs and benefits of peer production? Namely, how many sentences within an article should be owned by a contributor that is characterized by a certain cooperative/competitive orientation and a particular creator/curator activity profile?

RQ2: How does the community governance mechanism, particularly the attempt to ensure a NPOV, affect the dynamics underlying Wikipedia’s co-production process?

4 A game theoretic model of Wikipedia’s collaborative production and governance

A game-theoretic model should capture essential features of the modeled system. Below we present our model in a descriptive manner, and then introduce the formal notations in the following subsections. Socio-technical systems such as Wikipedia exhibit both cooperative and competitive dynamics. Our game-theoretic model aims to represent (a) contributors’ cooperation in the production of high-quality articles as well as their competition to “own” articles” contents and (b) the community’s attempt to ensure that articles present a neutral and balanced perspective. In other words, our model assumes contributors may be driven by two goals: the first is to contribute towards the creation of comprehensive encyclopedic entries and the second goal is to maximize ownership of article sections, such that content “owned” by that contributor survives multiple rounds of revisions of the article. Our model also assumes that the objective of Wikipedia’s governance mechanisms enacted by the community is to ensure the neutrality of content, such that no single contributor takes ownership of large portions of an article. Wikipedia’s governance has the broader goal of regulating behavior, such that the community thrives and the peer-produced content is of high quality. Our focus here is on one important dimension of governance: that of combatting the attempts to “color” the article according to one particular viewpoint, that is, to ensure objectivity and the lack of bias in Wikipedia’s content [8]. Beyond the importance of this particular dimension of Wikipedia’s governance (as evident by the NPOV pillar), objectivity is particularly relevant for our conceptualization, as it is directly impacted by contributors’ effort to “own” content.

We model the interactions between the governance mechanisms and contributors’ co-production as a Stackelberg (leader-follower) game [20, 21], where the community is the leader that determines a level of governance that increases the neutrality of an article. Given a set level of governance, contributors’ interactions are modeled as a non-cooperative game, where each contributor’s objective is to maximize both the article’s comprehensiveness and the contributor’s content ownership. A high level schematic of the interactive model between the community’s goal of ensuring neutrality and contributors’ cooperation/competition is presented in Fig 1.

thumbnail
Fig 1. A high level schematic of the interactive model between Wikipedia’s governance to ensure neutral point of view (NPOV) and the interactions between contributors.

https://doi.org/10.1371/journal.pone.0281725.g001

Peer production on Wikipedia is a complex socio-technical process and contributions occur asynchronously over time. Our strategy for capturing these dynamics is temporal bracketing: recording a series of yearly “snapshots” of the process over a finite time horizon [71]. We use a static game-theoretic model to study these interactions, and test the model against empirical data for each of the yearly snapshots. Please note that modeling such a complex process requires some simplifying assumptions. We assume that contributors’ activity profiles, namely, their position on the creator-curator continuum and their cooperative/competitive orientation, are stable over time (at least within a yearly snapshot). This assumption is in line with prior research that demonstrated that the changes in contributors’ activity profiles are infrequent [16]. We also assume that Wikipedia’s governance structure, and in particular the goals of ensuring neutrality, are stable over time (as changes in policies are relatively infrequent). It should be stressed that while our model provides an approximation to the complexities of peer production, we also validated it using empirical data that captures Wikipedia’s actual collaborative-authoring dynamics.

We model the net utility as the difference between the utility (the contributor’s fractional ownership and the benefit derived from the co-production of high-quality articles) and the cost (a measure of all the efforts expended by the contributor). The optimal strategies are determined through the Nash equilibrium of the non-cooperative game that models the interactions between contributors. We analyze the conditions for the existence and uniqueness of a Nash equilibrium for the game and discuss its implications.

As noted, we model the interactions between the governance mechanism—namely, enforcing a neutral point of view- and contributors’ co-production as a Stackelberg (leader-follower) game. The role of the neutrality-enforcing mechanism is to choose the optimal level of enforcement, t, defined in our model in terms of the effort involved in complying with governance (i.e., when editing a sentence). The response of the N contributors in the particular article is to make unilateral contributions that maximize their net utility (the difference between the benefit obtained from their content ownership as well as co-producing a comprehensive article and the efforts expended to make edits, maintain the quality of the article and comply with governance). A contributor may choose to make any number of contributions (i.e., sentences owned in an article). Our objective is then to determine for each contributor, the optimal levels of activity, so that each contributor’s net utility is maximized. Please refer to Tables 1 and 2 for the list of notations/variables that are used in our analysis.

thumbnail
Table 1. List of notations used to set up the game theoretic model.

https://doi.org/10.1371/journal.pone.0281725.t001

thumbnail
Table 2. List of notations used to evaluate the Nash equilibrium of the game.

https://doi.org/10.1371/journal.pone.0281725.t002

4.1 Modeling co-production: Contributors’ cooperation and competition

We model the interactions of the N content contributors to a particular Wikipedia article as a non-cooperative game. Please note that while our model assumes a given number of contributors, it does allow for contributors to make a choice regarding their participation. Namely, by deciding to own zero content contributors are actually deciding not to join the game. All contributors can determine their individual activity levels, namely the number of sentences the contributor would attempt to own, a choice that affects that contributor’s utility and costs. The N contributors in the particular article can make unilateral contributions − x1, x2, ⋯, xN, respectively, where xi represents the ith contributor’s overall activity (i.e., the number of sentences the contributor would like to own). That decision affects the comprehensiveness of the article and the content ownership yield. The strategy set, therefore, is the content added to the article by each contributor x1, x2, ⋯, xN. Our model assumes both self-interest (i.e., competitive) and cooperative motivation for contributors. We define a “weight”, wi (0 < wi < 1) to represent the extent to which one is competitive, where wi represents the cooperative aspect. When wi → 0, it indicates that the ith contributor is cooperatively oriented and the ith contributor is considered to be competitively oriented if wi → 1. Additionally, we define a “weight,” βi (0 ≤ βi ≤ 1; i.e., βi is continuous and can take all possible values in the interval, [0, 1]) that identifies a contributor as either a creator or a curator (one may be involved in a combination of creation and curation tasks). Creators are characterized by large-size edits (i.e., larger βi’s); curators’ edits are smaller (i.e., smaller βi’s).

In terms of their benefits, cooperatively, a contributor derives value from the creation of a high-quality and comprehensive article. Particularly, comprehensiveness is considered a key indicator of information quality [45] For example the number of words or sentences is a very good predictor of article quality [46, 47]. We model this aspect of the contributor’s utility as the sum of the article portions that are “owned” by all those contributing to the focal article, (total number of sentences in the article).

Competitively, the goal of individual contributors is to advance their personal point of view and “own” contents of an article. We model the competitive component of utility for each contributor in the non-cooperative game as the contributor’s fractional ownership in the article. The actions of contributors are either originating new content or changing existing content to the co-created artifact. Contributors may come to “own” portions of the article (in our case, in terms of sentences), whereby each sentence a contributor comes to “own” indirectly reduces others’ fractional ownership. The ith contributor has fractional content ownership, (1)

Each contributor incurs a cost in making the contribution, modeled as the sum total of:

  1. the effort expended in producing content and editing an article. This cost element depends on the average effort of taking ownership of a sentence as well as the number of sentences the contributor eventually likes to own, i.e., xi. We assume that changing an existing sentence requires less effort than originating a new sentence, such that the effort expended by a contributor is a linear function the contributor’s position in the creator-curator continuum; (βi), in line with prior works in the area of software development which approximated coding effort based on the quantity of code that is produced (i.e., lines of code) [39, 40], we assume that effort is a linear function of edit size (i.e., one’s position in the creator-curator continuum; βi). Thus, the effort involved in editing a single sentence is a function of the contributor’s activity profile, i, where L is a constant. Hence, the overall effort contributor i makes in editing the focal article is ixi. Please note that this cost is specific to each contributor and is independent of the contributor’s cooperative/competitive orientation.
  2. the effort associated with participation in coordination and administrative work, such as editing the articles’ TalkPages. Such activities take place on auxiliary pages and are independent of one’s content production activity. This cost element is more applicable for the cooperative contributors, i.e., fi(1 − wi).
  3. the effort of complying with Wikipedia’s rules and policies, in particular, neutrality-enforcing policies. This effort is intended to regulate only the self-interest activities (i.e., attempting to “own” article portions) and thus is more applicable to competitive contributors. This cost element depends on the governance-compliance effort associated with editing a single sentence, (similar for all contributors) as well as on the number of sentences edited by a contributor. Therefore, for the ith contributor with competitive orientation, owning xi amount of sentences requires expending an effort twixi to comply with the level of neutrality enforcement, t.

In sum, the net utility experienced by the ith contributor, ui, can be written as the difference between utility of contributor i and the total effort expended by contributor i. Formally stated: (2)

Each contributor then determines the optimal amount of sentences owned, (i.e., ), that maximizes his or her net utility, uiwhen the contributions made by all other contributors, xj, ji are fixed. It is therefore observed that the net utility obtained by the ith contributor not only depends on the strategy of the ith contributor (i.e., xi), but also on the strategies of all the other contributors (i.e., xj, ji). Therefore, ui is also sometimes denoted as ui(xi, xi) where xi represents the vector, , i.e., all elements of the vector, except the ith element. This results in a non-cooperative game of complete information [72] between the contributors. The optimal xi, ∀ i (denoted as ), which is determined by maximizing ui in Eq (2), when all other xj’s, ji are fixed, i.e., the vector, xi is fixed, is then the Nash equilibrium of the non-cooperative game where no contributor can make a unilateral change.

The unique Nash equilibrium, , can be obtained in closed-form as follows. x* is obtained by determining the best response for contributor i, which maximizes its net utility, ui (given by Eq (2)) when the contributions made by all other contributors, i.e., xj, ji are known. However, it is noted that every contributor attempts to do the same. In other words, the unique Nash equilibrium, , is the vector that maximizes the net utility, ui, in Eq (2), i. Applying the first order necessary condition to Eq (2), is obtained as the solution to (3) subject to the constraints , ∀ i.

From Eq (3), we obtain , ∀ i, when xi ≥ 0. Thus, ui is a concave function of xi and , which solves Eq (3) subject to , ∀ i, is a local as well as a global maximum point. In other words, the non-cooperative game has a unique Nash equilibrium, The vector of contributions, , can be obtained by numerically solving the system of N non-linear equations specified by Eq (3). However, to study the effect of the contributors’ type (i.e. creators or curators based on their βi’s) and the level of neutrality enforcement, t, on contributors’ strategies, it is desirable to obtain an expression that relates the vectors, x*, x = [xi]1≤iN, β = [βi]1≤iN and t.

Solving Eq (3), we propose the following theorem that provides the unique Nash equilibrium of the non-cooperative game.

Theorem 1: The unique Nash equilibrium of the non-cooperative game can be obtained as (4)

Proof: See Appendix-1 in the S1 File.

Theorem 2: The unique Nash equilibrium x*, is feasible, i.e., , ∀ i if and only if (5)

Proof: The proof follows by just setting in Eq (4) to be .

Intuitively, Eq (5) implies the following. Let all the other contributors as well as the neutrality enforcement represent “adversaries” of contributor i. The right hand side shows the total adversarial activity level against contributor i. The left hand side shows a term similar to the total activity level in the favor of contributor i if contributor i were to duplicate herself against each individual adversary. Eq (5) then says that a contributor makes a non-zero contribution to an article only if the contributor has to expend effort in his/her favor, that is less than the effort or activity level of the adversaries.

Theorem 3: The fractional content ownership of contributor i at the Nash equilibrium, , is (6) where for any, θ, θ+ = max(θ, 0) and is non-zero only if Eq (5) is satisfied, i.e., if the Nash equilibrium is feasible.

Proof: The expression follows by substittuing obtained from Eq (4) into Eq (1) and setting .

Theorem 4: Asymptotically, (i.e., when the number of contributors, N, becomes very large), for only those contributors for whom , where , is the average size of edits divided by the competitiveness weightage of all contributors making changes to the focal Wikipedia article. In other words, when the number of contributors in an article is significant, only those contributors whose edit size (correlated with their position on the creator-curator continuum) per their competitiveness weight is less than the group’s average end up with non-zero ownership.

Proof: Eq (6) can be re-written as (7) Asymptotically, i.e., as the number of contributors, N, becomes large is evaluated by taking limN→∞ in Eq (7), to obtain (8) which is non-zero if and only if .

Note that the optimal in Eq (4) and the optimal fractional ownership, in Eq (6), do not depend on the fixed cost, fi. This is because, the fixed cost does not change with changing xi (from Eq (2)) and hence has no bearing on the optimal . Similarly, from Eq (1), the fractional ownership, ci, is independent of fi and therefore, in Eq (6) is also independent of fi. The condition in Eq (5) and the expression in Eq (6) have the following interesting implications.

  • From Eq (6), the ownership of contributors depend on the βj of all the contributors. This is intuitively correct in a peer production project such as Wikipedia because changes to the produced artifact are made by multiple contributors and the ownership held by a contributor will depend on the activity of all the contributors co-producing the artifact.
  • In Eq (4), the level of neutrality enforcement, t, appears as a linear factor in the numerator and as a second order quadratic factor (power of 2) in the denominator. Therefore, when neutrality enforcement, t increases, the overall content owned by a contributor decreases. This implies that when the “tax” imposed on contributors in terms of complying with NPOV norms, policies and procedures is too high it outweighs the benefits associated with content ownership along with with making an article comprehensive. Thus, contributors stop co-production work.
  • The given by Eqs (6) and (7) are the exact expressions for the content ownership. The asymptotic behavior in Eq (8) is used only to reach the conclusion that when the number contributors in an article is significant, only those contributors whose is below the group’s average, survive the peer production process and end up owning non-zero percentage of content in the article. For all the numerical analysis, only Eq (6) is used.

A low value of , indicates low βi (i.e., a contributor making small edits acting as a curator) and having wi closer to 1 (i.e. having a competitive orientation). In other words, if a contributor is entirely cooperative, then wi → 0 and and becomes more than the average, . Thus, contributors who are purely cooperative do not own any content. This is a straightforward result, since cooperative contributors are primarily interested in producing high-quality articles, and less driven by the desire to “own” articles.

Competitive participants (wi → 1), with large edit sizes (βi’s), who are creators of content (i.e., with substantial changes per sentence edited) also do not own content since becomes large. In other words, only those with a competitive orientation who choose to act as curators making small edits end up owning significant portion of the content. One might expect that the creators who contribute more content (and in the process exert more effort) would end up owning much of an article’s contents. In contrast, the results of our game-theoretic analysis implies that when competing over content ownership in the presence of Wikipedia’s governance to ensure neutrality, and when controlling for one’s cooperative/competitive-orientation, the creators of content who make on average large contributions would eventually not own any content.

4.2 Neutrality enforcement and its effect on co-production

The community governance factor in our model is the result of contributors’ aggregate governance effort. The level of governance, t, should be chosen so as to maximize the effectiveness of peer production. Our model assumes that the main objective of Wikipedia’s governance is to reduce bias and ensure that all points of view are reflected in an article. Assuming a sufficiently large sample, the set of contributors working on an article would sample a range of viewpoints. The more one contributor (or a small subset of the contributors) is able to dominate the discussion (i.e. take ownership of large portions of the article), the more likely the article is to reflect a single side. In contrast, when content ownership within an article is more equally distributed and when multiple contributors own sections of the article, a more NPOV is expected. Thus, our notion of article neutrality reflects equal ownership of an article’s contents. We are less interested here in the particular biases that may be present in the content added by each contributor.

Building on information theory [73], we use the metric of entropy to estimate the amount of bias in the distribution of content ownership. We note that the entropy metric was employed in prior studies in organization science [74], and in particular in estimating the distribution of work in Wikipedia [8]. In information theoretical terms, if a source emits N possible symbols, with probability distribution, pi, i.e., the source emits symbol, Wi, with probability, pi ≥ 0, , then the entropy of the source, H is defined as [75] (9) In any Wikipedia article, the fractional ownership of all contributors, are all non-negative and satisfy the condition, . Therefore, a page can be viewed as as an information source with the N contributors playing the role of the N possible symbols and the fractional content ownership of each contributor, , playing the role of pi. The entropy of the page can then be written as (10) In Eq (10), the entropy, H(t), may depend on other factors, e.g., the optimal contributions, 1 ≤ iN, obtained from Eq (4), which in turn, depend on β and . However, we represent the entropy as H(t) because the neutrality enforcement, t, is the only parameter in H that represents the action taken by Wikipedia’s governance mechanisms. Our model’s optimization function for neutrality enforcement (maximizing entropy in contributors’ ownership) is normalized (each article’s entropy is calculated in relation to the maximum entropy of any article, which is 1).

It was shown in information theory [75] that the expression of entropy, H(t), in Eq (10) is maximized when , ∀ i. i.e., when all contributors make equal contribution, which corresponds to the scenario when there is complete neutrality. Therefore, we use the entropy in Eq (10) as the objective function for determining the optimal level of neutrality enforcement for the Stackelberg game. Namely, the objective function for neutrality enforcement in our model is to maximize the entropy in content ownership within an article.

The optimal level of neutrality enforcement, t, is the value of z that maximizes H(z) in Eq (10), which is obtained by replacing t with z in Eq (6). Then, (11)

From Eqs (6), (10) and (11), the optimal value of t that maximizes the entropy is the value of t that makes , ∀ i, i.e., t → ∞. See Appendix-2 in the S2 File for details. Our analysis indicates that while neutrality enforcement levels remain moderate, our prior results from the non-cooperative game remain unchanged, such that those with a cooperative orientation who are content creators (i.e., make above-average-size contributions) end up not owning content. However, when neutrality enforcement levels exceed a certain threshold the non-cooperative model collapses, ownership becomes a direct function of neutrality enforcement (rather than contributors’ efforts). Namely, from Eq (6), when (i.e. when the amount of effort a contributor has to exert to overcome neutrality enforcement is significantly- typically one order of magnitude- larger than the aggregate effort that all contributors exert in making unit contributions to the article), . In other words, ALL contributors end up owning an insignificant but equal portion of the article’s contents. Therefore, the optimal value of t is obtained by solving (12) where z* represents the maximum level of neutrality enforcement allowed in order to maintain the neutrality of an article above a required threshold. The resulting Stackelberg game model is depicted in Fig 2.

thumbnail
Fig 2. The comprehensive Stackelberg game depicting the complete interaction between Wikipedia’s governance to ensure neutrality and cooperation/competition peer production dynamics.

https://doi.org/10.1371/journal.pone.0281725.g002

The complete model for the leader-follower game between the governance mechanism and the competitive self-interested contributors suggests that at the Stackelberg equilibrium, the governance mechanism operates in such a manner to result in an implicit outcome that keeps the entropy high, i.e., the difference between maximum and minimum fractional content ownership is kept low, thus ensuring the article’s objectivity. Further, our model limits governance levels to an upper threshold.

5 Verification of the analysis with empirical data

To verify the game theoretic analysis, we compared its predictions against data from Wikipedia. In line with prior studies in the area [26], we employed a double-stratified sampling procedure, randomly selecting 1000 articles from the January 2012 dump of the English Wikipedia. Our strata were based on: (I) the number of revisions the article has gone through (i.e., article maturity) and (II) the article’s topical domains. This sampling strategy is important given that collaboration patterns could differ across articles in different phases of their life cycle across topical domains. Given the power law distributions in the number of articles’ revisions [76], we used the following four maturity strata: (a) 1–10 revisions; (b) 11–100; (c) 101–1000; and (d) more than 1000 revisions. The topical strata were based on Wikipedia’s categorization system, using the main topics scheme. The English Wikipedia main topic categorization scheme is developed by the community and is subject to frequent changes; see http://en.wikipedia.org/wiki/Category:Main_topic_classifications. With 4 maturity strata and 25 topical categories, we have 100 cells with 10 randomly selected articles in each (i.e. 250 articles in each maturity stratum and 40 articles in each topical category). Altogether, this sample contained: 1000 articles and 721,806 editing activities (i.e. article revisions), made by 222,119 contributors. In line with prior works in the area [8], we assume that for non-registered contributors, a unique IP address corresponds to a distinct contributor. About half of the contributors in our sample are non-registered. After excluding articles with a single contributor, we were left with 864 multi-contributor articles, which make up the sample of our analysis.

We began by counting the number of distinct contributors (N) and the total number of edits made by each contributor, across all articles and the total number of sentences edited by the ith contributor (1 ≤ iN), ζi, across all of the articles in our sample. Next, we calculated the average amount of effort expended by the contributor when editing a sentence. A contributor’s average edit size, i, was calculated based on his activity profile across all articles within the sample to which he contributed. The contributor’s position on the creator-curator continuum, βi, was estimated based on the average size of edits per sentence, calculated using the Levenshtein distance [77]. The Levenshtein distance is a string metric used in information theory and computer science for measuring the difference between two sequences (or between two text segments) and is defined as the minimum number of single-character edits (i.e. insertions, deletions or substitutions) required to change one word into another [78]. This metric has been commonly used in the study of Wikipedia to estimate the scope of editing activities [79]. Based on this calculation, we were able to approximate the size of all edits made by the ith contributor (1 ≤ iN), ei, to all articles in our sample. We then computed the average edit size of contributor i per sentence, i, . Note that βi is a trait of the contributor, reflecting her activity profile (whether a creator or a curator of content) such that it remains the same across all articles that we analyzed.

In order to approximate a contributor’s cooperative/competitive orientation, we relied on their functional roles within Wikipedia, as reflected by their access privileges. Wikipedia community has developed a comprehensive and detailed set of procedures for governing the collaborative editing process, including a well-defined scheme of access privileges [80]. Very broadly speaking, these roles could be organized in four strata, reflecting contributors’ commitment and involvement within the community: unregistered members, registered members, privileged members (holding a special privilege), and core members (i.e. administrators) [81] Data regarding contributors’ functional roles were accessed through Wikipedia’s API. We relied on these strata in order to estimate the extent to which one is competitive (i.e. wi) within the community: the greater one’s rights and responsibilities within the Wikipedia community, the more one is considered to have a cooperative orientation. Specifically, we assigned the values of 0.1, 0.4, 0.6, and 0.9 to the different strata (e.g., an administrator is assigned w1 = 0.1, reflecting the least competitive and most cooperative orientation).

We measured contributors’ self-interest benefit (i.e. ownership of articles’ contents) and costs (namely, the effort in making an edit) by calculating the fractional ownership for each contributor in each article of our sample using the algorithm in [38]. The content ownership algorithm tracks the evolution of content, recording the number of sentences owned by each contributor at each revision, until the study’s end date. The algorithm uses a sentence as the unit of analysis, where each full sentence is initially owned by the contributor who first added it; as content on a wiki page evolves, a contributor loses a sentence when more than 50% of that sentence’s content (i.e. the words in the original sentence) is deleted or revised by others (thus, a contributor making a major revision to an existing sentence can take ownership of that sentence). If no single contributor “owns” more than 50% of a sentence, that sentence becomes ownerless. The output of the algorithm indicates the number of sentences (and the fraction of sentences within the focal Wikipedia article) originated by each contributor that persisted in the most recent version of the article. We defined as the number of sentences owned by contributor i. A contributor’s fractional ownership in a particular article is obtained as .

The number of contributors in the 864−article sample was found to follow a uniform distribution with the average number of contributors being 124.6 and a standard deviation, 71.9. We also found that contributors are more likely to make smaller edits per sentence than larger edits, with the distribution of i resembling an exponential probability density function. The average i was 6.8 (measured in Levenshtein distance) and the standard deviation was 6.2. The average content ownership in an article for users, , was found to have a mean of 8% and a standard deviation of 0.55.

Employing the βi’s and wi’s thus obtained, we used the expression in Eq (6) to determine our model’s prediction for a contributor’s fractional ownership in a particular Wikipedia page. Eq (6) provides us with a contributor’s expected fractional ownership. We repeated this analysis for all Wikipedia articles in our sample and for all contributors in each article. Our empirical analysis suggests that a contributor’s creator/curator activity profile (βi) and cooperative/competitive orientation (wi) are two distinct variables, as the correlation between their estimators is quite low (Pearson correlation = 0.167).

We empirically tested our model’s predictions regarding the relation between contributors’ average edit size (i.e. a proxy for the effort typically exerted by a contributor when making an edit, reflecting their position on the creator-curator continuum), their cooperative/competitive orientation and their resulting content ownership. In trying to gain a better understanding of the temporal dynamics underlying the co-production competition over an article’s content, we applied a temporal bracketing technique by breaking each article into yearly brackets (i.e. Year 1 since inception, Year 2, etc.). We then created subsets by yearly periods, capturing the states of all articles after their first year of operation, second year etc., and calculated contributors’ average effort (i.e. their creator-curator profile), cooperative/competitive orientation and fractional ownership at the end of each year in an article’s life. Next, for each of these yearly subsets, we repeated the analysis described above, comparing the model’s prediction to actual fractional ownership. Our analytical results indicate that the number of sentences owned by a contributor, xi, hinges on both wi and βi. Naturally, it is expected that correlates with these two variables. The correlation between and wi is 0.678 (i.e. contributors that have a competitive orientation tend to own more sentences). The correlation between and βi is −0.571 (i.e., curators tend to own more sentences).

Our findings show that the model’s predictions regarding contributors’ fractional ownership are almost identical to the empirical findings. Please see Fig 3 for the results for the entire study period. Looking at the temporal brackets, we observe that already at the end of the year of inception (Year 1), the process converges and our model’s predictions closely match the empirical data (see Fig 4). Fig 5 shows that from Year 2 onward, the results are almost identical to the findings for the entire period. These results suggest that as articles mature and the co-production process stabilizes, our empirical results become more aligned with the model’s prediction.

thumbnail
Fig 3. The percentage ownership obtained from the data and that obtained from Eq (6) for 864 multi-contributor articles in our sample, for the entire period of study.

Results represent the aggregated solution for the 864 games (each game solved with the appropriate number of editors). Contributors are indexed according to the decreasing order of ’s.

https://doi.org/10.1371/journal.pone.0281725.g003

thumbnail
Fig 4. The percentage ownership obtained from the data and that obtained from Eq (6) for 864 multi-contributor articles in our sample, for the first year of inception.

Results represent the aggregated solution for the 864 games (each game solved with the appropriate number of editors). Contributors are indexed according to the decreasing order of ’s.

https://doi.org/10.1371/journal.pone.0281725.g004

thumbnail
Fig 5. The percentage ownership obtained from the data and that obtained from Eq (6) for 864 multi-contributor articles in our sample, for the second year of inception.

Results represent the aggregated solution for the 864 games (each game solved with the appropriate number of editors). Contributors are indexed according to the decreasing order of ’s.

https://doi.org/10.1371/journal.pone.0281725.g005

We measured the Pearson’s correlation coefficient value between the fractional ownership obtained by the game theoretic analysis and that obtained from the measured data (2 years from inception, and onward), and found it to be 0.789, thus demonstrating that the data closely follow our model’s predictions. The stability of our findings across various stages of an article’s life cycle solidifies the validity of our model, demonstrating that despite simplifying assumptions and using a static model, we were able to capture the temporal dynamic underlying cooperation and competition in Wikipedia’s co-production of articles. In order to further test for the effect of articles’ maturity, we analyzed the alignment between the model’s prediction and the empirical findings by the number of contributors per article. We found that the alignment increases as the number of contributors per article grows (stabilizing at roughly 30 contributors per article). In order to verify the robustness of our model, we re-ran our analysis, this time excluding non-human editors (i.e. “bots”). Our findings indicate that the pattern of results for both the entire-period and the temporal bracketing analyses remains the same.

Trying to get a better sense of the (small) discrepancies in ownership values between the model’s prediction and the empirical data, we explored whether the differences could be offset by establishing a linear fit function mapping. Let (representing the ownership of the contributors obtained by the game theoretic analysis) and (representing the ownership of the contributors obtained from the empirical data). For each Wikipedia article, we fit a function (13) where the parameters, ρ and δ, are obtained by the method of least squares [82]. We randomly selected 300 articles as training data in supervised learning to determine the linear regression coefficients, repeating the training/testing split 1000 times. The average error for the training data was 7.6%. We then ran a linear regression on the remaining 564 pages and tested the linear fit as well as its significance by measuring the p−value. The error was found to be between 18–23% and the linear regression was statistically significant at p < 0.05. The implication of this result is that the game theoretic analysis models the contributors’ interactions in Wikipedia accurately up to a linear scaling factor.

We further employed a combination of the game theoretic analysis and findings from the empirical data to determine how governance levels influence the trade off between articles’ neutrality and their comprehensiveness Fig 6 depicts the impact of governance level, t, on the neutrality of an article (measured by its entropy) and the comprehensiveness of the article (measured in terms of the article size). The entropy was measured from empirical data and the article size was obtained analytically using in Eq (4). Fig 6 shows that reasonably low levels of governance t ≈ 6 ensure neutrality (large amount of entropy >0.9), while maintaining an article’s comprehensiveness (high value for ). However, when the governance, t, becomes larger (closer to 100), the comprehensiveness of the article (article size) is reduced significantly yet the gain in neutrality (i.e. entropy) is marginal. This result suggests that that low levels of governance are sufficient for curbing much of the competition over content ownership, ensuring a relatively even distribution of ownership, while resulting in only a very minor reduction in activity levels, thus having a minimal impact on articles’ comprehensiveness.

thumbnail
Fig 6. The effect of governance level,t, on the article size, obtained analytically using in Eq (4) and the entropy, H(t) obtained from empirical data.

For H(t) we just measured the entropy of each article using Eq (10) but with the values of from the data. The articles were indexed in the same order for which the normalized number of sentences were calculated.

https://doi.org/10.1371/journal.pone.0281725.g006

6 Discussion and conclusion

Whereas most research in the area has emphasized the cooperative aspects in Wikipedia’s co-production, answering the call to shift the focus of online collaboration research to issues of scarcity and competition [83], the focus of the current study is on the interplay between contributors’ competition and cooperation. In particular, in this paper we developed a game-theoretic model of cooperation and competition in Wikipedia, under constraints of governance. We corroborated the model through an empirical study. We stress that we did not intend our game-theoretic model to be comprehensive; socio-technical systems such as Wikipedia are much too complex to be fully captured through a game-theoretic model. Instead, we attempted to model a few essential features of Wikipedia’s co-production process, namely contributors’ cooperation in producing high-quality and comprehensive articles, and their competition over content ownership, while taking into account the community’s efforts to ensure that articles represent a balanced perspective.

The key results of our game-theoretic analysis are as follows:

  • Analytically, we found that the contributor’s characteristics, or more specifically, the ratio between the contributor’s position on the creator-curator continuum and the contributor’s cooperative/competitive orientation is the factor that determines who ends up owning content. When this ratio is smaller than the group’s average, the contributor maintains ownership over portions of the article. Namely, under the governance mechanisms, the fractional content that is eventually owned by a contributor is higher for curators (i.e., with a typical small-size edit per sentence) with a competitive orientation (i.e., peripheral community members). In essence, creators with a cooperative orientation lose ownership of the article. This result was corroborated through an empirical analysis of 219, 811 distinct contributors co-producing 864 representative articles over an eleven-year period, which used the proxies mentioned earlier. Employing a temporal bracketing approach, we analyzed the data for each yearly bracket independently, to find that already from the second year of an article’s life and onward the empirical pattern of results converges to our analytical predictions.
  • Analytically, we show that excessive governance should be curtailed, by identifying and maintaining a permissible upper limit, beyond which it discourages contributors from making contributions to an article, bringing the co-production process to a halt. Furthermore, our empirical analysis suggests that a low level of governance i s optimal for ensuring neutrality while maintaining articles’ comprehensiveness.

We note that prior studies have discussed individual contributors’ attempts to influence articles’ contents [33, 56, 64, 65], but these studies did not consider peer production dynamics. Namely, in a large system such as Wikipedia, where participants’ contributions tend to be in reference to some existing content (contributed by others), knowing the effects of contributors’ actions on others is essential for understanding the system’s dynamics. We, thus, extend prior research by investigating the effects of collaborators’ behavior on others’ actions and studying the system’s dynamics.

The finding from the non-cooperative game that a competitive orientation yields content ownership suggests that the benefits associated with content ownership outweigh the effort that is associated with governance-enforcing tax (at least up to a certain limit; see below). This result is rather straightforward, whereas the findings regarding creators and curators is less intuitive, given that creators are major contributors of content to the article. One may argue that this could be explained by the temporal order of editing: creators add content early on, only to be overwriten by curators’ later contributions. However, our temporal bracketing analysis accounts for this possible effect, and we demonstrate that our results hold even when excluding the first year of articles’ lives (i.e. the results from Year 2 and onward). How, then, could this result be explained? One line of explanation points to community’s effort to ensure neutrality [25, 33]. The “creators” who make large contributions and take ownership of large portions of the article pose a potential threat to the article’s neutrality. In response, the community works collectively to fight such manipulations and restores neutrality. In the long run, the result is a more neutral content [30, 84]. Not withstanding the role of community efforts to ensure neutrality, our results may also be explained by paying attention to the profile of those making the large contributions. Prior studies have identified the “creators” [3437, 85] or “content-oriented” contributors as those whose main interest is to demonstrate their knowledge of the topic, and are much less invested in Wikipedia’s internal processes. In contrast, the “curators” (also referred to as “community-oriented” contributors [8]), who engage in refining, shaping, and reorganizing content [86], act as “janitors of knowledge” [87] and end up owning larger portions of articles. That is, at the end of the day, it is the curators who play the leading role in shaping articles’ contents. We note that both these lines of explanation highlight the effectiveness of Wikipedia’s social processes.

A second powerful result of this study is in demonstrating that that the community’s efforts to govern content creation and ensure neutrality, although essential for maintaining a balanced position, should be carefully monitored and kept limited. The reason is that when the “tax” imposed on contributors in terms of complying with NPOV norms, policies and procedures is too high, it outweighs the benefits associated with content ownership, such that contributors stop competing for ownership (and in effect, co-production is stalled). Prior studies have documented the growing bureaucracy of Wikipedia [80], and have suggested that the increasing complexity of Wikipedia’s governance structure is deterring newcomers and may eventually lead to the decline of Wikipedia [1]. Our results provide analytical proof of the risk associated with increased governance cost and highlight the need to balance the cost and benefits associated with the governance of online production communities. Furthermore, our analysis indicates that a low level of governance is optimal for ensuring neutrality while maintaining articles’ comprehensiveness, suggesting that light-weight governance mechanisms may be optimal. Whereas prior research did suggest that governance levels should be curtailed, the current study adds to the literature by identifying the appropriate levels of governance.

In sum, our study makes important contributions to research on peer production, First we shed light on the dynamics underlying peer production. Second, we show how Wikipedia’s governance and in particular, the community’s efforts to maintain neutrality, affect contributors’ behavior and consequently, the quality of the co-produced articles. Although there have been studies using game theory and network ties to study collaboration in social networks in general [88] and applied to Wikipedia in particular [89], we are not aware of prior research in this area that linked governance to individuals’ actions.

Notwithstanding these contributions, our work could be extended in several directions, for example by employing alternative measures for our model’s constructs (alternative measures of articles’ objectivity through an analysis of Wikipedia’s discussion pages) or extending the model to include additional constructs related to contributors (for example: contributors’ intent and their psychological ownership, their compliance with Wikipedia’s policies, or the phase in an article’s life on which they concentrate their edits) or related to the articles (the extent to which articles are controversial or the importance of the knowledge-based product). Future research could also attempt to empirically validate our model’s prediction that excessive levels of governance impede peer production. Finally, a more complex convex cost model (e.g., a quadratic cost model) and its effect on co-production and governance can also be explored.

An additional important contribution of this study is in applying game theory to investigate peer production in Wikipedia. Prior studies have called for the development of new economic models [90] and in particular, the application of game theory to investigate emerging socio-technical systems such as Wikipedia [91]. A few studies have used game-theoretic models to compare the efficiency of closed- and open-source software development regimes [92], investigate the competitive dynamics of the process by which participants provide feedback on each other’s work (i.e. rating and reputation systems) [93], and to study co-production in user generated content [94], crowd sourcing [95] and Wikipedia [96]. Our study extends prior work in the area by using game theory to shed light on the role that governance mechanisms play in moderating the competition between contributors to peer production. Our game-theoretic model provides insights into the complexities of cooperation and competition in peer production, while making several simplifying assumptions.

Although our empirical analysis corroborated the game-theoretic model, we did observe small discrepancies between the model’s predictions and the empirical data. One possible explanation for these differences is that our model provided only an approximation of Wikipedia’s complexities. Future research could help make progress: such work could relax some of our model’s assumptions, for example, by distinguishing between different contributors’ knowledge and capabilities, modeling learning effects, capturing other important qualities of articles such as accuracy or completeness, and exploring multi-stage games to consider various classes of contributors that differ in their roles and goals. In particular, we foresee three avenues for extending our study. First, in our model a contributor owns his newly-added content thus impacting others’ relative ownership. A more sophisticated model could also account for overwriting (or deleting) another contributor’s content, thus directly impacting the ownership of a specific other. A challenge for such a model would be to handle the complexity associated with numerous pair-wise relationships. Second, our model could possibly be extended to capture a tighter linkage between the leader-follower game (community governance) and the non-cooperative game (cooperation and competition over content ownership), for example by linking the effectiveness of the community’s governance to the aggregate of contributors’ governance effort.

Beyond the theoretical contributions and directions for future work, our study’s findings also have important implications for custodians of online production communities. The most important insight pertains to the levels of governance that are required. Namely, we find that low levels of governance are sufficient for curbing much of the competition over content ownership, ensuring a relatively even distribution of ownership, while only resulting in a very minor reduction in activity levels, thus having a minimal impact on articles’ comprehensiveness. Another important contribution pertains to the importance of peripheral community members that have a less cooperative orientation. In fact, as a consequence of their pursuit of content ownership, their activity levels are increased, such that they add contents to the article, making the article more comprehensive and of higher quality. Hence, communities such as Wikipedia should encourage peripheral participation [97] in addition to their efforts to encourage cooperative participants to take on additional functions and roles.

In conclusion, we believe that game theory can reveal deep insights into the complex dynamics underlying peer production. While game theory, and in particular leader-follower game, has already been employed in the context of allocation of divisible resources [98], the application of such techniques to community-based peer production is novel. Future research directions for game theory in this area include the use of cooperative game theory [99] to study the stability of the production process; applying coalitional game models [100] to analyze how competing coalitions of contributors emerge, applying resilience and immunity models [101] to investigate how the peer production process performs in cases of deviations from expected behavior, and using evolutionary game theory [102] to study how unexpected behaviors emerge as a result of sequential actions by different contributors. Game theory can also be used to develop models that would account for the relationship between individual’s governance work and the community’s overall governance level while using algorithm and mechanism design [103] to determine the most effective community governance structures.

Supporting information

S1 File. Appendix-1: Nash equilibrium of the non-cooperative game between the contributors.

https://doi.org/10.1371/journal.pone.0281725.s001

(PDF)

S2 File. Appendix-2: Determining the optimal value of neutrality enforcement, t.

https://doi.org/10.1371/journal.pone.0281725.s002

(PDF)

Acknowledgments

The authors thank the anonymous reviewers for their constructive criticism which greatly enabled us to enhance the results and presentation in the manuscript.

References

  1. 1. Halfaker A, Geiger RS, Morgan JT, Riedl J. The rise and decline of an open collaboration system: How Wikipedia’s reaction to popularity is causing its decline. American Behavioral Scientist. 2012;57:664–688.
  2. 2. Leuf B, Cunningham W. The Wiki way: Quick collaboration on the web. Addison Wesley; 2001.
  3. 3. Benkler Y. The wealth of networks: How social production transforms markets and freedom. Yale Univ Press; 2006.
  4. 4. Gavalda-Miralles A, Choffnes DR, Otto JS, Sanchez MA, Bustamante FE, Amaral LA, et al. Impact of heterogeneity and socioeconomic factors on individual behavior in decentralized sharing ecosystems. Proceedings of the National Academy of Sciences (PNAS). 2014;111:43:15322–15327. pmid:25288755
  5. 5. Giles J. Internet encyclopaedias go head-to-head. Nature. 2005;438.7070(900). pmid:16355180
  6. 6. Michelucci P, Dickinson J. The power of crowds. Science. 2016;351:6268:32–33.
  7. 7. Sauermann H, Franzoni C. Crowd science user contribution patterns and their implications. PNAS. 2015;112:3:679–684. pmid:25561529
  8. 8. Arazy O, Nov O, Patterson R, Yeo L. Information quality in Wikipedia: The effects of group composition and task conflict. Journal of Management Information Systems. 2011;27(4):71–98.
  9. 9. Kittur A, Kraut A. Harnessing the wisdom of crowds in Wikipedia: Quality through coordination. ACM conference on Computer Supported Cooperative Work (CSCW’2008). 2008.
  10. 10. Kittur A, Kraut RE. Beyond Wikipedia: Coordination and conflict in online production groups. Proceedings of the 2010 ACM conference on Computer Supported Cooperative Work (CSCW’2010). 2010.
  11. 11. Nov O. What motivates Wikipedians? Communications of the ACM. 2007;50(11):60–64.
  12. 12. Yang LH, Lai CY. Motivations of Wikipedia content contributors. Computers in Human Behavior. 2010;26(6):1377–1383.
  13. 13. Aaltonen A, Lanzara GF. Building governance capability in online social production: Insights from Wikipedia. Organization Studies. 2015;36(12):1649–1673.
  14. 14. Auray N. Online communities and governance mechanisms. Governance, Regulation and Powers on the Internet: Cambridge University Press. 2012.
  15. 15. Schroeder A, Wagner C. Governance of open content creation: A conceptualization and analysis of control and guiding mechanisms in the open content domain. Jl, American So for Info Sc and Technol. 2012;63(10):1947–1959.
  16. 16. Arazy O, Assaf HL, Balila A. Neither a Bazaar nor a cathedral: The interplay between structure and agency in Wikipedia’s role system. Journal of the Association for Information Science and Technology. 2019;70(1):3–15.
  17. 17. Schroer J, Hertel G. Voluntary engagement in an open web-based encyclopedia: Wikipedians and why they do it. Media Psychology. 2009;12(1):96–120.
  18. 18. Myerson R B. Game theory: Analysis of Conflict. Harvard University. 1991.
  19. 19. Jain S, Parkes DC. The role of game theory in human computation systems. Proc, ACM SIGKDD Workshop on Human Computation. 2009; p. 58–61.
  20. 20. Simaan M, Cruz JB. On the Stackelberg Strategy in Nonzero-Sum Games. Journal of Optimization Theory and Applications. 1973;11(5).
  21. 21. Vonstackelberg H. The Theory of the Market Economy. Oxford University Press, Oxford, England; 1952.
  22. 22. Edwards J. Wikipedia Sends Cease-And-Desist Letter to 300-Strong Sockpuppet Army. Business Insider. 2013.
  23. 23. Richards K. CONFIRMED: Companies Have Been Editing Wikipedia Pages To Make Themselves Look Better. Business Insider. 2014.
  24. 24. Arazy O, Lifshitz-Assaf H, Nov O, Daxenberger J, Balestra M, Cheshire C. On the How and Why of Temporal Dynamics of Emergent Roles in Wikipedia. 20th Conference on Computer-Supported Coop-erative Work and Social Computing (CSCW’2016), Portland, OR, USA. 2016.
  25. 25. Greenstein S, Zhu F. Open Content, Linus? Law, and Neutral Point of View. Information Systems Research (ISR). 2017;27(3):618–635.
  26. 26. Arazy O, Daxenberger J, Lifshitz-Assaf H, Nov O, Gurevych I. Turbulent Stability of Emergent Roles: The Dualistic Nature of Self-Organizing Knowledge Co-Production. Information Systems Research (ISR). 2016;27(4):792–812.
  27. 27. Liu J, Ram S. Who Does What: Collaboration Patterns in the Wikipedia and Their Impact on Article Quality. ACM Trans Management Inf Syst. 2011;2(2):1–23.
  28. 28. Yang D, Halfaker A, Kraut R, Hovy E. Who did what: Editor role identification in Wikipedia. Proceedings of the International AAAI Conference on Web and Social Media. 2016;10(1).
  29. 29. Forte A, Larco V, Bruckman A. Decentralization in Wikipedia governance. Journal of Management Information Systems. 2009;26(1):49–72.
  30. 30. Greenstein S, Zhu F. Is Wikipedia Biased? American Economic Review. 2012;102(3):343–348.
  31. 31. Greenstein S, Zhu F. Do experts or crowd-based models produce more bias? Evidence from Encyclopedia Britannica and Wikipedia. MIS Quarterly. 2018.
  32. 32. Hassine T. The dynamics of NPOV disputes. Proceedings of Wikimania. 2005.
  33. 33. Young AG, Wigdor AD, Kane GC. The Gender Bias Tug-of-War in a Co-creation Community: Core-Periphery Tension on Wikipedia. Journal of Management Information Systems. 2020;37:1047–1072.
  34. 34. Kim J, Agrawala M, Bernstein M. Mosaic: Designing online creative communities for sharing works-in-progress. Computer Supported Cooperative Work (CSCW). 2017.
  35. 35. Rinie L, Brebber J, Purcell K. Photos and videos as social currency online. Pew Internet & American Life Project. 2012; p. 23–29.
  36. 36. Hill BM, Monroy-Herandez A. A longitudinal dataset of five years of public activity in the Scratch online community. Scientific Data. 2017;4:1–14. pmid:28140385
  37. 37. Hull G, Scott J. Curating and creating online: Identity, authorship, and viewing in a digital age. Museum communication and social media: the connected museum, Routledge, New York, NY. 2013.
  38. 38. Arazy O, Stroulia E, Ruecker S, Arias C, Fiorentino C, Ganev V, et al. Recognizing contributions in wikis: Authorship categories, algorithms, and visualizations. Jl, American Soc for Info Sc and Technol. 2010;61(6):1166–1179.
  39. 39. Mendes E, Mosley N, Counsel S. Web Effort Estimation. Web Engineering, Springer Berlin Heidelberg. 2006; p. 29–73.
  40. 40. Shihab E, Kamei Y, Adams B, Hassan AE. Is lines of code a good measure of effort in effort-aware models? Information and Software Technology. 2013;55(11):1981–1993.
  41. 41. Kittur A, Suh B, Pendleton BA, Chi EH. He says, she says: conflict and coordination in Wikipedia. Proceedings of the SIGCHI conference on Human factors in computing systems. 2007.
  42. 42. Ransbotham S, Kane GC. Membership turnover and collaboration success in online communities: Explaining rises and falls from grace in Wikipedia. MIS Quarterly-Management Information Systems. 2011;35(3):613.
  43. 43. Arazy O, Kopak R. On the measurability of information quality. Journal of the American Society for Information Science and Technology. 2011;62(1):89–99.
  44. 44. Wang RY, Strong DM. Beyond accuracy: What data quality means to data consumers. Journal of Management Information Systems. 1996;12(4):5–33.
  45. 45. Arazy O, Nov O. Determinants of Wikipedia Quality: The Roles of Global and Local Contribution Inequality. Proceedings of Computer Supported Cooperative Work (CSCW’2010). 2010.
  46. 46. Blumenstock JE. Size matters: Word count as a measure of quality on wikipedia. Proceedings of the 17th international conference on World Wide Web. 2008; p. 1095–1096.
  47. 47. Wohner T, Peters R. Assessing the quality of Wikipedia articles with lifecycle based metrics. Proceedings of the 5th International Symposium on Wikis and Open Collaboration. 2009; p. 1–10.
  48. 48. Lasfer A, Vaast E. Reaching for the Top: A Practice Approach to Understanding Power Dynamics in Online Communities. Academy of Management Proceedings, Academy of Management Briarcliff Manor, NY 10510, 16695. 2019.
  49. 49. Das S, Lavoie A, Magdon-Ismail M. Manipulation among the arbiters of collective intelligence: How Wikipedia administrators mold public opinion. 22nd ACM Intl Conf on Info and Knowledge Management. 2013; p. 1097–1106.
  50. 50. McKenzie PJ, Burkell J, Wong L, Whippey C, Trosow SE, McNally MB. User-generated online content 1: Overview, current state and context. First Monday. 2012;17(6).
  51. 51. Golumbia D. Commercial Trolling: Social Media and the Corporate Deformation of Democracy. Available at SSRN 2394716. 2013.
  52. 52. Oppong M. Covert PR in Wikipedia. Retrieved from Frankfurt, Germany. 2014.
  53. 53. Umarova K, Mustafaraj E. How partisanship and perceived political bias affect wikipedia entries of news sources. Companion Proceedings of The 2019 World Wide Web Conference. 2019; p. 1248–1253.
  54. 54. Hube C. Bias in Wikipedia. Proceedings of the 26th International Conference on World Wide Web Companion. 2017; p. 717–721.
  55. 55. Hube C, Fatahu B. Detecting biased statements in Wikipedia. Companion Proceedings of the The Web Conference 2018. 2018; p. 1779–1786.
  56. 56. Arazy O, Lindberg A, Rezaei M, Samorani M. The Evolutionary Trajectories of Peer-Produced Artifacts: Group Composition, the Trajectories’ Exploration, and the Quality of Artifacts. Management Information Systems Quarterly (MISQ). 2020;44.
  57. 57. Pierce JL, Kostova T, Dirks KT. Toward a theory of psychological ownership in organizations. Academy of management review. 2001;26(2):298–310.
  58. 58. Pierce JL, Kostova T, Dirks KT. The state of psychological ownership: Integrating and extending a century of research. Review of General Psychology. 2003;7(1):84.
  59. 59. Brown G, Robinson TBLSL. Territoriality in organizations. Academy of Management Review. 2005;30(3):577–594.
  60. 60. Halfaker A, Kittur A, Kraut R, Riedl J. A jury of your peers: Quality, experience and ownership in Wikipedia. Proceedings of the 5th International Symposium on Wikis and Open Collaboration, ACM. 2009; p. 15.
  61. 61. Kriplean T, Beschastnikh I, McDonald DW, Golder SA. Community, consensus, coercion, control: cs*w or how policy mediates mass participation. Proceedings of the 2007 international ACM conference on Supporting group work. 2007.
  62. 62. Scott SD, Carpendale MST, Inkpen KM. Territoriality in collaborative tabletop workspaces. Proceedings of the 2004 ACM conference on Computer supported cooperative work. 2004.
  63. 63. Thom-Santelli J, Cosley DR, Gay G. What’s mine is mine: Territoriality in collaborative authoring. Proceedings of the SIGCHI Conference on Human Factors in Computing Systems ACM. 2009; p. 1481–1484.
  64. 64. Kane GC, Johnson J, Majchrzak A. Emergent life cycle: The tension between knowledge change and knowledge retention in open online coproduction communities. Management Science. 2014;60(12):3026–3048.
  65. 65. Bidar RTH, Asindhgatta R. Destructive Domination in Crowdsourcing. Proceedings of ICIS2020. 2020.
  66. 66. Arazy O, Ortega F, Nov O, Yeo L, Balila A. Functional Roles and Career Paths in Wikipedia. Computer Supported Cooperative Work (CSCW 2015), Vancouver, BC, Canada. 2015.
  67. 67. Crowston K, Wei K, Li Q, Howison J. Core and Periphery in Free/Libre and Open Source Software Team Communications. IEEE Computer Society Proceedings of the 39th Annual Hawaii International Conference on System Sciences. 2006;06.
  68. 68. Preece J, Shneiderman B. The Reader-to-Leader Framework: Motivating technology-mediated social participation. AIS Transactions on Human-Computer Interaction. 2009;1(1):13–32.
  69. 69. Halfake A, A Kittur A, Riedl J. Don’t bite the newbies: How reverts affect the quantity and quality of Wikipedia work. Proceedings of the 7th ACM International Symposium on Wikis and Open Collaboration (WikiSym’2011). 2011; p. 163–172.
  70. 70. Suh B, Chi EH, Pendleton B, Kittur A. Us vs. them: Understanding social dynamics in Wikipedia with revert graph visualizations. IEEE symposium on visual analytics science and technology. 2007; p. 163–170.
  71. 71. Langley A. Strategies for theorizing from process data. Academy of Management Review. 1999;24(4):691–710.
  72. 72. Neumann JV,Morgenstern . Theory of Games and Economic Behavior. Princeton University Press; 1944.
  73. 73. Shannon CE. A Mathematical Theory of Communication. Bell Systems Technical Journal. 1948;27(3):379–423.
  74. 74. Ancona D, Caldwell D. Demography and design: Predictors of new product team performance. Organization Science. 1992;3(3):321–341.
  75. 75. Cover TM, Thomas JA. Elements of Information Theory. John Wiley and Sons; 2006.
  76. 76. Ortega F, Gonzalez-Barahona JM, Robles G. On the inequality of contributions to Wikipedia. Proc, 41st Annual Hawaii Intl Conf on System Sciences (HICSS’2008). 2008; p. 304.
  77. 77. Levenshtein VI. Binary codes capable of correcting deletions, insertions and reversals. Soviet physics doklady. 1966;10:707–710.
  78. 78. Ristad ES, Yianilos PN. Learning string-edit distance. IEEE Transactions on Pattern Analysis and Machine Intelligence. 1998;20(5):522–532.
  79. 79. Keith M, Shao B, Steinbart PJ. The usability of passphrases for authentication: An empirical field study. International Journal of Human-Computer Studies. 2007;65(1):17–28.
  80. 80. Butler B, Joyce E, Pike J. Don’t look now, but we’ve created a bureaucracy: The nature and roles of policies and rules in Wikipedia. Proc, ACM CHI conference on human factors in computing systems. 2008.
  81. 81. Arazy O, Nov O, Ortega F. The Wikipedia World is Not Flat: On The Organizational Structure of Online Production Communities. Proceedings of the 22nd European Conference on Information Systems. 2014.
  82. 82. Meyer CD. Matrix analysis and applied linear algebra. SIAM; 2000.
  83. 83. Wang X, Butler BS, Ren Y. The impact of membership overlap on growth: An ecological competition view of online groups. Organization Science. 2013;24(2):414–431.
  84. 84. Stvilia B, Twidale MB, Smith LC, Gasser L. Information quality work organization in Wikipedia. Journal of the American Society for Information Science and Technology. 2008;59(6):983–1001.
  85. 85. Roque R, Rusk N, Resnick M. Supporting diverse and creative collaboration in the Scratch online community. Mass collaboration and education, Springer. 2016.
  86. 86. Butler B, Sproull S, Kiesler S, Kraut RE. Community effort in online communities: Who does the work and why. Leadership at a Distance Research in TechnologicallySupported Work, Weisband S, Ed London UK: Lawrence Erlbaum Associates. 2007; p. 171–194.
  87. 87. Niederer S, Dijck JV. Wisdom of the crowd or technicity of content? Wikipedia as a sociotechnical system. New Media & Society. 2010;12(8):1368–1387.
  88. 88. Hanaki N, Peterhansl A, Dodds PS, Watts DJ. Cooperation in Evolving social networks. Management Science. 2007;53(7):1036–1050.
  89. 89. Zhang X, Wang C. Network Positions and Contributions to Online Public Goods: The Case of Chinese Wikipedia. Journal of Management Information Systems. 2012;29(2):11–40.
  90. 90. Lerner J, Tirole J. Some simple economics of open source. The Journal of Industrial Economics. 2002;50(2):197–234.
  91. 91. Conte R, Gilbert N, Bonelli G, Cioffi-Revilla C, Deffuant G, Kertesz J, et al. Manifesto of computational social science. The European Physical Journal Special Topics. 2012;214(1):325–346.
  92. 92. Baldwin YC, Clark BK. The architecture of participation: Does code architecture mitigate free riding in the open source development model? Management Science. 2006;52(7):1116–1127.
  93. 93. Dellarocas C. The digitization of word of mouth: Promise and challenges of online feedback mechanisms. Management science. 2003;49(10):1407–1424.
  94. 94. Ghosh A, McAfee P. Incentivizing high-quality user-generated content. Proc, 20th Intl Conf on World wide web. 2011; p. 137–146.
  95. 95. Horton JJ, Chilton LB. The labor economics of paid crowdsourcing. Proc, 11th ACM Conf on Electronic Commerce. 2010; p. 209–218.
  96. 96. Anand S, Arazy O, Mandayam NB, Nov O. A game theoretic analysis of collaboration in wikipedia. Decision and Game Theory for Security (GameSec’2013). 2013; p. 29–44.
  97. 97. Lave J, Wenger E. Situated Learning: Legitimate Peripheral Participation. Cambridge University Press, Cambridge, UK; 1991.
  98. 98. Maheswaran RT, Basar T. Efficient signal proportional allocation (ESPA) mechanisms: Decentralized social welfare maximization for divisible resources. IEEE Jl on Sel Areas in Commun. 2006;24(5):1000–1009.
  99. 99. Branzei R, Dimitrov D, Tijs S. Models in cooperative game theory. Springer; 2008.
  100. 100. Ray D, Vohra R. A theory of endogenous coalition structures. Games and Economic Behavior. 1999;26(2):286–336.
  101. 101. Halpern JY. Beyond Nash equilibrium: Solution concepts for the 21st century. Proc, 27th ACM symposium on Principles of distributed computing. 2008.
  102. 102. Weibull JW. Evolutionary game theory. MIT press; 1997.
  103. 103. Jackson MO. Mechanism Theory. Available at SSRN 2542983; 2000.