Knowledge categorization affects popularity and quality of Wikipedia articles

doi:10.1371/journal.pone.0190674

Fig 1.

Number of edits and quality by category granularity.

The noteworthy pattern is visible in the left-hand side of the diagram (articles in coarse categories). The 20% of articles with the coarsest categories receive above-average numbers of edits but their quality evaluations deteriorates compared to those with a medium category granularity.

More »

Expand

Fig 2.

Average probability to be featured for articles in 10 edit classes.

More »

Expand

Fig 3.

Average return for the ten granularity classes.

Darker bars represent coarser articles.

More »

Expand

Fig 4.

Probability of being top-importance for the ten granularity classes.

More »

Expand

Fig 5.

Return by category granularity separately for 10 different edit classes restricted to the 10% or articles receiving the highest number of edits.

The bar chart for each edit class can be read in the same way as the single bar chart in Fig 3.

More »

Expand

Table 1.

Linear regression for the logarithm of the number of edits.

More »

Expand

Table 2.

Logistic regression for FA-probability.

More »

Expand

Table 3.

Logistic regression for FA-probability.

More »

Expand

Fig 6.

Mean number of edits and effect of granularity on the number of edits, separately for each TLC.

Mean number of edits is displayed in the x-axis. The linear regression coefficient α₁ of the granularity variable explaining the number of edits (compare Eq 1) is displayed in the y-axis. Area of points is proportional to the number of articles in the respective top-level category. All parameter estimates are significantly different from zero (p < 0.001).

More »

Expand

Fig 7.

Average quality and coefficient of granularity explaining quality, separately for each TLC.

The baseline probability of featured articles in the respective TLC is displayed in the x-axis. The logistic regression coefficient of the granularity variable, when controlling for the number of edits (parameter θ₂ in Eq 2), is displayed in the y-axis. Coefficients that are significant (insignificant) at the 5% level are displayed as red (gray) dots.

More »

Expand