Caveat emptor, computational social science: Large-scale missing data in a widely-published Reddit corpus
Fig 5
Gaps are not evenly distributed across communities.
The total historical counts of comments per community comments are mildly correlated with the number of dangling references, while submissions are not very correlated with the number of dangling references.