Critical biblical studies via word frequency analysis: Unveiling text authorship

doi:10.1371/journal.pone.0322905

Fig. 1.

Workflow for comparing Text 1 and Text 2.

Step I: Perform exact binomial testing for each word, measuring the fit of the word’s occurrences to a binomial allocation model. Step II: Conduct Higher Criticism (HC) on the per-word binomial allocation p-values and use it as an index of discrepancy between the texts. HC assesses the global significance of the p-values by comparing their z-scores to the uniform empirical process. Words associated with p-values smaller than the HC threshold are considered to provide meaningful discrimination between Text 1 and Text 2.

More »

Expand

Fig 2.

Examined biblical data displayed using the HC-discrepancy values.

Each point corresponds to a chapter, indicating its HC-discrepancy with respect to each of the corpora (D, DtrH and P). The labels on the nodes correspond to chapters. For the purpose of validation, only the convex hull of the chapters was colored, based on the ground-truth attribution (yellow for D, blue for DtrH and pine green for P).

More »