A Supervised Approach to Quantifying Sentence Similarity: With Application to Evidence Based Medicine

doi:10.1371/journal.pone.0129392

Table 1.

Features used to encode pairwise sentence similarity as a basis for the learning model.

More »

Expand

Fig 1.

Example of parse tree and its reduced version for a sample sentence.

The parse tree represents the syntactic structure of a sentence in the form of a rooted tree. The reduced form retains only the major groups of part of speech tags—i.e., NPs and VPs.

More »

Expand

Fig 2.

Reduced parse trees of the two sample sentences (i.e. Outcome A and B) listed in the Introduction.

More »

Expand

Fig 3.

Example of role-based semantic similarity measure for two sample sentences.

Both measures are computed using Eq 7, with the actual similarity being specific to pre-verb component (as defined in Eq 8) and predicates (as defined in Eq 9).

More »

Expand

Table 2.

Statistics on the SICK corpus [9].

More »

Expand

Table 3.

The statistics of the NICTA-PIBOSO corpus.

More »

Expand

Table 4.

Evaluation of regression algorithms on 10-fold cross-validation on the SICK training corpus.

More »

Expand

Table 5.

Analysis of effects of different similarity measures—Pearson Correlation results for 10-fold cross-validation using Leave-one-Out feature strategy (i.e. the model is trained on all features except the one mentioned in each row) and results for each measure individually (i.e. the model is trained only for the mentioned feature).

More »