Improving the Caenorhabditis elegans Genome Annotation Using Machine Learning
Figure 2
Given Two Sequences, s1 and s2 of Equal Length, Our Kernel Consists of a Weighted Sum to Which Each Match in the Sequences Makes a Contribution wl Depending on Its Length l, Where Longer Matches Contribute More Significantly
For predictions, we use a window of 140 nt around the potential splice site (cf. Materials and Methods for details, including the procedure of how the length of the window is determined).