Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

< Back to Article

Fig 1.

The BrEPS 2.0 workflow.

The protocol consists of six steps to generate the BrEPS database. A: Selection and preparation of sequences. B: All-vs-all BLAST of sequences. C: Complete linkage clustering based on the E-value from BLAST. D: Multiple sequence alignment and pattern creation on selected nodes. E: Pattern verification. F: Preparation of the final database.

More »

Fig 1 Expand

Table 1.

Filters applied to UniProt protein entries to parse enzyme data from UniProt flatfiles.

More »

Table 1 Expand

Fig 2.

Detailed overview of the new data selection.

Only Swiss-Prot sequences with evidence on protein level (A) are used as seed sequences to retrieve additional, non-redundant sequences from TrEMBL and Swiss-Prot using UniRef references with >= 50% sequence identity (B and C). These additional sequences get the corresponding Swiss-Prot annotation and are merged with the seed sequences into one database (D).

More »

Fig 2 Expand

Table 2.

Similarity sets created for every amino acid pair to complement semi-conserved amino acid positions with other highly similar amino acids.

More »

Table 2 Expand

Fig 3.

Extended patterns example.

Semi-conserved pattern positions are extended with amino acids from PAM250-based similarity sets.

More »

Fig 3 Expand

Table 3.

Proposed EC number examples for patterns created from sequences with different EC numbers.

More »

Table 3 Expand

Table 4.

Evaluation results for BrEPS 1.0 and BrEPS 2.0 using UniProt release 2014_10.

More »

Table 4 Expand

Table 5.

Evaluation results for BrEPS 2.0 using UniProt release 2017_01.

More »

Table 5 Expand

Table 6.

Comparison of InterPro 63.0 with BrEPS 2017.1.

More »

Table 6 Expand