TXSelect: A multi-task learning model to identify secretory effectors
Fig 5
Feature process strategies for ESM representations.
(A) Basic pooling operations, including mean, max, min, and standard deviation pooling, applied to residue-level embeddings. (B) Region-specific feature extraction. Protein sequences were divided into N-terminal, core region, and C-terminal segments according to sequence length. The lengths of terminal regions were dynamically determined with lower/upper constraints on amino acid counts. The remaining residues constituted the core region. Minimum length constraints were further applied to ensure balanced representation. Detailed rules for length assignment and minimum thresholds are provided in the Methods.