Automated classification of clinical diagnoses in electronic health records using transformer

doi:10.1371/journal.pone.0329963

Fig 1.

The structure of the model proposed in this study.

It is an end-to-end pipeline integrating Transformer-based text embedding, multi-task learning, and transfer learning modules.

More »

Expand

Fig 2.

The data preprocessing process in this study (This figure details the specialized clinical text normalization workflow, including medical concept recognition, temporal marker preservation, and ontology-based noise reduction to prepare EHR data for Transformer processing).

More »

Expand

Fig 3.

The principle of Transformer-based text embedding process in this study.

More »

Expand

Fig 4.

The multi-task learning framework in this study.

More »

Expand

Fig 5.

The transfer learning framework in this study (This figure explains the two-phase optimization with clinical domain adapters and confusion loss, showing how pretrained ClinicalBERT weights are adapted to target institutions while mitigating catastrophic forgetting through curriculum learning).

More »