Unsupervised cross-lingual model transfer for named entity recognition with contextualized word representations

doi:10.1371/journal.pone.0257230

Fig 1.

An example of neural NER with sequence labeling.

More »

Expand

Fig 2.

The internal structure of a one-layer transformer.

(a) Standard Transformer. (b) Transformer with Adapters.

More »

Expand

Fig 3.

The pretrained transformer-based language model coupling with adapters and PGN.

More »

Expand

Table 1.

Data statistics, where the number of sentences and entities are reported, and Devel indicates the development set.

More »

Expand

Table 2.

Main results of single-source cross-lingual NER, where lavg indicates the averaged performance for each target language, and avg denotes the overall average F-scores of all source-target pairs.

More »

Expand

Table 3.

Main results of multi-source cross-lingual NER, where all other languages except the target language itself are exploited as the source languages.

More »

Expand

Table 4.

Comparisons with previous studies.

More »

Expand

Fig 4.

An example of word alignment visualization between a German sentence and its English translation, where the solid arrows are gold-standard being all correctly predicted by XLM, and the dashed arrows are incorrectly aligned by mBERT, and the others are the same for the two types of word representations.

More »

Expand

Table 5.

The comparisons between the fine-tuning and feature-based adapter exploration methods, where XLM is used as the input language model.

More »

Expand

Fig 5.

The similarity heatmap of language embeddings for different language pairs, where deeper color indicates higher similarity.

More »

Expand

Table 6.

A case study, where the text with underlines indicates errors.

More »

Expand