Skip to main content
Browse Subject Areas

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

< Back to Article

SensiMix: Sensitivity-Aware 8-bit index & 1-bit value mixed precision quantization for BERT compression

Fig 5

Inverse Layer-wise Fine-tuning (ILF).

Given the model with k MP encoder layers, ILF adds one more MP encoder layer on the bottom of it and fine-tune the model with k + 1 MP encoder layers.

Fig 5