SensiMix: Sensitivity-Aware 8-bit index & 1-bit value mixed precision quantization for BERT compression
Fig 4
Full-precision weight distributions of three binarized FFN in SensiMix (3+3) before and after applying ABWR on the QQP task.
Click through the PLOS taxonomy to find articles in your field.
For more information about PLOS Subject Areas, click here.
Full-precision weight distributions of three binarized FFN in SensiMix (3+3) before and after applying ABWR on the QQP task.