UltraFastBERT, a BERT model using only 0.3% of its neurons for inference, matching the performance of standard BERT models. Achieving up to 78x speedup over traditional methods. #NLP #LLMs #BertGoBrrrt
8
4
0
0
UltraFastBERT, a BERT model using only 0.3% of its neurons for inference, matching the performance of standard BERT models. Achieving up to 78x speedup over traditional methods. #NLP #LLMs #BertGoBrrrt