【DAILY READING】DistilBERT, a distilled version of BERT:smaller, faster, cheaper and lighter
Conclusion By Myself This paper is about distilling BERT model. It is not so hard to read, but because the lacking of the distilling knowledge, I still can not clearly know what it said, Aha. It in...