EELBERT: Tiny Models Using Dynamic Embeddings
We present EELBERT, an approach for transformer-based model compression (e.g. BERT), with minimal impact on the accuracy of downstream tasks. ...
We present EELBERT, an approach for transformer-based model compression (e.g. BERT), with minimal impact on the accuracy of downstream tasks. ...