Precise knowledge distillation through top N reranking
We propose to use n-best reranking to improve sequence-level knowledge distillation (Kim and Rush, 2016), where we extract pseudo-labels for ...
We propose to use n-best reranking to improve sequence-level knowledge distillation (Kim and Rush, 2016), where we extract pseudo-labels for ...