SpectralKD

When Transformers Sing: Adapting SpectralKD for Text-Based Knowledge Distillation

While working on my Knowledge Distillation problem for intent classification, I faced a puzzling roadblock. My setup involved a teacher model, which is RoBERTa-large (finetuned on my intent classification), and a student model, which...

Recent posts

Popular categories

ASK ANA