Skip to content

Knowledge Distillation (KD)

Techniques for compressing models by transferring knowledge from teacher to student models.

  • Student-Teacher Framework
  • Soft Targets
  • Temperature Scaling
  • Dark Knowledge
  • Response-based vs Feature-based Distillation
  • Cross-modal Distillation