Knowledge Distillation

Neural Networks

Training small model (student) from large model (teacher)

Student model learns teacher’s softened outputs to achieve similar performance with smaller size for edge or faster inference.

Learn more about concepts related to Knowledge Distillation

Quantization

Reducing numerical precision to shrink models and speed inference