Paper Explanation: Net2Net – Accelerating Learning via Knowledge Transfer

Motivation One of the biggest challenges during designing new neural network architectures is time. During real-world workflows, one often trains many different neural networks during the experimentation and design process. This is a wasteful process in which each new model is trained from scratch. In a typical workflow, one trains multiple models, with each model … Continue reading Paper Explanation: Net2Net – Accelerating Learning via Knowledge Transfer