Events & Conferences3 years ago
Knowledge distillation for better convergence in multitask learning
Validation curves in a five-task multitask learning setup, where training minimizes the sum of the task losses. The tasks corresponding to the blue, purple, and red...