Events & Conferences1 year ago
Knowledge distillation method for better vision-language models
Large machine learning models based on the transformer architecture have recently demonstrated extraordinary results on a range of vision and language tasks. But such large models...