ICML Archives - aistoriz.com

Improving LLM pretraining with better data organization

The documents used to train a large language model (LLM) are typically concatenated to form a single “superdocument”, which is then divided into sequences that match...

Events & Conferences12 months ago

A quick guide to Amazon’s papers at ICML 2024

Amazon’s papers at the International Conference on Machine Learning (ICML) lean — like the conference as a whole — toward the theoretical. Although some papers deal...

Events & Conferences2 years ago

How dynamic lookahead improves speech recognition

Automatic speech recognition (ASR) models, which convert speech into text, come in two varieties, causal and noncausal. A causal model processes speech as it comes in;...