When a large language model (LLM) is prompted with a request such as Which medications are likely to interact with St. John’s wort?, it doesn’t search...
The 10 most viewed blog posts of 2024 Large language models remained a hot topic, but posts about cryptography and automated reasoning also drew readers. Staff...
Until the recent, astonishing success of large language models (LLMs), research on dialogue-based AI systems pursued two main paths: chatbots, or agents capable of open-ended conversation,...
The documents used to train a large language model (LLM) are typically concatenated to form a single “superdocument”, which is then divided into sequences that match...
At this year’s Computer Vision and Pattern Recognition Conference (CVPR) — the premier computer vision conference — Amazon Web Services’ vice president for AI and data,...
Amazon’s papers at the International Conference on Machine Learning (ICML) lean — like the conference as a whole — toward the theoretical. Although some papers deal...
In the past few years, foundation models and generative-AI models — and particularly, large language models (LLMs) — have become a major topic of AI research....
For all their remarkable abilities, large language models (LLMs) have an Achilles heel, which is their tendency to hallucinate, or make assertions that sound plausible but...
Teaching large language models (LLMs) to reason is an active topic of research in natural-language processing, and a popular approach to that problem is the so-called...
As they are everywhere, large language models are a major topic of conversation at this year’s meeting of the Association for Computational Linguistics (ACL). Yang Liu,...