Reinforcement learning Archives

Social networks are vulnerable to relatively simple AI manipulation and polarization: Concordia research | Education

It seems that no matter the topic of conversation, online opinion around it will be split into two seemingly irreconcilable camps. That’s largely a result of...

AI Research2 months ago

Ireckonu’s AI Research Revolutionizes Hospitality with Timely Churn Prevention Strategies

Home » TRAVEL NEWS UPDATES » Ireckonu’s AI Research Revolutionizes Hospitality with Timely Churn Prevention Strategies Thursday, July 10, 2025 Dr. Rik van Leeuwen, Head of...

AI Research2 months ago

Alibaba’s open-source AI model shines as Qwen-based agentic framework tops global ranking

Jointly developed by open-source initiative Agentica and San Francisco-based start-up Together AI, DeepSWE was trained on the Qwen3-32B large language model (LLM) – part of Alibaba...

Events & Conferences4 months ago

A better training method for reinforcement learning with human feedback

Reinforcement learning with human feedback (RLHF) is the standard method for aligning large language models (LLMs) with human preferences — such as the preferences for nontoxic...

Events & Conferences6 months ago

Training code generation models to debug their own outputs

Code generation — automatically translating natural-language specifications into computer code — is one of the most promising applications of large language models (LLMs). But the more...

Events & Conferences9 months ago

Amazon opens new AI lab in San Francisco focused on long-term research bets

From left to right: David Luan, VP of Autonomy and head of Amazon’s AGI SF Lab, and Pieter Abbeel, Amazon Scholar, Robotics. Today, we’re excited to...

Events & Conferences1 year ago

A quick guide to Amazon’s papers at ICML 2024

Amazon’s papers at the International Conference on Machine Learning (ICML) lean — like the conference as a whole — toward the theoretical. Although some papers deal...

Events & Conferences2 years ago

A quick guide to Amazon’s papers at NeurIPS 2023

The Conference on Neural Information Processing Systems (NeurIPS) takes place this week, and the Amazon papers accepted there touch on a wide range of topics, from...

Events & Conferences2 years ago

Lihong Li wins 2023 Seoul Test of Time Award

Lihong Li, a senior principal scientist in Amazon Ads, has won the 2023 Seoul Test of Time award for the 2010 paper “A Contextual-Bandit Approach to...

Events & Conferences3 years ago

In reinforcement learning, slower networks can learn faster

Reinforcement learning (RL) is an increasingly popular way to model sequential decision-making problems in artificial intelligence. RL agents learn through trial and error, repeatedly interacting with...

aistoriz.com

All posts tagged "Reinforcement learning"