AI Research

Multi-turn conversations with Action-Based Contrastive Self-Training

Published

3 months ago

June 3, 2025

Are action-based preferences necessary? One of the key factors of ACT is that the contrastive pairs highlight differences between conversational actions. In “ACT w/ Random Actions”, we additionally examine the importance of action selection by randomly sampling both the winning and losing action when constructing the preference pair, and observe this underperforms normal ACT.

Do we need on-policy sampling? In “ACT w/o on-policy sampling”, we examine the importance of on-policy sampling by evaluating normal off-policy DPO on the dataset as constructed in Phase 1. While we do observe some improvements over SFT (e.g., from 69.0 to 74.8 Macro F1), the overall improvements are much larger when using on-policy sampling as with full ACT. This may be due to the fact that the off-policy negative responses are not guaranteed to lie in the language manifold of the policy model, and distribution shift may be too difficult to overcome with off-policy learning.

Is trajectory simulation necessary? ACT is better-aligned with multi-turn conversations due to its trajectory simulation. Without multi-turn simulation, our approach can be viewed similarly to on-policy DPO variants like IRPO, but with a conversation-specific reward signal which accounts for conversation actions and task heuristics. In “ACT w/ sampling w/o simulation”, we find that this trajectory-level simulation is critical to improving multi-turn performance, especially the policy model’s ability to reason about its own clarification questions.

Is ACT model agnostic? The base model in our main experiments, Zephyr, is obtained by aligning Mistral. In “ACT with unaligned foundation models” we observe a performance gap of 6.5 Action F1 and 4.3 Trajectory F1 after ACT tuning for the two models. However, our results demonstrate ACT can improve performance regardless of pre-existing alignment with human feedback, although it can help as an improved model initialization. Overall, we find that improving base model performance with ACT is model agnostic.

Source link

AI Research

Researchers ‘polarised’ over use of AI in peer review

Published

3 hours ago

September 14, 2025

Tom Williams

Researchers appear to be becoming more divided over whether generative artificial intelligence should be used in peer review, with a survey showing entrenched views on either side.

A poll by IOP Publishing found that there has been a big increase in the number of scholars who are positive about the potential impact of new technologies on the process, which is often criticised for being slow and overly burdensome for those involved.

A total of 41 per cent of respondents now see the benefits of AI, up from 12 per cent from a similar survey carried out last year. But this is almost equal to the proportion with negative opinions which stands at 37 per cent after a 2 per cent year-on-year increase.

This leaves only 22 per cent of researchers neutral or unsure about the issue, down from 36 per cent, which IOP said indicates a “growing polarisation in views” as AI use becomes more commonplace.

Women tended to have more negative views about the impact of AI compared with men while junior researchers tended to have a more positive view than their more senior colleagues.

Nearly a third (32 per cent) of those surveyed say they already used AI tools to support them with peer reviews in some form.

Half of these say they apply it in more than one way with the most common use being to assist with editing grammar and improving the flow of text.

A minority used it in more questionable ways such as the 13 per cent who asked the AI to summarise an article they were reviewing – despite confidentiality and data privacy concerns – and the 2 per cent who admitted to uploading an entire manuscript into a chatbot so it could generate a review on their behalf.

IOP – which currently does not allow AI use in peer reviews – said the survey showed a growing recognition that the technology has the potential to “support, rather than replace, the peer review process”.

But publishers must fund ways to “reconcile” the two opposing viewpoints, the publisher added.

A solution could be developing tools that can operate within peer review software, it said, which could support reviewers without positing security or integrity risks.

Publishers should also be more explicit and transparent about why chatbots “are not suitable tools for fully authoring peer review reports”, IOP said.

“These findings highlight the need for clearer community standards and transparency around the use of generative AI in scholarly publishing. As the technology continues to evolve, so too must the frameworks that support ethical and trustworthy peer review,” Laura Feetham-Walker, reviewer engagement manager at IOP and lead author of the study, said.

tom.williams@timeshighereducation.com

Source link

AI Research

Amazon Employing AI to Help Shoppers Comb Reviews

Published

3 hours ago

September 14, 2025

PYMNTS

Amazon earlier this year began rolling out artificial intelligence-voiced product descriptions for select customers and products.

Now, the company’s “Hear the Highlights” feature has extended to all users, CNBC reported Sunday (Sept. 14), arguing this could replace user-created reviews as the main source of product information.

Among the advantages here, the report added, is that artificial intelligence (AI) won’t suffer from cognitive overload from combing through thousands of reviews.

“It’s important to recognize where AI is currently strong, such as in automation and pattern recognition, and where it still falls short, like in judgment-heavy tasks,” said Ankur Edkie, co-founder and CEO of Murf AI, which develops AI voiceovers. “A key question is whether there’s a way to factor in customer context as an input while generating these summaries.”

The value of AI, according to Edkie, is determining the right “problem-capability fit.” Without that, he added, a sense of “gimmickry” is likely to filter through thanks to AI fatigue, which he says consumers are likely feeling by now.

PYMNTS has contacted Amazon for comment but has not yet gotten a reply.

The CNBC report also noted that the tendency of AI to focus on common themes can water down responses even as it summarizes them, taking out the detailed personal experiences found in human reviews.

“AI might overlook unique insights or niche needs that don’t align with the majority of responses,” said Brian Numainville, principal at consumer research firm Feedback Group. “Additionally, the ability to critically interpret reviews — like spotting biases or trusting certain reviewers — is diminished with AI summaries.”

Nauman Dawalatabad, a research scientist at Zoom Communications, offered his opinion that the technology is on its way to improving customer experience.

“I take it as technology helping us to make informed decisions,” he said, pointing to the mental fatigue and wasted time that can result from working through customer reviews.

Meanwhile, recent research by PYMNTS Intelligence shows that AI shopping adoption has begun to gain traction with younger and middle-aged consumers. The research found that 32% of all consumers said they have used or would use generative AI for shopping.

“Bridge millennials — older millennials straddling Gen X — lead the way, with 38% reporting AI use for shopping,” PYMNTS wrote last month. “Zillennials are close behind at 36%, followed by millennials at 35%. Gen X is next, at 33%, while Gen Z comes in at 31%. Baby boomers show some traction as well, with 28% using gen AI for shopping.”

Source link

AI Research

Nubank To Continue Leveraging AI To Enhance Digital Financial Services In Latin America

Published

4 hours ago

September 14, 2025

Omar Faridi

Nubank (NYSE: NU) is reportedly millions of customers across Latin America. Recently, the company’s Chief Technology Officer, Eric Young, shared his vision for leveraging artificial intelligence to fuel Nubank’s global expansion and improve financial services.

During a recent discussion, Young outlined how AI is not just a tool but a cornerstone for operational efficiency, customer-centric growth, and democratizing access to personalized finance.

With a career that includes work at Amazon in the early 2000s, Young brings a philosophy of prioritizing customer experience.

At Amazon, he witnessed firsthand how technology could transform user experiences, a mindset he now applies to Nubank’s mission. “If not us, then who?”

Young posed rhetorically during the videocast, underscoring Nubank’s unique position to disrupt traditional banking.

Founded in Brazil in 2013, Nubank has positively impacted the financial sector by prioritizing financial inclusion and superior customer service, challenging legacy banks with its digital-first approach.

Under Young’s leadership, Nubank’s priorities are clear: enhance agility, expand internationally, and harness AI to serve customers better.

He emphasized the need for cross-functional collaboration, particularly with the product and design teams.

This includes partnering with Nubank’s recently appointed Chief Design Officer (CDO), Ethan Eismann, to iterate quickly on new features.

By fostering a culture of testing and learning, Young aims to deliver products that not only meet but exceed user expectations, ultimately capturing a larger market share.

This involves deepening engagement with existing users, attracting new ones, and venturing into underserved markets where financial services remain inaccessible.

Central to Young’s strategy is AI’s transformative potential.

Nubank’s 2024 acquisition of Hyperplane, an AI-focused startup, marks a pivotal step in this direction.

Young highlighted how advanced language models—such as those powering ChatGPT and Google Gemini—can bridge the gap between everyday users and elite financial advisory services.

These models excel at processing vast amounts of data, including transaction histories, to offer hyper-personalized recommendations.

Imagine an AI that automates budgeting, predicts spending patterns, and suggests investment opportunities tailored to an individual’s financial profile, all without the hefty fees of traditional private banking.

Young drew a parallel to the exclusivity of high-end services.

Historically, AI-driven private banking was reserved for the ultra-wealthy, but Nubank’s vision is to make it ubiquitous.

“We’re democratizing access to hyper-personalized financial experiences.”

By analyzing user data ethically and securely, AI can empower customers from all segments—whether a small business owner in Mexico or a young professional in Colombia—to manage their finances with the precision once afforded only to elites.

This aligns with Nubank’s core ethos of inclusion, ensuring that technology serves as an equalizer rather than a divider.

Looking ahead, Young sees AI as the engine for Nubank’s platformization efforts, enabling scalable solutions that support international growth.

As Nubank eyes further expansion beyond Brazil, Mexico, and Colombia, AI will streamline operations, from fraud detection to customer support chatbots, reducing costs while enhancing reliability.

Yet, Young cautioned that success hinges on responsible implementation—prioritizing privacy, transparency, and human oversight to build trust.

In an era where fintechs aggressively compete for market share, Eric Young’s insights position Nubank not just as a bank, but as a key player in AI-powered financial services.

By blending technological prowess with a focus on the customer, Nubank is set to transform money management, making various services more accessible to consumers.

As Young basically put it, the question isn’t whether AI will change finance—it’s how Nubank will aim to make a positive impact.

Source link