Events & Conferences

Amazon at ICML: Industry and academia meet at Expo Day

Published

4 years ago

July 22, 2021

At this year’s International Conference on Machine Learning (ICML), Alice Zheng, a senior manager of applied science in Amazon’s display advertising organization, was one of the cochairs of Expo Day, a chance for the conference’s corporate sponsors to report their latest research and demonstrate their latest technologies.

Alice Zheng, a senior manager of applied science in Amazon’s display advertising organization.

“ICML Expo talks are pretty much like research talks that you would see in the main conference, and here is where academic conferences differ a little bit from industry conferences, based on my observation.” Zheng says. “I used to be part of a small local startup, doing machine learning. And we attended a lot of industry vendor conferences, for marketing. I had come from an academic background, and I walk into this giant pavilion with row upon row upon row of vendors with booths that have people dancing aerobics to disco tunes every afternoon from 3:00 to 4:00 to draw a crowd. I was like, ‘What is happening here? This is such a circus.’”

At academic conferences like ICML, Zheng says, sponsor presentations have a much different tone. “In academic conferences, sponsors are generally companies trying to hire,” Zheng says. “That changes how they present themselves and what they talk about. The Expo chairs are responsible for reading proposals and making accept or reject decisions. We want to uphold a certain level of quality: They should be research talks. They should be informative. They should be relevant to the topics of the conference.”

ICML’s Expo Day, which took place on Sunday, was organized into three tracks, Zheng explains. One track consisted of hour-long research talks and demos. The other two consisted of four-hour workshops in which attendees could gain first-hand experience working with new technologies developed by conference sponsors.

But while Zheng and her cochair, Hsuan-Tien Lin of National Taiwan University, upheld rigorous scientific standards in their evaluation of submitted talks, there are, she says, differences in emphasis between industry and academic research.

“One of the disconnects that I see between industry and academia is that academia focuses way more on modeling and math,” she says. “The full cycle for machine learning development and operations starts with ideation: you come up with an idea, or there’s a problem that you want to solve. You formulate the problem, you propose a solution, you test it, both offline and online. You iterate on the process, and you eventually end up with something that works well. You work with engineering to get a robust implementation — and that can take a while — you deploy it, and you monitor it, to make sure that it’s continually working as it should.

“That’s the full cycle. In academia, I would say that oftentimes even the ideation is fairly rote — people work on pre-established problems. The focus is on the proposal of new methods, new solutions, followed by very light testing. It’s like one out of five steps of the entire cycle.

“Several years ago, I created a talk about operating machine learning models, and I highlighted the importance of evaluation and metrics. It’s tempting to say ‘Oh, I’ll just measure the AUC [area under the curve].’ But in reality, it’s more complicated than that, because in many, many application areas, it’s not one homogeneous set of data, one monolithic model. The data can be subdivided, and you should look at each slice separately. How do you slice the data? How do you create metrics that are stable enough to operate on and still sensitive enough that if something were to go wrong, it would appear abnormal?

“So I created this talk, and I gave it at our intern symposium, and people loved it. Students were raising their hands and saying, ‘How can we learn more? Where can we go to do more of this?’ This is the kind of thing that industry research excels at. Academia does not have the same amount or variety of application data.”

Explainable AI

Of course, industry researchers stay abreast of the latest academic research, and from her vantage as ICML Expo chair, Zheng can see which academic trends have recently begun to take hold in industry, as well. One of these, she says, is model explainability.

Model explainability is a question of fundamental scientific interest: neural networks are black boxes, and it’s natural to want to understand how they do the remarkable things they do. But, Zheng explains, it’s also a question with immediate business implications.

“Sometimes the business needs insights, in which case an explainable model is the thing that they’re looking for,” Zheng says. “Oftentimes you need some way to examine if something is going wrong. It gets back to model operations.”

The need for model explainability, Zheng says, frequently arises in the context of her work at Amazon, which focuses on the design of algorithms for automatically bidding on advertising space on other sites and responding to bids for space on Amazon sites.

“Digital advertising, or programmatic advertising, operates in a high-volume, high-velocity environment,” Zheng explains. “Websites send bid requests out to ad exchanges, and then the ad exchanges contract with different bidders. And the bidders say, ‘Okay, I’m willing to pay this much for an opportunity to show an ad.’ Our algorithms need to make split-second decisions about where to bid and how much to bid.

“The advertisers want to know, ‘What can I do to improve the performance of my ad?’ They’re trying to understand this black-box system. They are always looking for insights in terms of, ‘How can I better set up? What actually drives conversions?’

“And then when we operate our model, we are also interested in general model analysis. Say our conversion rate prediction model is giving a lot of weight to ads that are selling consumer products. Because customers may buy toilet paper every month, whereas they buy a camera maybe every two years. Is that helping advertisers reach the right audience? By being able to understand what the model is doing, we can detect problems, and we can improve it.”

Source link

Related Topics:Ad-related technologies Demand forecasting Explainable AI ICML

Up Next

Automatically identifying scene boundaries in movies and TV shows

Don't Miss

New take on hierarchical time series forecasting improves accuracy

Larry Hardesty

Click to comment

Events & Conferences

A New Ranking Framework for Better Notification Quality on Instagram

Published

3 days ago

September 2, 2025

Xian Sun

We’re sharing how Meta is applying machine learning (ML) and diversity algorithms to improve notification quality and user experience.
We’ve introduced a diversity-aware notification ranking framework to reduce uniformity and deliver a more varied and engaging mix of notifications.
This new framework reduces the volume of notifications and drives higher engagement rates through more diverse outreach.

Notifications are one of the most powerful tools for bringing people back to Instagram and enhancing engagement. Whether it’s a friend liking your photo, another close friend posting a story, or a suggestion for a reel you might enjoy, notifications help surface moments that matter in real time.

Instagram leverages machine learning (ML) models to decide who should get a notification, when to send it, and what content to include. These models are trained to optimize for user positive engagement such as click-through-rate (CTR) – the probability of a user clicking a notification – as well as other metrics like time spent.

However, while engagement-optimized models are effective at driving interactions, there’s a risk that they might overprioritize the product types and authors someone has previously engaged with. This can lead to overexposure to the same creators or the same product types while overlooking other valuable and diverse experiences.

This means people could miss out on content that would give them a more balanced, satisfying, and enriched experience. Over time, this can make notifications feel spammy and increase the likelihood that people will disable them altogether.

The real challenge lies in finding the right balance: How can we introduce meaningful diversity into the notification experience without sacrificing the personalization and relevance people on Instagram have come to expect?

To tackle this, we’ve introduced a diversity-aware notification ranking framework that helps deliver more diverse, better curated, and less repetitive notifications. This framework has significantly reduced daily notification volume while improving CTR. It also introduces several benefits:

The extensibility of incorporating customized soft penalty (demotion) logic for each dimension, enabling more adaptive and sophisticated diversity strategies.
The flexibility of tuning demotion strength across dimensions like content, author, and product type via adjustable weights.
The integration of balancing personalization and diversity, ensuring notifications remain both relevant and varied.

The Risks of Notifications without Diversity

The issue of overexposure in notifications often shows up in two major ways:

Overexposure to the same author: People might receive notifications that are mostly about the same friend. For example, if someone often interacts with content from a particular friend, the system may continue surfacing notifications from that person alone – ignoring other friends they also engage with. This can feel repetitive and one-dimensional, reducing the overall value of notifications.

Overexposure to the same product surface: People might mostly receive notifications from the same product surface such as Stories, even when Feed or Reels could provide value. For example, someone may be interested in both reel and story notifications but has recently interacted more often with stories. Because the system heavily prioritizes past engagement, it sends only story notifications, overlooking the person’s broader interests.

Introducing Instagram’s Diversity-Aware Notification Ranking Framework

Instagram’s diversity-aware notification ranking framework is designed to enhance the notification experience by balancing the predicted potential for user engagement with the need for content diversity. This framework introduces a diversity layer on top of the existing engagement ML models, applying multiplicative penalties to the candidate scores generated by these models, as figure1, below, shows.

The diversity layer evaluates each notification candidate’s similarity to recently sent notifications across multiple dimensions such as content, author, notification type, and product surface. It then applies carefully calibrated penalties—expressed as multiplicative demotion factors—to downrank candidates that are too similar or repetitive. The adjusted scores are used to re-rank the candidates, enabling the system to select notifications that maintain high engagement potential while introducing meaningful diversity. In the end, the quality bar selects the top-ranked candidate that passes both the ranking and diversity criteria.

Figure.1: Instagram’s diversity-aware ranking framework where the diversity layer sits on top of the existing modeling layer and penalizes notifications that are too similar to recently sent ones.

Mathematical Formulation

Within the diversity layer, we apply a multiplicative demotion factor to the base relevance score of each candidate. Given a notification candidate 𝑐, we compute its final score as the product of its base ranking score and a diversity demotion multiplier:

$\text{Score}(c) = R(c) \times D(c)$

where R(c) represents the candidate’s base relevance score, and D(c) ∈ [0,1] is a penalty factor that reduces the score based on similarity to recently sent notifications. We define a set of semantic dimensions (e.g., author, product type) along which we want to promote diversity. For each dimension i, we compute a similarity signal p_i(c) between candidate c and the set of historical notifications H, using a maximal marginal relevance (MMR) approach:

$p_i(c) = \mathrm{max}_{h \in H}\mathrm{sim}_i(c, h)$

where sim_i(·,·) is a predefined similarity function for dimension i. In our baseline implementation, p_i(c) is binary: it equals 1 if the similarity exceeds a threshold 𝜏_i and 0 otherwise.

The final demotion multiplier is defined as:

$D(c) = \prod_{i=1}^{m} \left( 1 - w_i \cdot p_i(c) \right)$

where each w_i∈ [0,1] controls the strength of demotion for its respective dimension. This formulation ensures that candidates similar to previously delivered notifications along one or more dimensions are proportionally down-weighted, reducing redundancy and promoting content variation. The use of a multiplicative penalty allows for flexible control across multiple dimensions, while still preserving high-relevance candidates.

The Future of Diversity-Aware Ranking

As we continue evolving our notification diversity-aware ranking system, a next step is to introduce more adaptive, dynamic demotion strategies. Instead of relying on static rules, we plan to make demotion strength responsive to notification volume and delivery timing. For example, as a user receives more notifications—especially of similar type or in rapid succession—the system progressively applies stronger penalties to new notification candidates, effectively mitigating overwhelming experiences caused by high notification volume or tightly spaced deliveries.

Longer term, we see an opportunity to bring large language models (LLMs) into the diversity pipeline. LLMs can help us go beyond surface-level rules by understanding semantic similarity between messages and rephrasing content in more varied, user-friendly ways. This would allow us to personalize notification experiences with richer language and improved relevance while maintaining diversity across topics, tone, and timing.

Source link

Events & Conferences

Simplifying book discovery with ML-powered visual autocomplete suggestions

Published

3 days ago

September 2, 2025

Mao Sheng Liu

Every day, millions of customers search for books in various formats (audiobooks, e-books, and physical books) across Amazon and Audible. Traditional keyword autocomplete suggestions, while helpful, usually require several steps before customers find their desired content. Audible took on the challenge of making book discovery more intuitive and personalized while reducing the number of steps to purchase.

We developed an instant visual autocomplete system that enhances the search experience across Amazon and Audible. As the user begins typing a query, our solution provides visual previews with book covers, enabling direct navigation to relevant landing pages instead of the search result page. It also delivers real-time personalized format recommendations and incorporates multiple searchable entities, such as book pages, author pages, and series pages.

1 of 2

Audible’s visual-autocomplete experience.

2 of 2

Amazon’s visual-autocomplete experience.

Our system needed to understand user intent from just a few keystrokes and determine the most relevant books to display, all while maintaining low latency for millions of queries. Using historical search data, we match keystrokes to products, transforming partial inputs into meaningful search suggestions. To ensure quality, we implemented confidence-based filtering mechanisms, which are particularly important for distinguishing between general queries like “mystery” and specific title searches. To reflect customers’ most recent interests, the system applies time-decay functions to long historical user interaction data.

Events & Conferences

Revolutionizing warehouse automation with scientific simulation

Published

1 week ago

August 26, 2025

Deniz Akyildiz

Modern warehouses rely on complex networks of sensors to enable safe and efficient operations. These sensors must detect everything from packages and containers to robots and vehicles, often in changing environments with varying lighting conditions. More important for Amazon, we need to be able to detect barcodes in an efficient way.