Vision-language models, which map images and text to a common representational space, have demonstrated remarkable performance on a wide range of multimodal AI tasks. But they’re...
Zongyi (Joe) Liu is a principal scientist in the Amazon Customer Experience and Business Trends (CXBT) organization, which evaluates the customer experience across Amazon’s products and...
Most state-of-the-art computer vision models depend on supervised learning, in which labeled data is used for training. But labeling is costly, and the cost is compounded...
Logo recognition is the task of identifying a specific logo and its location in images or videos. It helps create a safe and trustworthy shopping experience,...
Streaming video can suffer from defects introduced during recording, encoding, packaging, or transmission, so most subscription video services — such as Amazon Prime Video — continually...
Image matching has many practical applications. For instance, image retrieval systems like Amazon’s StyleSnap or the Amazon Shopping app’s Camera Search let customers upload photos to...
A product page in the Amazon Store will often include links to product variants, which differ by color, size, style, and so on. Sometimes, however, errors...
Joe Tighe, senior manager for computer vision at Amazon Web Services, is a coauthor on two papers being presented at this year’s Winter Conference on Applications...
Like all of Amazon’s major technology groups, Amazon Prime Video has a dedicated team of scientists who are working constantly to find new ways to delight...
Gérard Medioni, an Amazon vice president and distinguished scientist, is the general chair at this year’s IEEE Winter Conference on Applications of Computer Vision (WACV), and...