Connect with us

AI Research

NVIDIA aims to solve AI’s issues with many languages

Published

on


While AI might feel ubiquitous, it primarily operates in a tiny fraction of the world’s 7,000 languages, leaving a huge portion of the global population behind. NVIDIA aims to fix this glaring blind spot, particularly within Europe.

The company has just released a powerful new set of open-source tools aimed at giving developers the power to build high-quality speech AI for 25 different European languages. This includes major languages, but more importantly, it offers a lifeline to those often overlooked by big tech, such as Croatian, Estonian, and Maltese.

The goal is to let developers create the kind of voice-powered tools many of us take for granted, from multilingual chatbots that actually understand you to customer service bots and translation services that work in the blink of an eye.

The centrepiece of this initiative is Granary, an enormous library of human speech. It contains around a million hours of audio, all curated to help teach AI the nuances of speech recognition and translation.

To make use of this speech data, NVIDIA is also providing two new AI models designed for language tasks:

  • Canary-1b-v2, a large model built for high accuracy on complex transcription and translation jobs.
  • Parakeet-tdt-0.6b-v3, which is designed for real-time applications where speed is everything.

If you’re keen to dive into the science behind it, the paper on Granary will be presented at the Interspeech conference in the Netherlands this month. For the developers eager to get their hands dirty, the dataset and both models are already available on Hugging Face.

The real magic, however, lies in how this data was created. We all know that training AI requires vast amounts of data, but getting it is usually a slow, expensive, and frankly tedious process of human annotation.

To get around this, NVIDIA’s speech AI team – working with researchers from Carnegie Mellon University and Fondazione Bruno Kessler – built an automated pipeline. Using their own NeMo toolkit, they were able to take raw, unlabelled audio and whip it into high-quality, structured data that an AI can learn from.

This isn’t just a technical achievement; it’s a huge leap for digital inclusivity. It means a developer in Riga or Zagreb can finally build voice-powered AI tools that properly understand their local languages. And they can do it more efficiently. The research team found that their Granary data is so effective that it takes about half the amount of it to reach a target accuracy level compared to other popular datasets.

The two new models demonstrate this power. Canary is frankly a beast, offering translation and transcription quality that rivals models three times its size, but with up to ten times the speed. Parakeet, meanwhile, can chew through a 24-minute meeting recording in one go, automatically figuring out what language is being spoken. Both models are smart enough to handle punctuation, capitalisation, and provide word-level timestamps, which is required for building professional-grade applications.

By putting these powerful tools and the methods behind them into the hands of the global developer community, NVIDIA isn’t just releasing a product. It’s kickstarting a new wave of innovation, hoping to create a world where AI speaks your language, no matter where you’re from.

(Photo by Aedrian Salazar)

See also: DeepSeek reverts to Nvidia for R2 model after Huawei AI chip fails

Want to learn more about AI and big data from industry leaders? Check out AI & Big Data Expo taking place in Amsterdam, California, and London. The comprehensive event is co-located with other leading events including Intelligent Automation Conference, BlockX, Digital Transformation Week, and Cyber Security & Cloud Expo.

Explore other upcoming enterprise technology events and webinars powered by TechForge here.



Source link

AI Research

OpenAI business to burn $115 billion through 2029 The Information

Published

on


OpenAI CEO Sam Altman walks on the day of a meeting of the White House Task Force on Artificial Intelligence (AI) Education in the East Room at the White House in Washington, D.C., U.S., September 4, 2025.

Brian Snyder | Reuters

OpenAI has sharply raised its projected cash burn through 2029 to $115 billion as it ramps up spending to power the artificial intelligence behind its popular ChatGPT chatbot, The Information reported on Friday.

The new forecast is $80 billion higher than the company previously expected, the news outlet said, without citing a source for the report.

OpenAI, which has become one of the world’s biggest renters of cloud servers, projects it will burn more than $8 billion this year, some $1.5 billion higher than its projection from earlier this year, the report said.

The company did not immediately respond to Reuters request for comment.

To control its soaring costs, OpenAI will seek to develop its own data center server chips and facilities to power its technology, The Information said.

OpenAI is set to produce its first artificial intelligence chip next year in partnership with U.S. semiconductor giant Broadcom, the Financial Times reported on Thursday, saying OpenAI plans to use the chip internally rather than make it available to customers.

The company deepened its tie-up with Oracle in July with a planned 4.5-gigawatts of data center capacity, building on its Stargate initiative, a project of up to $500 billion and 10 gigawatts that includes Japanese technology investor SoftBank. OpenAI has also added Alphabet’s Google Cloud among its suppliers for computing capacity.

The company’s cash burn will more than double to over $17 billion next year, $10 billion higher than OpenAI’s earlier projection, with a burn of $35 billion in 2027 and $45 billion in 2028, The Information said.

Read the complete report by The Information here.



Source link

Continue Reading

AI Research

Who is Shawn Shen? The Cambridge alumnus and ex-Meta scientist offering $2M to poach AI researchers

Published

on


Shawn Shen, co-founder and Chief Executive Officer of the artificial intelligence (AI) startup Memories.ai, has made headlines for offering compensation packages worth up to $2 million to attract researchers from top technology companies. In a recent interview with Business Insider, Shen explained that many scientists are leaving Meta, the parent company of Facebook, due to constant reorganisations and shifting priorities.“Meta is constantly doing reorganizations. Your manager and your goals can change every few months. For some researchers, it can be really frustrating and feel like a waste of time,” Shen told Business Insider, adding that this is a key reason why researchers are seeking roles at startups. He also cited Meta Chief Executive Officer Mark Zuckerberg’s philosophy that “the biggest risk is not taking any risks” as a motivation for his own move into entrepreneurship.With Memories.ai, a company developing AI capable of understanding and remembering visual data, Shen is aiming to build a niche team of elite researchers. His company has already recruited Chi-Hao Wu, a former Meta research scientist, as Chief AI Officer, and is in talks with other researchers from Meta’s Superintelligence Lab as well as Google DeepMind.

From full scholarships to Cambridge classrooms

Shen’s academic journey is rooted in engineering, supported consistently by merit-based scholarships. He studied at Dulwich College from 2013 to 2016 on a full scholarship, completing his A-Level qualifications.He then pursued higher education at the University of Cambridge, where he was awarded full scholarships throughout. Shen earned a Bachelor of Arts (BA) in Engineering (2016–2019), followed by a Master of Engineering (MEng) at Trinity College (2019–2020). He later continued at Cambridge as a Meta PhD Fellow, completing his Doctor of Philosophy (PhD) in Engineering between 2020 and 2023.

Early career: Internships in finance and research

Alongside his academic pursuits, Shen gained early experience through internships and analyst roles in finance. He worked as a Quantitative Research Summer Analyst at Killik & Co in London (2017) and as an Investment Banking Summer Analyst at Morgan Stanley in Shanghai (2018).Shen also interned as a Research Scientist at the Computational and Biological Learning Lab at the University of Cambridge (2019), building the foundations for his transition into advanced AI research.

From Meta’s Reality Labs to academia

After completing his PhD, Shen joined Meta (Reality Labs Research) in Redmond, Washington, as a Research Scientist (2022–2024). His time at Meta exposed him to cutting-edge work in generative AI, but also to the frustrations of frequent corporate restructuring. This experience eventually drove him toward building his own company.In April 2024, Shen began his academic career as an Assistant Professor at the University of Bristol, before launching Memories.ai in October 2024.

Betting on talent with $2M offers

Explaining his company’s aggressive hiring packages, Shen told Business Insider: “It’s because of the talent war that was started by Mark Zuckerberg. I used to work at Meta, and I speak with my former colleagues often about this. When I heard about their compensation packages, I was shocked — it’s really in the tens of millions range. But it shows that in this age, AI researchers who make the best models and stand at the frontier of technology are really worth this amount of money.”Shen noted that Memories.ai is looking to recruit three to five researchers in the next six months, followed by up to ten more within a year. The company is prioritising individuals willing to take a mix of equity and cash, with Shen emphasising that these recruits would be treated as founding members rather than employees.By betting heavily on talent, Shen believes Memories.ai will be in a strong position to secure additional funding and establish itself in the competitive AI landscape.His bold $2 million offers may raise eyebrows, but they also underline a larger truth: in today’s technology race, the fiercest competition is not for customers or capital, it’s for talent.





Source link

Continue Reading

AI Research

The Energy Monster AI Is Creating

Published

on


We don’t really know how much energy artificial intelligence is consuming. There aren’t any laws currently on the books requiring AI companies to disclose their energy usage or environmental impact, and most firms therefore opt to keep that controversial information close to the vest. Plus, large language models are evolving all the time, increasing in both complexity and efficiency, complicating outside efforts to quantify the sector’s energy footprint. But while we don’t know exactly how much electricity data centers are eating up to power ever-increasing AI integration, we do know that it’s a whole lot. 

“AI’s integration into almost everything from customer service calls to algorithmic “bosses” to warfare is fueling enormous demand,” the Washington Post recently reported. “Despite dramatic efficiency improvements, pouring those gains back into bigger, hungrier models powered by fossil fuels will create the energy monster we imagine.”

And that energy monster is weighing heavily on the minds of policymakers around the world. Global leaders are busily wringing their hands over the potentially disastrous impact AI could have on energy security, especially in countries like Ireland, Saudi Arabia, and Malaysia, where planned data center development outpaces planned energy capacity. 

In a rush to keep ahead of a critical energy shortage, public and private entities involved on both the tech and energy sides of the issue have been rushing to increase energy production capacities by any means. Countries are in a rush to build new power plants as well as to keep existing energy projects online beyond their planned closure dates. Many of these projects are fossil fuel plants, causing outcry that indiscriminate integration of artificial intelligence is undermining the decarbonization goals of nations and tech firms the world over. 

“From the deserts of the United Arab Emirates to the outskirts of Ireland’s capital, the energy demands of AI applications and training running through these centres are driving the surge of investment into fossil fuels,” reports the Financial Times. Globally, more than 85 gas-powered facilities are currently being built to meet AI’s energy demand according to figures from Global Energy Monitor.

In the United States, the demand surge is leading to the resurrection of old coal plants. Coal has been in terminal decline for years now in the U.S., and a large number of defunct plants are scattered around the country with valuable infrastructure that could lend itself to a speedy new power plant hookup. Thanks to the AI revolution, many of these plants are now set to come back online as natural gas-fired plants. While gas is cleaner than coal, the coal-to-gas route may come at the expense of clean energy projects that could have otherwise used the infrastructure and coveted grid hookups of defunct coal-fired power plants. 

“Our grid isn’t short on opportunity — it’s short on time,” Carson Kearl, Enverus senior analyst for energy and AI, recently told Fortune. “These grid interconnections are up for grabs for new power projects when these coal plants roll off. The No. 1 priority for Big Tech has changed to [speed] to energy, and this is the fastest way to go in a lot of cases,” Kearl continued.

Last year, Google stated that the company’s carbon emissions had skyrocketed by a whopping 48 percent over the last five years thanks to its AI integration. “AI-powered services involve considerably more computer power – and so electricity – than standard online activity, prompting a series of warnings about the technology’s environmental impact,” the BBC reported last summer. Google had previously pledged to reach net zero greenhouse gas emissions by 2030, but the company now concedes that “as we further integrate AI into our products, reducing emissions may be challenging.”

By Haley Zaremba for Oilprice.com 

More Top Reads From Oilprice.com





Source link

Continue Reading

Trending