AI Research

AI chatbots struggle to function beyond English: ‘They know a lot … but they miss the culture’

Published

1 month ago

July 25, 2025

The world’s leading AI chatbots can now generate everything from emails to research papers—in English. But shift to a different language, and AI’s performance begins to slip.

Most large language models are “a bit like a Fulbright scholar who is interested in Asia as their area of study,” said Kalika Bali, a senior principal researcher at Microsoft Research India at the Fortune Brainstorm AI Singapore conference on Wednesday. “They know a lot about the [subject], but they miss the culture. It’s an outsider’s gaze into the culture of a country.”

Bali pointed to a classic math question—”John and Mary have a key lime pie which they need to divide into five parts”—to show the trouble of using a culturally clueless AI.

Generic AI models will translate the prompt directly. But as Bali pointed out, “in a country like India, most people don’t know what a pie is, [let alone] a key lime pie.”

To develop models that better understand local culture, more data is needed in local languages. But getting that data is not always simple.

Roughly half of all web content is in English, meaning there’s no shortage of high-quality digital resources for LLMs to learn English from. For other languages that do not enjoy this same abundance, developers have to explore different methods of getting training data.

Kasima Tharnpipitchai, head of AI strategy at SCB 10X, highlighted the foundational work by native speakers needed to build a training dataset.

Tharnpipitchai led SCB 10X’s project to launch the Thai LLM Typhoon. To build a dataset in Thai, Tharnpipitchai said that native speakers had to sift through open large datasets by hand, determining which Thai data sources were high-quality and which were not.

“There are no tricks here, you really have to do the work,” he said. “It really is just effort. It’s almost brute force.”

SCB 10X launched Typhoon a year and a half ago. Tharnpipitchai said Typhoon was able to outperform GPT-3.5 in Thai, a fact which “says more about how poorly GPT-3.5 was performing in Thai” than their own work.

Yet scraping non-English web data is beginning to raise legal concerns.

Khalil Nooh, cofounder and CEO of Malaysian startup Mesolitica, which is developing a Malay LLM, said that the company has had data owners request their sources be removed from the training dataset, which is available online since they are an open-source model.

This has further limited the already small pool of high-quality data they have in Malay. To solve this, “the challenge for us is to work with private dataset owners,” Nooh said.

Both Nooh and Bali are exploring synthetic data generation to help create more high-quality data in their target languages. Machines can translate the abundant English content online into other languages to supplement their limited datasets. This is especially useful for LLMs trying to work in regional dialects that have almost no digital presence otherwise.

“How we are able to capture all the 16 dialects in Malaysia is through synthetic [data],” said Nooh.

But there are some obstacles to getting data that neither “brute force” nor machine generation can overcome. In many communities, researchers must balance getting a full picture with managing cultural sensitivities when collecting data in local languages.

While “on the whole, India is very tech positive,” Bali noted, “there are things that you would not ask” when doing on-the-ground data collection. Local communities may not want to share information on certain topics, even if it is widely known among people in the region.

Nooh added that in Malaysia, the three Rs—“race, religion, and royalty”—are all subjects of regional sensitivity.

Although there are currently no regulations on what LLMs can “say” in Malaysia, Nooh said that Mesolitica has “gone ahead to prepare the components that are needed if ever that is required to be implemented.”

To tackle cultural sensitivities in Thailand, Tharnpipitchai similarly explained that SCB 10X released a “safety model” for public sector use, in addition to their regular Typhoon model.

Source link

Related Topics:Brainstorm AI Editor's Picks Featured language Microsoft Southeast Asia 500

Up Next

How Artificial Intelligence Can Enhance Healthcare Research

Don't Miss

Like Dividends? These 3 AI Stocks Pay Investors – July 25, 2025

Cecilia Hult

Click to comment

AI Research

UCLA Researchers Enable Paralyzed Patients to Control Robots with Thoughts Using AI – CHOSUNBIZ – Chosun Biz

Published

44 minutes ago

September 1, 2025

The Editors

UCLA Researchers Enable Paralyzed Patients to Control Robots with Thoughts Using AI – CHOSUNBIZ Chosun Biz

Source link

AI Research

Hackers exploit hidden prompts in AI images, researchers warn

Published

2 hours ago

September 1, 2025

News Desk

Cybersecurity firm Trail of Bits has revealed a technique that embeds malicious prompts into images processed by large language models (LLMs). The method exploits how AI platforms compress and downscale images for efficiency. While the original files appear harmless, the resizing process introduces visual artifacts that expose concealed instructions, which the model interprets as legitimate user input.

In tests, the researchers demonstrated that such manipulated images could direct AI systems to perform unauthorized actions. One example showed Google Calendar data being siphoned to an external email address without the user’s knowledge. Platforms affected in the trials included Google’s Gemini CLI, Vertex AI Studio, Google Assistant on Android, and Gemini’s web interface.

The approach builds on earlier academic work from TU Braunschweig in Germany, which identified image scaling as a potential attack surface in machine learning. Trail of Bits expanded on this research, creating “Anamorpher,” an open-source tool that generates malicious images using interpolation techniques such as nearest neighbor, bilinear, and bicubic resampling.

From the user’s perspective, nothing unusual occurs when such an image is uploaded. Yet behind the scenes, the AI system executes hidden commands alongside normal prompts, raising serious concerns about data security and identity theft. Because multimodal models often integrate with calendars, messaging, and workflow tools, the risks extend into sensitive personal and professional domains.

Also Read: Nvidia CEO Jensen Huang says AI boom far from over

Traditional defenses such as firewalls cannot easily detect this type of manipulation. The researchers recommend a combination of layered security, previewing downscaled images, restricting input dimensions, and requiring explicit confirmation for sensitive operations.

“The strongest defense is to implement secure design patterns and systematic safeguards that limit prompt injection, including multimodal attacks,” the Trail of Bits team concluded.

Source link

AI Research

When AI Freezes Over | Psychology Today

Published

2 hours ago

September 1, 2025

John Nosta

A phrase I’ve often clung to regarding artificial intelligence is one that is also cloaked in a bit of techno-mystery. And I bet you’ve heard it as part of the lexicon of technology and imagination: “emergent abilities.” It’s common to hear that large language models (LLMs) have these curious “emergent” behaviors that are often coupled with linguistic partners like scaling and complexity. And yes, I’m guilty too.

In AI research, this phrase first took off after a 2022 paper that described how abilities seem to appear suddenly as models scale and tasks that a small model fails at completely, a larger model suddenly handles with ease. One day a model can’t solve math problems, the next day it can. It’s an irresistible story as machines have their own little Archimedean “eureka!” moments. It’s almost as if “intelligence” has suddenly switched on.

But I’m not buying into the sensation, at least not yet. A newer 2025 study suggests we should be more careful. Instead of magical leaps, what we’re seeing looks a lot more like the physics of phase changes.

Ice, Water, and Math

Think about water. At one temperature it’s liquid, at another it’s ice. The molecules don’t become something new—they’re always two hydrogens and an oxygen—but the way they organize shifts dramatically. At the freezing point, hydrogen bonds “loosely set” into a lattice, driven by those fleeting electrical charges on the hydrogen atoms. The result is ice, the same ingredients reorganized into a solid that’s curiously less dense than liquid water. And, yes, there’s even a touch of magic in the science as ice floats. But that magic melts when you learn about Van der Waals forces.

The same kind of shift shows up in LLMs and is often mislabeled as “emergence.” In small models, the easiest strategy is positional, where computation leans on word order and simple statistical shortcuts. It’s an easy trick that works just enough to reduce error. But scale things up by using more parameters and data, and the system reorganizes. The 2025 study by Cui shows that, at a critical threshold, the model shifts into semantic mode and relies on the geometry of meaning in its high-dimensional vector space. It isn’t magic, it’s optimization. Just as water molecules align into a lattice, the model settles into a more stable solution in its mathematical landscape.

The Mirage of “Emergence”

That 2022 paper called these shifts emergent abilities. And yes, tasks like arithmetic or multi-step reasoning can look as though they “switch on.” But the model hasn’t suddenly “understood” arithmetic. What’s happening is that semantic generalization finally outperforms positional shortcuts once scale crosses a threshold. Yes, it’s a mouthful. But happening here is the computational process that is shifting from a simple “word position” in a prompt (like, the cat in the _____) to a complex, hyperdimensional matrix where semantic associations across thousands of dimensions create amazing strength to the computation.

And those sudden jumps? They’re often illusions. On simple pass/fail tests, a model can look stuck at zero until it finally tips over the line and then it seems to leap forward. In reality, it was improving step by step all along. The so-called “light-bulb moment” is really just a quirk of how we measure progress. No emergence, just math.

Why “Emergence” Is So Seductive

Why does the language of “emergence” stick? Because it borrows from biology and philosophy. Life “emerges” from chemistry as consciousness “emerges” from neurons. It makes LLMs sound like they’re undergoing cognitive leaps. Some argue emergence is a hallmark of complex systems, and there’s truth to that. So, to a degree, it does capture the idea of surprising shifts.

But we need to be careful. What’s happening here is still math, not mind. Calling it emergence risks sliding into anthropomorphism, where sudden performance shifts are mistaken for genuine understanding. And it happens all the time.

A Useful Imitation

The 2022 paper gave us the language of “emergence.” The 2025 paper shows that what looks like emergence is really closer to a high-complexity phase change. It’s the same math and the same machinery. At small scales, positional tricks (word sequence) dominate. At large scales, semantic structures (multidimensional linguistic analysis) win out.

No insight, no spark of consciousness. It’s just a system reorganizing under new constraints. And this supports my larger thesis: What we’re witnessing isn’t intelligence at all, but anti-intelligence, a powerful, useful imitation that mimics the surface of cognition without the interior substance that only a human mind offers.

Artificial Intelligence Essential Reads

So the next time you hear about an LLM with “emergent ability,” don’t imagine Archimedes leaping from his bath. Picture water freezing. The same molecules, new structure. The same math, new mode. What looks like insight is just another phase of anti-intelligence that is complex, fascinating, even beautiful in its way, but not to be mistaken for a mind.

Source link