AI Research
AI could be about to completely change the way we do mathematics
AI is getting better at handling mathematical research
lucadp/Getty Images
Is an artificial intelligence revolution about to transform mathematics? Some prominent mathematicians think so, thanks to automated tools that can help write proofs suddenly showing impressive leaps in capability, with the potential to change the way maths research is done.
Around 100 of the world’s top mathematicians gathered at the University of Cambridge in June for a conference whose theme was based on whether computers might help mathematicians resolve some long-standing problems over how to check that their proofs were correct. This process, known as formalisation, doesn’t necessarily have to involve artificial intelligence, and indeed a similar meeting held at Cambridge in 2017 made no mention of AI.
But eight years later, AI has come on by leaps and bounds, most notably with the success of large language models powering tools like ChatGPT. This has attracted new interest to the question of how AI might affect mathematics, from automatically translating human-written proofs into a formal, computer-checkable language, to constructing proofs themselves.
“It’s a little bit overwhelming,” says Jeremy Avigad at Carnegie Mellon University in Pennsylvania, who helped organise the meeting. “It’s nice; I’ve been doing this for a long time and it used to be kind of a fringe, niche thing. All of a sudden, I find myself popular.”
Two of the talks were put on by Google DeepMind, which last year made headlines when its AI system, AlphaProof, achieved a silver medal score at the International Mathematical Olympiad (IMO), a prestigious competition for young mathematicians and a long-standing target for AI systems. “If you talk to mathematicians and ask them about the [AlphaProof] IMO results, you would have had different reactions. I think most would say that these are pretty hard high school problems, but maybe some other mathematicians would call them trivial,” says Thomas Hubert, a research engineer at DeepMind.
Hubert and his team showed that AlphaProof could go beyond the IMO competition, to aid in formalising a small part of the prime number theorem, an important result in number theory. The mathematics had already been converted to Lean, a programming language, but AlphaProof was able to prove the theorem and then check it was correct. “I wanted to do a demo of how AlphaProof could be used in real life,” says Hubert.
Morph Labs, a US-based AI startup, also demonstrated an AI tool called Trinity, which is designed to translate proofs completely automatically, starting from the handwritten mathematical notation and generating a fully formalised and checked proof in Lean. Bhavik Mehta at Imperial College London, who has consulted with Morph Labs, showed an example of Trinity proving a theorem relating to the ABC conjecture, which infamously remains the subject of intense debate about whether it is true or not – something that formalisation could address.
Though this proof was only a tiny building block of the entire proof needed for the ABC conjecture, and Trinity required a slightly more detailed version of the handwritten proof than the original paper, many people were surprised at how much correct mathematical code was generated by the tool.
“The difference between what Morph did and what had gone before was that they took an entire maths paper, albeit one from 1962 that was only four pages long, a human [then] broke the argument down into small pieces and then a machine just translated the entire thing into Lean,” says Kevin Buzzard at Imperial College London. “I’m not sure we’ve seen anything like that before.”
However, it still isn’t clear how this will work for other areas of mathematics, says Mehta. “This was essentially the first attempt, and the first attempt worked, maybe it got lucky.”
Christian Szegedy at Morph Labs claims that once the tool is fully up and running, it will quickly progress. “Once a feedback loop is established and we don’t need the amount of hand-holding that this theorem needed… then it becomes basically a chain reaction and we can do all of mathematics at once,” he says.
Tools like these are already at the point that they could be incredibly helpful to mathematicians, says Timothy Gowers at the University of Cambridge. “It will just take some work to create them, and there seem to be a lot of people who are keen to do precisely that, so I think that over the next few, and I really mean few, anything from one to five years, there will have been changes to how we do maths that will rival in importance the changes to mathematical practice brought about by email, LaTeX [the standardised maths notation], arXiv [an online paper repository] and Google.”
But not all mathematicians agree that the Morph Labs paper was so impressive. Rodrigo Ochigame at Leiden University in the Netherlands cautions that we don’t know the full details of the work. “They posted only a single, possibly cherry-picked, output of their system without writing a paper or sharing basic information about their methods. They didn’t even say if they tested their system on any other theorems,” he says. “When asked by the audience about the amount of compute used by the model, they repeatedly declined to answer, making it difficult to assess the significance of their result.”
And scepticism remains about just how useful AI tools can be. Most mathematicians still work without the use of automated tools, and it is unclear whether they will change their minds as the tools improve, says Minhyong Kim at the International Centre for Mathematical Sciences in the UK. “Mathematics and mathematicians are incredibly diverse in their inclinations. I imagine some people will end up using AI tools very effectively and creatively, while some will try to maintain a distance.”
“People underestimate the complexity, creativity and subtlety of mathematical research,” says Ochigame, which is why so much research is still done using pen, paper and deep thought. “There is a huge gap between high school math competitions, such as the IMO, and cutting-edge research,” he says.
Topics:
AI Research
The new frontier of medical malpractice
Although the beginnings of modern artificial intelligence (AI) can be traced
as far back as 1956, modern generative AI, the most famous example of which is
arguably ChatGPT, only began emerging in 2019. For better or worse, the steady
rise of generative AI has increasingly impacted the medical field. At this time, AI has begun to advance in a way that creates
potential liability…
AI Research
Pharmaceutical Innovation Rises as Global Funding Surges and AI Reshapes Clinical Research – geneonline.com
AI Research
Radiomics-Based Artificial Intelligence and Machine Learning Approach for the Diagnosis and Prognosis of Idiopathic Pulmonary Fibrosis: A Systematic Review – Cureus
-
Funding & Business7 days ago
Kayak and Expedia race to build AI travel agents that turn social posts into itineraries
-
Jobs & Careers7 days ago
Mumbai-based Perplexity Alternative Has 60k+ Users Without Funding
-
Mergers & Acquisitions7 days ago
Donald Trump suggests US government review subsidies to Elon Musk’s companies
-
Funding & Business7 days ago
Rethinking Venture Capital’s Talent Pipeline
-
Jobs & Careers6 days ago
Why Agentic AI Isn’t Pure Hype (And What Skeptics Aren’t Seeing Yet)
-
Funding & Business4 days ago
Sakana AI’s TreeQuest: Deploy multi-model teams that outperform individual LLMs by 30%
-
Jobs & Careers6 days ago
Astrophel Aerospace Raises ₹6.84 Crore to Build Reusable Launch Vehicle
-
Jobs & Careers6 days ago
Telangana Launches TGDeX—India’s First State‑Led AI Public Infrastructure
-
Funding & Business1 week ago
From chatbots to collaborators: How AI agents are reshaping enterprise work
-
Jobs & Careers4 days ago
Ilya Sutskever Takes Over as CEO of Safe Superintelligence After Daniel Gross’s Exit