Connect with us

AI Research

How artificial intelligence is tackling mathematical problem-solving

Published

on


The International Mathematical Olympiad (IMO) is arguably the leading mathematical problem-solving competition. Every year, high school students from around the world attempt six problems over the span of three hours. Students whose scores cross a threshold, roughly corresponding to solving five of the six problems, obtain Gold medals, with Silver and Bronze medals for those crossing other thresholds. The problems do not require advanced mathematical knowledge, but instead test for mathematical creativity. They are always new, and it is ensured that no similar problems are online or in the literature.

The AI gold medallist

IMO 2025 had some unusual participants. Even before the Olympiad closed, OpenAI, the maker of ChatGPT, announced that an experimental reasoning model of theirs had answered the Olympiad at the Gold medal level, following the same time limits as the human participants. Remarkably, this was not a model specifically trained or designed for the IMO, but a general-purpose reasoning model with reasoning powers good enough for an IMO Gold.

The OpenAI announcement raised some issues. Many felt that announcing an AI result while the IMO had not concluded overshadowed the achievements of the human participants. Also, the Gold medal score was graded and given by former IMO medalists hired by OpenAI, and some disputed whether the grading was correct. However, a couple of days later, another announcement came. Google-DeepMind attempted the IMO officially, with an advanced version of Gemini Deep Think. Three days after the Olympiad, with the permission of the IMO organisers, they announced that they had obtained a score at the level of a Gold medal. The IMO president Prof. Gregor Dolinar stated, “We can confirm that Google DeepMind has reached the much-desired milestone, earning 35 out of a possible 42 points — a gold medal score. Their solutions were astonishing in many respects. IMO graders found them to be clear, precise and most of them easy to follow.”

Stages of development

Even as it became a popular sensation, ChatGPT was infamous both for hallucinations (making up facts) and for simple arithmetic mistakes. Both these would make solving even modest mathematical problems mostly impossible.

The first advance that greatly reduced these errors, which came a few months after the launch of ChatGPT, was the use of so-called agents. Specifically, models were now able to use web searches to gather accurate information, and Python interpreters to run programs to perform calculations and check reasoning using numerical experiments. These made the models dramatically more accurate, and good enough to solve moderately hard mathematical problems. However, as a single error in a mathematical solution makes the solution invalid, these were not yet accurate enough to reach IMO (or research) level.

Greater accuracy can be obtained by pairing language models with formal proof systems such as the Lean prover — a computer software that can understand and check proofs. Indeed, for IMO 2024 such a system from Google-DeepMind called AlphaProof obtained a silver medal score (but it ran for two days).

Finally, a breakthrough came with the so-called reasoning models, such as o3 from OpenAI and Google-DeepMind’s Gemini-2.5-pro. These models are perhaps better described as internal monologue models. Before answering a complex question, they generate a monologue considering approaches, carrying them out, revisiting their proposed solutions, sometimes dithering and starting all over again, before finally giving a solution with which they are satisfied. It were such models, with some additional advances, that got Olympiad Gold medal scores.

Analogical reasoning and combining ingredients from different sources gives language models some originality, but probably not enough for hard and novel problems. However, verification either through the internal consistency of reasoning models or, better still, checking by the Lean prover, allows training by trying a large number of things and seeing what works, in the same way that AI systems became chess champions starting with just the rules.

Such reinforcement learning has allowed recent models to go beyond training data by creating their own synthetic data.

The implications

Olympiad problems, for both humans and AIs, are not ends in themselves but tests of mathematical problem-solving ability. There are other aspects of research besides problem-solving.

Growing anecdotal experiences suggest that AI systems have excellent capabilities in many of these too, such as suggesting approaches and related problems.

However, the crucial difference between problem-solving and research/development is scale. Research involves working for months or years without errors creeping in, and without wandering off in fruitless directions. As mentioned earlier, coupling models with the Lean prover can prevent errors. Indications are that it is only a matter of time before this is successful.

In the meantime, these models can act as powerful collaborators with human researchers, greatly accelerating research and development in all areas involving mathematics. The era of the super-scientist is here.

Siddhartha Gadgil is a professor in the Department of Mathematics, IISc 

Published – August 11, 2025 08:30 am IST



Source link

Continue Reading
Click to comment

Leave a Reply

Your email address will not be published. Required fields are marked *

AI Research

StockGro launches AI stock research engine for retail investors

Published

on


By Vriti Gothi

Today

  • AI
  • Cross Border Payments
  • Digital Lending

Stockgro

StockGro, has launched of Stoxo, an AI-powered stock-market research engine designed exclusively for retail investors to bridge the gap between sophisticated market intelligence and everyday investors.

Stoxo harnesses advanced artificial intelligence to transform the way retail participants access, interpret, and act on market information. With its ability to analyse real-time trends, compare stocks across multiple parameters, and deliver actionable insights in an intuitive format, the platform offers retail investors a level of research capability once reserved for institutional players. Developed with an emphasis on accessibility and user-friendly design, Stoxo ensures that complex financial data is presented with clarity, empowering users to make confident, informed investment decisions.

The introduction of Stoxo positions StockGro at the forefront of India’s rapidly evolving investment ecosystem. The platform’s AI-driven architecture is built for scalability, enabling it to adapt seamlessly to shifting market conditions while maintaining the speed and precision required in modern trading environments. For customers, the impact is immediate greater transparency, enhanced decision-making power, and the ability to participate in the markets with a degree of insight previously out of reach for many retail investors.

Beyond individual benefit, Stoxo represents a step forward for the broader financial sector by fostering inclusivity and boosting retail participation. By providing institutional-grade research capabilities in a digital-first, user-friendly environment, StockGro is advancing financial literacy and enabling more Indians to take an active role in wealth creation.

With the launch of Stoxo, StockGro continues to redefine the boundaries of FinTech innovation, merging advanced technology with a deep understanding of investor needs to shape a more informed, empowered, and inclusive investing future for India.

Previous Article

Brex gets EU payment licence to expand across Europe

Read More


IBSi FinTech Journal

  • Most trusted FinTech journal since 1991
  • Digital monthly issue
  • 60+ pages of research, analysis, interviews, opinions, and rankings
  • Global coverage


Subscribe Now



Source link

Continue Reading

AI Research

Did Bill Gates Predict GPT-5’s Disappointment Before Launch?

Published

on


There had been a lot of hype and anticipation building around GPT-5 prior to its recent launch. OpenAI touted the tool as the smartest AI model while comparing it to an entire team of PhD-level experts. GPT-5 ships with a plethora of next-gen features across a wide range of categories, including coding, writing, and medicine.

The ChatGPT maker’s CEO, Sam Altman, previously claimed that something “smarter than the smartest person you know” will soon be running on a device in your pocket, potentially referring to GPT-5. However, the AI firm has received backlash from users following the model’s launch and its abrupt decision to deprecate the model’s predecessors.





Source link

Continue Reading

AI Research

Better Artificial Intelligence Stock: ASML vs. AMD

Published

on


ASML and AMD are pivotal players in the booming AI market, helping both to see strong sales so far this year.

Artificial intelligence (AI) remains a hot area to invest in, as seen in Nvidia‘s share price, which is up over 30% this year through Aug. 6. Two AI businesses to consider are ASML Holding (ASML 1.33%) and Advanced Micro Devices (AMD 0.17%), since they provide key hardware to the industry.

The former makes cutting-edge lithography machines, which are necessary for producing the advanced microchips that power AI systems. AMD, one of Nvidia’s top competitors, sells AI chips to cloud computing companies such as Microsoft.

ASML and AMD are both strong businesses. But determining which is a better AI investment isn’t simple. So let’s evaluate them in more detail.

Image source: Getty Images.

A look into ASML

ASML’s lithography equipment is essential for manufacturing AI microchips because the technology demands immense computing power. This necessitates shrinking chip components to minuscule dimensions. For instance, a microchip the size of your fingernail contains billions of transistors. ASML’s machines support this.

Although the Dutch company plays an important role in AI, its stock has struggled in 2025, remaining essentially flat through Aug. 6. Part of this is because management anticipates economic uncertainty ahead as a result of factors such as President Donald Trump’s aggressive tariff policies.

Even so, ASML expects 2025 sales to rise 15% over 2024’s 28.3 billion euros ($33 billion). This is significant since 2024’s revenue represents only a 2.6% year-over-year increase. And so far this year, the company is doing well.

Through two quarters, revenue stood at $18 billion, up from the prior year’s $13.4 billion. Operating income rose to $5.8 billion from 2024’s $3.7 billion. This robust growth resulted in net income of $5.4 billion, a strong increase over the previous year’s $3.3 billion.

The excellent first-half results were tempered by a third-quarter revenue forecast between $8.6 billion and $9.2 billion. This outlook, when compared to the prior year’s sales of $8.9 billion, suggests the current trend of strong year-over-year growth may be slowing down, which contributed to ASML’s tepid stock performance.

How AMD is faring

Like rival Nvidia, AMD stock is having a stellar year. Shares are up 35% in 2025 through Aug. 6. This performance is understandable following the company’s second-quarter earnings results. The quarter’s revenue reached a record $7.7 billion, a 32% year-over-year increase.

CEO Lisa Su said, “We are seeing robust demand across our computing and AI product portfolio and are well positioned to deliver significant growth in the second half of the year.” In that second half, AMD expects revenue of $8.7 billion, a strong increase over the previous year’s $6.8 billion.

Despite the sales growth, AMD exited the second quarter with an operating loss of $134 million compared to operating income of $269 million in the previous year. The substantial drop was due to new U.S. government restrictions introduced earlier this year on the sale of AI chips to China. As a result, AMD could not sell chips it had intended for Chinese customers, forcing the company to write off that inventory by $800 million.

Yet this makes its second-quarter sales growth all the more impressive. In the quarter, net income was $872 million, up 229% year over year. Consequently, diluted earnings per share soared 238% to $0.54 in a boon to shareholders.

AMD is working to get government approval to sell AI chips to China again. When that OK is obtained, the company is in a position to deliver more outsize sales growth.

Deciding between ASML and AMD

AMD’s outstanding performance, its anticipated third-quarter revenue growth, and an eventual return of sales to China point to it being the superior AI stock versus ASML.

However, an important consideration is share price valuation. The price-to-earnings ratio (P/E) tells you how much investors are willing to pay for a dollar’s worth of earnings based on the trailing 12 months.

ASML PE Ratio Chart

Data by YCharts.

The top chart shows ASML’s P/E ratio has declined over the past year, indicating its stock’s valuation has improved. Compared to AMD’s recently rising earnings multiple, as seen in the bottom chart, ASML shares look like a bargain.

ASML’s short-term sales may slow due to the current macroeconomic uncertainty, but over the long run, it’s likely to benefit from the rise of AI. The company sees the technology as a significant chance for growth in semiconductors, similar to previous opportunities like PCs, the internet, and smartphones.

Industry forecasts support ASML’s perspective. The AI sector is projected to grow from $244 billion in 2025 to $1 trillion by 2031. While this market growth is a tailwind for both companies, ASML’s attractive valuation makes it look like the more compelling AI stock to buy right now.

Robert Izquierdo has positions in ASML, Advanced Micro Devices, Microsoft, and Nvidia. The Motley Fool has positions in and recommends ASML, Advanced Micro Devices, Microsoft, and Nvidia. The Motley Fool recommends the following options: long January 2026 $395 calls on Microsoft and short January 2026 $405 calls on Microsoft. The Motley Fool has a disclosure policy.



Source link

Continue Reading

Trending