AI Research

Optimizing LLM-based trip planning

Published

3 months ago

June 6, 2025

Many real-world planning tasks involve both harder “quantitative” constraints (e.g., budgets or scheduling requirements) and softer “qualitative” objectives (e.g., user preferences expressed in natural language). Consider someone planning a week-long vacation. Typically, this planning would be subject to various clearly quantifiable constraints, such as budget, travel logistics, and visiting attractions only when they are open, in addition to a number of constraints based on personal interests and preferences that aren’t easily quantifiable.

Large language models (LLMs) are trained on massive datasets and have internalized an impressive amount of world knowledge, often including an understanding of typical human preferences. As such, they are generally good at taking into account the not-so-quantifiable parts of trip planning, such as the ideal time to visit a scenic view or whether a restaurant is kid-friendly. However, they are less reliable at handling quantitative logistical constraints, which may require detailed and up-to-date real-world information (e.g., bus fares, train schedules, etc.) or complex interacting requirements (e.g., minimizing travel across multiple days). As a result, LLM-generated plans can at times include impractical elements, such as visiting a museum that would be closed by the time you can travel there.

We recently introduced AI trip ideas in Search, a feature that suggests day-by-day itineraries in response to trip-planning queries. In this blog, we describe some of the work that went into overcoming one of the key challenges in launching this feature: ensuring the produced itineraries are practical and feasible. Our solution employs a hybrid system that uses an LLM to suggest an initial plan combined with an algorithm that jointly optimizes for similarity to the LLM plan and real-world factors, such as travel time and opening hours. This approach integrates the LLM’s ability to handle soft requirements with the algorithmic precision needed to meet hard logistical constraints.

Source link

AI Research

OpenAI and NVIDIA will join President Trump’s UK state visit

Published

1 hour ago

September 14, 2025

Richard Devine

U.S. President Donald Trump is about to do something none of his predecessors have — make a second full state visit to the UK. Ordinarily, a President in a second term of office visits, meets with the monarch, but doesn’t get a second full state visit.

On this one it seems he’ll be accompanied by two of the biggest faces in the ever-growing AI race; OpenAI CEO, Sam Altman, and NVIDIA CEO, Jensen Huang.

This is according to a report by the Financial Times, which claims that the two are accompanying President Trump to announce a “large artificial intelligence infrastructure deal.”

The deal is said to support a number of data center projects in the UK, another deal towards developing “sovereign” AI for another of the United States’ allies.

The report claims that the two CEOs will announce the deal during the Trump state visit, and will see OpenAI supply the technology, and NVIDIA the hardware. The UK will supply all the energy required, which is handy for the two companies involved.

UK energy is some of the most expensive in the world (one reason I’m trying to use my gaming PC with an RTX 5090 a lot less!)

The exact makeup of the deal is still unknown, and, naturally, neither the U.S. nor UK governments have said anything at this point.

AI has helped push NVIDIA to the lofty height of being the world’s most valuable company. (Image credit: Getty Images | Kevin Dietsch)

The UK government, like many others, has openly announced its plans to invest in AI. As the next frontier for tech, you either get on board or you get left behind. And President Trump has made no secret of his desires to ensure the U.S. is a world leader.

OpenAI isn’t the only company that could provide the software side, but it is the most established. While Microsoft may be looking towards a future where it is less reliant on the tech behind ChatGPT for its own AI ambitions, it makes total sense that organizations around the world would be looking to OpenAI.

NVIDIA, meanwhile, continues to be the runaway leader on the hardware front. We’ve seen recently that AMD is planning to keep pushing forward, and a recent Chinese model has reportedly been built to run specifically without NVIDIA GPUs.

But for now, everything runs best on NVIDIA, and as long as it can keep churning out enough GPUs to fill these data centers, it will continue to print money.

The state visit is scheduled to begin on Wednesday, September 17, so I’ll be keeping a close eye out for when this AI deal gets announced.

Source link

AI Research

Canada invests $28.7M to train clean energy workers and expand AI research

Published

1 hour ago

September 14, 2025

The Editors

The federal government is investing $28.7 million to equip Canadian workers with skills for a rapidly evolving clean energy sector and to expand artificial intelligence (AI) research capacity.

The funding, announced Sept. 9, includes more than $9 million over three years for the AI Pathways: Energizing Canada’s Low-Carbon Workforce project. Led by the Alberta Machine Intelligence Institute (Amii), the initiative will train nearly 5,000 energy sector workers in AI and machine learning skills for careers in wind, solar, geothermal and hydrogen energy. Training will be offered both online and in-person to accommodate mid-career workers, industry associations, and unions across Canada.

In addition, the government is providing $19.7 million to Amii through the Canadian Sovereign AI Compute Strategy, expanding access to advanced computing resources for AI research and development. The funding will support researchers and businesses in training and deploying AI models, fostering innovation, and helping Canadian companies bring AI-enabled products to market.

“Canada’s future depends on skilled workers. Investing and upskilling Canadian workers ensures they can adapt and succeed in an energy sector that’s changing faster than ever,” said Patty Hajdu, Minister of Jobs and Families and Minister responsible for the Federal Economic Development Agency for Northern Ontario.

Evan Solomon, Minister of Artificial Intelligence and Digital Innovation, added that the investment “builds an AI-literate workforce that will drive innovation, create sustainable jobs, and strengthen our economy.”

Amii CEO Cam Linke said the funding empowers Canada to become “the world’s most AI-literate workforce” while providing researchers and businesses with a competitive edge.

The AI Pathways initiative is one of eight projects funded under the Sustainable Jobs Training Fund, which supports more than 10,000 Canadian workers in emerging sectors such as electric vehicle maintenance, green building retrofits, low-carbon energy, and carbon management.

The announcement comes as Canada faces workforce shifts, with an estimated 1.2 million workers retiring across all sectors over the next three years and the net-zero transition projected to create up to 400,000 new jobs by 2030.

The federal investments aim to prepare Canadians for the jobs of the future while advancing research, innovation, and commercialization in AI and clean energy.

Source link

AI Research

100x Faster Brain-Inspired AI Model

Published

4 hours ago

September 14, 2025

Name

In the rapidly evolving field of artificial intelligence, a new contender has emerged from China’s research labs, promising to reshape how we think about energy-efficient computing. The SpikingBrain-7B model, developed by the Brain-Inspired Computing Lab (BICLab) at the Chinese Academy of Sciences, represents a bold departure from traditional large language models. Drawing inspiration from the human brain’s neural firing patterns, this system employs spiking neural networks to achieve remarkable efficiency gains. Unlike conventional transformers that guzzle power, SpikingBrain-7B mimics biological neurons, firing only when necessary, which slashes energy consumption dramatically.

At its core, the model integrates hybrid-linear attention mechanisms and conversion-based training techniques, allowing it to run on domestic MetaX chips without relying on NVIDIA hardware. This innovation addresses a critical bottleneck in AI deployment: the high energy demands of training and inference. According to a technical report published on arXiv, the SpikingBrain series, including the 7B and 76B variants, demonstrates over 100 times faster first-token generation at long sequence lengths, making it ideal for edge devices in industrial control and mobile applications.

Breaking Away from Transformer Dominance

The genesis of SpikingBrain-7B can be traced to BICLab’s GitHub repository, where the open-source code reveals a sophisticated architecture blending spiking neurons with large-scale model training. Researchers at the lab, led by figures like Guoqi Li and Bo Xu, have optimized for non-NVIDIA clusters, overcoming challenges in parallel training and communication overhead. This approach not only enhances stability but also paves the way for neuromorphic hardware that prioritizes energy optimization over raw compute power.

Recent coverage in Xinhua News highlights how SpikingBrain-1.0, the foundational system, breaks from mainstream models like ChatGPT by using spiking networks instead of dense computations. This brain-inspired paradigm allows the model to train on just a fraction of the data typically required—reports suggest as little as 2%—while matching or exceeding transformer performance in benchmarks.

Efficiency Gains and Real-World Applications

Delving deeper, the model’s spiking mechanism enables asynchronous processing, akin to how the brain handles information dynamically. This is detailed in the arXiv report, which outlines a roadmap for next-generation hardware that could integrate seamlessly into sectors like healthcare and transportation. For instance, in robotics, SpikingBrain’s low-power profile supports real-time decision-making without the need for massive data centers.

Posts on X (formerly Twitter) from AI enthusiasts, such as those praising its 100x speedups, reflect growing excitement. Users have noted how the model’s hierarchical processing mirrors neuroscience findings, with emergent brain-like patterns in its structure. This sentiment aligns with broader neuromorphic computing trends, as seen in a Nature Communications Engineering article on advances in robotic vision, where spiking networks enable efficient AI in constrained environments.

Challenges and Future Prospects

Despite its promise, deploying SpikingBrain-7B isn’t without hurdles. The arXiv paper candidly discusses adaptations needed for CUDA and Triton operators in hybrid attention setups, underscoring the technical feats involved. Moreover, training on MetaX clusters required custom optimizations to handle long-sequence topologies, a feat that positions China at the forefront of independent AI innovation amid global chip restrictions.

In industry circles, this development is seen as a catalyst for shifting AI paradigms. A NotebookCheck report emphasizes its potential for up to 100x performance boosts over conventional systems, fueling discussions on sustainable AI. As neuromorphic computing gains traction, SpikingBrain-7B could inspire a wave of brain-mimicking models, reducing the environmental footprint of AI while expanding its reach to everyday devices.

Implications for Global AI Research

Beyond technical specs, the open-sourcing of SpikingBrain-7B via GitHub invites global collaboration, with the repository already garnering attention for its spike-driven transformer implementations. This mirrors earlier BICLab projects like Spike-Driven-Transformer-V2, building a continuum of research toward energy-efficient intelligence.

Looking ahead, experts anticipate integrations with emerging hardware, as outlined in PMC’s coverage of spike-based dynamic computing. With SpikingBrain’s bilingual capabilities and industry validations, it stands as a testament to how bio-inspired designs can democratize AI, challenging Western dominance and fostering a more inclusive technological future.

Source link