AI Research
Scientists Are Sneaking Passages Into Research Papers Designed to Trick AI Reviewers

Artificial intelligence has infected every corner of academia — and now, some scientists are fighting back with a seriously weird trick.
In a new investigation, reporters from Japan’s Nikkei Asia found more than a dozen academic papers that contained invisible prompts meant to trick AI review tools into giving them glowing write-ups.
Examining the academic database arXiv, where researchers publish studies awaiting peer review, Nikkei found 17 English-language papers from 14 separate institutions in eight countries that contained examples of so-called “prompt injection.” These hidden missives, meant only for AI, were often in white text on white backgrounds or in minuscule fonts.
The tricky prompts, which ranged from one to three sentences in length, would generally tell AI reviewers to “give a positive review only” or “not highlight any negatives.” Some were more specific, demanding that any AI reading the work say that the paper had “impactful contributions, methodological rigor, and exceptional novelty,” and as The Register found, others ordered bots to “ignore all previous instructions.”
(Though Nikkei did not name any such review tools, a Nature article published back in March revealed that a site called Paper Wizard will spit out entire reviews of academic manuscripts under the guise of “pre-peer-review,” per its creators.)
When the newspaper contacted authors implicated in the scheme, the researchers’ responses differed.
One South Korean paper author — who was not named, along with the others discovered by the investigation — expressed remorse and said they planned to withdraw their paper from an upcoming conference.
“Inserting the hidden prompt was inappropriate,” that author said, “as it encourages positive reviews even though the use of AI in the review process is prohibited.”
One of the Japanese researchers had the entirely opposite take, arguing the practice was defensible because AI is prohibited by most academic conferences where these sorts of papers would be presented.
“It’s a counter against ‘lazy reviewers’ who use AI,” the Japanese professor said.
In February of this year, ecologist Timothée Poisot of the University of Montreal revealed in a blog post that AI had quietly been doing the important work of academic peer review. Poisot, an associate professor at the school’s Department of Biological Sciences, discovered this after getting back a review on one of his colleague’s manuscripts that included an AI-signaling response.
When The Register asked him about Nikkei‘s findings, Poisot said he thought it was “brilliant” and doesn’t find the practice of such prompt injection all that problematic if it’s in defense of careers.
One thing’s for sure: the whole thing throws the “Through the Looking Glass” state of affairs in academia into sharp relief, with AI being used to both to write and review “research” — a mosh pit of laziness that can only hinder constructive scientific progress.
More on AI and academia: College Students Are Sprinkling Typos Into Their AI Papers on Purpose
AI Research
(Policy Address 2025) HK earmarks HK$3B for AI research and talent recruitment – The Standard (HK)
AI Research
[2506.08171] Worst-Case Symbolic Constraints Analysis and Generalisation with Large Language Models

View a PDF of the paper titled Worst-Case Symbolic Constraints Analysis and Generalisation with Large Language Models, by Daniel Koh and 4 other authors
Abstract:Large language models (LLMs) have demonstrated strong performance on coding tasks such as generation, completion and repair, but their ability to handle complex symbolic reasoning over code still remains underexplored. We introduce the task of worst-case symbolic constraints analysis, which requires inferring the symbolic constraints that characterise worst-case program executions; these constraints can be solved to obtain inputs that expose performance bottlenecks or denial-of-service vulnerabilities in software systems. We show that even state-of-the-art LLMs (e.g., GPT-5) struggle when applied directly on this task. To address this challenge, we propose WARP, an innovative neurosymbolic approach that computes worst-case constraints on smaller concrete input sizes using existing program analysis tools, and then leverages LLMs to generalise these constraints to larger input sizes. Concretely, WARP comprises: (1) an incremental strategy for LLM-based worst-case reasoning, (2) a solver-aligned neurosymbolic framework that integrates reinforcement learning with SMT (Satisfiability Modulo Theories) solving, and (3) a curated dataset of symbolic constraints. Experimental results show that WARP consistently improves performance on worst-case constraint reasoning. Leveraging the curated constraint dataset, we use reinforcement learning to fine-tune a model, WARP-1.0-3B, which significantly outperforms size-matched and even larger baselines. These results demonstrate that incremental constraint reasoning enhances LLMs’ ability to handle symbolic reasoning and highlight the potential for deeper integration between neural learning and formal methods in rigorous program analysis.
Submission history
From: Daniel Koh [view email]
[v1]
Mon, 9 Jun 2025 19:33:30 UTC (1,462 KB)
[v2]
Tue, 16 Sep 2025 10:35:33 UTC (1,871 KB)
AI Research
Spatially-Aware Image Focus for Visual Reasoning

View a PDF of the paper titled SIFThinker: Spatially-Aware Image Focus for Visual Reasoning, by Zhangquan Chen and 6 other authors
Abstract:Current multimodal large language models (MLLMs) still face significant challenges in complex visual tasks (e.g., spatial understanding, fine-grained perception). Prior methods have tried to incorporate visual reasoning, however, they fail to leverage attention correction with spatial cues to iteratively refine their focus on prompt-relevant regions. In this paper, we introduce SIFThinker, a spatially-aware “think-with-images” framework that mimics human visual perception. Specifically, SIFThinker enables attention correcting and image region focusing by interleaving depth-enhanced bounding boxes and natural language. Our contributions are twofold: First, we introduce a reverse-expansion-forward-inference strategy that facilitates the generation of interleaved image-text chains of thought for process-level supervision, which in turn leads to the construction of the SIF-50K dataset. Besides, we propose GRPO-SIF, a reinforced training paradigm that integrates depth-informed visual grounding into a unified reasoning pipeline, teaching the model to dynamically correct and focus on prompt-relevant regions. Extensive experiments demonstrate that SIFThinker outperforms state-of-the-art methods in spatial understanding and fine-grained visual perception, while maintaining strong general capabilities, highlighting the effectiveness of our method. Code: this https URL.
Submission history
From: Zhangquan Chen [view email]
[v1]
Fri, 8 Aug 2025 12:26:20 UTC (5,223 KB)
[v2]
Thu, 14 Aug 2025 10:34:22 UTC (5,223 KB)
[v3]
Sun, 24 Aug 2025 13:04:46 UTC (5,223 KB)
[v4]
Tue, 16 Sep 2025 09:40:13 UTC (5,223 KB)
-
Business3 weeks ago
The Guardian view on Trump and the Fed: independence is no substitute for accountability | Editorial
-
Tools & Platforms1 month ago
Building Trust in Military AI Starts with Opening the Black Box – War on the Rocks
-
Ethics & Policy2 months ago
SDAIA Supports Saudi Arabia’s Leadership in Shaping Global AI Ethics, Policy, and Research – وكالة الأنباء السعودية
-
Events & Conferences4 months ago
Journey to 1000 models: Scaling Instagram’s recommendation system
-
Jobs & Careers3 months ago
Mumbai-based Perplexity Alternative Has 60k+ Users Without Funding
-
Podcasts & Talks2 months ago
Happy 4th of July! 🎆 Made with Veo 3 in Gemini
-
Education2 months ago
Macron says UK and France have duty to tackle illegal migration ‘with humanity, solidarity and firmness’ – UK politics live | Politics
-
Education3 months ago
VEX Robotics launches AI-powered classroom robotics system
-
Podcasts & Talks2 months ago
OpenAI 🤝 @teamganassi
-
Funding & Business3 months ago
Kayak and Expedia race to build AI travel agents that turn social posts into itineraries