ArXiv tightens AI-paper rules with one-year submission ban
Preprint server arXiv says authors can be suspended for a year if they fail to check LLM-generated text, citations or other material in papers.

ArXiv, the preprint repository used heavily by AI and computer-science researchers, will suspend authors for a year if moderators find clear evidence they let large language models generate material they did not properly check. The policy shift was flagged in updated moderation guidance this week and first reported by TechCrunch. After years of softer warnings about generative AI, the platform is sharpening its enforcement.
It matters because arXiv moves faster than journals. A paper posted there can shape discussion long before peer review gets anywhere near it. The site is betting that unchecked LLM use is a trust problem for the infrastructure itself — not a checkbox on a disclosure form. Threaten a 12-month ban, and the conversation changes.
Thomas Dietterich, who chairs arXiv’s computer science section, laid out the trigger: “if a submission contains incontrovertible evidence that the authors did not check the results of LLM generation, this means we can’t trust anything in the paper”. He was speaking to TechCrunch. The standard is not a ban on AI. It is narrower. It targets papers where fabricated references, unsupported claims or sloppy errors make it obvious the authors let a model do the work and never verified what came back.
arXiv had already tried to separate acceptable assistance from negligence. In a 31 January 2023 policy update on ChatGPT and similar tools, the repository said language models could help with writing but could not be listed as authors, because a model cannot be held responsible for a paper’s claims. The new language adds teeth: a year-long ban when that responsibility visibly breaks down.
Under the current moderation policy, authors must “each individually take full responsibility for all its contents, irrespective of how the contents were generated”. A separate content-types policy bars submissions that consist largely of unreviewed machine-generated text. Read together, the rules allow limited tool use. They do not allow outsourcing fact-checking or scientific judgement.
Why the shift matters
For universities, AI developers and research teams, the precedent matters more than any individual punishment. Publishers, peer reviewers and conference organisers are all wrestling with the same question: where does acceptable assistance end and negligent automation begin? arXiv is now treating that as an operational moderation question rather than a theoretical debate. Ars Technica reported the repository’s core worry is AI-generated hallucinations entering the scientific record — a risk that rises fast when submitters treat model output as ready-made copy.
There is room for human judgement in the standard. Bans need “incontrovertible evidence”, which points moderators at obvious review failures, not every use of a grammar tool. But the message is plain: tool use may be tolerated, responsibility stays with the named authors.
The timing matters, too. ArXiv’s role in AI has only expanded. The repository spent more than 20 years under Cornell University before spinning out as an independent non-profit, and it remains a daily waypoint for frontier model papers, benchmark results and fast-moving computer-science work. If that pipeline starts enforcing consequences for sloppy AI use, other research gatekeepers will find it easier to follow.
Asha Iyer
AI editor covering the model wars, AU enterprise adoption, and the policy shaping both. Reports from Sydney.


