XJTU IAIR publishes research in Nature Communications

The schematic diagram of the Hyper-RAG architecture.
A research team from Tsinghua University, the Institute of Artificial Intelligence and Robotics (IAIR) at Xi'an Jiaotong University (XJTU), Shanghai University, and the Beijing Institute of Technology has proposed Hyper-RAG, a hypergraph-driven retrieval-augmented generation method designed to mitigate hallucinations in large language models (LLMs) within specialized domains.
The findings, titled Hyper-RAG: combating LLM hallucinations using hypergraph-driven retrieval-augmented generation, were published in the world-renowned journal Nature Communications.
When addressing queries in specialized fields such as medicine, law, or finance, LLMs often generate responses that deviate from the facts or are overly vague, posing potential application risks. Retrieval-Augmented Generation (RAG) addresses this by building domain-specific knowledge bases and using vector-based retrieval to extract relevant information, enabling LLMs to produce more accurate and reliable content.
Traditional RAG and existing graph-based RAG methods are limited by their reliance on pairwise relationship representations and struggle to characterize complex, high-order correlations among multiple entities, such as disease mechanisms influenced by multiple factors, complex chains of evidence, or multi-person interactions. This limitation often leads to the weakening or loss of critical knowledge during the structuring process, which in turn affects the completeness and reliability of the generated content.
To overcome this, the research team developed Hyper-RAG, which integrates hypergraph computing into the RAG framework. Unlike traditional graphs, a single hyperedge in a hypergraph can connect multiple entities at once. This allows it to capture both low-order pairwise associations and high-order group correlations within the original data, building a more comprehensive, structured domain-knowledge representation while minimizing information loss.
Hyper-RAG achieves precise recall and diffusion of relevant knowledge, providing LLMs with a richer factual basis to support high-reliability applications like medical diagnostics and financial analysis.
The researchers conducted extensive experiments across nine diverse datasets using six different LLMs and multi-dimensional evaluation metrics. The results showed that missing critical knowledge decreased by 60.7 percent, and LLM hallucinations decreased by 48.5 percent.
By breaking the constraints of traditional graph structures, Hyper-RAG establishes a domain knowledge base and knowledge hypergraph that covers all scales of association.
This research provides a new paradigm for knowledge representation and retrieval to mitigate LLM hallucinations. With hypergraph computing, Hyper-RAG provides robust support for LLM applications in high-stakes scenarios such as medical diagnosis, financial analysis, and scientific Q&A. It also paves the way for a new technical path to implement more reliable AI4Science and AI4Health solutions.

