GRAPHRAG AND ROLE OF GRAPH DATABASES IN ADVANCING AI

Authors

  • Ravi Kiran Magham Osmania University, India. Author

Keywords:

GraphRAG, Graph Databases, Artificial Intelligence, Knowledge Representation, Large Language Models

Abstract

This article explores the synergy between GraphRAG and graph databases in advancing AI capabilities. Graph RAG, a novel approach to query-focused summarization that combines knowledge graph generation, retrieval-augmented generation (RAG), and query-focused summarization (QFS) to support human sensemaking over large text corpora. While traditional RAG methods excel at answering specific questions, they struggle with global queries about entire datasets. Graph RAG addresses this limitation by using an LLM to build a graph-based text index in two stages: first deriving an entity knowledge graph from source documents, then pre-generating community summaries for groups of related entities. Given a query, each community summary generates a partial response, which is then synthesized into a final answer. Evaluations on datasets in the 1 million token range show that Graph RAG substantially improves the comprehensiveness and diversity of answers compared to naïve RAG baselines, while also demonstrating favorable performance against global text summarization approaches. The hierarchical nature of the graph index allows for efficient querying at different levels of granularity. This approach offers a scalable solution for global sensemaking tasks over large private document collections, with potential applications in scientific discovery, intelligence analysis, and other domains requiring complex information synthesis.

References

T. L. Scao et al., "Bloom: A 176B-Parameter Open-Access Multilingual Language Model," arXiv preprint arXiv:2211.05100, 2022. [Online]. Available: https://arxiv.org/abs/2211.05100

G. Izacard, P. Lewis, E. Lomeli, L. Hosseini, F. Petroni, T. Schick, J. Dwivedi, A. Cancedda, S. Riedel, and S. Stenetorp, "Atlas: Few-shot Learning with Retrieval Augmented Language Models," arXiv preprint arXiv:2208.03299, 2022. [Online]. Available: https://arxiv.org/abs/2208.03299

Robinson, J. Webber, and E. Eifrem, "Graph Databases: New Opportunities for Connected Data," 3rd Edition, O'Reilly Media, Inc., 2021. [Online]. Available: https://www.oreilly.com/library/view/graph-databases/9781492044062/

Sara AlMahri, Liming Xu, Alexandra Brintrup. "Enhancing Supply Chain Visibility with Knowledge Graphs and Large Language Models." arXiv:2408.07705v1 [cs.IR], Aug 2024. https://arxiv.org/html/2408.07705v1

Boci Peng, Yun Zhu, et al. "Graph Retrieval-Augmented Generation: A Survey." arXiv:2408.08921, Sep 2024. https://arxiv.org/pdf/2408.08921

Ben Lorica, Prashanth Rao. "GraphRAG: Design Patterns, Challenges, Recommendations." Gradient Flow, May 2024. https://gradientflow.com/graphrag-design-patterns-challenges-recommendations/

Y. Hu, Z. Zhang, Y. Lei, G. Pan, C. Ling, and L. Zhao, "GRAG: Graph Retrieval-Augmented Generation," in 2024 IEEE International Conference on Data Engineering (ICDE), 2024, pp. 1-12. https://ieeexplore.ieee.org/document/10409137

J. Zhang, X. Zhang, J. Yu, J. Tang, J. Tang, C. Li, and H. Chen, "Subgraph Retrieval Enhanced Model for Multi-hop Knowledge Base Question Answering," in Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022, pp. 5773-5784. https://aclanthology.org/2022.acl-long.396/

J. Webber and I. Robinson, "Graph Databases: New Opportunities for Connected Data," O'Reilly Media, Inc., 2nd Edition, 2021. https://www.oreilly.com/library/view/graph-databases-new/9781098155308/

I. Robinson, J. Webber, and E. Eifrem, "Graph Databases: New Opportunities for Connected Data," O'Reilly Media, Inc., 3rd Edition, 2023.

https://www.oreilly.com/library/view/graph-databases-3rd/9781098150013/

M. Galkin, X. Yuan, H. Mostafa, J. Tang, and Z. Zhu, "Towards Foundation Models for Knowledge Graph Reasoning," in The Twelfth International Conference on Learning Representations, 2024. https://openreview.net/forum?id=V8N9bxJAh8

Y. Zhu, X. Wang, J. Chen, S. Qiao, Y. Ou, Y. Yao, S. Deng, H. Chen, and N. Zhang, "LLMs for Knowledge Graph Construction and Reasoning: Recent Capabilities and Future Opportunities," arXiv:2305.13168 [cs.AI], May 2023. https://arxiv.org/abs/2305.13168

Downloads

Published

2024-10-09

How to Cite

Ravi Kiran Magham. (2024). GRAPHRAG AND ROLE OF GRAPH DATABASES IN ADVANCING AI. INTERNATIONAL JOURNAL OF RESEARCH IN COMPUTER APPLICATIONS AND INFORMATION TECHNOLOGY (IJRCAIT), 7(2), 98-110. https://ijrcait.com/index.php/home/article/view/IJRCAIT_07_02_007