GraphRAG

Explore the GraphRAG implementation This section details how to reproduce GraphRAG results.

Index of GraphRAG

To ensure experimental consistency and command availability, please follow these instructions to install GraphRAG:

conda create -n graphrag python=3.9 conda activate graphrag pip install graphrag==1.2.0
graphrag init --root path/to/main_folder

This will create two files in the main_folder directory:

  • .env: Contains environment variables for the GraphRAG pipeline

    • Defines GRAPHRAG_API_KEY=<API_KEY> for OpenAI/Azure OpenAI authentication
    • Replace with your own API key
    • Delete file if using alternate authentication (e.g. managed identity)
  • settings.yaml: Contains configurable pipeline settings

    • Modify to customize pipeline behavior

More details about GraphRAG configuration and usage can be found in the official documentation.

Then index by running

graphrag index --root path/to/main_folder

Answer and Evaluation

First, prepare your test questions according to the benchmark format. You’ll need to create a test set parquet file containing questions and their corresponding answer keys. Once ready, you can run the evaluation with:

python -m /eval/eval_graph -f path/to/main_folder -q path/to/question_parquet

Last modified April 5, 2025: update reproduce (f23a25c)