NaiveRAG

Explore the NaiveRAG baseline implementation This section details how to reproduce NaiveRAG results.

Index of NaiveRAG

NaiveRAG can use the NodeRAG environment. If you have installed the NodeRAG conda environment, you can directly use NaiveRAG for indexing. If you haven’t installed the NodeRAG environment yet, please refer to the Quick Start guide in the documentation.

You need a folder structure similar to NodeRAG. Create a main working directory called main_folder and place an input folder inside it. Put the files you want to index in the input folder.

main_folder/
├── input/
│   ├── file1.md
│   ├── file2.txt
│   ├── file3.docx
│   └── ...

Then run

python -m NaiveRAG.build -f path/to/main_folder

Answer and Evaluation

First, prepare your test questions according to the benchmark format. You’ll need to create a test set parquet file containing questions and their corresponding answer keys. Once ready, you can run the evaluation with:

python -m /eval/eval_naive -f path/to/main_folder -q path/to/question_parquet