This is the multi-page printable view of this section. Click here to print.

Return to the regular view of this page.

NaiveRAG

Explore the NaiveRAG baseline implementation This section details how to reproduce NaiveRAG results.

    Index of NaiveRAG

    NaiveRAG can use the NodeRAG environment. If you have installed the NodeRAG conda environment, you can directly use NaiveRAG for indexing. If you haven’t installed the NodeRAG environment yet, please refer to the Quick Start guide in the documentation.

    You need a folder structure similar to NodeRAG. Create a main working directory called main_folder and place an input folder inside it. Put the files you want to index in the input folder.

    main_folder/
    ├── input/
    │   ├── file1.md
    │   ├── file2.txt
    │   ├── file3.docx
    │   └── ...
    

    Then run

    python -m NaiveRAG.build -f path/to/main_folder
    

    Answer and Evaluation

    First, prepare your test questions according to the benchmark format. You’ll need to create a test set parquet file containing questions and their corresponding answer keys. Once ready, you can run the evaluation with:

    python -m /eval/eval_naive -f path/to/main_folder -q path/to/question_parquet