RAG Evaluation using RAGAS

This is a repository for paper: Evaluating Open-Source LLMs in RAG Systems: A Benchmark on Diploma Theses Abstracts Using RAGAS

📚 Antal, M., Buza, K. Evaluating Open-Source LLMs in RAG Systems: A Benchmark on Diploma Theses Abstracts Using Ragas. Acta Univ. Sapientiae Inform. 17, 5 (2025). https://doi.org/10.1007/s44427-025-00006-3

🎯 Presentation Slides

🎯 Presentation Slides (PDF)

Installation

Prerequisites

Python 3.11 or higher
Git
OpenAI API key

Steps

Clone the repository

git clone https://github.com/margitantal68/rag_paper

Navigate to the project directory
```
cd rag_paper
```

Create and activate a virtual environment

On Linux/macOS:

python3 -m venv venv
source venv/bin/activate

On Windows:

python -m venv venv
venv\Scripts\activate

Set Up Elasticsearch

Install Elasticsearch using Docker:

docker run -d -p 9200:9200 -e "discovery.type=single-node" -e "xpack.security.enabled=false" docker.elastic.co/elasticsearch/elasticsearch:8.9.0

Set Up Ollama
- Install Ollama and pull the required models
Install dependencies
```
pip install -r requirements.txt
```

Usage

This project requires an OpenAI API key. Follow these steps to set it up:

Obtain your OpenAI API key from OpenAI's website.
Copy the .env.example file in the project directory:
```
cp .env.example .env
```
Set the API key in the .env file:
```
OPENAI_API_KEY=your_api_key_here
```
Run the scripts in the following order:

Create the Elasticsearch index
```
python theses_create_index.py
```
Evaluate the Retriever
```
python theses_retrieval_evaluation.py
```
Evaluate the Generation
```
python theses_rag_evaluation.py
```

⚠️ Do not run the script for testset creation theses_testset_creation_ragas_single_hop.py as it is not needed for the evaluation. The testset is already created and included in the repository theses\TESTSET\test_dataset.csv.

⚠️ Do not run the script for question classification theses_testset_question_classification.py as it is not needed for the evaluation. The classification is already done and included in the repository theses\TESTSET\test_dataset.csv.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
docs		docs
theses		theses
.DS_Store		.DS_Store
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
docker-compose.yml		docker-compose.yml
plots.py		plots.py
requirements.txt		requirements.txt
test_reranker.py		test_reranker.py
theses_create_index.py		theses_create_index.py
theses_rag_evaluation_oll A5C6 ama.py		theses_rag_evaluation_ollama.py
theses_retrieval_evaluation.py		theses_retrieval_evaluation.py
theses_testset_creation_ragas_single_hop.py		theses_testset_creation_ragas_single_hop.py
theses_testset_question_classification.py		theses_testset_question_classification.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RAG Evaluation using RAGAS

Installation

Prerequisites

Steps

Usage

About

Uh oh!

Releases

Packages

Uh oh!

Languages

margitantal68/rag_paper

Folders and files

Latest commit

History

Repository files navigation

RAG Evaluation using RAGAS

Installation

Prerequisites

Steps

Usage

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages