https://github.com/firstlink/haystack/blob/main/advanced_topics/evaluating_rag_pipeline.py