Building End-to-End Reliable RAG Applications

Track:: PyData: LLMs (2024)
Type:: Poster
Level:: intermediate
Room:: Exhibit Hall
Start:: 13:00 on 10 July 2024
Duration:: 60 minutes

Abstract

Retrieval-Augmented Generation (RAG) presents an excellent approach to overcoming the limitations associated with Large Language Models (LLMs), such as hallucinations or issues related to the recency of their training data. However, relying solely on RAG is insufficient, particularly when dealing with domain-specific data or verifying a response’s adequacy. Neglecting these scenarios can cost time, money, and customer satisfaction. That’s why, as you develop an application, it’s crucial to evaluate your retrieval process, improve it with advanced techniques if necessary, and consider all edge cases, including handling out-of-domain queries, and implement fallback mechanisms. Thus, you ensure that your system is both resilient and flexible. This poster will explain some problems you may encounter in real life and which steps to take to build reliable and resilient RAG applications with the open source LLM framework Haystack that you can safely use in production