So, you’re constructing a RAG system or utilizing an LLM to talk with paperwork. However customers usually ask: how can we belief the solutions?
Furthermore, we steadily hear about hallucinations, which undermine customers’ belief.
If we construct an software however fail to point out customers the place the solutions come from, the applying may grow to be unusable in some instances.
On this article, I’ll share an strategy to deal with this concern. By linking each reply generated by the LLM to its supply textual content within the doc, we will construct transparency and belief. This technique not solely offers clear proof for the solutions but additionally permits customers to confirm the outcomes instantly throughout the PDF.
Typically, the generated reply might not be completely correct, however having the ability to find the proper supply textual content is already useful for the person.
Let’s take an instance of this paper from arxiv.org. We will think about this use case:
Step one on this strategy is to extract the textual content from the PDF in a structured format.