Retrieval-augmented generation, as a name, has done a lot of damage. It frames the problem as “generation that happens to use retrieval,” when in fact almost every interesting failure mode lives in the retrieval, the indexing, or the feedback loop between them.
This article argues that RAG should be thought of as a systems problem first and a generation problem second, and walks through the postmortems that brought me to that view.