Softlandia Ltd.’s Post

There are plenty of different RAG architectures! Do you know what to use and when?

View profile for Olli-Pekka Heinisuo, graphic

Applied AI for your KPI // Co-founder, Senior Software Architect @ Softlandia

The current state of the RAG ecosystem and my bold predictions for 2025. 1. Vanilla RAG aka naive RAG The "original" RAG architecture gained popularity in 2022. It indexes static chunks of text data into a vector database, and the top n results from a text similarity search are injected into the final prompt. 2. Advanced RAG Better indexing and search strategies emerged in 2023 and the Advanced RAG was born. Techniques like semantic chunking and dynamic chunking replaced static chunking and search accuracy was further improved via reranking. 3. Advanced Agentic RAG In 2024, agents are everywhere and you can't build a RAG without them. Agentic patterns enabled RAGs to access services, databases, and APIs. Agents select the retrieval sources and combine the results into the final answer. 4. Advanced Agentic Vision RAG In the second half of 2024, vision-based retrieval with specialized models such as ColPali emerged. Multimodal vision RAGs understand text, images, tables, graphs, and other visual elements and could replace chunking completely in most RAG implementations. 2025 will be the year of multimodality and vision models.

  • No alternative text description for this image

To view or add a comment, sign in

Explore topics