•  

DeepMind’s RAG System with Animesh Chatterji and Ivan Solovyev

0
0

Retrieval-augmented generation, or RAG, has become a foundational approach to building production AI systems. However, deploying RAG in practice can be complex and costly. Developers typically have to manage vector databases, chunking strategies, embedding models, and indexing infrastructure. Designing effective RAG systems is also a moving target, as techniques and best practices evolve in step with rapidly advancing language models.


Google DeepMind recently released the File Search Tool, a fully managed RAG system built directly into the Gemini API. File Search abstracts away the retrieval pipeline, allowing developers to upload documents, code, and other text data, automatically generate embeddings, and query their knowledge base. We wanted to understand how the DeepMind team designed a general-purpose RAG system that maintains high retrieval quality.


Animesh Chatterji is a Software Engineer at Google DeepMind and Ivan Solovyev is a Product Manager at DeepMind, and they worked on File Search Tool. They joined the podcast with Sean Falconer to discuss the evolution of RAG, why simplicity and pricing transparency matter, how embedding models have improved retrieval quality, the tradeoffs between configurability and ease of use, and what’s next for multimodal retrieval across text, images, and beyond.



Sean’s been an academic, startup founder, and Googler. He has published works covering a wide range of topics from AI to quantum computing. Currently, Sean is an AI Entrepreneur in Residence at Confluent where he works on AI strategy and thought leadership. You can connect with Sean on LinkedIn.



 


Please click here to see the transcript of this episode.



Sponsorship inquiries: sponsor@softwareengineeringdaily.com



The post DeepMind’s RAG System with Animesh Chatterji and Ivan Solovyev appeared first on Software Engineering Daily.


No comments yet...
Log in to comment
0 0 0
2026-04-09

Mobile App Security with Ryan Lloyd

Mobile apps have become a primary interface for critical services, including banking, payments, and …
0 0 0
2026-04-07

FastMCP with Adam Azzam and Jeremiah Lowin

The Model Context Protocol, or MCP, gives developers a common way to expose tools, data, and capabil…
0 0 0
2026-04-02

SED News: OpenCode, AI Code vs. Shipped Code, and the LiteLLM Breach

SED News is a monthly podcast from Software Engineering Daily where hosts Gregor Vand and Sean Falco…
0 0 0
2026-03-31

FreeBSD with John Baldwin

FreeBSD is one of the longest-running and most influential open-source operating systems in the worl…
0 0 0
2026-03-26

Cilium, eBPF, and Modern Kubernetes Networking with Bill Mulligan

Modern cloud-native systems are built on highly dynamic, distributed infrastructure where containers…
0 0 0
2026-03-24

Games That Push Back with Bennett Foddy

Bennett Foddy is a legendary game designer known for creating wholly distinctive games such as QWOP,…

Software Engineering Daily

Technical interviews about software topics.

Log in to Follow

More episodes from Software Engineering Daily

Top Podcasts Top rated Podcasts