•  

DeepMind’s RAG System with Animesh Chatterji and Ivan Solovyev

0
0

Retrieval-augmented generation, or RAG, has become a foundational approach to building production AI systems. However, deploying RAG in practice can be complex and costly. Developers typically have to manage vector databases, chunking strategies, embedding models, and indexing infrastructure. Designing effective RAG systems is also a moving target, as techniques and best practices evolve in step with rapidly advancing language models.


Google DeepMind recently released the File Search Tool, a fully managed RAG system built directly into the Gemini API. File Search abstracts away the retrieval pipeline, allowing developers to upload documents, code, and other text data, automatically generate embeddings, and query their knowledge base. We wanted to understand how the DeepMind team designed a general-purpose RAG system that maintains high retrieval quality.


Animesh Chatterji is a Software Engineer at Google DeepMind and Ivan Solovyev is a Product Manager at DeepMind, and they worked on File Search Tool. They joined the podcast with Sean Falconer to discuss the evolution of RAG, why simplicity and pricing transparency matter, how embedding models have improved retrieval quality, the tradeoffs between configurability and ease of use, and what’s next for multimodal retrieval across text, images, and beyond.



Sean’s been an academic, startup founder, and Googler. He has published works covering a wide range of topics from AI to quantum computing. Currently, Sean is an AI Entrepreneur in Residence at Confluent where he works on AI strategy and thought leadership. You can connect with Sean on LinkedIn.



 


Please click here to see the transcript of this episode.



Sponsorship inquiries: sponsor@softwareengineeringdaily.com



The post DeepMind’s RAG System with Animesh Chatterji and Ivan Solovyev appeared first on Software Engineering Daily.


No comments yet...
Log in to comment
0 0 0
Yesterday

DeepMind’s RAG System with Animesh Chatterji and Ivan Solovyev

Retrieval-augmented generation, or RAG, has become a foundational approach to building production AI…
0 0 0
2026-03-10

Reinventing the Python Notebook with Akshay Agrawal

Interactive notebooks were popularized by the Jupyter project and have since become a core tool for …
0 0 0
2026-03-05

Organizational Context for AI Coding Agents with Dennis Pilarinos

AI agents have taken on a growing share of software development work, so much so that the hardest pr…
0 0 0
2026-03-03

SED News: OpenClaw Goes Viral, Mistral’s Compute Play, and the Agent Arms Race

SED News is a monthly podcast from Software Engineering Daily where hosts Gregor Vand and Sean Falco…
0 0 0
2026-02-26

Amazon’s IDE for Spec-Driven Development with David Yanacek

AI-assisted coding tools have made it easier than ever to spin up prototypes, but turning those prot…
0 0 0
2026-02-24

Engineering AI Systems for Autonomy and Resilience with Krishna Sai

Enterprise IT systems have grown into sprawling, highly distributed environments spanning cloud infr…

Software Engineering Daily

Technical interviews about software topics.

Log in to Follow

More episodes from Software Engineering Daily

Top Podcasts Top rated Podcasts