Many people are confused about the usefulness of 1M tokens because LLMs often st...

m4r71n · 2025-08-12T20:09:49 1755029389

How does this work under the hood? Does it build an in-memory vector database of the input sources and runs queries on top of that data to supplement the context window?

brokegrammer · 2025-08-13T09:49:50 1755078590

No idea how it's implemented because it's proprietary. Details here: https://support.anthropic.com/en/articles/11473015-retrieval...

menaerus · 2025-08-14T08:46:18 1755161178

RAG commonly implies some sort of vector database to be built and which will then be used for response augmentation. If it operates over the repo, I believe it will index your codebase using those vector embeddings.