Skip to content

Latest commit

 

History

History
 
 

rag-grounding

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 

RAG and Grounding

This directory provides a curated list of notebooks that explore Retrieval Augmented Generation (RAG), grounding techniques, knowledge bases, grounded generation, and related topics like vector search and semantic search.

All of these links are notebooks or other examples in this repository, but are indexed here for your convenience.

What is RAG and Grounding?

Animated GIF showing "what is grounding"

  • Ungrounded generation relies on the LLM training data alone and is prone to hallucinations when it doesn't have all the right facts
  • Grounding a LLM with relevant facts provides fresh and potentially private data to the model as part of it's input or prompt
  • RAG is a technique which retrieves relevant facts, often via search, and provides them to the LLM

Using RAG and Grounding to improve generations and reduce hallucinations is becoming commonplace. Doing so well and generating extremely high quality results which are entirely grounded on the most relevant facts, potentially from a very large corpus of information and at high scale - is an art. Vertex AI provides a platform of tools and APIs which help you build and maintain a great search engine and RAG application, and the evaluations needed to hill climb "quality".

Measuring RAG/Grounding Quality

See this blog post: How to evaluate generated answers from RAG at scale on Vertex AI for a walkthrough.

Out of the Box RAG/Grounding

Build your own RAG/Grounding

We have several notebooks and examples for specific use cases or types of data which may require a custom RAG and Grounding. We have many products which can be used to build a RAG/Grounding pipeline of your own, or which you can add to an existing RAG and Grounding solution.

Search

Vertex AI Search is an end-to-end Search engine which delivers high quality grounded generation and RAG at scale, built-in.

Vertex AI Vector Search is a extremely performant Vector Database which powers Vertex AI Search. Other databases like AlloyDB and BigQuery also have vector searches, each with different performance characteristics and retrieval performance.

Embeddings

Gemini

Open Models

Agents on top of RAG

Use Cases

These notebooks offer a valuable resource to understand and implement RAG and grounding techniques in various applications. Feel free to dive into the notebooks that pique your interest and start building your own RAG-powered solutions.