LLM Foundations: Vector Databases for Caching and Retrieval Augmented Generation (RAG)

1h 33mAdvanced2024-02-23

Authors

Kumaran Ponnambalam

Working with data for 20+ years

Course details

As large language models grow in popularity, the infrastructure to be used around them also becomes vital to reduce costs, generate accurate responses, and improve efficiency. Vector databases play a vital role in several LLM use cases to help alleviate LLM shortcomings, reduce costs and latency. Knowledge of its basics and applications are vital for any engineer building applications with LLMs, and in this course, Kumaran Ponnambalam teaches you the basics of vector databases and how to use them in LLM caching and retrieval-augmented generation (RAG).

Kumaran begins with a discussion on the basics of vector databases and their applications. He then explores specialized databases for storing vectors and uses the Milvus database as the reference example, and demonstrates read and write operations with the Milvus database. Learn how to use vector databases for LLM caching, with an example use case, along with examples of RAG use cases. Finally, Kumaran concludes with a discussion on optimizing vector databases.

Skills covered

Natural Language Processing (NLP)Machine LearningDatabase DevelopmentArtificial Intelligence FoundationsDatabase ManagementFoundationsArtificial Intelligence (AI)Software Development

Concepts

0. Introduction

01 - GenAI with vector databases
02 - Course coverage and prerequisites

1. Introduction to Vector Databases

03 - What is a vector
04 - Vectorization in NLP
05 - Vector similarity search
06 - Vector databases
07 - Pros and cons of vector databases

2. Milvus Database Concepts

08 - Introduction to Milvus DB
09 - Milvus architecture
10 - Collections in Milvus
11 - Partitions in Milvus
12 - Indexes in Milvus
13 - Managing data in Milvus
14 - Query and search in Milvus
15 - Set up Milvus and exercise files

3. Milvus Database Operations

16 - Create a connection
17 - Create databases and users
18 - Create collections
19 - Insert data into Milvus
20 - Build an index
21 - Query scalar data
22 - Search vector fields
23 - Delete objects and entities

4. Vector DB for LLM Query Caching

24 - LLMs and caching
25 - Prompt caching workflow
26 - Set up the Milvus cache
27 - Inference process and caching
28 - Cache management

5. Introduction to Retrieval Augmented Generation (RAG)

29 - LLMs as a knowledge source
30 - Introduction to retrieval augmented generation
31 - RAG - Knowledge curation process
32 - RAG question-answering process
33 - Applications of RAG

6. Implementing RAG with Milvus

34 - Set up Milvus for RAG
35 - Prepare data for the knowledge base
36 - Populate the Milvus database
37 - Answer questions with RAG

7. Vector Databases Best Practices

38 - Choose a vector database
39 - Combine vector and scalar data
40 - Distance measure considerations
41 - Tune vector DB performance

Conclusion

42 - Continue with LLMs