LLM Foundations: Vector Databases for Caching and Retrieval Augmented Generation (RAG)
1h 33mAdvanced2024-02-23
Authors

Kumaran Ponnambalam
Working with data for 20+ years
Course details
As large language models grow in popularity, the infrastructure to be used around them also becomes vital to reduce costs, generate accurate responses, and improve efficiency. Vector databases play a vital role in several LLM use cases to help alleviate LLM shortcomings, reduce costs and latency. Knowledge of its basics and applications are vital for any engineer building applications with LLMs, and in this course, Kumaran Ponnambalam teaches you the basics of vector databases and how to use them in LLM caching and retrieval-augmented generation (RAG).
Kumaran begins with a discussion on the basics of vector databases and their applications. He then explores specialized databases for storing vectors and uses the Milvus database as the reference example, and demonstrates read and write operations with the Milvus database. Learn how to use vector databases for LLM caching, with an example use case, along with examples of RAG use cases. Finally, Kumaran concludes with a discussion on optimizing vector databases.
Kumaran begins with a discussion on the basics of vector databases and their applications. He then explores specialized databases for storing vectors and uses the Milvus database as the reference example, and demonstrates read and write operations with the Milvus database. Learn how to use vector databases for LLM caching, with an example use case, along with examples of RAG use cases. Finally, Kumaran concludes with a discussion on optimizing vector databases.
Skills covered
Natural Language Processing (NLP)Machine LearningDatabase DevelopmentArtificial Intelligence FoundationsDatabase ManagementFoundationsArtificial Intelligence (AI)Software Development
Concepts
0. Introduction
- 01 - GenAI with vector databases
- 02 - Course coverage and prerequisites
1. Introduction to Vector Databases
- 03 - What is a vector
- 04 - Vectorization in NLP
- 05 - Vector similarity search
- 06 - Vector databases
- 07 - Pros and cons of vector databases
2. Milvus Database Concepts
- 08 - Introduction to Milvus DB
- 09 - Milvus architecture
- 10 - Collections in Milvus
- 11 - Partitions in Milvus
- 12 - Indexes in Milvus
- 13 - Managing data in Milvus
- 14 - Query and search in Milvus
- 15 - Set up Milvus and exercise files
3. Milvus Database Operations
- 16 - Create a connection
- 17 - Create databases and users
- 18 - Create collections
- 19 - Insert data into Milvus
- 20 - Build an index
- 21 - Query scalar data
- 22 - Search vector fields
- 23 - Delete objects and entities
4. Vector DB for LLM Query Caching
- 24 - LLMs and caching
- 25 - Prompt caching workflow
- 26 - Set up the Milvus cache
- 27 - Inference process and caching
- 28 - Cache management
5. Introduction to Retrieval Augmented Generation (RAG)
- 29 - LLMs as a knowledge source
- 30 - Introduction to retrieval augmented generation
- 31 - RAG - Knowledge curation process
- 32 - RAG question-answering process
- 33 - Applications of RAG
6. Implementing RAG with Milvus
- 34 - Set up Milvus for RAG
- 35 - Prepare data for the knowledge base
- 36 - Populate the Milvus database
- 37 - Answer questions with RAG
7. Vector Databases Best Practices
- 38 - Choose a vector database
- 39 - Combine vector and scalar data
- 40 - Distance measure considerations
- 41 - Tune vector DB performance
Conclusion
- 42 - Continue with LLMs
Related courses
- RAG, AI Apps, and AI Agents for Cybersecurity and Networking
- Build with AI: Create Custom Chatbots with n8n
- Hands-On AI: Building Your First Conversational AI Chatbot
- Build LLM Evaluation Applications with LangChain
- GraphRAG Essential Training
- Complete Guide to Evaluating Large Language Models (LLMs)
- Hands-On AI: Build an AI Chatbot with GPT-4o and Next.js
- Hands-On AI: Build Your Own GPTs