Special offers now — see discounted courses.
day
:
hour
:
min
:
sec
See special offers
LLM Foundations: Vector Databases for Caching and Retrieval Augmented Generation (RAG)

LLM Foundations: Vector Databases for Caching and Retrieval Augmented Generation (RAG)

1h 33mAdvanced2024-02-23

Authors

Kumaran Ponnambalam

Kumaran Ponnambalam

Working with data for 20+ years

Course details

As large language models grow in popularity, the infrastructure to be used around them also becomes vital to reduce costs, generate accurate responses, and improve efficiency. Vector databases play a vital role in several LLM use cases to help alleviate LLM shortcomings, reduce costs and latency. Knowledge of its basics and applications are vital for any engineer building applications with LLMs, and in this course, Kumaran Ponnambalam teaches you the basics of vector databases and how to use them in LLM caching and retrieval-augmented generation (RAG).

Kumaran begins with a discussion on the basics of vector databases and their applications. He then explores specialized databases for storing vectors and uses the Milvus database as the reference example, and demonstrates read and write operations with the Milvus database. Learn how to use vector databases for LLM caching, with an example use case, along with examples of RAG use cases. Finally, Kumaran concludes with a discussion on optimizing vector databases.

Skills covered

Natural Language Processing (NLP)Machine LearningDatabase DevelopmentArtificial Intelligence FoundationsDatabase ManagementFoundationsArtificial Intelligence (AI)Software Development

Concepts

0. Introduction

  • 01 - GenAI with vector databases
  • 02 - Course coverage and prerequisites

1. Introduction to Vector Databases

  • 03 - What is a vector
  • 04 - Vectorization in NLP
  • 05 - Vector similarity search
  • 06 - Vector databases
  • 07 - Pros and cons of vector databases

2. Milvus Database Concepts

  • 08 - Introduction to Milvus DB
  • 09 - Milvus architecture
  • 10 - Collections in Milvus
  • 11 - Partitions in Milvus
  • 12 - Indexes in Milvus
  • 13 - Managing data in Milvus
  • 14 - Query and search in Milvus
  • 15 - Set up Milvus and exercise files

3. Milvus Database Operations

  • 16 - Create a connection
  • 17 - Create databases and users
  • 18 - Create collections
  • 19 - Insert data into Milvus
  • 20 - Build an index
  • 21 - Query scalar data
  • 22 - Search vector fields
  • 23 - Delete objects and entities

4. Vector DB for LLM Query Caching

  • 24 - LLMs and caching
  • 25 - Prompt caching workflow
  • 26 - Set up the Milvus cache
  • 27 - Inference process and caching
  • 28 - Cache management

5. Introduction to Retrieval Augmented Generation (RAG)

  • 29 - LLMs as a knowledge source
  • 30 - Introduction to retrieval augmented generation
  • 31 - RAG - Knowledge curation process
  • 32 - RAG question-answering process
  • 33 - Applications of RAG

6. Implementing RAG with Milvus

  • 34 - Set up Milvus for RAG
  • 35 - Prepare data for the knowledge base
  • 36 - Populate the Milvus database
  • 37 - Answer questions with RAG

7. Vector Databases Best Practices

  • 38 - Choose a vector database
  • 39 - Combine vector and scalar data
  • 40 - Distance measure considerations
  • 41 - Tune vector DB performance

Conclusion

  • 42 - Continue with LLMs

Related courses

Related learn paths

About us

LyndaKade is a leading learning platform that helps people learn business, software, technology, and creative skills to achieve personal and professional goals.

Phone numberAparat ChannelTelegram SupportTelegram ChannelInstagram Page

All rights to this site belong to LyndaKade.

Terms of Service|Privacy Policy

نماد الکترونیک enamad در صورت اتصال با آی‌پی داخل کشور، نمایش داده خواهد شد.
logo-samandehi - لوگو ساماندهی
zarinpal
zibal