E5C2 GitHub - ksm26/Prompt-Compression-and-Query-Optimization: Enhance the performance and cost-efficiency of large-scale Retrieval Augmented Generation (RAG) applications. Learn to integrate vector search with traditional database operations and apply techniques like prefiltering, postfiltering, projection, and prompt compression. · GitHub
[go: up one dir, main page]

Skip to content

ksm26/Prompt-Compression-and-Query-Optimization

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

🔍 Welcome to the "Prompt Compression and Query Optimization" course! Course will equip you with the skills to optimize the performance and cost-efficiency of large-scale Retrieval Augmented Generation (RAG) applications by integrating traditional database features with vector search capabilities.

Course Summary

In this course, you'll learn to optimize large-scale RAG applications by integrating vector search capabilities with traditional database operations. Here’s what you can expect to learn and experience:

  1. 📋 Prefiltering and Postfiltering: Filter results based on specific conditions. Prefiltering is done at the database index creation stage, while postfiltering is applied after the vector search is performed.
  2. 📊 Projection: Select a subset of fields returned from a query to minimize the size of the output, enhancing performance and security.
  3. 🔄 Reranking: Reorder search results based on other data fields to improve the relevance and quality of information retrieval.
  4. ✂️ Prompt Compression: Reduce the length of prompts, which can be expensive to process in large-scale applications, optimizing both performance and cost.

Key Points

  • 🌐 Vector Search and Database Operations: Combine vector search capabilities with traditional database operations to build efficient and cost-effective RAG applications.
  • 🚀 Optimized Query Processing: Use prefiltering, postfiltering, and projection techniques for faster query processing and optimized query output.
  • 💡 Prompt Compression: Implement prompt compression techniques to reduce the length of prompts, making them more efficient to process in large-scale applications.

About the Instructor

🌟 Richmond Alake is a Developer Advocate at MongoDB, bringing extensive expertise in database optimization and vector search capabilities to guide you through this course.

🔗 To enroll in the course or for further information, visit deeplearning.ai.

Releases

No releases published

Packages

 
 
 

Contributors

0