0% found this document useful (0 votes)

93 views19 pages

Community Session IndexingChaining

The document discusses LLMs as builders and the concepts of chaining and indexing. It introduces LangChain as an interface for chaining LLMs, vector databases, and documents. Chaining involves connecting different LLMs and data sources to perform useful tasks. Indexing involves splitting documents into chunks, creating embeddings, and storing them in a vector database for retrieval. The document provides examples of chaining applications and companies working on vector databases and chaining tools.

Uploaded by

Sani Kamal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

93 views19 pages

Community Session IndexingChaining

Uploaded by

Sani Kamal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 19

LLM Indexing and Chaining

Community Session
Outline
Introductory Talk - Thinking about LLMs as Builders
● Chaining
● Indexing
● LLMs, Vector DBs, and LLM Ops

Community Breakout Discussion Activities

Live Interactive Build Demo!

● Document querying with LangChain

● LangChain (also LlamaIndex, HayStack,

others) provide a standard interface for
Chains

● Chains = main abstraction innovation

“LLMs in isolation is often insufficient for creating a truly

powerful app - the real power comes when you can combine
them with other sources of computation or knowledge.” ~
Harrison Chase, Creator of LangChain
Creating an Index (with a Data-Indexing Chain)
1. Splitting doc into chunks
2. Creating embeddings for each document
3. Storing documents and embeddings in a vectorstore

Same chains…
● Prompt Chain
● Tools Chain
● Data Indexing Chain

Same primary components…

● LLM
● Vector Database
● Document(s)

● fetching the most relevant documents for

a particular query

○ → those whose embeddings are

most similar to the embedding of the
query

How does all of this fit together?

LLM Ops
● Definition

How we store, index,

and retrieve
knowledge that we
need to perform useful
LLM tasks

https://every.to/chain-of-thought/a-few-things-i-believe-about-ai

● LlamaIndex ~ Open Source

● HayStack ~ $9.2M seed funding/debt financing

○ Extractive QA

● AgentGPT ~ Open Source Project by Level AI ($20M Series B, 2022)

○ Call centers

© 2023 FourthBrain
The more mature infrastructure layer is…
Vector Store DB Companies (LangChain Support)
● Chroma ~ $18M seed round
● FAISS (Facebook)
● Elastic Search (est. company)
● Milvus $60M Series B (ext), $43M in ‘21
● Pinecone ~ $100M Series B
● Qdrant ~ $7.5M seed round
● Weaviate ~$50M Series B

© 2023 FourthBrain
Ex Project Ideas from “Building with LLMs” Course
Simple (1-step)
● Natural Language Website Search: Scrape all text from {hotel}.com webpages
and index it in a vector store so that any information can be searched with an
LLM

Mild (2-step)
● Technical Q&A (“AI Tech Support”): Create a fine-tuned LLM to answer FAQs
about technical documentation, and then if there is no answer use a non-fine-
tuned LLM to search all relevant documentation to find answer.

© 2023 FourthBrain
Ex Project Ideas from “Building with LLMs” Course
Medium (2+step)
● Qualitative + Quantitative Q&A (“The AI VP”): Create a fine-tuned LLM to
generate SQL queries for your database structure using common queries useful to
your product/sales/etc. team, then perform the SQL query and return the
quantitative result. Compare the result against the question asked, and combine
into a holistic response.

Breakouts! are you interested in building

solutions for?
(20 minutes)
10 per room Assign ONE person from your
room to take notes and share!
Indexing, Chaining, and LLM Ops

● Dataset: Hitchhiker’s Guide to the Galaxy

● Chaining Tool: LangChain

● Vector DB Tool: ChromaDB

Mean Stack Technologies-Module II - Angular JS, Mongodb
No ratings yet
Mean Stack Technologies-Module II - Angular JS, Mongodb
6 pages
Chatgpt For Python
No ratings yet
Chatgpt For Python
192 pages
Full Stack Data-Science AI, ChatGPT & Generative - 5
No ratings yet
Full Stack Data-Science AI, ChatGPT & Generative - 5
35 pages
Introduction To Learning: Frederic Precioso 24/01/2019
No ratings yet
Introduction To Learning: Frederic Precioso 24/01/2019
179 pages
LLM Fine Tuning
No ratings yet
LLM Fine Tuning
1 page
LangGraph Tutorials
100% (1)
LangGraph Tutorials
3 pages
MCP Security
No ratings yet
MCP Security
28 pages
Csab r2 Cutoff
No ratings yet
Csab r2 Cutoff
136 pages
Sok7305252 003 A 001
No ratings yet
Sok7305252 003 A 001
107 pages
Brkewn 2024
No ratings yet
Brkewn 2024
135 pages
LangChain & RAG
No ratings yet
LangChain & RAG
62 pages
20 Types of LLM Guardrails
No ratings yet
20 Types of LLM Guardrails
12 pages
Natural Language Processing
No ratings yet
Natural Language Processing
12 pages
Hands-On Lab With LLMs and Gen AI Within IDC
No ratings yet
Hands-On Lab With LLMs and Gen AI Within IDC
57 pages
Deloitte NL Risk Knowledge Graphs Financial Services
No ratings yet
Deloitte NL Risk Knowledge Graphs Financial Services
16 pages
LLMs For Me - Introduction LLMs & Generative Text
No ratings yet
LLMs For Me - Introduction LLMs & Generative Text
38 pages
Application of Large Language
No ratings yet
Application of Large Language
75 pages
Generative AI Interview Questions and Answers
No ratings yet
Generative AI Interview Questions and Answers
7 pages
Cloud Computing Module-05 Search Creators
100% (1)
Cloud Computing Module-05 Search Creators
25 pages
Hugging Face Transformers
No ratings yet
Hugging Face Transformers
8 pages
A Step-By-Step Guide To Building AI Agents With LangGraph - by Alannaelga - Coinmonks - Nov, 2024 - Medium
No ratings yet
A Step-By-Step Guide To Building AI Agents With LangGraph - by Alannaelga - Coinmonks - Nov, 2024 - Medium
32 pages
Sonar Qube
No ratings yet
Sonar Qube
46 pages
A Gentle Intro To Chaining LLMS, Agents, and Utils Via LangChain
No ratings yet
A Gentle Intro To Chaining LLMS, Agents, and Utils Via LangChain
26 pages
LangChain Academy - Introduction To LangGraph - Motivation
No ratings yet
LangChain Academy - Introduction To LangGraph - Motivation
17 pages
LLM Intro
No ratings yet
LLM Intro
51 pages
MCP 9
No ratings yet
MCP 9
17 pages
6 Graph Databases Neo4j
No ratings yet
6 Graph Databases Neo4j
46 pages
Kubernetes For MLOps Engineers
No ratings yet
Kubernetes For MLOps Engineers
7 pages
K1004-02 - UQU ED - BOQ - Jan 2011-EN
No ratings yet
K1004-02 - UQU ED - BOQ - Jan 2011-EN
141 pages
Closed Traverse
No ratings yet
Closed Traverse
12 pages
6 Relational Schema Design
No ratings yet
6 Relational Schema Design
52 pages
Chat-Bots Project Presentation
No ratings yet
Chat-Bots Project Presentation
33 pages
List of Network Provider Lock
No ratings yet
List of Network Provider Lock
6 pages
Embeddings
No ratings yet
Embeddings
13 pages
An Introduction To Programming Physics-Informed Neural Network-Based Computational Solid Mechanics
100% (1)
An Introduction To Programming Physics-Informed Neural Network-Based Computational Solid Mechanics
32 pages
Sumo Model
No ratings yet
Sumo Model
88 pages
Effective Prompt Engineering For LLMs - A Developer's Guide To Advanced AI Techniques - by Pankaj - Nov, 2024 - Medium
No ratings yet
Effective Prompt Engineering For LLMs - A Developer's Guide To Advanced AI Techniques - by Pankaj - Nov, 2024 - Medium
16 pages
An Automated Conversation System Using Natural Language Processing (NLP) Chatbot in Python
No ratings yet
An Automated Conversation System Using Natural Language Processing (NLP) Chatbot in Python
23 pages
Is 12766 1997
No ratings yet
Is 12766 1997
12 pages
Schneider CCTV
No ratings yet
Schneider CCTV
41 pages
Oltc Fundamental
100% (4)
Oltc Fundamental
118 pages
10 Evani Generative AI Champion
No ratings yet
10 Evani Generative AI Champion
39 pages
00 Course Introduction
100% (1)
00 Course Introduction
17 pages
1 - Optimize Amazon SageMaker Deployment Strategies
No ratings yet
1 - Optimize Amazon SageMaker Deployment Strategies
45 pages
Tutorial 3 Agile and Digital Thinking - Question
100% (1)
Tutorial 3 Agile and Digital Thinking - Question
7 pages
Research Paper Llama
No ratings yet
Research Paper Llama
27 pages
Unlocking Rapid Data Extraction: Groq + OCR and Claude Vision - by Júlio Almeida - Python in Plain E
No ratings yet
Unlocking Rapid Data Extraction: Groq + OCR and Claude Vision - by Júlio Almeida - Python in Plain E
17 pages
Edirol R-09HR Manual
100% (2)
Edirol R-09HR Manual
124 pages
Understanding Unit and Integration Testing in Golang
No ratings yet
Understanding Unit and Integration Testing in Golang
59 pages
I Think Unix
No ratings yet
I Think Unix
299 pages
Modeling A Recommendation Engine Workshop
No ratings yet
Modeling A Recommendation Engine Workshop
94 pages
lastCleanException 20220208125100
No ratings yet
lastCleanException 20220208125100
7 pages
Flight From Strategy To Executable Code-2018 KOSTA Keynote
No ratings yet
Flight From Strategy To Executable Code-2018 KOSTA Keynote
27 pages
Data Mining N Business Intelligence
No ratings yet
Data Mining N Business Intelligence
63 pages
2303.13936-Programming Research ChatGPT and CoPilot
100% (1)
2303.13936-Programming Research ChatGPT and CoPilot
9 pages
Lang Chain
No ratings yet
Lang Chain
8 pages
Academic Research Assistance 1716570959
No ratings yet
Academic Research Assistance 1716570959
13 pages
04 - Google BigQuery Pricing
No ratings yet
04 - Google BigQuery Pricing
18 pages
Control Engineering: Types of Control Systems
No ratings yet
Control Engineering: Types of Control Systems
26 pages
MLOps
No ratings yet
MLOps
9 pages
Ccna 1 Mind Map
100% (1)
Ccna 1 Mind Map
1 page
Using SOLR For Enabling Highly Customized Sitewide Navigation
No ratings yet
Using SOLR For Enabling Highly Customized Sitewide Navigation
12 pages
Function Calling - OpenAI API
No ratings yet
Function Calling - OpenAI API
5 pages
What IS - Frontend?: Micro
No ratings yet
What IS - Frontend?: Micro
22 pages
COE 2016 Overview of Enovia V6 Objects
No ratings yet
COE 2016 Overview of Enovia V6 Objects
20 pages
Results
No ratings yet
Results
6 pages
Symantec Endpoint Protection PDF
No ratings yet
Symantec Endpoint Protection PDF
6 pages
Lecture 1 Kaldi
No ratings yet
Lecture 1 Kaldi
56 pages
Namta - Digital For Children PDF
No ratings yet
Namta - Digital For Children PDF
32 pages
Selection Process of Interface Metaphor (2011)
No ratings yet
Selection Process of Interface Metaphor (2011)
1 page
Edt 2411 A
No ratings yet
Edt 2411 A
4 pages
Engineering Design Knowledge Representation Based On Logic and Objects
No ratings yet
Engineering Design Knowledge Representation Based On Logic and Objects
19 pages
Embuk
No ratings yet
Embuk
36 pages
Messaging With RabbitMQ - Logical Link Diagram
100% (1)
Messaging With RabbitMQ - Logical Link Diagram
11 pages
1 Course Introduction
No ratings yet
1 Course Introduction
11 pages
Subject: - Mobile Application Development (22617)
No ratings yet
Subject: - Mobile Application Development (22617)
10 pages
Experiment # 1, Week 5, Day 3:: Aim of Experiment: To Do PLL Simulation in Proteus Using IC AD630
No ratings yet
Experiment # 1, Week 5, Day 3:: Aim of Experiment: To Do PLL Simulation in Proteus Using IC AD630
11 pages
Circular e Resource2024
No ratings yet
Circular e Resource2024
2 pages
Applied Coding Track
No ratings yet
Applied Coding Track
10 pages
Donald Ngandeu 1
No ratings yet
Donald Ngandeu 1
6 pages
PostgreSQL As A Vector Database: Create, Store, and Query OpenAI Embeddings With Pgvector
No ratings yet
PostgreSQL As A Vector Database: Create, Store, and Query OpenAI Embeddings With Pgvector
2 pages
Social Change in Pakistani Society 3
No ratings yet
Social Change in Pakistani Society 3
1 page
New Microsoft Word Document
No ratings yet
New Microsoft Word Document
2 pages
PVC Pressure Pipe Standards For Sewer Force Mains
No ratings yet
PVC Pressure Pipe Standards For Sewer Force Mains
1 page
Service Report 84398297551 20200717 PDF
No ratings yet
Service Report 84398297551 20200717 PDF
1 page
Engine P222LE S
No ratings yet
Engine P222LE S
2 pages
Debugging Like a Pro: A Practical Guide with Examples
From Everand
Debugging Like a Pro: A Practical Guide with Examples
William E. Clark
No ratings yet
The JavaScript Journey: From Basics to Full-Stack Mastery
From Everand
The JavaScript Journey: From Basics to Full-Stack Mastery
Priya Singh
No ratings yet
Heroku Cloud Application Development
From Everand
Heroku Cloud Application Development
Anubhav Hanjura
No ratings yet
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
From Everand
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
Robert Johnson
No ratings yet

Community Session IndexingChaining

Uploaded by

Community Session IndexingChaining

Uploaded by

LLM Indexing and Chaining

Community Breakout Discussion Activities

Live Interactive Build Demo!

● LangChain (also LlamaIndex, HayStack,

● Chains = main abstraction innovation

“LLMs in isolation is often insufficient for creating a truly

Same primary components…

● fetching the most relevant documents for

○ → those whose embeddings are

How does all of this fit together?

How we store, index,

● LlamaIndex ~ Open Source

● HayStack ~ $9.2M seed funding/debt financing

● AgentGPT ~ Open Source Project by Level AI ($20M Series B, 2022)

Breakouts! are you interested in building

● Dataset: Hitchhiker’s Guide to the Galaxy

● Chaining Tool: LangChain

● Vector DB Tool: ChromaDB

You might also like