Yeah, that’s query. And perhaps simply to type of body this with a little bit of context – I feel it’s helpful to consider sure use instances for every index. So the factor about vector index, or having the ability to use a vector retailer, is that they’re usually well-suited for functions the place you need to ask type of fact-based questions. And so if you wish to ask a query about particular info in your information corpus, utilizing a vector retailer tends to be fairly efficient.
[00:26:13.08] For example, let’s say your information corpus is about American historical past, or one thing, and your query is, “Hey, what occurred within the 12 months of 1780?” That sort of query tends to lend properly to utilizing a vector retailer, as a result of the best way the general system works is you’d take this question, you’d generate an embedding for the question, you’d first do retrieval from the specter retailer with a view to fetch again essentially the most related chunks to the question, and you then would put this into the enter immediate of the language mannequin.
So the set of retrieved objects that you’d get could be these which might be most semantically much like your question by way of embedding distance. So once more, going again to embeddings – the nearer completely different embeddings are between your question and your context, the extra related that context is, and the farther aside it’s, then the much less related. So that you get again essentially the most related context or question, feed it to a language mannequin, get again a solution.
There are different settings the place customary Prime-Okay embedding base lookup – and I can dive into this in as a lot technical depth that you just guys would need to, however there’s a setting that’s actually customary, type of like Prime-Okay embedding-based retrieval doesn’t work properly. And one instance the place it doesn’t usually work properly – and it is a very fundamental instance – is for those who simply need to get a abstract of a whole doc or a whole set of paperwork. Let’s say as an alternative of asking a query a couple of particular reality, like “What occurred and 1776?” perhaps you simply need to ask the language mannequin “Are you able to simply give me a whole abstract of American historical past within the 1800s?” That sort of query tends to not lend properly to embedding-based lookup, since you usually repair a Prime-Okay worth if you do embedding-based lookup, and you’d get again very particular context. However typically you really need the language mannequin to undergo all of the completely different contexts inside your knowledge.
So a vector index, storing it with embeddings would create a question interface the place you possibly can solely fetch the ok most related nodes. In the event you retailer it, as an example, with like an inventory index, you may retailer the objects in a manner such that it’s similar to a flat checklist. So if you question this checklist index, you truly get again all of the related objects inside this checklist, and you then’d feed it to our synthesis module to synthesize the ultimate reply. So the best way you do retrieval over completely different indices truly is determined by the character of those indices.
One other very fundamental instance is that we even have a key phrase desk index, the place you possibly can type of lookup particular objects by key phrases, as an alternative of by way of embedding-based essence. Key phrases, as an example, are usually good for stuff that requires excessive precision, and slightly bit decrease recall. So you actually need to fetch particular objects that match precisely to the key phrases. This has the benefit of truly permitting you to retrieve a bit extra exact context than one thing that factor-based embedding lookup doesn’t.
The best way I take into consideration it is a lot of what Llama Index needs to offer is that this general question interface over your knowledge. Given any class of queries that you just may need to ask, whether or not it’s like a fact-based query, whether or not it’s a abstract query, or whether or not it’s some extra attention-grabbing questions, we need to present the instrument units as a way to reply these questions. And indices, defining the fitting construction of your knowledge is only one step of this general course of, and serving to us obtain this imaginative and prescient of a really journalizable question interface over your knowledge.
Some examples of several types of queries that we help – there’s the fact-based query lookup, which is semantic search utilizing vector embeddings, which you can ask summarization questions by way of utilizing our checklist index. You might truly run a structured question, so you probably have a SQL database, you may truly run structured analytics over your database, and do text-to-SQL. You are able to do examine and distinction sort queries, the place you possibly can truly have a look at completely different paperwork inside your assortment, after which have a look at the variations between them. You might even have a look at temporal queries, the place you possibly can cause about time, after which go forwards and backwards, and principally type of say “Hey, this occasion truly occurred after this occasion. Right here’s the fitting reply to this query that you just’re asking about.”
And so numerous what Llama Index does present is a set of instruments, the indices, the info ingesters, the question interface to resolve any of those queries that you just may need to reply.


