vlite

A blazing fast, lightweight, and simple vector database made with numpy and llama.cpp in ~1k lines of code.

features

🔥 fastest vector db retrieval with binary embeddings and int8 rescoring
🏎️ accelerated embedding generation with llama.cpp
🍪 OMOM (pronounced "om-nom") file format, a novel abstraction for storing user context similar to browser cookies
injest text, PDF, CSV, PPTX, and webpages
batteries included chunking, metadata filtering, PDF OCR support for extracting text from scanned PDFs
over 77.95% faster than Chroma on indexing, and 422% faster on retrieval

installation

pip install vlite

installation with PDF OCR Support

To enable PDF OCR support (with surya), install the vlite[ocr] extra:

pip install vlite[ocr]

usage

from vlite import VLite
from vlite.utils import process_pdf

vdb = VLite()

vdb.add("hello world", metadata={"artist": "adele"}

vdb.add(process_pdf("attention-is-all-you-need.pdf", use_ocr=True))

results = vdb.retrieve("how do transformers work?")

print(results)

about

vlite is a vector database built for agents, ChatGPT Plugins, and other AI apps that need a fast and simple database to store vectors. It was developed to support the billions of embeddings generated, indexed, and sorted with ChatWith+ ChatGPT Plugins, which run for millions of users. Most vector databases either repeatedly crashed on a daily basis or were too expensive for the high throughput required.

vlite introduces the OMOM (pronounced "om-nom") file format, which acts like a browser cookie for user embeddings, providing efficient storage, retrieval of embeddings, composability, portability, and user context.

Under the hood, vlite uses llama.cpp for accelerated embedding generation and defaults to binary embeddings and INT8 embedding rescoring for the fastest retrieval in memory vector databases. It beats Chroma on all metrics retrieval/indexing, specifically 77.95% faster indexing speed compared to Chroma.

License

AGPL-3.0 License

Contributing

Thanks to Claude, Ray, and Howard for their contributions to vlite. If you'd like to contribute, please open an issue or a pull request.

Name		Name	Last commit message	Last commit date
Latest commit History 114 Commits
.github/workflows		.github/workflows
tests		tests
vlite		vlite
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
docs.md		docs.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

vlite

features

installation

installation with PDF OCR Support

usage

about

License

Contributing

About

Uh oh!

Releases 4

Packages

Uh oh!

Contributors 5

Uh oh!

Languages

License

sdan/vlite

Folders and files

Latest commit

History

Repository files navigation

vlite

features

installation

installation with PDF OCR Support

usage

about

License

Contributing

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 4

Packages 0

Uh oh!

Contributors 5

Uh oh!

Languages

Packages