Contextual Chatbot

This project implements a contextual chatbot that can answer questions based on uploaded documents. It uses FastAPI for the backend, Milvus for vector storage, and GPT-Neo for text generation.

Usage

Uploa 8000 d a document:
- Click on the "Choose File" button in the "Upload Document" section.
- Select a PDF or DOCX file from your computer.
- Click the "Upload" button to process and store the document.
Ask questions:
- Enter your question in the text box in the "Ask a Question" section.
- Click the "Submit" button to get a response from the chatbot.
View chat history:
- The chat history will be displayed in the section below the query form.

Setup with docker

docker-compose up

Setup without docker

python -m venv .venv
source .venv/bin/activate
pip install -r requirement.txt
sh startup.sh

High Level Design

Design with components

Flow:

PDF -> Extract Text -> Chunking -> Embedding -> Save to DB
query -> Embedding -> Search in DB -> Retrieve Releveent Chunks -> LLM -> Response

Component Overview

FastAPI - Used for backend with two endpoints

POST /upload: Upload and process a document (PDF or DOCX)
POST /query: To get response from the llm with query.

Front-End

keep a very simple index.html file which can use used for upload the document, and submit query. The conversational history will also be visible.

Performance Evaluation

Used ragas for synthetic data generation and performance evaluation

Vector DB

Used Milvus as a vector db

Document Processing

Used Langchain to parsing and processing the documents.

LLM

I have used openai to generate synthetic data (ragas backend), considering it is a one time things.
As a RAG retriever you can use either hugging face model, ollama models and openai models currently. However the module can be easily modified to support any models.
I have tested 3 models : 1. ollama llama 3.2 2. openai models 3. HF GPT Neo
Ollama latency is poor but if we deploy ollama on the server it will be faster. Accuracy wise it is good.
OpenAI model is accurate with good latency.
To Use ollama need to setup ollama locally
To Use openai need to setup api key in the environment

Generate the test data

keep all the pdf in the /data/pdf folder
python evaluation/systhetic_data_generation.py
python evaluation/eval.py

python evaluation/synthetic_data_generate.py --pdf_directory="data/pdfs" --num_questions=12

Steps for new pdf

Run python evaluation/systhetic_data_generation.py to generate the synthetic test question-answer pairs.
Run the service and then python evaluation/eval.py to evaluate the service accuracy.

ML Ops Cycle

New pdf -> Generate QA pairs -> Evaluate -> Update the config -> Evaluate

Generate QA pairs (evaluation/systhetic_data_generation.py)
Evaluate (evaluation/eval.py)
Check the score (score.csv) and update the config (config.py)
Repeat 2 and 3

Files to Track using dvc

testset.csv -> evaluation/systhetic_data_generation.py -> Generate the question answer pairs (GT)
question_answer.csv -> Intermediary file
score.csv -> evaluation/eval.py -> predict and answer and evaluate the model by comparing with GT.
config.json -> config.py -> Store the configuration

{
    "chunk_size": 1000, # embedding size of the chunk
    "model_name": "all-MiniLM-L6-v2", # emdedding models
    "embedding_dim": 384,
    "top_k": 3, # num of relevent chunk as a context for retriver
    "retrival_model": "openai" # openai/ollama/gpt-neo
}

TODO

Unit Testing Response
gpt x - accuracy not good, respons time ~ 30s, token/s=24.27
ollama(llama 2.1) - accuracy good, response time - 30s, token/s = 1.9
gpt4o-mini- accuracy good, response time 4.6s, token/s = 10.6

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
.dvc		.dvc
data		data
evaluation		evaluation
src		src
static		static
tests		tests
.Dockerignore		.Dockerignore
.dvcignore		.dvcignore
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
config.json		config.json
docker-compose.yml		docker-compose.yml
index.html		index.html
requirement.txt		requirement.txt
startup.sh		startup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Contextual Chatbot

Usage

Setup with docker

Setup without docker

High Level Design

Design with components

Flow:

Component Overview

Generate the test data

Steps for new pdf

ML Ops Cycle

Files to Track using dvc

TODO

About

Uh oh!

Releases 1

Packages

Uh oh!

Languages

skj092/Contextual-Chatbot

Folders and files

Latest commit

History

Repository files navigation

Contextual Chatbot

Usage

Setup with docker

Setup without docker

High Level Design

Design with components

Flow:

Component Overview

Generate the test data

Steps for new pdf

ML Ops Cycle

Files to Track using dvc

TODO

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Languages

Packages