Files-DB-MCP: Vector Search for Code Projects

A local vector database system that provides LLM coding agents with fast, efficient search capabilities for software projects via the Message Control Protocol (MCP).

Features

Zero Configuration - Auto-detects project structure with sensible defaults
Real-Time Monitoring - Continuously watches for file changes
Vector Search - Semantic search for finding relevant code
MCP Interface - Compatible with Claude Code and other LLM tools
Open Source Models - Uses Hugging Face models for code embeddings

Installation

Option 1: Clone and Setup (Recommended)

# Using SSH (recommended if you have SSH keys set up with GitHub)
git clone git@github.com:randomm/files-db-mcp.git ~/.files-db-mcp && bash ~/.files-db-mcp/install/setup.sh

# Using HTTPS (if you don't have SSH keys set up)
git clone https://github.com/randomm/files-db-mcp.git ~/.files-db-mcp && bash ~/.files-db-mcp/install/setup.sh

Option 2: Automated Installation Script

curl -fsSL https://raw.githubusercontent.com/randomm/files-db-mcp/main/install/install.sh | bash

Usage

After installation, run in any project directory:

files-db-mcp

The service will:

Detect your project files
Start indexing in the background
Begin responding to MCP search queries immediately

Requirements

Docker
Docker Compose

Configuration

Files-DB-MCP works without configuration, but you can customize it with environment variables:

EMBEDDING_MODEL - Change the embedding model (default: 'jinaai/jina-embeddings-v2-base-code' or project-specific model)
FAST_STARTUP - Set to 'true' to use a smaller model for faster startup (default: 'false')
QUANTIZATION - Enable/disable quantization (default: 'true')
BINARY_EMBEDDINGS - Enable/disable binary embeddings (default: 'false')
IGNORE_PATTERNS - Comma-separated list of files/dirs to ignore

First-Time Startup

On first run, Files-DB-MCP will download embedding models which may take several minutes depending on:

The size of the selected model (300-500MB for high-quality models)
Your internet connection speed

Subsequent startups will be much faster as models are cached in a persistent Docker volume. For faster initial startup, you can:

# Use a smaller, faster model (90MB)
EMBEDDING_MODEL=sentence-transformers/all-MiniLM-L6-v2 files-db-mcp

# Or enable fast startup mode
FAST_STARTUP=true files-db-mcp

Model Caching

Files-DB-MCP automatically persists downloaded embedding models, so you only need to download them once:

Models are stored in a Docker volume called model_cache
This volume persists between container restarts and across different projects
The cache is shared for all projects using Files-DB-MCP on your machine
You don't need to download the model again for each project

Claude Code Integration

Add to your Claude Code configuration:

{
  "mcpServers": {
    "files-db-mcp": {
      "command": "python",
      "args": ["/path/to/src/claude_mcp_server.py", "--host", "localhost", "--port", "6333"]
    }
  }
}

For details, see Claude MCP Integration.

Documentation

Installation Guide - Detailed setup instructions
API Reference - Complete API documentation
Configuration Guide - Configuration options

Repository Structure

/src - Source code
/tests - Unit and integration tests
/docs - Documentation
/scripts - Utility scripts
/install - Installation scripts
/.docker - Docker configuration
/config - Configuration files
/ai-assist - AI assistance files

License

MIT License

Contributing

Contributions welcome! Please feel free to submit a pull request.

Name		Name	Last commit message	Last commit date
Latest commit History 57 Commits
.docker		.docker
.github		.github
.tasks		.tasks
ai-assist		ai-assist
config		config
docs		docs
install		install
scripts		scripts
src		src
tests		tests
.coverage		.coverage
.dockerignore		.dockerignore
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CLAUDE.md		CLAUDE.md
Dockerfile		Dockerfile
Dockerfile.test		Dockerfile.test
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
docker-compose.ci.yml		docker-compose.ci.yml
docker-compose.yml		docker-compose.yml
files-db-mcp		files-db-mcp
install.sh		install.sh
pyproject.toml		pyproject.toml
run.sh		run.sh
setup.sh		setup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Files-DB-MCP: Vector Search for Code Projects

Features

Installation

Option 1: Clone and Setup (Recommended)

Option 2: Automated Installation Script

Usage

Requirements

Configuration

First-Time Startup

Model Caching

Claude Code Integration

Documentation

Repository Structure

License

Contributing

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

randomm/files-db-mcp

Folders and files

Latest commit

History

Repository files navigation

Files-DB-MCP: Vector Search for Code Projects

Features

Installation

Option 1: Clone and Setup (Recommended)

Option 2: Automated Installation Script

Usage

Requirements

Configuration

First-Time Startup

Model Caching

Claude Code Integration

Documentation

Repository Structure

License

Contributing

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages