Balena Embed

A lightweight FastAPI server for generating 384-dimensional dense vector embeddings from text using the MiniLM-L6 model (all-MiniLM-L6-v2). Useful for semantic search, text similarity, and other NLP tasks.

Features

Simple REST API for text embedding
Fast inference with Sentence Transformers
Docker and direct Python support
Returns embeddings as JSON arrays
Handles invalid input with clear error messages

Installation

Using Docker

Build the Docker image:
```
docker build -t balena-embed .
```
Run the server:
```
docker run -p 6700:6700 balena-embed
```

Direct Installation

Install Python 3.11 (or 3.11 slim) and pip if not already installed.
Install dependencies:
```
pip install -r requirements.txt
```
Start the server:
```
python embed_server.py
```

Usage

Send a POST request to http://localhost:6700/embed with JSON body:

{ 
    "text": "your text here" 
}

The response will be a JSON object with a 384-dimensional embedding vector:

{ 
    "embedding": [0.123, ...] 
}

Error Handling

If the text field is missing or not a string, the server returns a 422 error with details.
If the model fails to generate an embedding, a 500 error is returned.

Requirements

Python 3.11 or 3.11 slim
pip (for direct install)
Docker (for containerized install)

Contributing

Contributions are welcome! Please open an issue or submit a pull request.

License

This project is licensed under the GPL 3.0 License. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
embed_server.py		embed_server.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Balena Embed

Features

Installation

Using Docker

Direct Installation

Usage

Error Handling

Requirements

Contributing

License

About

Uh oh!

Releases

Packages

Languages

License

cesp99/balena-embed

Folders and files

Latest commit

History

Repository files navigation

Balena Embed

Features

Installation

Using Docker

Direct Installation

Usage

Error Handling

Requirements

Contributing

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages