NGILlama3 API

This repository provides a RESTful API built using Flask for interacting with a custom language model, NGILlama3, based on the Llama architecture. The API allows for text generation using a pre-trained language model from Hugging Face, designed for a range of natural language processing tasks.

Features

Text Generation: Generate responses based on user input.
Custom Model: The API uses the NGILlama3 model, a fine-tuned version of Llama, for improved natural language understanding and generation.
Hugging Face Integration: Utilizes the Hugging Face transformers library for easy access to pre-trained models and tokenizers.

Getting Started

Prerequisites

To use the application, ensure the following dependencies are installed:

Docker: Required for running the application in a containerized environment.
Python 3.8+: If running the application outside of Docker, you need Python and the associated libraries.

Docker Setup

Build the Docker Image

Clone this repository:

git clone https://github.com/HeReFanMi/NGI_LLM.git
cd NGI_LLM

Build the Docker image:
```
docker build -t ngillama3-flask-api .
```

Run the Docker Container

Once the image is built, you can run the container using:

docker run -d -p 5002:5002 ngillama3-flask-api

This will start the Flask application on port 5002 inside the container and expose it to your host machine.

API Endpoints

The Flask API exposes a single endpoint:

POST /predict: Takes a JSON payload with a text input and returns a generated response.

Request

Make a POST request to /predict with the following JSON payload:

{
  "text": "Your input text here."
}

Response

The API will respond with a JSON object containing the generated text:

{
  "response": "The model-generated text here."
}

Example cURL Request

curl -X POST http://127.0.0.1:5002/predict -H "Content-Type: application/json" -d '{"chunks":["A new study has shown that regular exercise can help reduce the risk of chronic diseases such as diabetes and heart disease.","Research also indicates that physical activity improves mental health and overall quality of life."],"question":"What are the health benefits of regular exercise?"}'
'

Model Information

Model Name: a-hamdi/NGILlama3-merged
Architecture: Fine-tuned Llama model.
Hugging Face Model: NGILlama3-merged on Hugging Face

Development Setup

To run the application locally without Docker, follow these steps:

Clone the repository:

git clone https://github.com/HeReFanMi/NGI_LLM.git
cd NGI_LLM

Set up the Conda environment:

conda create --name unsloth_env python=3.10 pytorch-cuda=11.8 pytorch cudatoolkit xformers -c pytorch -c nvidia -c xformers -y
conda activate unsloth_env

Install the required Python dependencies:

pip install "unsloth[colab-new] @ git+https://github.com/unslothai/unsloth.git"
pip install --no-deps "trl<0.9.0" peft accelerate bitsandbytes

Run the Flask app:
```
flask run
```

This will start the application at http://127.0.0.1:5002.

Requirements

The application relies on the following Python libraries:

transformers==4.33.2: Hugging Face Transformers library for working with pre-trained models.
torch==2.0.1: PyTorch for model inference.
flask==2.3.2: Flask web framework for building the API.

Troubleshooting

Model Loading Issues: Ensure the model is available on Hugging Face and the internet connection is stable.
Out of Memory Errors: If you are running the app locally and encounter memory issues, consider using a machine with a more powerful GPU or reduce the model size.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
NGI_LLM		NGI_LLM
Dockerfile.dockerfile		Dockerfile.dockerfile
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NGILlama3 API

Features

Getting Started

Prerequisites

Docker Setup

Build the Docker Image

Run the Docker Container

API Endpoints

Request

Response

Example cURL Request

Model Information

Development Setup

Requirements

Troubleshooting

About

Releases

Packages

Contributors 4

Languages

HeReFanMi/NGI_LLM

Folders and files

Latest commit

History

Repository files navigation

NGILlama3 API

Features

Getting Started

Prerequisites

Docker Setup

Build the Docker Image

Run the Docker Container

API Endpoints

Request

Response

Example cURL Request

Model Information

Development Setup

Requirements

Troubleshooting

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages