morse-code-llm

Purpose

This project demonstrates how a Large Language Model (LLM) can be fine-tuned to perform specialised tasks. In this example, the base model used is Gemma 3 1B (pretrained), fine-tuned specifically to translate between English and Morse code.

Note: This LLM is intended solely for demonstration and educational purposes. It does not have practical real-world applications beyond being a teaching example.

Prerequisites

Python >3.11 but <3.13 (as there are known issues with 3.13)
VS Code
VS Code Jupyter Notebook extension
You need an account on Hugging Face

Steps to Obtain a Writeable Token from Hugging Face

Visit Hugging Face and log in to your account.
Navigate to your profile settings by clicking on your avatar in the top-right corner and selecting "Settings."
In the settings menu, select "Access Tokens."
Click on "New Token," provide a name for the token, and set the role to "write."
Copy the generated token.

Login Using the Hugging Face CLI

Run the following command in your terminal to log in:

huggingface-cli login

Paste your token when prompted.

Install

Environment Compatibility

The notebooks in this project are best run on Linux or WSL2 environment. Running them natively on Windows can present challenges. I used WSL2 with Debian.

Setting up a Python Virtual Environment

It is recommended to create a Python virtual environment before installing the project requirements. This ensures that dependencies are isolated and do not interfere with other projects.

To create and activate a virtual environment, follow these steps:

Create a virtual environ 8821 ment:
```
python -m venv .venv
```
Activate the virtual environment:
```
source .venv/bin/activate
```
Install the requirements:
```
pip install -r requirements.txt
```

Selecting the Virtual Environment in VS Code Jupyter Notebook

To use the virtual environment in a Jupyter Notebook within VS Code:

Open the Command Palette (Ctrl+Shift+P or Cmd+Shift+P on Mac).
Search for and select Python: Select Interpreter.
Choose the interpreter located in the venv directory (e.g., ./venv/bin/python or .\venv\Scripts\python.exe).
Open your notebook, and in the top-right corner, select the kernel corresponding to the virtual environment.

Notebooks

Tip

Run the notebooks sequentially from 00-test-env.ipynb to 02-fine-tune-bi.ipynb for this guide.

00-test-env.ipynb

To verify that the custom Morse code library is installed and to confirm that the Jupyter Notebook widgets are functioning as expected.

01-build-dataset.ipynb

To create the training dataset by preparing English phrases and their Morse Code translations. This notebook includes data normalisation, encoding, deduplication, and uploading the dataset to Hugging Face.

02-fine-tune-bi.ipynb

To fine-tune the Gemma 3 model for bidirectional translation between English and Morse Code. This notebook uses the dataset prepared in 01-build-dataset.ipynb and trains the model for both directions of translation.

Model Evaluation with TensorBoard

To evaluate the model using TensorBoard, follow these steps:

Run the following command to start TensorBoard:
```
tensorboard --logdir outputs
```

Ensure your configuration includes the following settings:

STConfig(
    ...
    output_dir = "outputs",
    report = "tensorboard"
)

Understanding Key Metrics

Training Loss: This metric indicates how well the model is learning during training. A decreasing training loss generally signifies that the model is improving. However, if the loss plateaus or increases, it may indicate overfitting or learning issues.
Gradient Norm (grad_norm): This measures the magnitude of gradients during backpropagation. Large gradient norms can lead to instability, while very small norms may indicate vanishing gradients. Monitor this value to ensure stable and effective training.

LM Studio Support

After creating a GGUF file and hosting it on Hugging Face, you can download and use it in LM Studio. LM Studio is a user-friendly interface for interacting with language models, allowing you to test and deploy your fine-tuned model efficiently. Simply follow the instructions in LM Studio to load the GGUF file and start using your model.

Ollama Support

It is possible to build a GGUF file and host it in Ollama. Ollama provides a platform for deploying and managing language models with ease. For detailed instructions on how to set this up, refer to the ollama/README.md file included in this repository.

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
docs		docs
ollama		ollama
tests		tests
.gitignore		.gitignore
00-test-env.ipynb		00-test-env.ipynb
01-build-dataset.ipynb		01-build-dataset.ipynb
02-fine-tune-bi.ipynb		02-fine-tune-bi.ipynb
README.md		README.md
requirements-text.freeze		requirements-text.freeze
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

morse-code-llm

Purpose

Prerequisites

Steps to Obtain a Writeable Token from Hugging Face

Login Using the Hugging Face CLI

Install

Environment Compatibility

Setting up a Python Virtual Environment

Selecting the Virtual Environment in VS Code Jupyter Notebook

Notebooks

00-test-env.ipynb

01-build-dataset.ipynb

02-fine-tune-bi.ipynb

Model Evaluation with TensorBoard

Understanding Key Metrics

LM Studio Support

Ollama Support

About

Uh oh!

Uh oh!

Languages

philipf/morse-code-llm

Folders and files

Latest commit

History

Repository files navigation

morse-code-llm

Purpose

Prerequisites

Steps to Obtain a Writeable Token from Hugging Face

Login Using the Hugging Face CLI

Install

Environment Compatibility

Setting up a Python Virtual Environment

Selecting the Virtual Environment in VS Code Jupyter Notebook

Notebooks

00-test-env.ipynb

01-build-dataset.ipynb

02-fine-tune-bi.ipynb

Model Evaluation with TensorBoard

Understanding Key Metrics

LM Studio Support

Ollama Support

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Uh oh!

Languages