LLM Graph Builder

A visual tool for constructing LLM (Large Language Model) training components and generating PyTorch code.

Features

Visual Component Builder: Drag and drop LLM components to create your architecture
PyTorch Code Generation: Generate ready-to-use PyTorch code from your visual design
Component Library: Access embeddings, positional encodings, QKV blocks, and more
Optimization Options: Configure training optimizations like FSDP, Flash Attention, MoE, and more
Training Hyperparameters: Fine-tune batch size, learning rate, model dimensions, and more
Device Detection: Automatically detect and use the best available hardware (CUDA, MPS, CPU)
Experiment Runner: Run small-scale experiments with synthetic data to test your model

Available Components

Embedding Layers: Convert token IDs to embeddings
Positional Encodings: Add position information to embeddings (Sinusoidal, Learned, Rotary, ALiBi)
Multi-Head Attention: Self-attention mechanisms with configurable parameters
Feed Forward Networks: Process features with non-linearity
Output Layers: Final projection layers with various activation functions

Optimization Options

Training Hyperparameters:
- Batch size, block size (context length), and maximum iterations
- Learning rate and evaluation intervals
- Model architecture parameters (embedding dimension, number of heads/layers)
- Dropout rate for regularization
Distributed Training:
- Fully Sharded Data Parallel (FSDP) with configurable sharding strategies
- DeepSpeed ZeRO with CPU offloading options
Mixture of Experts (MoE):
- Configure number of experts and routing strategy
- Set top-k experts per token (Switch Transformers for k=1, standard MoE for k=2)
- Adjust capacity factors for training and evaluation
- Enable expert parallelism for multi-GPU setups
- Control expert dropout for better generalization
Attention Optimizations:
- Flash Attention for faster, memory-efficient attention
- xFormers memory-efficient attention mechanisms
Memory Optimizations:
- Gradient checkpointing to reduce memory usage
- Mixed precision training (FP16/BF16)
Compilation:
- PyTorch 2.0 torch.compile() with different compilation modes
Device Detection:
- Automatic detection of CUDA GPUs
- Support for Apple Silicon GPUs via Metal Performance Shaders (MPS)
- Fallback to CPU when no GPU is available
Experiment Features:
- Run small-scale experiments with synthetic data
- Configure batch size, epochs, and sequence length
- Track and visualize training metrics (loss, timing)
- Save model checkpoints during training

Getting Started

Prerequisites

Node.js 18+ and npm

Installation

Clone the repository:

git clone https://github.com/your-username/llm-graph-trainer.git
cd llm-graph-trainer

Install dependencies:

npm install

Run the development server:

npm run dev

Open http://localhost:3000 in your browser.

Usage

Navigate to the Builder page
Drag components from the left panel onto the canvas
Connect components by dragging from one node's output handle to another node's input handle
Configure component parameters by clicking on them
Go to the Optimizations tab to configure training optimizations
Configure device detection and experiment settings in the Experiment tab
Click "Generate Code" to create PyTorch code for your model
Copy or download the generated code for use in your PyTorch projects

Running Experiments

The generated code includes functionality to run small-scale experiments with your model:

Configure experiment settings in the Experiment tab:
- Set batch size, epochs, and sequence length
- Enable metrics tracking and checkpoint saving
- Configure synthetic dataset size
The generated code will include a run_experiment() function that:
- Automatically detects the best available device (CUDA, MPS, CPU)
- Generates synthetic data for training
- Trains the model for the specified number of epochs
- Tracks and visualizes training metrics
- Saves model checkpoints
Run the generated Python code:

python your_model.py

View the results in the experiment_results directory:
- Training loss plots
- Performance metrics
- Model checkpoints

Testing

The LLM Graph Trainer includes a comprehensive test suite to ensure the application works as expected. The tests focus on verifying that:

The synchronization between the nodes array and the selectedNode state works correctly
Parameter updates from the properties panel are reflected in the node data
Changes to nodes from other sources (like validation) are reflected in the properties panel
Multiple parameter updates in sequence are handled correctly

Running Tests

To run the tests, first install the dependencies:

npm install

Then run the tests using one of the following commands:

# Run tests in watch mode
npm test

# Run tests with UI
npm run test:ui

# Run tests with coverage
npm run test:coverage

Test Structure

The tests are organized into several files:

FlowEditor.test.tsx: Tests for the main FlowEditor component
NodeProperties.test.tsx: Tests for the NodeProperties component
NodeSynchronization.test.tsx: Tests specifically for the node synchronization mechanism
Integration.test.tsx: Integration tests between FlowEditor and NodeProperties

Key Test Cases

State Synchronization: Tests verify that when a node is updated through any means, both the nodes array and the selectedNode state are kept in sync.
Parameter Updates: Tests check that parameter changes in the properties panel are correctly applied to the node data.
Conditional Rendering: Tests ensure that conditional UI elements (like the MoE settings when useMoE is enabled) appear and disappear correctly.
Multiple Updates: Tests confirm that multiple parameter updates in sequence are all applied correctly.

Technologies Used

Next.js
React
TypeScript
Tailwind CSS
Shadcn UI
React Flow
Monaco Editor

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

Inspired by the need for easier LLM architecture experimentation
Built with modern web technologies for a smooth user experience

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
public		public
src		src
.gitignore		.gitignore
README.md		README.md
components.json		components.json
eslint.config.mjs		eslint.config.mjs
next.config.ts		next.config.ts
package-lock.json		package-lock.json
package.json		package.json
postcss.config.mjs		postcss.config.mjs
run-tests.sh		run-tests.sh
tsconfig.json		tsconfig.json
tsconfig.node.json		tsconfig.node.json
vitest.config.ts		vitest.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM Graph Builder

Features

Available Components

Optimization Options

Getting Started

Prerequisites

Installation

Usage

Running Experiments

Testing

Running Tests

Test Structure

Key Test Cases

Technologies Used

License

Acknowledgments

About

Releases

Packages

Languages

altyni86/llm-graph-trainer

Folders and files

Latest commit

History

Repository files navigation

LLM Graph Builder

Features

Available Components

Optimization Options

Getting Started

Prerequisites

Installation

Usage

Running Experiments

Testing

Running Tests

Test Structure

Key Test Cases

Technologies Used

License

Acknowledgments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages