-
sigseg.com
- Huntington Beach, CA
- johnnylambada.com
- @johnnylambada
Stars
Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.
Automagically reverse-engineer REST APIs via capturing traffic
Maid is a cross-platform Flutter app for interfacing with GGUF / llama.cpp models locally, and with Ollama and OpenAI models remotely.
Termux - a terminal emulator application for Android OS extendible by variety of packages.
Generates a bitmap from html by rendering the content inside an off screen webview
Official inference framework for 1-bit LLMs
CLI Jellyfin Controller Utility for Linux and Windows
Jan is an open source alternative to ChatGPT that runs 100% offline on your computer
Exploring Hacker News by mapping and analyzing 40 million posts and comments for fun
Distribute and run LLMs with a single file.
LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a chain of LLMs to find the answer. The user can see the progres…
JHipster is a development platform to quickly generate, develop, & deploy modern web applications & microservice architectures.
Letta (formerly MemGPT) is the stateful agents framework with memory, reasoning, and context management.
If you are using Googles App Engine and want to use secrets in the app.yaml file, you can store them as Secrets in your repository and have them replaced during deployment.
🤖 Deploy a private ChatGPT alternative hosted within your VPC. 🔮 Connect it to your organization's knowledge base and use it as a corporate oracle. Supports open-source LLMs like Llama 2, Falcon, a…
Universal LLM Deployment Engine with ML Compilation
Automated Dollar Cost Averaging Leveraged & Inverse Funds
A verbosely commented minimal progressive web app
📦 Workbox: JavaScript libraries for Progressive Web Apps
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
The Remix Stack for deploying to Fly with PostgreSQL, authentication, testing, linting, formatting, etc.
The simplest way to run LLaMA on your local machine
Code and documentation to train Stanford's Alpaca models, and generate the data.