-
Seekr Technologies LLC
- Tempe, AZ
- @rvoleti89
Stars
This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.
This recipe is dedicated to helping you make the best possible pizza dough for Neapolitan pizza.
A Git-compatible VCS that is both simple and powerful
A high-throughput and memory-efficient inference and serving engine for LLMs
HabanaAI / vllm-fork
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
This project upgrades a Gaggia espresso machine with smart controls to improve your coffee-making experience. By adding a display and custom electronics, you can monitor and control the machine mor…
This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?
🚀 Retrieval Augmented Generation (RAG) with txtai. Combine search and LLMs to find insights with your own data.
Machine Learning and Computer Vision Engineer - Technical Interview Questions
[ICML 2024] TrustLLM: Trustworthiness in Large Language Models
Android app to improve Xbox Cloud Gaming (xCloud) and Remote Play experiences
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorR…
Comparison of Language Model Inference Engines
Grokking the Coding Interview: Patterns for Coding Questions Alternative
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
A command-line productivity tool powered by AI large language models like GPT-4, will help you accomplish your tasks faster and more efficiently.
🤗 Optimum Intel: Accelerate inference with Intel optimization tools
Enabling easy statistical significance testing for deep neural networks.
Letta (formerly MemGPT) is the stateful agents framework with memory, reasoning, and context management.
✨✨Latest Advances on Multimodal Large Language Models
🦜🔗 Build context-aware reasoning applications