8000 albertoperdomo2 (alberto) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View albertoperdomo2's full-sized avatar
🎯
🎯

Block or report albertoperdomo2

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 1 Updated Jun 23, 2025

Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs

Python 380 49 Updated Jun 27, 2025

Gateway API Inference Extension

Jupyter Notebook 363 108 Updated Jun 29, 2025

From the Transistor to the Web Browser, a rough outline for a 12 week course

6,238 484 Updated Oct 12, 2021

Simplified model deployment on llm-d

Go 24 11 Updated Jun 23, 2025

Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.

Python 47,068 5,380 Updated Jun 29, 2025

Ollama Python library

Python 7,918 729 Updated Jun 18, 2025

Containerization is a Swift package for running Linux containers on macOS.

Swift 7,366 159 Updated Jun 27, 2025

A tool for creating and running Linux containers using lightweight virtual machines on a Mac. It is written in Swift, and optimized for Apple silicon.

Swift 16,111 313 Updated Jun 27, 2025

Everything we actually know about the Apple Neural Engine (ANE)

2,230 82 Updated Mar 7, 2025

Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost of electricity. 🔔 Official updates only via twitter @Martin9…

Python 19,642 1,917 Updated Jun 28, 2025

A repository to unravel the language of GPUs, making their kernel conversations easy to understand

Python 186 7 Updated Jun 1, 2025

llm-d is a Kubernetes-native high-performance distributed LLM inference framework

Makefile 1,284 97 Updated Jun 25, 2025

An AI Hedge Fund Team

Python 37,457 6,533 Updated Jun 29, 2025

LLM inference in C/C++

C++ 82,372 12,224 Updated Jun 30, 2025

A CNI meta-plugin for multi-homed pods in Kubernetes

Go 2,610 610 Updated Jun 5, 2025

mutatio.dev is an open source platform to systematically test, measure, and optimize LLM prompts.

TypeScript 2 Updated May 15, 2025

rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.

C++ 91 22 Updated Jun 27, 2025

A debugger for Linux

Rust 1,245 25 Updated Jun 9, 2025

A lightweight design for computation-communication overlap.

Cuda 144 5 Updated Jun 20, 2025

real time face swap and one-click video deepfake with only a single image

Python 71,396 10,210 Updated Jun 29, 2025

Supercharge Your LLM with the Fastest KV Cache Layer

Python 2,269 273 Updated Jun 30, 2025

A course of learning LLM inference serving on Apple Silicon for systems engineers.

Python 2,660 137 Updated Jun 14, 2025

A dynamic library providing Virtualization-based process isolation capabilities

Rust 1,173 101 Updated Jun 24, 2025

Accelerate LLM preference tuning via prefix sharing with a single line of code

Python 41 Updated Apr 30, 2025

BLAS routines written in Triton

Python 2 3 Updated Jun 25, 2025

macOS: mount any linux-supported filesystem read/write using NFS and a microVM

Rust 271 5 Updated Jun 29, 2025

FlatBuffers: Memory Efficient Serialization Library

C++ 24,389 3,353 Updated Jun 29, 2025

CLI stuff for Jira

Python 4 5 Updated Jun 23, 2025
Next
0