8000 EricLBuehler (Eric Buehler) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View EricLBuehler's full-sized avatar

Organizations

@huggingface

Block or report EricLBuehler

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A Datacenter Scale Distributed Inference Serving Framework

Rust 4,025 359 Updated May 16, 2025

Code for BLT research paper

Python 1,644 132 Updated May 15, 2025

Rust Workspace Bootstrapper

Rust 10 Updated May 14, 2025

Transformers provides a simple, intuitive interface for Rust developers who want to work with Large Language Models locally, powered by the Candle crate. It offers an API inspired by Python's Trans…

Rust 16 1 Updated May 13, 2025

Exploration work on executing CUDA kernels on Apple Silicon (Metal-compatible code).

Rust 29 1 Updated Apr 27, 2025
Rust 5 1 Updated May 14, 2025

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 2,553 188 Updated May 16, 2025
Rust 36 1 Updated May 11, 2025

Inference engine for GLiNER models, in Rust

Rust 58 9 Updated Mar 30, 2025

DFloat11: Lossless LLM Compression for Efficient GPU Inference

Python 352 20 Updated May 6, 2025

Official inference framework for 1-bit LLMs

C++ 19,533 1,451 Updated May 16, 2025

A powerful validation library for Rust

Rust 687 34 Updated Jan 19, 2025

Experimental compiler for deep learning models

Rust 67 2 Updated Apr 15, 2025
Rust 4 Updated Feb 27, 2025

👷 Build compute kernels

Nix 39 8 Updated May 13, 2025

Load compute kernels from the Hub

Python 119 6 Updated May 16, 2025
Go 4 Updated Apr 2, 2025

Fast, Lightweight, Unified Engine for Text2Image Diffusion Models

C++ 20 3 Updated Apr 13, 2025

Rust standalone inference of Namo-500M series models. Extremly tiny, runing VLM on CPU.

Rust 24 Updated Mar 12, 2025
Dockerfile 2 Updated Oct 14, 2024

Model Context Protocol (MCP) implementation in Rust

Rust 308 21 Updated Mar 21, 2025

GPU based FFT written in Rust and CubeCL

Rust 22 Updated Mar 13, 2025

A modular diffusion pipeline for synthesis of post-treatment glioma MR images.

Jupyter Notebook 3 Updated May 14, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 7,658 768 Updated May 12, 2025

Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning

Jupyter Notebook 215 30 Updated Feb 24, 2025

Any model. Any hardware. Zero compromise. Built with @ziglang / @openxla / MLIR / @bazelbuild

Zig 2,262 80 Updated May 16, 2025

Open-source framework that builds customized & randomized JSON files for use in endpoint load testing.

C++ 55 9 Updated Feb 15, 2025

Rust bindings to LLVM. (Mirror of https://gitlab.com/taricorp/llvm-sys.rs/)

Rust 197 41 Updated Apr 28, 2025
Rust 8 Updated Mar 5, 2025

Code to make working with CUDA, via the CUDARC lib, easier.

Rust 7 Updated May 13, 2025
Next
0