10000 spectrometerHBH (Bohan Hou) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View spectrometerHBH's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Organizations

@mlc-ai

Block or report spectrometerHBH

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

better `adb shell`

Shell 192 6 Updated Apr 15, 2024

FlashInfer: Kernel Library for LLM Serving

Cuda 3,266 357 Updated Jun 30, 2025

✨ Light and Fast AI Assistant. Support: Web | iOS | MacOS | Android | Linux | Windows

TypeScript 83,964 61,081 Updated Jun 30, 2025

Easy-to-use headless React Hooks to run LLMs in the browser with WebGPU. Just useLLM().

TypeScript 692 30 Updated Jun 27, 2023

🗣️ Chat with LLM like Vicuna totally in your browser with WebGPU, safely, privately, and with no server. Powered by web llm.

JavaScript 633 47 Updated Aug 8, 2024

Universal LLM Deployment Engine with ML Compilation

Python 20,881 1,757 Updated Jun 25, 2025

High-performance In-browser LLM Inference Engine

TypeScript 15,798 1,036 Updated May 5, 2025

Bringing stable diffusion models to web browsers. Everything runs inside the browser with no server support.

Jupyter Notebook 3,670 232 Updated Mar 12, 2024

Automatically Discovering Fast Parallelization Strategies for Distributed Deep Neural Network Training

C++ 1,805 239 Updated Jun 28, 2025

Transformer related optimization, including BERT, GPT

C++ 6,222 909 Updated Mar 27, 2024

Development repository for the Triton language and compiler

MLIR 15,991 2,078 Updated Jun 30, 2025

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

Python 4,650 381 Updated Apr 1, 2025

The Legion Parallel Programming System

C++ 727 153 Updated Jun 9, 2025

A collection of resources and papers on Diffusion Models

HTML 11,842 987 Updated Aug 1, 2024

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Python 32,658 3,072 Updated Jun 30, 2025
Python 611 66 Updated Jun 4, 2024
Jupyter Notebook 207 69 Updated Nov 22, 2024

Training and serving large-scale neural networks with auto parallelization.

Python 3,137 355 Updated Dec 9, 2023
Python 420 48 Updated Oct 16, 2024
Python 40 7 Updated Mar 31, 2022

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 91,121 24,554 Updated Jun 30, 2025

An Open Source Machine Learning Framework for Everyone

C++ 190,525 74,731 Updated Jun 30, 2025
Python 194 56 Updated Mar 28, 2023

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 39,148 4,445 Updated Jun 30, 2025

OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version.

C 6,842 1,570 Updated Jun 29, 2025

The Tensor Algebra Compiler (taco) computes sparse tensor expressions on CPUs and GPUs

C++ 1,305 195 Updated Apr 14, 2025

中文的C++ Template的教学指南。与知名书籍C++ Templates不同,该系列教程将C++ Templates作为一门图灵完备的语言来讲授,以求帮助读者对Meta-Programming融会贯通。(正在施工中)

C++ 10,254 1,612 Updated Aug 20, 2024

xv6 OS

C 8,574 4,225 Updated Aug 13, 2024
Next
0