8000 junrushao (Junru Shao) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View junrushao's full-sized avatar

Organizations

@apache @dmlc @tlc-pack @mlc-ai

Block or report junrushao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
C++ 35 7 Updated May 25, 2025

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 1,300 103 Updated Jun 19, 2025

A responsive, good looking with modern design documentation theme for Sphinx, with great supports for many sphinx extensions.

CSS 228 11 Updated Jun 11, 2025

Tile primitives for speedy kernels

Cuda 2,461 157 Updated Jun 18, 2025

🚀 Kick-start your C++! A template for modern C++ projects using CMake, CI, code coverage, clang-format, reproducible dependency management and much more.

CMake 4,873 425 Updated Mar 12, 2025

AI Assistant running within your browser.

TypeScript 68 14 Updated Dec 3, 2024

A Easy-to-understand TensorOp Matmul Tutorial

C++ 364 47 Updated Sep 21, 2024

An extention of TVMScript to write simple and high performance GPU kernels with tensorcore.

Python 50 2 Updated Jul 23, 2024
Python 541 45 Updated Oct 29, 2024

MLC-LLM fork of oobabooga/text-generation-webui

Python 1 Updated Oct 17, 2023

Vercel and web-llm template to run wasm models directly in the browser.

TypeScript 152 21 Updated Nov 21, 2023

OpenGVLab / OmniQuant

[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.

Python 818 64 Updated May 22, 2025

FlashInfer: Kernel Library for LLM Serving

Cuda 3,211 343 Updated Jun 19, 2025

Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.

Python 11 1 Updated Aug 19, 2023

Neovim plugin for interacting with LLM's and building editor integrated prompts.

Lua 375 27 Updated Jun 8, 2025

LLM powered development for Neovim

Lua 1,029 56 Updated Jan 9, 2025

Serving multiple LoRA finetuned LLM as one

Python 1,066 52 Updated May 8, 2024
Python 7 Updated Sep 13, 2023
C++ 62 20 Updated Dec 18, 2024

Run Large Language Models on RK3588 with GPU-acceleration

105 5 Updated Aug 16, 2023

Structured inference with Llama 2 in your browser

TypeScript 52 2 Updated Nov 1, 2024

The documents for TVM Unity

Shell 8 2 Updated Aug 9, 2024

ad-llama demo with Vite

HTML 4 Updated Aug 3, 2023

python interface for mlc chat cli

Python 15 1 Updated May 7, 2023

My personal list of Neovim plugins

HTML 754 9 Updated Jun 19, 2025

Status column plugin that provides a configurable 'statuscolumn' and click handlers.

Lua 578 27 Updated Jun 2, 2025

NVIDIA CUPTI samples mirror.

Shell 7 3 Updated Jun 7, 2025

A template for modern C++ projects using CMake, Clang-Format, CI, unit testing and more, with support for downstream inclusion.

CMake 1,820 221 Updated Mar 16, 2024
Next
0