StudyingShao

😅

NVJiangShao StudyingShao

😅

8 followers · 7 following

NVIDIA

Achievements

Stars

StudyingShao / cutlass

Forked from NVIDIA/cutlass

CUDA Templates for Linear Algebra Subroutines

C++ 2 Updated Apr 1, 2025

NVIDIA / cutlass

CUDA Templates for Linear Algebra Subroutines

C++ 7,654 1,258 Updated Jun 7, 2025

fanshiqing / moe_grouped_gemm

A PyTorch Toolbox for Grouped GEMM in MoE Model Training

5 1 Updated May 28, 2024

NVIDIA / TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory…

Python 2,464 428 Updated Jun 7, 2025

triton-inference-server / tensorrtllm_backend

The Triton TensorRT-LLM Backend

Shell 846 122 Updated Jun 5, 2025

fanshiqing / grouped_gemm

Forked from tgale96/grouped_gemm

PyTorch bindings for CUTLASS grouped GEMM.

Cuda 125 39 Updated Jan 2, 2025

StudyingShao / TensorRT-LLM

Forked from NVIDIA/TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 3 Updated May 19, 2025

NVIDIA / TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorR…

C++ 10,679 1,481 Updated Jun 8, 2025

pyscf / gpu4pyscf

A plugin to use Nvidia GPU in PySCF package

Cuda 204 37 Updated Jun 7, 2025

RayTracing / raytracing.github.io

Main Web Site (Online Books)

HTML 9,517 923 Updated Apr 28, 2025

Tencent / secguide

面向开发人员梳理的代码安全指南

13,447 1,947 Updated Mar 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NVJiangShao StudyingShao

Achievements

Achievements

Block or report StudyingShao

Stars

StudyingShao / cutlass

NVIDIA / cutlass

fanshiqing / moe_grouped_gemm

NVIDIA / TransformerEngine

triton-inference-server / tensorrtllm_backend

fanshiqing / grouped_gemm

StudyingShao / TensorRT-LLM

NVIDIA / TensorRT-LLM

pyscf / gpu4pyscf

RayTracing / raytracing.github.io

Tencent / secguide