yuguo yuguo-Jack

😮

2 followers · 2 following

Sugon
ZhengZhou
23:44 (UTC +08:00)
yuguo960516@outlook.com

Achievements

TransformerEngine Public
Forked from NVIDIA/TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilizatio…

Python Apache License 2.0 Updated Mar 17, 2025
Pai-Megatron-Patch Public
Forked from alibaba/Pai-Megatron-Patch

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

Python Apache License 2.0 Updated Feb 10, 2025
Megatron-LM Public
Forked from NVIDIA/Megatron-LM

Ongoing research training transformer models at scale

Python Other Updated Jan 23, 2025
sglang Public
Forked from sgl-project/sglang

SGLang is a fast serving framework for large language models and vision language models.

Python Apache 10000 License 2.0 Updated Nov 6, 2024
flux Public
Forked from bytedance/flux

A fast communication-overlapping library for tensor parallelism on GPUs.

C++ Apache License 2.0 Updated Oct 30, 2024
flashinfer Public
Forked from flashinfer-ai/flashinfer

FlashInfer: Kernel Library for LLM Serving

Cuda Apache License 2.0 Updated Sep 10, 2024
FastDeploy Public
Forked from PaddlePaddle/FastDeploy

⚡️An Easy-to-use and Fast Deep Learning Model Deployment Toolkit for ☁️Cloud 📱Mobile and 📹Edge. Including Image, Video, Text and Audio 20+ main stream scenarios and 150+ SOTA models with end-to-end…

C++ Apache License 2.0 Updated Aug 30, 2024
Paddle Public
Forked from PaddlePaddle/Paddle

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice （『飞桨』核心框架，深度学习&机器学习高性能单机、分布式训练和跨平台部署）

C++ Apache License 2.0 Updated Jul 25, 2024
oneflow-hip Public
Forked from Oneflow-Inc/oneflow-hip

C++ Apache License 2.0 Updated Jul 18, 2024
flash-attention Public
Forked from PaddlePaddle/flash-attention

Fast and memory-efficient exact attention

C++ BSD 3-Clause "New" or "Revised" License Updated Jun 25, 2024
PaddleNLP Public
Forked from PaddlePaddle/PaddleNLP

👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search…

Python Updated Jun 18, 2024
flash-attention-hip Public

Flash Attention 2 C API for Paddle-ROCM

C++ 1 1 BSD 3-Clause "New" or "Revised" License Updated Jun 18, 2024
composable_kernel Public
Forked from ROCm/composable_kernel

Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators

C++ Other Updated Dec 7, 2023
hipBLASLt Public
Forked from ROCm/hipBLASLt

hipBLASLt is a library that provides general matrix-matrix operations with a flexible API and extends functionalities beyond a traditional BLAS library

Assembly 2 MIT License Updated Dec 1, 2023
ChatGLM-6B-in-DeepSpeed-Chat Public

ChatGLM-6B in DeepSpeed-Chat for DCU

Python 8 Apache License 2.0 Updated Sep 14, 2023
GLM-Pretrain-in-Megatron-DeepSpeed Public

GLM-Pretrain in Megatron-Deepspeed for DCU

Python 8 1 Apache License 2.0 Updated Aug 31, 2023
VkFFT Public
Forked from DTolm/VkFFT

Vulkan/CUDA/HIP/OpenCL/Level Zero/Metal Fast Fourier Transform library

C++ MIT License Updated Aug 22, 2023
Tensile Public
Forked from ROCm/Tensile

Stretching GPU performance for GEMMs and tensor contractions.

Python MIT License Updated Aug 22, 2023
docs Public
Forked from PaddlePaddle/docs

Documentations for PaddlePaddle

Python Apache License 2.0 Updated Jul 18, 2023
oneflow Public
Forked from Oneflow-Inc/oneflow

OneFlow is a performance-centered and open-source deep learning framework.

C++ Apache License 2.0 Updated Jul 18, 2023

yuguo yuguo-Jack

Achievements

Achievements

TransformerEngine Public

Uh oh!

Pai-Megatron-Patch Public

Uh oh!

Megatron-LM Public

Uh oh!

sglang Public

Uh oh!

flux Public

Uh oh!

flashinfer Public

Uh oh!

FastDeploy Public

Uh oh!

Paddle Public

Uh oh!

oneflow-hip Public

Uh oh!

flash-attention Public

Uh oh!

PaddleNLP Public

Uh oh!

flash-attention-hip Public

Uh oh!

composable_kernel Public

Uh oh!

hipBLASLt Public

Uh oh!

ChatGLM-6B-in-DeepSpeed-Chat Public

Uh oh!

GLM-Pretrain-in-Megatron-DeepSpeed Public

Uh oh!

VkFFT Public

Uh oh!

Tensile Public

Uh oh!

docs Public

Uh oh!

oneflow Public

Uh oh!