sugsugsug

UGyeong Song sugsugsug

10 followers · 15 following

SNUCSE 18
Seoul, South Korea

Achievements

Starred repositories

Relaxed-System-Lab / HexiScale

Accommodating Large Language Model Training over Heterogeneous Environment.

Python 20 8 Updated Mar 13, 2025

llvm / llvm-project

The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.

LLVM 32,382 13,455 Updated May 14, 2025

InternLM / Awesome-LLM-Training-System

36 8 Updated Aug 6, 2024

cli99 / llm-analysis

Latency and Memory Analysis of Transformer Models for Training and Inference

Python 411 46 Updated Apr 19, 2025

icloud-ecnu / Espresso

Python 7 Updated Dec 10, 2024

deepspeedai / Megatron-DeepSpeed

Forked from NVIDIA/Megatron-LM

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 2,069 352 Updated Mar 24, 2025

DicardoX / Research-Space

This repository is established to store personal notes and annotated papers during daily research.

122 8 Updated Apr 22, 2025

deepspeedai / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 38,333 4,365 Updated May 14, 2025

SamsungLabs / Metis

[ATC '24] Metis: Fast automatic distributed training on heterogeneous GPUs (https://www.usenix.org/conference/atc24/presentation/um)

Python 26 15 Updated Nov 18, 2024

yyyujintang / Awesome-Mamba-Papers

Awesome Papers related to Mamba.

1,358 69 Updated Oct 17, 2024

dair-ai / ML-Papers-of-the-Week

🔥Highlighting the top ML papers every week.

11,228 686 Updated Apr 11, 2025

Event-AHU / Mamba_State_Space_Model_Paper_List

[Mamba-Survey-2024] Paper list for State-Space-Model/Mamba and it's Applications

710 41 Updated Mar 17, 2025

state-spaces / mamba

Mamba SSM architecture

Python 14,863 1,299 Updated May 9, 2025

ParCIS / Chimera

Chimera: bidirectional pipeline parallelism for efficiently training large-scale models.

Python 63 8 Updated Mar 20, 2025

alpa-projects / alpa

Training and serving large-scale neural networks with auto parallelization.

Python 3,131 359 Updated Dec 9, 2023

Zefan-Cai / Awesome-LLM-KV-Cache

Awesome-LLM-KV-Cache: A curated list of 📙Awesome LLM KV Cache Papers with Codes.

295 18 Updated Mar 3, 2025

PacktPublishing / Hands-On-GPU-Programming-with-Python-and-CUDA

Hands-On GPU Programming with Python and CUDA, published by Packt

Python 380 170 Updated Aug 10, 2024

siboehm / SGEMM_CUDA

Fast CUDA matrix multiplication from scratch

Cuda 713 107 Updated Dec 28, 2023

2024-2-CID-TEAM-A / nntrainer

Forked from nnstreamer/nntrainer

NNtrainer is Software Framework for Training Neural Network Models on Devices.

C++ 1 Updated Dec 18, 2024

nnstreamer / nntrainer

NNtrainer is Software Framework for Training Neural Network Models on Devices.

C++ 155 84 Updated May 12, 2025

Samsung / Tizen.NET

Welcome to Tizen .NET

C# 230 32 Updated Apr 18, 2025

kklis / proxy

TCP proxy in ANSI C

C 395 153 Updated Oct 12, 2024

AmberLJC / LLMSys-PaperList

Large Language Model (LLM) Systems Paper List

1,221 68 Updated May 10, 2025

xlite-dev / Awesome-LLM-Inference

📚A curated list of Awesome LLM/VLM Inference Papers with codes: WINT8/4, FlashAttention, PagedAttention, MLA, Parallelism etc.

Python 3,994 277 Updated May 12, 2025

junstar92 / parallel_programming_study

Study parallel programming - CUDA, OpenMP, MPI, Pthread

Cuda 56 14 Updated Jul 3, 2022

hyunwoongko / transformer

Transformer: PyTorch Implementation of "Attention Is All You Need"

Python 3,699 527 Updated Aug 6, 2024

b05902062 / TDConvED

implementation of TDConvED for video captioning

Python 13 1 Updated Mar 18, 2020

allanzhao / RoboGrammar

RoboGrammar: Graph Grammar for Terrain-Optimized Robot Design (SIGGRAPH Asia 2020)

C++ 211 66 Updated Jan 21, 2023

kuangliu / pytorch-cifar

95.47% on CIFAR10 with PyTorch

Python 6,180 2,159 Updated Feb 24, 2023

Zhen-Dong / HAWQ

Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.

Python 432 81 Updated May 15, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

UGyeong Song sugsugsug

Achievements

Achievements

Block or report sugsugsug

Starred repositories

Relaxed-System-Lab / HexiScale

llvm / llvm-project

InternLM / Awesome-LLM-Training-System

cli99 / llm-analysis

icloud-ecnu / Espresso

deepspeedai / Megatron-DeepSpeed

DicardoX / Research-Space

deepspeedai / DeepSpeed

SamsungLabs / Metis

yyyujintang / Awesome-Mamba-Papers

dair-ai / ML-Papers-of-the-Week

Event-AHU / Mamba_State_Space_Model_Paper_List

state-spaces / mamba

ParCIS / Chimera

alpa-projects / alpa

Zefan-Cai / Awesome-LLM-KV-Cache

PacktPublishing / Hands-On-GPU-Programming-with-Python-and-CUDA

siboehm / SGEMM_CUDA

2024-2-CID-TEAM-A / nntrainer

nnstreamer / nntrainer

Samsung / Tizen.NET

kklis / proxy

AmberLJC / LLMSys-PaperList

xlite-dev / Awesome-LLM-Inference

junstar92 / parallel_programming_study

hyunwoongko / transformer

b05902062 / TDConvED

allanzhao / RoboGrammar

kuangliu / pytorch-cifar

Zhen-Dong / HAWQ

Starred topics

Machine learning