weigao266 (weigao266) / Repositories

weigao266.github.io Public

TeX MIT License Updated May 27, 2025
LASP Public
Forked from OpenNLPLab/LASP

Linear Attention Sequence Parallelism (LASP)

Python Updated Mar 13, 2025
lm-evaluation-harness-recall Public

Python 2 MIT License Updated Feb 26, 2025
Linear-MoE Public
Forked from OpenSparseLLMs/Linear-MoE

Python 3 Apache License 2.0 Updated Feb 16, 2025
Megatron-LM Public
Forked from NVIDIA/Megatron-LM

Ongoing research training transformer models at scale

Python Other Updated Dec 12, 2024
LLaMA-MoE-v2 Public
Forked from OpenSparseLLMs/LLaMA-MoE-v2

LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training

Python Apache License 2.0 Updated Nov 26, 2024
deepspeed-lightning Public

Python Apache License 2.0 Updated Jun 14, 2024
Megatron-DeepSpeed-SP Public

Python Other Updated Jun 14, 2024
DeepSpeed-LASP Public
Forked from deepspeedai/DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python Apache License 2.0 Updated May 23, 2024
fairscale-CO2 Public

The Fairscale framework with CO2 integrated.

Python MIT License Updated Apr 29, 2024
fairseq-CO2 Public
Forked from facebookresearch/fairseq

Example of using CO2 within Fairseq.

Python MIT License Updated Apr 28, 2024
ring-attention-pytorch Public
Forked from lucidrains/ring-attention-pytorch

Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch

Python MIT License Updated Apr 17, 2024
large_scale_point_cloud_distributed Public

Python MIT License Updated Apr 2, 2024
TNL-MoE Public
Forked from pjlab-sys4nlp/llama-moe

TNL-MoE: Building Mixture-of-Experts from TransNormerLLM (TNL) with Continual Pre-training

Python Apache License 2.0 Updated Feb 29, 2024
triton-learn Public

Jupyter Notebook Updated Dec 3, 2023
DataDriven-POPF Public

Data driven probabilistic optimal power flow with Probabilistic Methods

Python 1 Updated Nov 29, 2023
LongRA Public

Python MIT License Updated Oct 22, 2023
Hard Public

The exercise code for <learn python the hard way>

Python Apache License 2.0 Updated Mar 29, 2020

Weigao Sun weigao266

Achievements

Achievements

weigao266.github.io Public

Uh oh!

LASP Public

Uh oh!

lm-evaluation-harness-recall Public

Uh oh!

Linear-MoE Public

Uh oh!

Megatron-LM Public

Uh oh!

LLaMA-MoE-v2 Public

Uh oh!

deepspeed-lightning Public

Uh oh!

Megatron-DeepSpeed-SP Public

Uh oh!

DeepSpeed-LASP Public

Uh oh!

fairscale-CO2 Public

Uh oh!

fairseq-CO2 Public

Uh oh!

ring-attention-pytorch Public

Uh oh!

large_scale_point_cloud_distributed Public

Uh oh!

TNL-MoE Public

Uh oh!

triton-learn Public

Uh oh!

DataDriven-POPF Public

Uh oh!

LongRA Public

Uh oh!

Hard Public

Uh oh!