LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Python 2,922 197 Updated May 19, 2025

wdlctc / mini-s

Python 50 3 Updated Oct 29, 2024

allenai / OLMoE

OLMoE: Open Mixture-of-Experts Language Models

Jupyter Notebook 753 67 Updated Mar 14, 2025

OpenNLPLab / LASP

Linear Attention Sequence Parallelism (LASP)

Python 82 3 Updated Jun 4, 2024

NUS-HPC-AI-Lab / VideoSys

VideoSys: An easy and efficient system for video generation

Python 1,963 129 Updated Mar 9, 2025

triton-lang / triton

Development repository for the Triton language and compiler

MLIR 15,612 1,984 Updated May 20, 2025

cli99 / llm-analysis

Latency and Memory Analysis of Transformer Models for Training and Inference

Python 414 46 Updated Apr 19, 2025

wdlctc / rtp

RTP: Rethinking Tensor Parallelism with Memory Deduplication

Python 11 Updated Dec 15, 2023

wdlctc / starlink-trace-tracker

Python 16 1 Updated Jul 19, 2023

davidADSP / SIMPLE

Selfplay In MultiPlayer Environments

Python 321 105 Updated Jun 12, 2024

Stable-Baselines-Team / stable-baselines3-contrib

Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code

Python 592 196 Updated May 19, 2025

awwong1 / torchprof

PyTorch layer-by-layer model profiler

Python 607 45 Updated May 23, 2021

microsoft / superbenchmark

A validation and profiling tool for AI infrastructure

Python 309 67 Updated May 19, 2025

microsoft / SuperScaler

An experimental parallel training platform

54 15 Updated Mar 25, 2024

horovod / horovod

Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

Python 14,483 2,254 Updated Apr 22, 2025

wenwei202 / terngrad

Ternary Gradients to Reduce Communication in Distributed Deep Learning (TensorFlow)

Python 182 48 Updated Nov 19, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cheng Luo wdlctc

Achievements

Achievements

Highlights

Block or report wdlctc

Stars

SandAI-org / MagiAttention

fangyuan-ksgk / Tiny-GRPO

Jiayi-Pan / TinyZero

zipnn / zipnn

Xnhyacinth / Awesome-LLM-Long-Context-Modeling

uservan / ThinkPO

hpcaitech / TensorNVMe

whatdhack / mini_llms

multimodal-art-projection / MAP-NEO

huggingface / smollm

hiyouga / LLaMA-Factory

wdlctc / headinfer

mit-han-lab / duo-attention

sail-sg / scaling-with-vocab

ictnlp / LLaMA-Omni