Sengxian

Aohan Zeng Sengxian

Scaling models just for fun

274 followers · 0 following

DCST, Tsinghua University
Beijing, China
https://blog.sengxian.com/

Achievements

x4 x2

Achievements

x4 x2

Highlights

Stars

SandAI-org / MagiAttention

A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training

Python 351 16 Updated May 13, 2025

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library 8000

Cuda 7,638 764 Updated May 12, 2025

THUDM / ComplexFuncBench

Complex Function Calling Benchmark.

Python 100 11 Updated Jan 20, 2025

THUDM / GLM-4-Voice

GLM-4-Voice | 端到端中英语音对话模型

Python 2,908 245 Updated Dec 5, 2024

openai / openai-realtime-api-beta

Node.js + JavaScript reference client for the Realtime API (beta)

JavaScript 947 280 Updated Nov 7, 2024

THUDM / CodeGeeX4

CodeGeeX4-ALL-9B, a versatile model for all AI software development scenarios, including code completion, code interpreter, web search, function calling, repository-level Q&A and much more.

Python 1,948 167 Updated Aug 25, 2024

kvcache-ai / Mooncake

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 3,247 243 Updated May 14, 2025

THUDM / GLM-4

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Python 6,544 551 Updated Apr 19, 2025

IST-DASLab / marlin

FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

Python 819 66 Updated Sep 4, 2024

THUDM / AgentTuning

AgentTuning: Enabling Generalized Agent Abilities for LLMs

Python 1,434 101 Updated Oct 31, 2023

QwenLM / Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 18,263 1,495 Updated Apr 29, 2025

THUDM / CodeGeeX2

CodeGeeX2: A More Powerful Multilingual Code Generation Model

Python 7,618 528 Updated Jul 10, 2024

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 17,349 1,680 Updated May 8, 2025

THUDM / ChatGLM2-6B

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型

Python 15,739 1,843 Updated Jun 27, 2024

WangRongsheng / XrayGLM

🩺 首个会看胸部X光片的中文多模态医学大模型 | The first Chinese Medical Multimodal Model that Chest Radiographs Summarization.

Python 986 133 Updated Nov 20, 2024

THUDM / VisualGLM-6B

Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型

Python 4,153 424 Updated Aug 23, 2024

typst A4FE / typst

A new markup-based typesetting system that is powerful and easy to learn.

Rust 40,312 1,099 Updated May 12, 2025

openai / evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 16,126 2,704 Updated Dec 18, 2024

THUDM / ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Python 41,045 5,222 Updated Jun 27, 2024

NVIDIA / TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory…

Python 2,407 420 Updated May 14, 2025

THUDM / GLM-130B

GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)

Python 7,682 606 Updated Jul 25, 2023

Hello-SimpleAI / chatgpt-comparison-detection

Human ChatGPT Comparison Corpus (HC3), Detectors, and more! 🔥

Python 1,307 119 Updated Dec 1, 2023

mit-han-lab / smoothquant

[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Python 1,402 173 Updated Jul 12, 2024

whyvl / wireproxy

Wireguard client that exposes itself as a socks5 proxy

Go 4,923 303 Updated Apr 16, 2025

facebookincubator / AITemplate

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

Python 4,633 377 Updated Apr 1, 2025

zhanzecheng / Chinese_segment_augment

python3实现互信息和左右熵的新词发现

Python 592 165 Updated Aug 1, 2019

bojone / word-discovery

速度更快、效果更好的中文新词发现

Python 511 102 Updated Mar 15, 2024

AlistGo / alist

🗂️A file list/WebDAV program that supports multiple storages, powered by Gin and Solidjs. / 一个支持多存储的文件列表/WebDAV程序，使用 Gin 和 Solidjs。

Go 49,247 6,677 Updated May 11, 2025

mbroemme / vdi-stream-client

VDI Stream Client is a very tiny, low latency and GPU accelerated client to connect to Windows running Parsec Host.

C 127 9 Updated Feb 22, 2022

ossrs / flutter_live

Live streaming player, iOS+Android, RTMP/HTTP-FLV/HLS/WebRTC, by Flutter+SRS.

JavaScript 355 108 Updated May 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Aohan Zeng Sengxian

Achievements

Achievements

Highlights

Block or report Sengxian

Stars

SandAI-org / MagiAttention

deepseek-ai / DeepEP

THUDM / ComplexFuncBench

THUDM / GLM-4-Voice

openai / openai-realtime-api-beta

THUDM / CodeGeeX4

kvcache-ai / Mooncake

THUDM / GLM-4

IST-DASLab / marlin

THUDM / AgentTuning

QwenLM / Qwen

THUDM / CodeGeeX2

Dao-AILab / flash-attention

THUDM / ChatGLM2-6B

WangRongsheng / XrayGLM

THUDM / VisualGLM-6B

typst A4FE / typst

openai / evals

THUDM / ChatGLM-6B

NVIDIA / TransformerEngine

THUDM / GLM-130B

Hello-SimpleAI / chatgpt-comparison-detection

mit-han-lab / smoothquant

whyvl / wireproxy

facebookincubator / AITemplate

zhanzecheng / Chinese_segment_augment

bojone / word-discovery

AlistGo / alist

mbroemme / vdi-stream-client

ossrs / flutter_live