8000 Sengxian (Aohan Zeng) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View Sengxian's full-sized avatar

Highlights

  • Pro

Block or report Sengxian

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training

Python 351 16 Updated May 13, 2025

DeepEP: an efficient expert-parallel communication library 8000

Cuda 7,638 764 Updated May 12, 2025

Complex Function Calling Benchmark.

Python 100 11 Updated Jan 20, 2025

GLM-4-Voice | 端到端中英语音对话模型

Python 2,908 245 Updated Dec 5, 2024

Node.js + JavaScript reference client for the Realtime API (beta)

JavaScript 947 280 Updated Nov 7, 2024

CodeGeeX4-ALL-9B, a versatile model for all AI software development scenarios, including code completion, code interpreter, web search, function calling, repository-level Q&A and much more.

Python 1,948 167 Updated Aug 25, 2024

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 3,247 243 Updated May 14, 2025

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Python 6,544 551 Updated Apr 19, 2025

FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

Python 819 66 Updated Sep 4, 2024

AgentTuning: Enabling Generalized Agent Abilities for LLMs

Python 1,434 101 Updated Oct 31, 2023

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 18,263 1,495 Updated Apr 29, 2025

CodeGeeX2: A More Powerful Multilingual Code Generation Model

Python 7,618 528 Updated Jul 10, 2024

Fast and memory-efficient exact attention

Python 17,349 1,680 Updated May 8, 2025

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型

Python 15,739 1,843 Updated Jun 27, 2024

🩺 首个会看胸部X光片的中文多模态医学大模型 | The first Chinese Medical Multimodal Model that Chest Radiographs Summarization.

Python 986 133 Updated Nov 20, 2024

Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型

Python 4,153 424 Updated Aug 23, 2024

A new markup-based typesetting system that is powerful and easy to learn.

Rust 40,312 1,099 Updated May 12, 2025

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 16,126 2,704 Updated Dec 18, 2024

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Python 41,045 5,222 Updated Jun 27, 2024

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory…

Python 2,407 420 Updated May 14, 2025

GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)

Python 7,682 606 Updated Jul 25, 2023

Human ChatGPT Comparison Corpus (HC3), Detectors, and more! 🔥

Python 1,307 119 Updated Dec 1, 2023

[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Python 1,402 173 Updated Jul 12, 2024

Wireguard client that exposes itself as a socks5 proxy

Go 4,923 303 Updated Apr 16, 2025

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

Python 4,633 377 Updated Apr 1, 2025

python3实现互信息和左右熵的新词发现

Python 592 165 Updated Aug 1, 2019

速度更快、效果更好的中文新词发现

Python 511 102 Updated Mar 15, 2024

🗂️A file list/WebDAV program that supports multiple storages, powered by Gin and Solidjs. / 一个支持多存储的文件列表/WebDAV程序,使用 Gin 和 Solidjs。

Go 49,247 6,677 Updated May 11, 2025

VDI Stream Client is a very tiny, low latency and GPU accelerated client to connect to Windows running Parsec Host.

C 127 9 Updated Feb 22, 2022

Live streaming player, iOS+Android, RTMP/HTTP-FLV/HLS/WebRTC, by Flutter+SRS.

JavaScript 355 108 Updated May 10, 2024
Next
0