8000 xinjinhan (Jinhan Xin) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View xinjinhan's full-sized avatar
  • HUAWEI
  • Shenzhen, China

Block or report xinjinhan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

FlashInfer: Kernel Library for LLM Serving

Cuda 3,173 332 Updated Jun 13, 2025

a lightweight LLM model inference framework

C++ 730 93 Updated Apr 7, 2024

Community maintained hardware plugin for vLLM on Ascend

Python 758 196 Updated Jun 15, 2025

FlashMLA: Efficient MLA decoding kernels

Cuda 11,598 845 Updated Apr 29, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 49,670 7,996 Updated Jun 15, 2025

The codebase for our ACL2023 paper: Did You Read the Instructions? Rethinking the Effectiveness of Task Definitions in Instruction Learning

Python 29 3 Updated Jul 16, 2023

[ICLR 2024] Skeleton-of-Thought: Prompting LLMs for Efficient Parallel Generation

Python 171 16 Updated Mar 1, 2024

Compress your input to ChatGPT or other LLMs, to let them process 2x more content and save 40% memory and GPU time.

Python 383 20 Updated Feb 12, 2024

A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour

Python 45,999 6,978 Updated Jun 14, 2025

🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading

Python 9,668 561 Updated Sep 7, 2024

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 145,622 29,352 Updated Jun 15, 2025

Cloud Shuffle Service(CSS) is a general purpose remote shuffle solution for compute engines, including Spark/Flink/MapReduce.

Java 257 58 Updated May 12, 2024

NoSQL Redis and Memcache traffic generation and benchmarking tool.

C++ 971 234 Updated May 28, 2025

cluster data collected from production clusters in Alibaba for cluster management research

Jupyter Notebook 1,773 427 Updated Apr 11, 2025

Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.

Java 955 397 Updated Jun 13, 2025

Apache Kyuubi is a distributed and multi-tenant g 97F7 ateway to provide serverless SQL on data warehouses and lakehouses.

Scala 2,203 943 Updated Jun 13, 2025

🔨 🍇 💻 🚀 GraphScope: A One-Stop Large-Scale Graph Computing System from Alibaba | 一站式图计算系统

C++ 3,453 457 Updated Jun 13, 2025
Scala 6 3 Updated May 19, 2023

Patched version of dbgen

C 28 28 Updated Feb 25, 2024

TPC-DS benchmark kit with some modifications/fixes

C 98 74 Updated Aug 13, 2024

HiBench is a big data benchmark suite.

Java 1,474 771 Updated Dec 10, 2024

PMM dashboards for database monitoring

JavaScript 2,799 1,550 Updated Jun 13, 2025

The open and composable observability and data visualization platform. Visualize metrics, logs, and traces from multiple sources like Prometheus, Loki, Elasticsearch, InfluxDB, Postgres and many mo…

TypeScript 68,472 12,696 Updated Jun 15, 2025

The Prometheus monitoring system and time series database.

Go 58,977 9,592 Updated Jun 13, 2025

Nightingale for monitoring and alerting, just as Grafana for visualization.

Go 10,994 1,528 Updated Jun 13, 2025

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 8,078 1,847 Updated Jun 14, 2025

A prototype implementation of Bao for PostgreSQL

C 198 56 Updated Sep 17, 2024

Flink 中文视频课程(持续更新...)

1 Updated Jun 18, 2020
Next
0