8000 Shwai-He (shwaihe) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View Shwai-He's full-sized avatar

Block or report Shwai-He

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling

Python 354 14 Updated May 17, 2025

MoE-Visualizer is a tool designed to visualize the selection of experts in Mixture-of-Experts (MoE) models.

Python 11 1 Updated Apr 8, 2025

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 3,044 235 Updated May 28, 2025

Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.

Python 2,527 285 Updated May 29, 2025

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 5,420 579 Updated May 29, 2025

A curated list for Efficient Large Language Models

Python 1,689 135 Updated Apr 23, 2025

MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models

Python 432 39 Updated Feb 1, 2024

The codes about "Uni-MoE: Scaling Unified Multimodal Models with Mixture of Experts"

Python 720 44 Updated May 13, 2025

Awesome Reasoning LLM Tutorial/Survey/Guide

Python 1,662 120 Updated May 26, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 7,719 779 Updated May 28, 2025

FlashMLA: Efficient MLA decoding kernels

Cuda 11,575 837 Updated Apr 29, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 48,507 7,668 Updated May 29, 2025

Retrieval and Retrieval-augmented LLMs

Python 9,785 716 Updated May 28, 2025

Finetune Qwen3, Llama 4, TTS, DeepSeek-R1 & Gemma 3 LLMs 2x faster with 70% less memory! 🦥

Python 39,595 3,129 Updated May 29, 2025

Fast State-of-the-Art Static Embeddings

Python 1,689 88 Updated May 29, 2025

Fully open reproduction of DeepSeek-R1

Python 24,610 2,271 Updated May 28, 2025

Code, documentation, and discussion around the MIMIC-CXR database

Jupyter Notebook 279 58 Updated Jul 13, 2020

CryptoNets is a demonstration of the use of Neural-Networks over data encrypted with Homomorphic Encryption. Homomorphic Encryptions allow performing operations such as addition and multiplication …

C# 292 75 Updated Jul 16, 2024

LLM training code for Databricks foundation models

Python 4,246 561 Updated May 29, 2025
Python 87 18 Updated May 27, 2020

A framework for the evaluation of autoregressive code generation language models.

Python 946 243 Updated Oct 31, 2024

A framework for few-shot evaluation of language models.

Python 2 Updated Apr 21, 2024

AlphaFold 3 inference pipeline.

Python 6,527 823 Updated May 27, 2025

The missing star history graph of GitHub repos - https://star-history.com

TypeScript 7,403 286 Updated May 28, 2025

Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. arXiv:2408.07666.

411 18 Updated May 28, 2025

Awesome LLM compression research papers and tools.

1 Updated Nov 5, 2024

A simple and effective LLM pruning approach.

Python 751 104 Updated Aug 9, 2024
Next
0