8000 kkwuthu (Xuang Wu) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View kkwuthu's full-sized avatar
  • Tsinghua University

Highlights

  • Pro

Block or report kkwuthu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

PyTorch code and models for VJEPA2 self-supervised learning from video.

Python 1,048 67 Updated Jun 13, 2025

📚 从零开始的大语言模型原理与实践教程

3,209 241 Updated Jun 15, 2025

Github Pages template based upon HTML and Markdown for personal, portfolio-based websites.

HTML 14,455 43,707 Updated Jun 8, 2025

MAGI-1: Autoregressive Video Generation at Scale

Python 3,277 191 Updated Jun 4, 2025

Official inference framework for 1-bit LLMs

Python 20,063 1,510 Updated Jun 3, 2025

About Awesome things towards foundation agents. Papers / Repos / Blogs / ...

1,429 143 Updated Jun 8, 2025
Python 156 19 Updated Apr 9, 2025

Awesome Reasoning LLM Tutorial/Survey/Guide

Python 1,742 123 Updated Jun 5, 2025

The Abstraction and Reasoning Corpus

JavaScript 4,429 664 Updated Apr 4, 2025

Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'

Python 205 22 Updated Dec 2, 2024

Chain of Experts (CoE) enables communication between experts within Mixture-of-Experts (MoE) models

Python 170 21 Updated May 21, 2025

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 14,374 1,025 Updated Jun 11, 2025

Open-Sora: Democratizing Efficient Video Production for All

Python 26,665 2,586 Updated Apr 30, 2025

PyTorch implementation of FractalGen https://arxiv.org/abs/2502.17437

Python 1,126 60 Updated Feb 25, 2025

Reproduce R1 Zero on Logic Puzzle

Python 2,355 158 Updated Mar 20, 2025

Genome modeling and design across all domains of life

Jupyter Notebook 2,857 310 Updated May 28, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 7,768 791 Updated Jun 13, 2025

这是一个从头训练大语言模型的项目,包括预训练、微调和直接偏好优 8970 化,模型拥有1B参数,支持中英文。

Python 426 59 Updated Feb 18, 2025

Code for BLT research paper

Python 1,683 142 Updated May 22, 2025

Large Concept Models: Language modeling in a sentence representation space

Python 2,226 201 Updated Jan 29, 2025
Python 69 4 Updated Nov 19, 2024

🚀 Efficient implementations of state-of-the-art linear attention models in Torch and Triton

Python 2,687 193 Updated Jun 14, 2025

A More Fair and Comprehensive Comparison between KAN and MLP

Jupyter Notebook 168 11 Updated Aug 17, 2024

Pytorch Lightning入门中文教程,转载请注明来源。(当初是写着玩的,建议看完MNIST这个例子再上手)

Jupyter Notebook 217 19 Updated Dec 6, 2020

深度学习系统笔记,包含深度学习数学基础知识、神经网络基础部件详解、深度学习炼丹策略、模型压缩算法详解。

Python 474 67 Updated Jun 6, 2025

阿里云盘命令行客户端,支持JavaScript插件,支持同步备份功能。

Go 4,642 373 Updated Apr 15, 2025

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 14,813 1,080 Updated Mar 17, 2025

An open-source framework for collaborative AI agents, enabling diverse, distributed agents to team up and tackle complex tasks through internet-like connectivity.

Python 725 74 Updated Oct 20, 2024
Next
0