Official implementation of the CVPR 2024 paper ViT-CoMer: Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense Predictions.

Python 301 20 Updated Jan 21, 2025

ds-kiel / HydraViT

Official Repository for "HydraViT: Stacking Heads for a Scalable ViT" (NeurIPS'24)

Python 7 Updated May 19, 2025

MoonshotAI / MoBA

MoBA: Mixture of Block Attention for Long-Context LLMs

Python 1,786 107 Updated Apr 3, 2025

JieShibo / MemVP

[ICML 2024] Memory-Space Visual Prompting for Efficient Vision-Language Fine-Tuning

Python 49 5 Updated May 12, 2024

lucidrains / PEER-pytorch

Pytorch implementation of the PEER block from the paper, Mixture of A Million Experts, by Xu Owen He at Deepmind

Python 126 3 Updated Aug 23, 2024

microsoft / CvT

This is an official implementation of CvT: Introducing Convolutions to Vision Transformers.

Python 577 123 Updated May 16, 2023

kyegomez / SwitchTransformers

Implementation of Switch Transformers from the paper: "Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity"

Python 107 13 Updated Apr 4, 2025

lucidrains / CoLT5-attention

Implementation of the conditionally routed attention in the CoLT5 architecture, in Pytorch

Python 230 13 Updated Sep 6, 2024

BinaryAI-1024 / mml-book-chinese

《MATHEMATICS FOR MACHINE LEARNING》一书的部分翻译。

140 26 Updated Jul 22, 2024

mml-book / mml-book.github.io

Companion webpage to the book "Mathematics For Machine Learning"

Jupyter Notebook 14,170 2,569 Updated Mar 13, 2025

dgasmith / opt_einsum

⚡️Optimizing einsum functions in NumPy, Tensorflow, Dask, and more with contraction order optimization.

Python 916 72 Updated Mar 16, 2025

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 17,703 1,721 Updated Jun 4, 2025

triton-lang / triton

Development repository for the Triton language and compiler

MLIR 15,797 2,015 Updated Jun 6, 2025

bartwojcik / D2DMoE

Python 8 1 Updated Dec 27, 2024

changgyhub / leetcode_101

LeetCode 101：力扣刷题指南

9,446 1,232 Updated Dec 8, 2024

LMD0311 / HERMES

HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation

103 4 Updated Jan 27, 2025

hustvl / DiffusionDrive

[CVPR 2025 Highlight] Truncated Diffusion Model for Real-Time End-to-End Autonomous Driving

Python 746 48 Updated Apr 4, 2025

hustvl / ViG

[AAAI 2025] Linear-complexity Visual Sequence Learning with Gated Linear Attention

Python 111 1 Updated Jun 17, 2024

sunsmarterjie / iTPN

(CVPR2023/TPAMI2024) Integrally Pre-Trained Transformer Pyramid Networks -- A Hierarchical Vision Transformer for Masked Image Modeling

Python 194 5 Updated Jul 28, 2024

facebookresearch / memory

Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, sparsely activated memory layers complement compute-heavy dense f…

Python 337 21 Updated Dec 12, 2024

LSXI7 / MINIMA

[CVPR 2025] MINIMA: Modality Invariant Image Matching

Python 393 26 Updated May 29, 2025

liguodongiot / llm-action

本项目旨在分享大模型相关技术原理以及实战经验（大模型工程化、大模型应用落地）

HTML 18,332 2,155 Updated May 27, 2025

MichaelX99 / MoNE

Python 1 Updated Aug 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Xingkui Zhu Adlith

Achievements

Achievements

Block or report Adlith

Stars

Gumpest / SparseVLMs

dvlab-research / MGM

ZikangYuan / voxel_svio

withinmiaov / A-Survey-on-Mixture-of-Experts-in-LLMs

hiyouga / LLaMA-Factory

hustvl / MIMDet

dk-liang / UniFuture

Traffic-X / ViT-CoMer