-
KuiperLLama Public
Forked from zjhellofss/KuiperLLama校招、秋招、春招、实习好项目,带你从零动手实现支持LLama2/3和Qwen2.5的大模型推理框架。
-
lite_llama Public
Forked from harleyszhang/lite_llamaA light llama-like llm inference framework based on the triton kernel.
-
ai-infra-hpc Public
Forked from jinbooooom/ai-infra-hpchpc 教程,包含集合通信(mpi、nccl)、cuda 编程、向量化 SIMD、RDMA 通信等
-
🚀🚀🚀 A collection of some awesome public YOLO object detection series projects and the related object detection datasets.
-
awesome-cuda-and-hpc Public
🚀🚀🚀 This repository lists some awesome public CUDA, cuda-python, cuBLAS, cuDNN, CUTLASS, TensorRT, TensorRT-LLM, Triton, TVM, MLIR, PTX and High Performance Computing (HPC) projects.
-
LeetCUDA Public
Forked from xlite-dev/LeetCUDA📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA/Tensor Cores Kernels, HGEMM, FA-2 MMA etc.🔥
-
-
awesome-llm-and-aigc Public
🚀🚀🚀A collection of some awesome public projects about Large Language Model(LLM), Vision Language Model(VLM), Vision Language Action(VLA), AI Generated Content(AIGC), the related Datasets and Applic…
-
Qwen3 Public
Forked from QwenLM/Qwen3Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
-
TensorRT-YOLO Public
Forked from laugh12321/TensorRT-YOLO🚀 Easier & Faster YOLO Deployment Toolkit for NVIDIA 🛠️
-
SageAttention Public
Forked from thu-ml/SageAttentionQuantized Attention that achieves speedups of 2.1-3.1x and 2.7-5.1x compared to FlashAttention2 and xformers, respectively, without lossing end-to-end metrics across various models.
-
VLM-R1 Public
Forked from om-ai-lab/VLM-R1Solve Visual Understanding with Reinforced VLMs
-
MAYE Public
Forked from GAIR-NLP/MAYERethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme
-
Video-R1 Public
Forked from tulerfeng/Video-R1Video-R1: Reinforcing Video Reasoning in MLLMs [🔥the first paper to explore R1 for video]
-
awesome-ai4science Public
This repository lists some awesome public projects about AI4Science.
2 UpdatedApr 5, 2025 -
yoloe Public
Forked from THU-MIG/yoloeYOLOE: Real-Time Seeing Anything
-
Visual-RFT Public
Forked from Liuziyu77/Visual-RFTOfficial repository of ’Visual-RFT: Visual Reinforcement Fine-Tuning’
-
-
chitu Public
Forked from thu-pacman/chituHigh-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.
-
-
OpenManus Public
Forked from FoundationAgents/OpenManusNo fortress, purely open ground. OpenManus is Coming.
-
fast.cu Public
Forked from pranjalssh/fast.cuFastest kernels written from scratch
-
VisualThinker-R1-Zero Public
Forked from turningpoint-ai/VisualThinker-R1-ZeroExplore the Multimodal “Aha Moment” on 2B Model
-
DeepGEMM Public
Forked from deepseek-ai/DeepGEMMDeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
-
FlashMLA Public
Forked from deepseek-ai/FlashMLAFlashMLA: Efficient MLA Decoding Kernel for Hopper GPUs
-
awesome-deepseek-integration Public
Forked from deepseek-ai/awesome-deepseek-integration -
X-AnyLabeling Public
Forked from CVHub520/X-AnyLabelingEffortless data labeling with AI support from Segment Anything and other awesome models.
-
edgeyolo Public
Forked from LSH9832/edgeyoloan edge-real-time anchor-free object detector with decent performance
-
TensorRT-Model-Optimizer Public
Forked from NVIDIA/TensorRT-Model-OptimizerTensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization, pruning, distillation, etc. It compresses deep learning models for downstream d…
-
NuMojo Public
Forked from Mojo-Numerics-and-Algorithms-group/NuMojoNuMojo is a library for numerical computing in Mojo 🔥 similar to numpy in Python.