coderonion

coderonion

235 followers · 1.8k following

Achievements

x2 x3 x2

Achievements

x2 x3 x2

KuiperLLama Public
Forked from zjhellofss/KuiperLLama

校招、秋招、春招、实习好项目，带你从零动手实现支持LLama2/3和Qwen2.5的大模型推理框架。

C++ 1 Updated Jun 11, 2025
lite_llama Public
Forked from harleyszhang/lite_llama

A light llama-like llm inference framework based on the triton kernel.

Python 1 Updated Jun 8, 2025
ai-infra-hpc Public
Forked from jinbooooom/ai-infra-hpc

hpc 教程，包含集合通信(mpi、nccl)、cuda 编程、向量化 SIMD、RDMA 通信等

Cuda 1 MIT License Updated Jun 5, 2025
awesome-yolo-object-detection Public

🚀🚀🚀 A collection of some awesome public YOLO object detection series projects and the related object detection datasets.

gui cuda yolo llama object-detection datasets vlm

1,513 208 Updated May 31, 2025
awesome-cuda-and-hpc Public

🚀🚀🚀 This repository lists some awesome public CUDA, cuda-python, cuBLAS, cuDNN, CUTLASS, TensorRT, TensorRT-LLM, Triton, TVM, MLIR, PTX and High Performance Computing (HPC) projects.

awesome hpc gpu cuda pytorch cublas triton

288 31 Updated May 31, 2025
LeetCUDA Public
Forked from xlite-dev/LeetCUDA

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA/Tensor Cores Kernels, HGEMM, FA-2 MMA etc.🔥

Cuda 1 GNU General Public License v3.0 Updated May 28, 2025
triton_course Public
Forked from zjhellofss/triton_course

Python 1 Updated May 11, 2025
awesome-llm-and-aigc Public

🚀🚀🚀A collection of some awesome public projects about Large Language Model(LLM), Vision Language Model(VLM), Vision Language Action(VLA), AI Generated Content(AIGC), the related Datasets and Applic…

reinforcement-learning cuda yolo triton awesome-list llama gpt

709 62 Updated May 3, 2025
Qwen3 Public
Forked from QwenLM/Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 4 Updated Apr 30, 2025
TensorRT-YOLO Public
Forked from laugh12321/TensorRT-YOLO

🚀 Easier & Faster YOLO Deployment Toolkit for NVIDIA 🛠️

C++ 1 GNU General Public License v3.0 Updated Apr 17, 2025
SageAttention Public
Forked from thu-ml/SageAttention

Quantized Attention that achieves speedups of 2.1-3.1x and 2.7-5.1x compared to FlashAttention2 and xformers, respectively, without lossing end-to-end metrics across various models.

Cuda 1 Apache License 2.0 Updated Apr 15, 2025
VLM-R1 Public
Forked from om-ai-lab/VLM-R1

Solve Visual Understanding with Reinforced VLMs

Python 1 Apache License 2.0 Updated Apr 9, 2025
MAYE Public
Forked from GAIR-NLP/MAYE

Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme

Python 1 Updated Apr 9, 2025
Video-R1 Public
Forked from tulerfeng/Video-R1

Video-R1: Reinforcing Video Reasoning in MLLMs [🔥the first paper to explore R1 for video]

Python 1 Updated Apr 8, 2025
awesome-ai4science Public

This repository lists some awesome public projects about AI4Science.

science math vla vlm ai4science llm qwen

2 Updated Apr 5, 2025
yoloe Public
Forked from THU-MIG/yoloe

YOLOE: Real-Time Seeing Anything

Python 1 GNU Affero General Public License v3.0 Updated Mar 21, 2025
Visual-RFT Public
Forked from Liuziyu77/Visual-RFT

Official repository of ’Visual-RFT: Visual Reinforcement Fine-Tuning’

Python 1 Apache License 2.0 Updated Mar 19, 2025
simplegemm Public
Forked from bertmaher/simplegemm

Cuda 1 MIT License Updated Mar 17, 2025
chitu Public
Forked from thu-pacman/chitu

High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.

Python 1 Apache License 2.0 Updated Mar 15, 2025
coderonion Public

1 Updated Mar 13, 2025
OpenManus Public
Forked from FoundationAgents/OpenManus

No fortress, purely open ground. OpenManus is Coming.

Python 1 MIT License Updated Mar 7, 2025
fast.cu Public
Forked from pranjalssh/fast.cu

Fastest kernels written from scratch

Cuda 1 MIT License Updated Mar 7, 2025
VisualThinker-R1-Zero Public
Forked from turningpoint-ai/VisualThinker-R1-Zero

Explore the Multimodal “Aha Moment” on 2B Model

Python 1 Updated Mar 1, 2025
DeepGEMM Public
Forked from deepseek-ai/DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 1 MIT License Updated Feb 27, 2025
FlashMLA Public
Forked from deepseek-ai/FlashMLA

FlashMLA: Efficient MLA Decoding Kernel for Hopper GPUs

C++ 1 MIT License Updated Feb 27, 2025
awesome-deepseek-integration Public
Forked from deepseek-ai/awesome-deepseek-integration

1 Creative Commons Zero v1.0 Universal Updated Feb 21, 2025
X-AnyLabeling Public
Forked from CVHub520/X-AnyLabeling

Effortless data labeling with AI support from Segment Anything and other awesome models.

Python 1 GNU General Public License v3.0 Updated Feb 19, 2025
edgeyolo Public
Forked from LSH9832/edgeyolo

an edge-real-time anchor-free object detector with decent performance

Python 1 Apache License 2.0 Updated Feb 19, 2025
TensorRT-Model-Optimizer Public
Forked from NVIDIA/TensorRT-Model-Optimizer

TensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization, pruning, distillation, etc. It compresses deep learning models for downstream d…

Python 1 Other Updated Feb 19, 2025
NuMojo Public
Forked from Mojo-Numerics-and-Algorithms-group/NuMojo

NuMojo is a library for numerical computing in Mojo 🔥 similar to numpy in Python.

Mojo 2 Apache License 2.0 Updated Feb 15, 2025

coderonion

Achievements

Achievements

KuiperLLama Public

Uh oh!

lite_llama Public

Uh oh!

ai-infra-hpc Public

Uh oh!

awesome-yolo-object-detection Public

Uh oh!

awesome-cuda-and-hpc Public

Uh oh!

LeetCUDA Public

Uh oh!

triton_course Public

Uh oh!

awesome-llm-and-aigc Public

Uh oh!

Qwen3 Public

Uh oh!

TensorRT-YOLO Public

Uh oh!

SageAttention Public

Uh oh!

VLM-R1 Public

Uh oh!

MAYE Public

Uh oh!

Video-R1 Public

Uh oh!

awesome-ai4science Public

Uh oh!

yoloe Public

Uh oh!

Visual-RFT Public

Uh oh!

simplegemm Public

Uh oh!

chitu Public

Uh oh!

coderonion Public

Uh oh!

OpenManus Public

Uh oh!

fast.cu Public

Uh oh!

VisualThinker-R1-Zero Public

Uh oh!

DeepGEMM Public

Uh oh!

FlashMLA Public

Uh oh!

awesome-deepseek-integration Public

Uh oh!

X-AnyLabeling Public

Uh oh!

edgeyolo Public

Uh oh!

TensorRT-Model-Optimizer Public

Uh oh!

NuMojo Public

Uh oh!