10000 0ct0cat (test) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View 0ct0cat's full-sized avatar
:octocat:
:octocat:

Block or report 0ct0cat

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code release for book "Efficient Training in PyTorch"

Python 70 13 Updated Apr 10, 2025

A tutorial for CUDA&PyTorch

C++ 147 28 Updated Jan 21, 2025

🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.

15,361 1,466 Updated Feb 13, 2023

RDMA core userspace libraries and daemons

C 1,841 756 Updated Jun 24, 2025

Experimental GStreamer plugin for encrypting / decrypting H264 streams with AES

C 7 1 Updated Sep 7, 2024

A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology

C++ 1,140 160 Updated Jun 5, 2025

Curated list of datasets and tools for post-training.

3,209 270 Updated Jan 29, 2025

[NeurIPS 2020] Official code for the paper "DeepSVG: A Hierarchical Generative Network for Vector Graphics Animation". Includes a PyTorch library for deep learning with SVG data.

Jupyter Notebook 1,081 107 Updated Aug 26, 2024

(CVPR 2025) Code of "Chat2SVG: Vector Graphics Generation with Large Language Models and Image Diffusion Models"

Python 160 14 Updated Apr 2, 2025

A benchmark dataset for evaluating LLM's SVG editing capabilities

Python 33 4 Updated Oct 17, 2024

通义千问的DPO训练

Jupyter Notebook 49 4 Updated Sep 21, 2024

cnn

Python 135 23 Updated Sep 8, 2019

A collection of modern/faster/saner alternatives to common unix commands.

32,087 803 Updated Sep 10, 2024

PyTorch distributed training from scratch (for educational purposes only)

Python 14 2 Updated Apr 12, 2025

《Pytorch实用教程》(第二版)无论是零基础入门,还是CV、NLP、LLM项目应用,或是进阶工程化部署落地,在这里都有。相信在本书的帮助下,读者将能够轻松掌握 PyTorch 的使用,成为一名优秀的深度学习工程师。

Jupyter Notebook 3,730 412 Updated Jan 27, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 15,593 2,224 Updated Jul 1, 2025

deepstream_tools will serve as a parent repo to hold various tools to be released for DeepStream SDK.

Python 11 1 Updated Jan 24, 2025

🗂️A file list/WebDAV program that supports multiple storages, powered by Gin and Solidjs. / 一个支持多存储的文件列表/WebDAV程序,使用 Gin 和 Solidjs。

Go 48,650 8,019 Updated Jul 1, 2025

CVPR 2025 论文和开源项目合集

20,346 2,696 Updated Jun 5, 2025

flash attention tutorial written in python, triton, cuda, cutlass

Cuda 377 40 Updated May 14, 2025

LLM inference in C/C++

C++ 82,408 12,223 Updated Jun 30, 2025

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.

Go 145,195 12,254 Updated Jul 1, 2025

动手学Ollama,CPU玩转大模型部署,在线阅读地址:https://datawhalechina.github.io/handy-ollama/

Jupyter Notebook 1,762 225 Updated Jun 23, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 53,230 6,524 Updated Jun 29, 2025

AutoMQ is a stateless/diskless Kafka on S3. 10x Cost-Effective. No Cross-AZ Traffic Cost. Autoscale in seconds. Single-digit ms latency. Multi-AZ Availability.

Java 6,746 465 Updated Jun 30, 2025

Unofficial description of the CUDA assembly (SASS) instruction sets.

Python 104 11 Updated Mar 10, 2025

[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Python 5,876 546 Updated Jan 22, 2025

CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.

C++ 2,529 232 Updated May 21, 2025

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

3,895 277 Updated Jun 9, 2025

fastllm是后端无依赖的高性能大模型推理库。同时支持张量并行推理稠密模型和混合模式推理MOE模型,任意10G以上显卡即可推理满血DeepSeek。双路9004/9005服务器+单显卡部署DeepSeek满血满精度原版模型,单并发20tps;INT4量化模型单并发30tps,多并发可达60+。

C++ 3,724 379 Updated Jun 26, 2025
Next
0