8000 ctgushiwei (Vector) / Starred · GitHub

More Web Proxy on the site http://driver.im/

ctgushiwei

Follow

Vector ctgushiwei

Follow

6 followers · 48 following

Guangdong Shenzhen

Achievements

Achievements

Lists (1)

Sort

🔮 Future ideas

Stars

skyzh / tiny-llm

A course of learning LLM inference serving on Apple Silicon for systems engineers.

Python 2,733 151 Updated Jun 14, 2025

bubbliiiing / clip-pytorch

这是一个clip-pytorch的模型，可以训练自己的数据集。

Python 233 30 Updated Apr 5, 2023

YaoFANGUK / video-subtitle-extractor

视频硬字幕提取，生成srt文件。无需申请第三方API，本地实现文本识别。基于深度学习的视频字幕提取框架，包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.

Python 7,529 775 Updated Jun 23, 2025

dennybritz / reinforcement-learning

Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.

Jupyter Notebook 21,417 6,142 Updated Jul 13, 2023

xmed-lab / CLIPN

ICCV 2023: CLIPN for Zero-Shot OOD Detection: Teaching CLIP to Say No

Python 138 12 Updated Dec 2, 2023

udacity / deep-reinforcement-learning

Repo for the Deep Reinforcement Learning Nanodegree program

Jupyter Notebook 5,072 2,375 Updated Nov 16, 2023

PaddlePaddle / PARL

A high-performance distributed training framework for Reinforcement Learning

Python 3,388 822 Updated Jan 24, 2025

XiaomiMiMo / MiMo-VL

449 22 Updated Jul 2, 2025

PacktPublishing / Deep-Reinforcement-Learning-Hands-On

Hands-on Deep Reinforcement Learning, published by Packt

Python 2,978 1,312 Updated May 9, 2025

quantumiracle / Popular-RL-Algorithms

PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..

Jupyter Notebook 1,261 138 Updated Mar 13, 2025

ericyangyu / PPO-for-Beginners

A simple and well styled PPO implementation. Based on my Medium series: https://medium.com/@eyyu/coding-ppo-from-scratch-with-pytorch-part-1-4-613dfc1b14c8.

Python 1,026 143 Updated Oct 1, 2024

AXERA-TECH / CLIP-ONNX-AX650-CPP

C++ 27 5 Updated Jun 30, 2025

deepspeedai / DeepSpeedExamples

Example models using DeepSpeed

Python 6,566 1,100 Updated Jul 8, 2025

MrSyee / pg-is-all-you-need

Policy Gradient is all you need! A step-by-step tutorial for well-known PG methods.

Jupyter Notebook 942 130 Updated May 30, 2025

NaiboWang / EasySpider

A visual no-code/code-free web crawler/spider易采集：一个可视化浏览器自动化测试/数据采集/爬虫软件，可以无代码图形化的设计和执行爬虫任务。别名：ServiceWrapper面向Web应用的智能化服务封装系统。

JavaScript 40,341 4,937 Updated Jul 8, 2025

boyu-ai / Hands-on-RL

https://hrl.boyuai.com/

Jupyter Notebook 3,644 700 Updated Nov 22, 2022

MathFoundationRL / Book-Mathematical-Foundation-of-Reinforcement-Learning

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 10,431 1,015 Updated Jun 24, 2025

JiuhaiChen / BLIP3o

Python 1,271 48 Updated Jul 6, 2025

chunhuizhang / llm_rl

llm & rl

Jupyter Notebook 156 16 Updated Jul 6, 2025

apple / ml-fastvlm

This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025

Python 4,316 235 Updated May 5, 2025

deezertidal / freevpn

免费公益机场节点分享

667 29 Updated Jul 10, 2025

huggingface / nanoVLM

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 3,697 331 Updated Jul 9, 2025

nikhilbarhate99 / PPO-PyTorch

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

Python 2,102 385 Updated Jul 9, 2024

Kejifaxian / welcome

翻墙软件下载（Windows/Android/macOS/iOS）

1,315 234 Updated Jun 16, 2025

Victorwz / Open-Qwen2VL

Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources

Python 231 8 Updated May 17, 2025

policy-gradient / GRPO-Zero

Implementing DeepSeek R1's GRPO algorithm from scratch

Python 1,466 68 Updated Apr 18, 2025

Byaidu / PDFMathTranslate

PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译，支持 Google/DeepL/Ollama/OpenAI 等服务，提供 CLI/GUI/MCP/Docker/Zotero

Python 25,607 447C 2,214 Updated Jun 30, 2025

Anduin2017 / HowToCook

程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).

Dockerfile 91,015 10,364 Updated Jul 8, 2025

huggingface / trl

Train transformer language models with reinforcement learning.

Python 14,542 2,031 Updated Jul 9, 2025

RethinkFun / trian_ppo

Python 87 9 Updated Sep 29, 2024

0