8000 ctgushiwei (Vector) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View ctgushiwei's full-sized avatar
  • Guangdong Shenzhen

Block or report ctgushiwei

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A course of learning LLM inference serving on Apple Silicon for systems engineers.

Python 2,733 151 Updated Jun 14, 2025

这是一个clip-pytorch的模型,可以训练自己的数据集。

Python 233 30 Updated Apr 5, 2023

视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.

Python 7,529 775 Updated Jun 23, 2025

Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.

Jupyter Notebook 21,417 6,142 Updated Jul 13, 2023

ICCV 2023: CLIPN for Zero-Shot OOD Detection: Teaching CLIP to Say No

Python 138 12 Updated Dec 2, 2023

Repo for the Deep Reinforcement Learning Nanodegree program

Jupyter Notebook 5,072 2,375 Updated Nov 16, 2023

A high-performance distributed training framework for Reinforcement Learning

Python 3,388 822 Updated Jan 24, 2025

Hands-on Deep Reinforcement Learning, published by Packt

Python 2,978 1,312 Updated May 9, 2025

PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..

Jupyter Notebook 1,261 138 Updated Mar 13, 2025

A simple and well styled PPO implementation. Based on my Medium series: https://medium.com/@eyyu/coding-ppo-from-scratch-with-pytorch-part-1-4-613dfc1b14c8.

Python 1,026 143 Updated Oct 1, 2024

Example models using DeepSpeed

Python 6,566 1,100 Updated Jul 8, 2025

Policy Gradient is all you need! A step-by-step tutorial for well-known PG methods.

Jupyter Notebook 942 130 Updated May 30, 2025

A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。

JavaScript 40,341 4,937 Updated Jul 8, 2025

https://hrl.boyuai.com/

Jupyter Notebook 3,644 700 Updated Nov 22, 2022

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 10,431 1,015 Updated Jun 24, 2025
Python 1,271 48 Updated Jul 6, 2025

llm & rl

Jupyter Notebook 156 16 Updated Jul 6, 2025

This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025

Python 4,316 235 Updated May 5, 2025

免费公益机场节点分享

667 29 Updated Jul 10, 2025

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 3,697 331 Updated Jul 9, 2025

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

Python 2,102 385 Updated Jul 9, 2024

翻墙软件下载(Windows/Android/macOS/iOS)

1,315 234 Updated Jun 16, 2025

Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources

Python 231 8 Updated May 17, 2025

Implementing DeepSeek R1's GRPO algorithm from scratch

Python 1,466 68 Updated Apr 18, 2025

PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/MCP/Docker/Zotero

Python 25,607 447C 2,214 Updated Jun 30, 2025

程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).

Dockerfile 91,015 10,364 Updated Jul 8, 2025

Train transformer language models with reinforcement learning.

Python 14,542 2,031 Updated Jul 9, 2025
Python 87 9 Updated Sep 29, 2024
Next
0