8000 nangeblog / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View nangeblog's full-sized avatar

Block or report nangeblog

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Distributed RL System for LLM Reasoning

Python 1,283 61 Updated May 30, 2025

🐳 Efficient Triton implementations for "Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention"

Python 686 29 Updated Mar 19, 2025

DeepSeek 系列工作解读、扩展和复现。

Python 651 53 Updated Mar 29, 2025

Simple RL training for reasoning

Python 3,598 267 Updated Apr 10, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 11,838 1,488 Updated Apr 24, 2025

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 4,857 1,751 Updated Feb 26, 2025

DeepSeek Coder: Let the Code Write Itself

Python 21,626 2,474 Updated May 21, 2024

Fully open reproduction of DeepSeek-R1

Python 24,622 2,275 Updated May 28, 2025

A powerful tool for creating fine-tuning datasets for LLM

JavaScript 7,904 785 Updated May 30, 2025

Official Repo for Open-Reasoner-Zero

Python 1,936 101 Updated Apr 8, 2025

A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.

Python 2,790 296 Updated Mar 10, 2025

A lightweight data processing framework built on DuckDB and 3FS.

Python 4,655 416 Updated Mar 5, 2025

MoBA: Mixture of Block Attention for Long-Context LLMs

Python 1,781 107 Updated Apr 3, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Python 5,393 606 Updated May 27, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 7,725 782 Updated May 28, 2025

PI Controller vs Reinforcement Learning to control temperature inside a room. This repo is the updated version of https://github.com/NasimKaveh/Thermal-HVAC-model.

Python 1 Updated Apr 4, 2024

Apply reinforcement learning to a building emulator to intelligently control HVAC systems.

Jupyter Notebook 12 4 Updated Jun 13, 2024

Enhancing HVAC Control Efficiency: A Hybrid Approach Using Imitation and Reinforcement Learning

Python 3 Updated Oct 9, 2024

This repository contains the implementation of reinforcement learning algorithms for optimizing energy demand response in commercial buildings. The project focuses on reducing peak loads and improv…

Jupyter Notebook 3 1 Updated Dec 13, 2024

It utilizes a defined environment to improve HVAC energy efficiency using Reinforcement Learning Control Algorithms. Several parameters including temperature, humidity and windspeed were used to tr…

Jupyter Notebook 4 Updated Dec 23, 2024

Accelerating Reinforcement Learning for HVAC Systems Using an LSTM-based Simulator

Python 2 Updated Feb 2, 2025

交易模块

Python 6,213 1,401 Updated May 13, 2024
Python 153 51 Updated Mar 10, 2022

[🔥updating ...] AI 自动量化交易机器人(完全本地部署) AI-powered Quantitative Investment Research Platform. 📃 online docs: https://ufund-me.github.io/Qbot ✨ :news: qbot-mini: https://github.com/Charmve/iQuant

Jupyter Notebook 11,648 1,676 Updated May 11, 2025

wtpy是基于wondertrader为底层的针对python的子框架

Python 1,241 326 Updated Aug 19, 2024

WonderTrader——量化研发交易一站式框架

C++ 5,098 985 Updated Feb 20, 2025

基于python的量化交易平台

Python 1,667 648 Updated May 2, 2020

30天掌握量化交易 (持续更新)

Python 6,301 1,408 Updated May 19, 2025

天勤量化开发包, 期货量化, 实时行情/历史数据/实盘交易

Python 4,025 690 Updated May 25, 2025

阿布量化交易系统(股票,期权,期货,比特币,机器学习) 基于python的开源量化交易,量化投资架构

Python 13,802 4,055 Updated Mar 11, 2025
Next
0