Starred repositories
🐳 Efficient Triton implementations for "Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention"
Minimal reproduction of DeepSeek R1-Zero
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
DeepSeek Coder: Let the Code Write Itself
Fully open reproduction of DeepSeek-R1
A powerful tool for creating fine-tuning datasets for LLM
Official Repo for Open-Reasoner-Zero
A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.
A lightweight data processing framework built on DuckDB and 3FS.
MoBA: Mixture of Block Attention for Long-Context LLMs
DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
DeepEP: an efficient expert-parallel communication library
PI Controller vs Reinforcement Learning to control temperature inside a room. This repo is the updated version of https://github.com/NasimKaveh/Thermal-HVAC-model.
Apply reinforcement learning to a building emulator to intelligently control HVAC systems.
Enhancing HVAC Control Efficiency: A Hybrid Approach Using Imitation and Reinforcement Learning
This repository contains the implementation of reinforcement learning algorithms for optimizing energy demand response in commercial buildings. The project focuses on reducing peak loads and improv…
It utilizes a defined environment to improve HVAC energy efficiency using Reinforcement Learning Control Algorithms. Several parameters including temperature, humidity and windspeed were used to tr…
Accelerating Reinforcement Learning for HVAC Systems Using an LSTM-based Simulator
[🔥updating ...] AI 自动量化交易机器人(完全本地部署) AI-powered Quantitative Investment Research Platform. 📃 online docs: https://ufund-me.github.io/Qbot ✨ :news: qbot-mini: https://github.com/Charmve/iQuant
阿布量化交易系统(股票,期权,期货,比特币,机器学习) 基于python的开源量化交易,量化投资架构