-
ai-hedge-fund Public
Forked from virattt/ai-hedge-fundAn AI Hedge Fund Team
Python MIT License UpdatedMay 16, 2025 -
Reading_Notes Public
Forked from 0917Ray/Reading_NotesSome reading notes edited in LaTeX. 一些学习笔记,使用LaTeX编辑.
Jupyter Notebook UpdatedMay 12, 2025 -
DRL-Pytorch Public
Forked from XinJingHao/DRL-PytorchClean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)
Python UpdatedMay 10, 2025 -
ai-hedge-fund-crypto Public
Forked from 51bitquant/ai-hedge-fund-cryptoAI-Hedge-Fund for Crypto 🚀 AI-powered hedge fund for cryptocurrency trading, leveraging LLM agents for intelligent decision-making.
Python MIT License UpdatedMay 5, 2025 -
MINI_LLM Public
Forked from jiahe7ay/MINI_LLMThis is a repository used by individuals to experiment and reproduce the pre-training process of LLM.
Python UpdatedMay 1, 2025 -
RLHF-Reward-Modeling Public
Forked from RLHFlow/RLHF-Reward-ModelingRecipes to train reward model for RLHF.
Python Apache License 2.0 UpdatedApr 24, 2025 -
-
MiniLM2 Public
Forked from SwarmClone/MiniLM2计划的核心——大语言模型
Python GNU General Public License v3.0 UpdatedMar 28, 2025 -
machine-learning-notes Public
Forked from luweiagi/machine-learning-notesThis is the notes of the way of machine learning study. You may find something useful in it.
UpdatedMar 24, 2025 -
EmoLLM Public
Forked from SmartFlowAI/EmoLLM心理健康大模型、LLM、The Big Model of Mental Health、Finetune、InternLM2、InternLM2.5、Qwen、ChatGLM、Baichuan、DeepSeek、Mixtral、LLama3、GLM4、Qwen2、LLama3.1
Python MIT License UpdatedMar 23, 2025 -
Slow_Thinking_with_LLMs Public
Forked from RUCAIBox/Slow_Thinking_with_LLMsA series of technical report on Slow Thinking with LLM
Python UpdatedMar 21, 2025 -
simpleRL-reason Public
Forked from hkust-nlp/simpleRL-reasonThis is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data
Python MIT License UpdatedMar 17, 2025 -
llm_related Public
Forked from wyf3/llm_related记录大模型相关的一些知识和方法
Jupyter Notebook UpdatedMar 15, 2025 -
X-R1 Public
Forked from dhcode-cpp/X-R1minimal-cost for training 0.5B R1-Zero
Python Apache License 2.0 UpdatedMar 10, 2025 -
Building-a-Small-LLM-from-Scratch Public
Forked from KaihuaTang/Building-a-Small-LLM-from-Scratch该系列的目的是让读者可以在基础的pytorch上,不依赖任何其他现成的外部库,从零开始理解并实现一个大语言模型的所有组成部分,以及训练微调代码,因此读者仅需python,pytorch和最基础深度学习背景知识即可。
Python UpdatedMar 7, 2025 -
vit-pytorch Public
Forked from lucidrains/vit-pytorchImplementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Python MIT License UpdatedMar 5, 2025 -
simple_GRPO Public
Forked from lsdefine/simple_GRPOA very simple GRPO implement for reproducing r1-like LLM thinking.
Python UpdatedFeb 28, 2025 -
R1-Onevision Public
Forked from Fancy-MLLM/R1-OnevisionR1-onevision, a visual language model capable of deep CoT reasoning.
Apache License 2.0 UpdatedFeb 25, 2025 -
Open-Reasoner-Zero Public
Forked from Open-Reasoner-Zero/Open-Reasoner-ZeroOfficial Repo for Open-Reasoner-Zero
Python MIT License UpdatedFeb 24, 2025 -
Logic-RL Public
Forked from Unakar/Logic-RLReproduce R1 Zero on Logic Puzzle
Python Apache License 2.0 UpdatedFeb 21, 2025 -
r1-reasoning-rag Public
Forked from deansaco/r1-reasoning-ragrecursive rag with r1 reasoning
Python UpdatedFeb 20, 2025 -
mini_qwen Public
Forked from qiufengqijun/mini_qwen这是一个从头训练大语言模型的项目,包括预训练、微调和直接偏好优化,模型拥有1B参数,支持中英文。
Python UpdatedFeb 18, 2025 -
R1-Nature Public
Forked from StarRing2022/R1-Nature最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。
Python UpdatedFeb 8, 2025 -
SkyThought Public
Forked from NovaSky-AI/SkyThoughtSky-T1: Train your own O1 preview model within $450
Python Apache License 2.0 UpdatedJan 26, 2025 -
llm-course Public
Forked from mlabonne/llm-courseCourse to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Jupyter Notebook Apache License 2.0 UpdatedJan 22, 2025 -
-
nanoGPT Public
Forked from karpathy/nanoGPTThe simplest, fastest repository for training/finetuning medium-sized GPTs.
Python MIT License UpdatedDec 9, 2024 -
Administrative-divisions-of-China Public
Forked from modood/Administrative-divisions-of-China中华人民共和国行政区划:省级(省份)、 地级(城市)、 县级(区县)、 乡级(乡镇街道)、 村级(村委会居委会) ,中国省市区镇村二级三级四级五级联动地址数据。
JavaScript Do What The F*ck You Want To Public License UpdatedNov 28, 2024 -
ProposalLLM Public
Forked from William-GuoWei/ProposalLLM标书大模型(Proposal-LLM Chinese version )
Python Apache License 2.0 UpdatedNov 14, 2024 -