javaxiong

javaxiong

2 followers · 41 following

ai-hedge-fund Public
Forked from virattt/ai-hedge-fund

An AI Hedge Fund Team

Python MIT License Updated May 16, 2025
Reading_Notes Public
Forked from 0917Ray/Reading_Notes

Some reading notes edited in LaTeX. 一些学习笔记，使用LaTeX编辑.

Jupyter Notebook Updated May 12, 2025
DRL-Pytorch Public
Forked from XinJingHao/DRL-Pytorch

Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)

Python Updated May 10, 2025
ai-hedge-fund-crypto Public
Forked from 51bitquant/ai-hedge-fund-crypto

AI-Hedge-Fund for Crypto 🚀 AI-powered hedge fund for cryptocurrency trading, leveraging LLM agents for intelligent decision-making.

Python MIT License Updated May 5, 2025
MINI_LLM Public
Forked from jiahe7ay/MINI_LLM

This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.

Python Updated May 1, 2025
RLHF-Reward-Modeling Public
Forked from RLHFlow/RLHF-Reward-Modeling

Recipes to train reward model for RLHF.

Python Apache License 2.0 Updated Apr 24, 2025
glm-4-voice-finetune Public
Forked from anthony-wss/glm-4-voice-finetune

Python Updated Apr 4, 2025
MiniLM2 Public
Forked from SwarmClone/MiniLM2

计划的核心——大语言模型

Python GNU General Public License v3.0 Updated Mar 28, 2025
machine-learning-notes Public
Forked from luweiagi/machine-learning-notes

This is the notes of the way of machine learning study. You may find something useful in it.

Updated Mar 24, 2025
EmoLLM Public
Forked from SmartFlowAI/EmoLLM

心理健康大模型、LLM、The Big Model of Mental Health、Finetune、InternLM2、InternLM2.5、Qwen、ChatGLM、Baichuan、DeepSeek、Mixtral、LLama3、GLM4、Qwen2、LLama3.1

Python MIT License Updated Mar 23, 2025
Slow_Thinking_with_LLMs Public
Forked from RUCAIBox/Slow_Thinking_with_LLMs

A series of technical report on Slow Thinking with LLM

Python Updated Mar 21, 2025
simpleRL-reason Public
Forked from hkust-nlp/simpleRL-reason

This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

Python MIT License Updated Mar 17, 2025
llm_related Public
Forked from wyf3/llm_related

记录大模型相关的一些知识和方法

Jupyter Notebook Updated Mar 15, 2025
X-R1 Public
Forked from dhcode-cpp/X-R1

minimal-cost for training 0.5B R1-Zero

Python Apache License 2.0 Updated Mar 10, 2025
Building-a-Small-LLM-from-Scratch Public
Forked from KaihuaTang/Building-a-Small-LLM-from-Scratch

该系列的目的是让读者可以在基础的pytorch上，不依赖任何其他现成的外部库，从零开始理解并实现一个大语言模型的所有组成部分，以及训练微调代码，因此读者仅需python，pytorch和最基础深度学习背景知识即可。

Python Updated Mar 7, 2025
vit-pytorch Public
Forked from lucidrains/vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python MIT License Updated Mar 5, 2025
simple_GRPO Public
Forked from lsdefine/simple_GRPO

A very simple GRPO implement for reproducing r1-like LLM thinking.

Python Updated Feb 28, 2025
R1-Onevision Public
Forked from Fancy-MLLM/R1-Onevision

R1-onevision, a visual language model capable of deep CoT reasoning.

Apache License 2.0 Updated Feb 25, 2025
Open-Reasoner-Zero Public
Forked from Open-Reasoner-Zero/Open-Reasoner-Zero

Official Repo for Open-Reasoner-Zero

Python MIT License Updated Feb 24, 2025
Logic-RL Public
Forked from Unakar/Logic-RL

Reproduce R1 Zero on Logic Puzzle

Python Apache License 2.0 Updated Feb 21, 2025
r1-reasoning-rag Public
Forked from deansaco/r1-reasoning-rag

recursive rag with r1 reasoning

Python Updated Feb 20, 2025
mini_qwen Public
Forked from qiufengqijun/mini_qwen

这是一个从头训练大语言模型的项目，包括预训练、微调和直接偏好优化，模型拥有1B参数，支持中英文。

Python Updated Feb 18, 2025
R1-Nature Public
Forked from StarRing2022/R1-Nature

最简易的R1结果在小模型上的复现，阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证，对于强推理能力，think思考过程性内容是AGI/ASI的核心。

Python Updated Feb 8, 2025
SkyThought Public
Forked from NovaSky-AI/SkyThought

Sky-T1: Train your own O1 preview model within $450

Python Apache License 2.0 Updated Jan 26, 2025
llm-course Public
Forked from mlabonne/llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook Apache License 2.0 Updated Jan 22, 2025
Foundations-of-LLMs Public
Forked from ZJU-LLMs/Foundations-of-LLMs

Other Updated Jan 14, 2025
nanoGPT Public
Forked from karpathy/nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python MIT License Updated Dec 9, 2024
Administrative-divisions-of-China Public
Forked from modood/Administrative-divisions-of-China

中华人民共和国行政区划：省级（省份）、地级（城市）、县级（区县）、乡级（乡镇街道）、村级（村委会居委会），中国省市区镇村二级三级四级五级联动地址数据。

JavaScript Do What The F*ck You Want To Public License Updated Nov 28, 2024
ProposalLLM Public
Forked from William-GuoWei/ProposalLLM

标书大模型（Proposal-LLM Chinese version )

Python Apache License 2.0 Updated Nov 14, 2024
TinyRAG Public
Forked from KMnO4-zx/TinyRAG

TinyRAG

Python Updated Oct 30, 2024

javaxiong

ai-hedge-fund Public

Uh oh!

Reading_Notes Public

Uh oh!

DRL-Pytorch Public

Uh oh!

ai-hedge-fund-crypto Public

Uh oh!

MINI_LLM Public

Uh oh!

RLHF-Reward-Modeling Public

Uh oh!

glm-4-voice-finetune Public

Uh oh!

MiniLM2 Public

Uh oh!

machine-learning-notes Public

Uh oh!

EmoLLM Public

Uh oh!

Slow_Thinking_with_LLMs Public

Uh oh!

simpleRL-reason Public

Uh oh!

llm_related Public

Uh oh!

X-R1 Public

Uh oh!

Building-a-Small-LLM-from-Scratch Public

Uh oh!

vit-pytorch Public

Uh oh!

simple_GRPO Public

Uh oh!

R1-Onevision Public

Uh oh!

Open-Reasoner-Zero Public

Uh oh!

Logic-RL Public

Uh oh!

r1-reasoning-rag Public

Uh oh!

mini_qwen Public

Uh oh!

R1-Nature Public

Uh oh!

SkyThought Public

Uh oh!

llm-course Public

Uh oh!

Foundations-of-LLMs Public

Uh oh!

nanoGPT Public

Uh oh!

Administrative-divisions-of-China Public

Uh oh!

ProposalLLM Public

Uh oh!

TinyRAG Public

Uh oh!