Stars
Extended crypt library for descrypt, md5crypt, bcrypt, and others
Simulation of a self balancing robot using an LQR controller implemented in Mujoco
Tutorial on how to get started with MuJoCo Simulation Platform. MuJoCo stands for Multi-Joint dynamics with Contact. It was acquired and made freely available by DeepMind in October 2021, and open …
MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/READ…
Various *nix tools built as statically-linked binaries
A powerful tool for creating fine-tuning datasets for LLM
🤗 smolagents: a barebones library for agents that think in code.
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Everything about the SmolLM2 and SmolVLM family of models
Train transformer language models with reinforcement learning.
A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API
An implementation of JSON Schema, draft v4 v6 & v7 - Go language
Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"
Minimal reproduction of DeepSeek R1-Zero
Implementing DeepSeek R1's GRPO algorithm from scratch
OrangeX4 / latex2sympy
Forked from purdue-tlt/latex2sympyParse LaTeX math expressions
Parse LaTeX math expressions
The simplest, fastest repository for training/finetuning medium-sized GPTs.
从无名小卒到大模型(LLM)大英雄~ 欢迎关注后续!!!
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
A high-performance distributed training framework for Reinforcement Learning
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Open source implementation of OPC UA (OPC Unified Architecture) aka IEC 62541 licensed under Mozilla Public License v2.0