basicv8vc

Jia basicv8vc

I like to fine-tune Deep Neural Nets on small datasets.

98 followers · 126 following

Achievements

x2 x2

Achievements

x2 x2

Organizations

Lists (18)

Sort

Stars

ace-step / ACE-Step

ACE-Step: A Step Towards Music Generation Foundation Model

Python 1,581 120 Updated May 11, 2025

LightChen233 / Awesome-Long-Chain-of-Thought-Reasoning

Latest Advances on Long Chain-of-Thought Reasoning

290 18 Updated Apr 13, 2025

XiaomiMiMo / MiMo

MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining

Python 1,277 52 Updated May 8, 2025

deepseek-ai / DeepSeek-Prover-V2

1,025 69 Updated Apr 30, 2025

AsyncFuncAI / deepwiki-open

Open Source DeepWiki: AI-Powered Wiki Generator for GitHub/Gitlab/Bitbucket Repositories. Join the discord: https://discord.gg/gMwThUMeme

TypeScript 4,635 363 Updated May 11, 2025

li-plus / flash-preference

Accelerate LLM preference tuning via prefix sharing with a single line of code

Python 41 Updated Apr 30, 2025

pytorch-labs / LeanRL

LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.

Python 566 25 Updated Oct 26, 2024

HanGuo97 / flute

Fast Matrix Multiplications for Lookup Table-Quantized LLMs

C++ 366 15 Updated Apr 13, 2025

yamadashy / repomix

📦 Repomix (formerly Repopack) is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) o…

TypeScript 15,702 678 Updated May 12, 2025

microsoft / VPTQ

VPTQ, A Flexible and Extreme low-bit quantization algorithm

Python 633 43 Updated Apr 25, 2025

PrimeIntellect-ai / prime-rl

prime-rl is a codebase for decentralized RL training at scale

Python 101 7 Updated May 11, 2025

Intelligent-Computing-Lab-Yale / GPTAQ

Code implementation of GPTQv2 (https://arxiv.org/abs/2504.02692)

Python 36 Updated Apr 17, 2025

SkyworkAI / Skywork-OR1

Unleashing the Power of Reinforcement Learning for Math and Code Reasoners

Python 547 34 Updated Apr 25, 2025

THUDM / GLM-4

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Python 6,529 549 Updated Apr 19, 2025

MoonshotAI / Kimina-Prover-Preview

Technical report of Kimina-Prover Preview.

278 8 Updated May 10, 2025

agentica-project / rllm

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 3,214 302 Updated May 11, 2025

yandex / YaFSDP

YaFSDP: Yet another Fully Sharded Data Parallel

Python 963 49 Updated May 7, 2025

ByteDance-Seed / Seed-Thinking-v1.5

741 10 Updated Apr 20, 2025

Essential-AI / reflection

Python 37 5 Updated Apr 11, 2025

MoonshotAI / Kimi-VL

Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities

830 39 Updated Apr 20, 2025

IST-DASLab / MoE-Quant

Code for data-aware compression of DeepSeek models

Python 24 4 Updated Apr 8, 2025

aimhubio / aim

Aim 💫 — An easy-to-use & supercharged open-source experiment tracker.

Python 5,565 343 Updated May 11, 2025

inclusionAI / AReaL

Distributed RL System for LLM Reasoning

Python 1,231 56 Updated Apr 27, 2025

NVIDIA / TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory…

Python 2,400 417 Updated May 11, 2025

unslothai / unsloth-zoo

Utils for Unsloth

Python 83 92 Updated May 11, 2025

hyx1999 / Quad

Official Implementation of QUAD: Quantization and Parameter-Efficient Tuning of LLM with Activation Decomposition

Python 3 Updated Apr 25, 2025

hyx1999 / LoRS

Official Implementation of LoRS: Efficient Low-Rank Adaptation for Sparse Large Language Model

Python 3 Updated Feb 13, 2025

etched-ai / open-oasis

Inference script for Oasis 500M

Python 1,823 156 Updated Nov 8, 2024

edwko / OuteTTS

Interface for OuteTTS models.

Python 1,215 101 Updated Apr 29, 2025

bklieger-groq / g1

g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains

Python 4,214 379 Updated Jan 27, 2025

Jia basicv8vc

Organizations

Lists (18)

Alignment

DSP gang

efficient moe rl-tuning

LLM && Agents

LLM inference

LLM internal

LLM PC

LLM pretraining

llm reasoning

LLM tuning

LLM4Sci

look-a-look

mamba(ssm)

multimodal

non-Trans LLMs

Triton && MLX && JAX

tts

workflow

Stars