Stars
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
Unofficial implementation of Titans, SOTA memory for transformers, in Pytorch
Learn Deep Reinforcement Learning in 60 days! Lectures & Code in Python. Reinforcement Learning + Deep Learning
A generative world for general-purpose robotics & embodied AI learning.
Bringing BERT into modernity via both architecture changes and scaling
A beautiful, simple, clean, and responsive Jekyll theme for academics
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Fast and memory-efficient exact attention
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
RapidIn: Scalable Influence Estimation for Large Language Models (LLMs). The implementation for paper "Token-wise Influential Training Data Retrieval for Large Language Models" (Accepted on ACL 2024).
Official JAX implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States
🔊 Text-Prompted Generative Audio Model
100+ Chinese Word Vectors 上百种预训练中文词向量
A PyTorch implemention of Match-LSTM, R-NET and M-Reader for Machine Reading Comprehension
Can large language models provide useful feedback on research papers? A large-scale empirical analysis.
Representation Engineering: A Top-Down Approach to AI Transparency
A website displaying hundreds of charts made with Python
Instruct-tune LLaMA on consumer hardware
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Python scripts preprocessing Penn Treebank and Chinese Treebank
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
Resource, Evaluation and Detection Papers for ChatGPT