-
Tsinghua University
- Beijing
Lists (5)
Sort Name ascending (A-Z)
Starred repositories
📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥
This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?
Running inference on the ZeroSCROLLS benchmark
[ACL'24 Outstanding] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark
Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasets
Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷
Paper list about multimodal and large language models, only used to record papers I read in the daily arxiv for personal needs.
The official repo of Aquila2 series proposed by BAAI, including pretrained & chat large language models.
Tuning LLMs with no tears💦; Sample Design Engineering (SDE) for more efficient downstream-tuning.
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…
中文langchain项目|小必应,Q.Talk,强聊,QiangTalk
An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
ChatGPT 中文调教指南。各种场景使用指南。学习怎么让它听你的话。
⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SFT etc.
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Making large AI models cheaper, faster and more accessible
Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.