Stars
Fully open reproduction of DeepSeek-R1
Generative Models by Stability AI
GPT4V-level open-source multi-modal model based on Llama3-8B
Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO、GRPO。
shibing624 / SearchGPT
Forked from leptonai/search_with_leptonSearchGPT: Building a quick conversation-based search engine with LLMs.
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
ChatLaw:A Powerful LLM Tailored for Chinese Legal. 中文法律大模型
A large-scale 7B pretraining language model developed by BaiChuan-Inc.
ChatGPT爆火,开启了通往AGI的关键一步,本项目旨在汇总那些ChatGPT的开源平替们,包括文本大模型、多模态大模型等,为大家提供一些便利
Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集
收录NLP竞赛策略实现、各任务baseline、相关竞赛经验贴(当前赛事、往期赛事、训练赛)、NLP会议时间、常用自媒体、GPU推荐等,持续更新中
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
Code and documentation to train Stanford's Alpaca models, and generate the data.
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料
ChatYuan: Large Language Model for Dialogue in Chinese and English
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…
Making large AI models cheaper, faster and more accessible
DeepIE: Deep Learning for Information Extraction
pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation