Stars
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
Repository of DISC-MedLLM, it is a comprehensive solution that leverages Large Language Models (LLMs) to provide accurate and truthful medical response in end-to-end conversational healthcare servi…
Chinese medical dialogue data 中文医疗对话数据集
中文通用大模型开放域多轮测评基准 | An Open Domain Benchmark for Foundation Models in Chinese
XVERSE-13B: A multilingual large language model developed by XVERSE Technology Inc.
中文Mixtral混合专家大模型(Chinese Mixtral MoE LLMs)
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
Llama3-Chinese是以Meta-Llama-3-8B为底座,使用 DORA + LORA+ 的训练方法,在50w高质量中文多轮SFT数据 + 10w英文多轮SFT数据 + 2000单轮自我认知数据训练而来的大模型。
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."
Firefly中文LLaMA-2大模型,支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、InternLM、Bloom等大模型
Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
Finetuning of Falcon-7B LLM using QLoRA on Mental Health Conversational Dataset
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Open-Sora: Democratizing Efficient Video Production for All
TencentLLMEval is a comprehensive and extensive benchmark for artificial evaluation of large models that includes task trees, standards, data verification methods, and more.
An In-depth Analysis of Diffusion Probability Model
A series of large language models developed by Baichuan Intelligent Technology
Deep Reinforcement Learning Lab, a platform designed to make DRL technology and fun for everyone
Serve, optimize and scale PyTorch models in production
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
Collections of resources from Joint Laboratory of HIT and iFLYTEK Research (HFL)
Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Lora beYond Conventional methods, Other Rank adaptation Implementations for Stable diffusion.
Dynamic Thresholding (CFG Scale Fix) for Stable Diffusion (SwarmUI, ComfyUI, and Auto WebUI)
AI绘画资料合集(包含国内外可使用平台、使用教程、参数教程、部署教程、业界新闻等等) Stable diffusion、AnimateDiff、Stable Cascade 、Stable SDXL Turbo