Stars
Video+code lecture on building nanoGPT from scratch
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Medical o1, Towards medical complex reasoning with LLMs
A step by step guide to fine-tuning the DeepSeek R1 Distilled models on Apple Silicon machines.
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO、GRPO。
An annotated implementation of the Transformer paper.
A PyTorch implementation of the Transformer model in "Attention is All You Need".
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
The official GitHub page for the survey paper "A Survey of Large Language Models".
A collection of research papers, datasets and software related to knowledge graphs for drug discovery. Accompanies the paper "A review of biomedical datasets relating to drug discovery: a knowledge…
A collection of GPT system prompts and various prompt injection/leaking knowledge.
Awesome AI GPTs, OpenAI GPTs, GPT-4, ChatGPT, GPTs, Prompts, plugins, Prompts leaking
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…
面向开发者的 LLM 入门教程,吴恩达大模型系列课程中文版
Train transformer language models with reinforcement learning.
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
The official Python library for the OpenAI API
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Strategies for Pre-training Graph Neural Networks
Few-Shot Graph Learning for Molecular Property Prediction
Must-read papers on graph neural networks (GNN)
Code for "Enhance Information Propagation for Graph Neural Network by Heterogeneous Aggregations"