-
Tsinghua University
- Shenzhen, China
-
01:12
(UTC -12:00) - louieworth.github.io
- @louieworth
More
Lists (4)
Sort Name ascending (A-Z)
Stars
MIMIC Code Repository: Code shared by the research community for the MIMIC family of databases
Scalable toolkit for efficient model alignment
The homework assignments of the course Introduction to Optimization Theory
A recipe for online RLHF and online iterative DPO.
A collection of tips for scientific research
Anonymous Github is a proxy server to support anonymous browsing of Github repositories for open-science code and data.
🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
Generative Representational Instruction Tuning
Robust recipes to align language models with human and AI preferences
A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).
Recipes to train reward model for RLHF.
[ICLR 2024] The official implementation of "Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model"
✨✨Latest Advances on Multimodal Large Language Models
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
Representation Engineering: A Top-Down Approach to AI Transparency
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
[Nature Reviews Bioengineering🔥] Application of Large Language Models in Medicine. A curated list of practical guide resources of Medical LLMs (Medical LLMs Tree, Tables, and Papers)
Meditron is a suite of open-source medical Large Language Models (LLMs).
Doctor Dignity is an LLM that can pass the US Medical Licensing Exam. It works offline, it's cross-platform, & your health data stays private.
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
This list of writing prompts covers a range of topics and tasks, including brainstorming research ideas, improving language and style, conducting literature reviews, and developing research plans.