8000 MilchstraB (Chengzhi Yu) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View MilchstraB's full-sized avatar
🎯
Focusing
🎯
Focusing
  • University of Science and Technology of China

Highlights

  • Pro

Block or report MilchstraB

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 860 49 Updated Jun 15, 2025

This is an official implementation of the Reward rAnked Fine-Tuning Algorithm (RAFT), also known as iterative best-of-n fine-tuning or rejection sampling fine-tuning.

Python 32 3 Updated Sep 22, 2024

An Easy-to-use, Scalable and High-performance RLHF Framework designed for Multimodal Models.

Python 129 6 Updated Apr 7, 2025

Codebase for Iterative DPO Using Rule-based Rewards

Python 247 31 Updated Apr 11, 2025

Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.

Python 770 46 Updated May 14, 2025

Aligning Large Language Models with Human: A Survey

730 31 Updated Sep 11, 2023

The Code for Lever LM: Configuring In-Context Sequence to Lever Large Vision Language Models

Python 15 2 Updated Oct 4, 2024
27 Updated Sep 27, 2024

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 8,007 868 Updated Apr 30, 2025

A family of lightweight multimodal models.

Python 1,023 74 Updated Nov 18, 2024

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 30,037 4,050 Updated Jul 17, 2024

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 18,912 2,227 Updated Jul 29, 2024

Example models using DeepSpeed

Python 6,534 1,094 Updated Jun 14, 2025

A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)

9,944 767 Updated May 31, 2024

Approaching (Almost) Any Machine Learning Problem

7,977 1,116 Updated Mar 25, 2023

《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version in translation

Java 113,388 14,086 Updated Jun 12, 2025

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 50,479 5,928 Updated Sep 18, 2024

[ECCVW 2022] The codes for the work "Swin-Unet: Unet-like Pure Transformer for Medical Image Segmentation"

Python 2,052 344 Updated May 9, 2025

Domain Adaptive Mitochondria Segmentation via Enforcing Inter-Section Consistency

Python 16 3 Updated Jul 16, 2023

A playbook for systematically maximizing the performance of deep learning models.

28,827 2,372 Updated Jun 18, 2024

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 34,450 4,931 Updated Jun 15, 2025

LaTeX中文模板收集

TeX 37 16 Updated Aug 15, 2018
Python 6,842 2,013 Updated Jun 3, 2025

Integral Human Pose Regression

Cuda 480 76 Updated Apr 4, 2019

The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection."

Python 9,165 1,576 Updated Jun 26, 2024

PyTorch implementation of DSNT

Python 306 57 Updated Sep 8, 2020

MedicalSeg is an easy-to-use 3D medical image segmentation toolkit that supports the whole segmentation process. Specially, We provide data preprocessing acceleration, high precision model on COVID…

Python 74 16 Updated Dec 14, 2022

超轻量级bert的pytorch版本,大量中文注释,容易修改结构,持续更新

Python 413 66 Updated Apr 18, 2022

Google AI 2018 BERT pytorch implementation

Python 6,416 1,324 Updated Sep 15, 2023
0