-
Beijing Jiaotong University & Georigia Tech
- Beijing
-
05:23
(UTC +08:00) - rbrq03.github.io
- in/jiannan-huang-79a9a4290
- @Jiannan03
Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Stars
[Preprint] UCGM: Unified Continuous Generative Models
nnDetection is a self-configuring framework for 3D (volumetric) medical object detection which can be applied to new data sets without manual intervention. It includes guides for 12 data sets that …
[CVPR 2024] VoCo: A Simple-yet-Effective Volume Contrastive Learning Framework for 3D Medical Image Analysis
[CVPR 2024 Extension] 160K volumes (42M slices) datasets, new segmentation datasets, 31M-1.2B pre-trained models, various pre-training recipes, 50+ downstream tasks implementation
程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).
GenEval: An object-focused framework for evaluating text-to-image alignment
🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning
[ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think
A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
A pipeline parallel training script for diffusion models.
Karras et al. (2022) diffusion models for PyTorch
基于深度学习的肿瘤辅助诊断系统,以图像分割为核心,利用人工智能完成肿瘤区域的识别勾画并提供肿瘤区域的特征来辅助医生进行诊断。有完整的模型构建、后端架设、工业级部署和前端访问功能。TensorRT、PyTorch 、OpenCV 、Flask、Vue
🚀从聊天记录创造数字分身的一站式解决方案💡 使用聊天记录微调大语言模型,让大模型有“那味儿”,并绑定到聊天机器人,实现自己的数字分身。 数字克隆/数字分身/数字永生/LLM/聊天机器人/LoRA
Efficient Mixture of Experts for LLM Paper List
Official repository for our work on micro-budget training of large-scale diffusion models.
CVPR and NeurIPS poster examples and templates. May we have in-person poster session soon!
👁️ 🖼️ 🔥PyTorch Toolbox for Image Quality Assessment, including PSNR, SSIM, LPIPS, FID, NIQE, NRQM(Ma), MUSIQ, TOPIQ, NIMA, DBCNN, BRISQUE, PI and more...
[CVPRW 2022] MANIQA: Multi-dimension Attention Network for No-Reference Image Quality Assessment
Quality-Aware Image-Text Alignment for Opinion-Unaware Image Quality Assessment
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory…
[ICML2025] Sparse VideoGen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity