-
Nanjing University
Highlights
- Pro
Stars
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
📷 A Website created using Tailwind CSS, HTML, CSS and JavaScript that can be used as a Photographer Portfolio.
Official Implementation of Rectified Flow (ICLR2023 Spotlight)
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
AIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models
WebGazer.js: Scalable Webcam EyeTracking Using User Interactions
[CVPR2025] We present StableAnimator, the first end-to-end ID-preserving video diffusion framework, which synthesizes high-quality videos without any post-processing, conditioned on a reference ima…
[CVPR 2024] Official repository for "MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model"
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"
[AAAI 2024] Follow-Your-Pose: This repo is the official implementation of "Follow-Your-Pose : Pose-Guided Text-to-Video Generation using Pose-Free Videos"
💥 Blazing fast terminal file manager written in Rust, based on async I/O.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
欢迎来到 LLM-Dojo,这里是一个开源大模型学习场所,使用简洁且易阅读的代码构建模型训练框架(支持各种主流模型如Qwen、Llama、GLM等等)、RLHF框架(DPO/CPO/KTO/PPO)等各种功能。👩🎓👨🎓
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
Access latex source of any arxiv.org paper directly on overleaf
5D Diplomacy With Multiverse Time Travel
This repository is dedicated to Track 2 of the W-CODA 2024 Workshop, "Multimodal Perception and Comprehension of Corner Cases in Autonomous Driving," held at ECCV 2024.
Official implementation of Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model (ICLR 2025 Oral)
[ICCV 2023] StreamPETR: Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
official code of *DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model*