Lists (1)
Sort Name ascending (A-Z)
Stars
Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
[CVPR 2024] Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
[CVPR2023] Blind Video Deflickering by Neural Filtering with a Flawed Atlas
A Code Release for Mip-NeRF 360, Ref-NeRF, and RawNeRF
A comprehensive list of Implicit Representations and NeRF papers relating to Robotics/RL domain, including papers, codes, and related websites
PITI: Pretraining is All You Need for Image-to-Image Translation
A flexible, high-performance 3D simulator for Embodied AI research.
A curated list of awesome neural radiance fields papers
Cross-view Transformers for real-time Map-view Semantic Segmentation (CVPR 2022 Oral)
Dynamic Neural Radiance Fields for Monocular 4D Facial Avater Reconstruction
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image genera…
PyTorch implementations of Generative Adversarial Networks.
Instant neural graphics primitives: lightning fast NeRF and more
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Dynamic, Non-Prehensile, and Underactuated Object Locomotion through Reinforcement Learning
nazir-hk / learn_rockwalk
Forked from JS-RML/learn_rockwalkDynamic, Non-Prehensile, and Underactuated Object Locomotion through Reinforcement Learning
[ACM MM 2021 Oral] Unsupervised Portrait Shadow Removal via Generative Priors
Curated list of awesome GAN applications and demo
zhangtianjia / CVPR2021-Paper-Code-Interpretation
Forked from extreme-assistant/CVPR2024-Paper-Code-Interpretationcvpr2021/cvpr2020/cvpr2019/cvpr2018/cvpr2017 论文/代码/解读/直播合集,极市团队整理
PyTorch implementation of paper "Neural Scene Flow Fields for Space-Time View Synthesis of Dynamic Scenes"
A collection of resources on neural rendering.
zhangtianjia / ICCV2021-Paper-Code-Interpretation
Forked from extreme-assistant/ICCV2023-Paper-Code-InterpretationICCV2021/2019/2017 论文/代码/解读/直播合集,极市团队整理