-
EPFL
- Switzerland
- https://hansunhayden.github.io/
- https://people.epfl.ch/han.sun/?lang=en
Stars
GIS system based on ArcGIS C# Engine
Official implementation of "Dynalign: Unsupervised Dynamic Taxonomy Alignment for Cross-Domain Segmentation" (ICLR 2025)
"rsync for cloud storage" - Google Drive, S3, Dropbox, Backblaze B2, One Drive, Swift, Hubic, Wasabi, Google Cloud Storage, Azure Blob, Azure Files, Yandex Files
🗂️A file list/WebDAV program that supports multiple storages, powered by Gin and Solidjs. / 一个支持多存储的文件列表/WebDAV程序,使用 Gin 和 Solidjs。
This is a repository for listing papers on scene graph generation and application.
Official implementation of "Unseen Visual Anomaly Generation" (CVPR 2025)
[ICLR 2025] Official implementation of MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance
MarS: a Financial Market Simulation Engine Powered by Generative Foundation Model
SkyReels-A2: Compose anything in video diffusion transformers
A collection of awesome video generation studies.
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
Official implementation of OneDiffusion paper (CVPR 2025)
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Character Animation (AnimateAnyone, Face Reenactment)
[ECCV2024] The Official Implementation for ''AdaCLIP: Adapting CLIP with Hybrid Learnable Prompts for Zero-Shot Anomaly Detection''
A curated publication list on open vocabulary semantic segmentation and related area (e.g. zero-shot semantic segmentation) resources..
This repository is for the first comprehensive survey on Meta AI's Segment Anything Model (SAM).
(TPAMI 2024) A Survey on Open Vocabulary Learning
Anomaly detection related books, papers, videos, and toolboxes
Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
[SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation
LightSeq: A High Performance Library for Sequence Processing and Generation
✨✨Latest Advances on Multimodal Large Language Models
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
SimCLRv2 - Big Self-Supervised Models are Strong Semi-Supervised Learners
[CVPR 2023] OneFormer: One Transformer to Rule Universal Image Segmentation
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"