8000 yhZhai (Yuanhao Zhai) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View yhZhai's full-sized avatar
😶‍🌫️
😶‍🌫️
  • State University of New York at Buffalo
  • New York

Block or report yhZhai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

YOLOv12: Attention-Centric Real-Time Object Detectors

Python 1,886 249 Updated Jun 4, 2025

Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.

Python 11,759 1,640 Updated Apr 7, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 12,170 1,449 Updated Jun 13, 2025

Framework agnostic sliced/tiled inference + interactive ui + error analysis plots

Python 4,603 659 Updated May 15, 2025

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 2,787 161 Updated May 28, 2025

New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos

8,015 514 Updated Jun 9, 2025

A reading list of video generation

589 37 Updated Jun 13, 2025

[ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling

Python 2,961 297 Updated Dec 21, 2024

High-resolution models for human tasks.

Python 5,048 293 Updated Nov 18, 2024

Open-source toolbox for visual fashion analysis based on PyTorch

Python 1,312 294 Updated May 10, 2024
C++ 2 Updated Nov 9, 2024

Annotated Flow Matching paper

Jupyter Notebook 187 10 Updated Sep 14, 2024

[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Python 5,741 531 Updated Jan 22, 2025

Official inference repo for FLUX.1 models

Python 22,365 1,582 Updated Jun 5, 2025

A simple pip-installable Python tool to generate your own HTML citation world map from your Google Scholar ID.

Python 542 47 Updated Jun 12, 2025

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 15,801 1,825 Updated Dec 25, 2024

[ECCV 2024] Prompting Language-Informed Distribution for Compositional Zero-Shot Learning

Python 13 Updated Jan 4, 2025

[NeurIPS 2024 D&B Spotlight🔥] ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation

Python 203 15 Updated May 27, 2025

The official implementation of "Spectrum AUC Difference (SAUCD): Human-aligned 3D Shape Evaluation" (CVPR2024).

Python 4 Updated Aug 12, 2024

A Collection of BM25 Algorithms in Python

Python 1,186 95 Updated Oct 8, 2024

State-of-the-Art Text Embeddings

Python 16,904 2,617 Updated Jun 11, 2025

Header-only C++/python library for fast approximate nearest neighbors

C++ 4,748 711 Updated Apr 20, 2025

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Python 4,056 358 Updated Aug 7, 2024

🐟 Code and models for the NeurIPS 2023 paper "Generating Images with Multimodal Language Models".

Jupyter Notebook 456 38 Updated Jan 19, 2024

Code and models for ICML 2024 paper, NExT-GPT: Any-to-Any Multimodal Large Language Model

Python 3,513 354 Updated May 13, 2025

Code of "Seesaw: Compensating for Nonlinear Reduction with Linear Computations for Private Inference" in ICML'24

Python 5 Updated Jul 9, 2024
Next
0