8000 daizuozhuo (Dai Zuozhuo) / Starred · GitHub

More Web Proxy on the site http://driver.im/

daizuozhuo

Follow

Dai Zuozhuo daizuozhuo

Follow

198 followers · 15 following

Alibaba
Hangzhou
https://daizuozhuo.github.io

Achievements

Achievements

Lists (1)

Sort

🚀 My stack

Stars

ruanyf / weekly

科技爱好者周刊，每周五发布

67,605 3,432 Updated Jun 6, 2025

Junyi42 / monst3r

Official Implementation of paper "MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion"

Python 1,181 68 Updated May 6, 2025

Tencent-Hunyuan / HunyuanVideo-I2V

HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo

Python 1,483 138 Updated May 20, 2025

Tencent-Hunyuan / HunyuanVideo

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 10,263 914 Updated Jun 3, 2025

Vchitect / VBench

[CVPR2024 Highlight] VBench - We Evaluate Video Generation

Python 1,030 58 Updated May 29, 2025

PixArt-alpha / PixArt-sigma

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Python 1,814 90 Updated Oct 31, 2024

JianhongBai / UniEdit

UniEdit: A Unified Tuning-Free Framework for Video Motion and Appearance Editing

Python 108 4 Updated Apr 16, 2025

KKallidromitis / SA-1B-Downloader

Simple script to parallelize download and extract files for SA-1B Dataset.

Python 36 4 Updated Oct 11, 2024

NUS-HPC-AI-Lab / VideoSys

VideoSys: An easy and efficient system for video generation

Python 1,974 130 Updated Mar 9, 2025

salesforce / LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 10,615 1,037 Updated Nov 18, 2024

alibaba / animate-anything

Fine-Grained Open Domain Image Animation with Motion Guidance

Python 924 70 Updated Oct 18, 2024

AnimationAI / AnimateAnything

Fine-Grained Open Domain Image Animation with Motion Guidance

10 Updated Dec 8, 2023

ddPn08 / Radiata

Stable diffusion webui based on diffusers.

Python 979 68 Updated Sep 29, 2023

extreme-assistant / CVPR2024-Paper-Code-Interpretation

cvpr2024/cvpr2023/cvpr2022/cvpr2021/cvpr2020/cvpr2019/cvpr2018/cvpr2017 论文/代码/解读/直播合集，极市团队整理

12,497 2,280 Updated Apr 25, 2024

Zeqiang-Lai / Mini-DALLE3

Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models

Python 313 30 Updated Dec 28, 2023

Breakthrough / PySceneDetect

🎥 Python and OpenCV-based scene cut/transition detection program & library.

Python 3,939 441 Updated May 3, 2025

ExponentialML / Text-To-Video-Finetuning

Finetune ModelScope's Text To Video model using Diffusers 🧨

Python 687 110 Updated Dec 14, 2023

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 41,745 6,967 Updated Dec 9, 2024

52CV / CVPR-2023-Papers

941 75 Updated Nov 1, 2023

microsoft / CodeT

Python 655 87 Updated Nov 1, 2024

yizhongw / self-instruct

Aligning pretrained language models with instruction data generated by themselves.

Python 4,389 509 Updated Mar 27, 2023

yokoffing / ChatGPT-Prompts

ChatGPT and Bing AI prompt curation

872 83 Updated Mar 6, 2024

THUDM / GLM-130B

GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)

Python 7,684 605 Updated Jul 25, 2023

openai / whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python 82,975 10,054 Updated May 13, 2025

modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 10,913 1,098 Updated May 28, 2025

zengyan-97 / CCLM

Cross-View Language Modeling: Towards Unified Cross-Lingual Cross-Modal Pre-training (ACL 2023))

Python 90 9 Updated Jun 12, 2023

wolfgarbe / SymSpell

SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm

C# 3,246 303 Updated Mar 29, 2025

microsoft / GLIP

Grounded Language-Image Pre-training

Python 2,419 212 Updated Jan 24, 2024

zdou0830 / METER

METER: A Multimodal End-to-end TransformER Framework

Python 370 33 Updated Nov 16, 2022

facebookresearch / mmf

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

Python 5,568 938 Updated Apr 24, 2025

0