8000 daizuozhuo (Dai Zuozhuo) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View daizuozhuo's full-sized avatar

Block or report daizuozhuo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

科技爱好者周刊,每周五发布

67,605 3,432 Updated Jun 6, 2025

Official Implementation of paper "MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion"

Python 1,181 68 Updated May 6, 2025

HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo

Python 1,483 138 Updated May 20, 2025

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 10,263 914 Updated Jun 3, 2025

[CVPR2024 Highlight] VBench - We Evaluate Video Generation

Python 1,030 58 Updated May 29, 2025

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Python 1,814 90 Updated Oct 31, 2024

UniEdit: A Unified Tuning-Free Framework for Video Motion and Appearance Editing

Python 108 4 Updated Apr 16, 2025

Simple script to parallelize download and extract files for SA-1B Dataset.

Python 36 4 Updated Oct 11, 2024

VideoSys: An easy and efficient system for video generation

Python 1,974 130 Updated Mar 9, 2025

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 10,615 1,037 Updated Nov 18, 2024

Fine-Grained Open Domain Image Animation with Motion Guidance

Python 924 70 Updated Oct 18, 2024

Fine-Grained Open Domain Image Animation with Motion Guidance

10 Updated Dec 8, 2023

Stable diffusion webui based on diffusers.

Python 979 68 Updated Sep 29, 2023

cvpr2024/cvpr2023/cvpr2022/cvpr2021/cvpr2020/cvpr2019/cvpr2018/cvpr2017 论文/代码/解读/直播合集,极市团队整理

12,497 2,280 Updated Apr 25, 2024

Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models

Python 313 30 Updated Dec 28, 2023

🎥 Python and OpenCV-based scene cut/transition detection program & library.

Python 3,939 441 Updated May 3, 2025

Finetune ModelScope's Text To Video model using Diffusers 🧨

Python 687 110 Updated Dec 14, 2023

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 41,745 6,967 Updated Dec 9, 2024
Python 655 87 Updated Nov 1, 2024

Aligning pretrained language models with instruction data generated by themselves.

Python 4,389 509 Updated Mar 27, 2023

ChatGPT and Bing AI prompt curation

872 83 Updated Mar 6, 2024

GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)

Python 7,684 605 Updated Jul 25, 2023

Robust Speech Recognition via Large-Scale Weak Supervision

Python 82,975 10,054 Updated May 13, 2025

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 10,913 1,098 Updated May 28, 2025

Cross-View Language Modeling: Towards Unified Cross-Lingual Cross-Modal Pre-training (ACL 2023))

Python 90 9 Updated Jun 12, 2023

SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm

C# 3,246 303 Updated Mar 29, 2025

Grounded Language-Image Pre-training

Python 2,419 212 Updated Jan 24, 2024

METER: A Multimodal End-to-end TransformER Framework

Python 370 33 Updated Nov 16, 2022

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

Python 5,568 938 Updated Apr 24, 2025
Next
0