10000 wuliwuliy (Gambel) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View wuliwuliy's full-sized avatar

Highlights

  • Pro

Block or report wuliwuliy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Packing Input Frame Context in Next-Frame Prediction Models for Video Generation

Python 8 Updated Apr 23, 2025
Python 404 28 Updated Mar 10, 2025

[ArXiv 2025] WorldMem: Long-term Consistent World Simulation with Memory

Python 175 3 Updated May 27, 2025

Code for: "Long-Context Autoregressive Video Modeling with Next-Frame Prediction"

Python 227 8 Updated Apr 23, 2025

Domain Generalization through Distilling CLIP with Language Guidance

Python 30 2 Updated Oct 18, 2023

Knowledge Distillation using Contrastive Language-Image Pretraining (CLIP) without a teacher model.

Python 18 1 Updated Sep 6, 2024

(TMM 2025) Official repository of paper "A Hierarchical Semantic Distillation Framework for Open-Vocabulary Object Detection"

Python 18 1 Updated Mar 14, 2025

[NeurIPS'24] Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy

Python 68 3 Updated Jan 22, 2025

The official PyTorch implementation for Improving Long-Text Alignment for Text-to-Image Diffusion Models (LongAlign)

Python 76 2 Updated Apr 23, 2025

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 15,544 2,094 Updated Jul 6, 2025

Simple large-scale training of stable diffusion with multi-node support.

Python 133 9 Updated May 8, 2023

4M: Massively Multimodal Masked Modeling

Python 1,740 108 Updated Jun 2, 2025

GenEval: An object-focused framework for evaluating text-to-image alignment

HTML 315 21 Updated Mar 3, 2025

Holistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models (CRFM) at Stanford for holistic, reproducible and transparen…

Python 2,319 310 Updated Jul 3, 2025

Rare-to-Frequent (R2F), ICLR'25, Spotlight

Python 47 Updated Apr 23, 2025

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,790 84 Updated Aug 15, 2024

Code and models for ICML 2024 paper, NExT-GPT: Any-to-Any Multimodal Large Language Model

Python 3,528 356 Updated May 13, 2025

[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Python 1,354 72 Updated Jun 24, 2025

[ICLR 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,562 67 Updated Jul 5, 2025

Kolors Team

Python 4,485 331 Updated Nov 13, 2024

Multimodal Models in Real World

Jupyter Notebook 519 21 Updated Feb 24, 2025

This project is the official implementation of 'LLMGA: Multimodal Large Language Model based Generation Assistant', ECCV2024 Oral

Python 396 25 Updated Jun 2, 2025

[ICCV 2025] LMM4LMM: Benchmarking and Evaluating Large-multimodal Image Generation with LMMs

Python 14 Updated Jul 3, 2025

[Neurips 2023 & TPAMI] T2I-CompBench (++) for Compositional Text-to-image Generation Evaluation

Python 269 11 Updated Apr 10, 2025
Python 35 Updated Feb 6, 2025

Using Low-rank adaptation to quickly fine-tune diffusion models.

Jupyter Notebook 7,377 490 Updated Mar 22, 2024

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Python 3,123 191 Updated Oct 31, 2024
Python 3,990 375 Updated Jun 13, 2025

iikira/BaiduPCS-Go原版基础上集成了分享链接/秒传链接转存功能

Go 3,684 512 Updated Apr 7, 2025

A One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 2,712 327 Updated Jul 5, 2025
Next
0