8000 DeminYu98 (Demin Yu) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View DeminYu98's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Harbin Institute of Technology
  • Shenzhen, China

Highlights

  • Pro

Block or report DeminYu98

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

[CVPR2025] PyTorch-based reimplementation of CrossFlow, as proposed in 'Flowing from Words to Pixels: A Noise-Free Framework for Cross-Modality Evolution'

Python 266 2 Updated Jun 8, 2025

[CVPR 2025] Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization

Python 225 8 Updated Apr 7, 2025

About Code release for "Dynamical Diffusion: Learning Temporal Dynamics with Diffusion Models" (ICLR 2025)

13 Updated Mar 1, 2025

Enjoy the magic of Diffusion models!

Python 8,881 809 Updated Jun 25, 2025

[CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models

Python 955 28 Updated Jun 12, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 2,789 210 Updated Jun 24, 2025

[ICLR 2025 Oral] The official implementation of "Diffusion-Based Planning for Autonomous Driving with Flexible Guidance"

Python 521 63 Updated Apr 5, 2025
Python 245 11 Updated Mar 10, 2025

Open-Sora: Democratizing Efficient Video Production for All

Python 26,754 2,604 Updated Apr 30, 2025
Python 42 3 Updated Mar 11, 2025

[Survey] Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

445 10 Updated Jan 17, 2025

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,276 515 Updated May 18, 2025

PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/MCP/Docker/Zotero

Python 24,921 2,149 Updated Jun 25, 2025

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 2,672 204 Updated Jun 20, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 12,459 1,512 Updated Jun 13, 2025

SEED-Voken: A Series of Powerful Visual Tokenizers

Python 898 34 Updated Feb 19, 2025
Python 62 5 Updated Mar 17, 2025

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Jupyter Notebook 2,962 189 Updated May 19, 2025

A differentiable PDE solving framework for machine learning

Python 1,627 209 Updated Apr 15, 2025
Python 7 1 Updated Feb 4, 2025

PyTorch implementation of a collections of scalable Video Transformer Benchmarks.

Python 298 38 Updated May 4, 2022

LangGPT: Empowering everyone to become a prompt expert!🚀 Structured Prompt,Language of GPT, 结构化提示词,结构化Prompt, Created by 「云中江树」

Jupyter Notebook 10,065 807 Updated Jun 7, 2025

The official codebase of ECCV2024 paper: PredBench: Benchmarking Spatio-Temporal Prediction across Diverse Disciplines.

Python 33 4 Updated Sep 28, 2024
Python 14 1 Updated Oct 17, 2024

ML Datasets Catalog

Python 58 11 Updated Jul 16, 2023

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…

Python 68,829 8,355 Updated Jun 24, 2025

This is an improved generated adversarial network based on evolutionary network.

Jupyter Notebook 9 1 Updated Sep 28, 2023

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 34,552 4,939 Updated Jun 24, 2025

[ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling

Python 2,979 297 Updated Dec 21, 2024
Next
0