8000 xiaosean (YONG-XIANG LIN) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View xiaosean's full-sized avatar

Highlights

  • Pro

Block or report xiaosean

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Ke-Omni-R is an advanced audio reasoning model and achieved SOTA on MMAU

Python 30 1 Updated Jun 11, 2025

Official implementations for paper: VACE: All-in-One Video Creation and Editing

Python 2,749 183 Updated May 15, 2025

Lets make video diffusion practical!

Python 14,678 1,320 Updated May 4, 2025

[CVPR2024 Highlight] VBench - We Evaluate Video Generation

Python 1,054 61 Updated Jun 25, 2025

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 1,554 105 Updated Jun 2, 2025

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 8,413 648 Updated May 29, 2025

fastdup is a powerful, free tool designed to rapidly generate valuable insights from image and video datasets. It helps enhance the quality of both images and labels, while significantly reducing d…

Python 1,695 83 Updated Jan 16, 2025

Paper list and datasets for industrial image anomaly/defect detection (updating). 工业异常/瑕疵检测论文及数据集检索库(持续更新)。

E6F0 2,387 211 Updated Jun 20, 2025

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python 4,295 277 Updated Jun 4, 2025

The Desktop AgentOS.

Python 7,430 914 Updated Jun 11, 2025

[ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling

Python 883 47 Updated Apr 30, 2025

Eagle Family: Exploring Model Designs, Data Recipes and Training Strategies for Frontier-Class Multimodal LLMs

Python 807 43 Updated Apr 27, 2025

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

TypeScript 40,585 3,806 Updated Jun 25, 2025

A lightweight, powerful framework for multi-agent workflows

Python 11,956 1,790 Updated Jun 25, 2025

cursor-workshop

TypeScript 25 Updated Feb 19, 2025

Production ready LLM model compression/quantization toolkit with hw accelerated inference support for both cpu/gpu via HF, vLLM, and SGLang.

Python 635 94 Updated Jun 24, 2025

[ICLR 2025] Official implementation of MotionClone: Training-Free Motion Cloning for Controllable Video Generation

Python 495 35 Updated Jun 17, 2025

👾 Fast and simple video download library and CLI tool written in Go

Go 29,773 3,159 Updated May 19, 2025

⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。

Python 17,841 1,933 Updated Apr 4, 2025

🔥 Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos

Python 1,154 78 Updated May 24, 2025

Autonomous agents for everyone

TypeScript 16,128 5,249 Updated Jun 26, 2025

New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos

8,027 516 Updated Jun 9, 2025

Subject-Diffusion:Open Domain Personalized Text-to-Image Generation without Test-time Fine-tuning

Python 303 13 Updated Jul 11, 2024

[ICLR 2025] Official implementation of MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance

Python 280 14 Updated Apr 14, 2025

Fine-Grained Open Domain Image Animation with Motion Guidance

Python 928 70 Updated Oct 18, 2024

Official repository of In-Context LoRA for Diffusion Transformers

1,924 92 Updated Dec 20, 2024

This is the official code repository of the AAAI2025 oral paper "VersaGen: Unleashing Versatile Visual Control for Text-to-Image Synthesis"

4 Updated Mar 17, 2025

A generative world for general-purpose robotics & embodied AI learning.

Python 25,344 2,282 Updated Jun 25, 2025

Official inference repo for FLUX.1 models

Python 22,563 1,605 Updated Jun 25, 2025
Next
0