8000 carlxwz / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View carlxwz's full-sized avatar

Block or report carlxwz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Official repository for "Chain-of-Zoom: Extreme Super-Resolution via Scale Autoregression and Preference Alignment"

Python 633 63 Updated Jun 2, 2025

🔥 [CVPR 2020] STEFANN: Scene Text Editor using Font Adaptive Neural Network (official code).

Python 272 42 Updated Apr 30, 2024

教育各种资料,从幼儿园到小学、中学,涵盖学而思,万维、猿辅导等多个机构,持续增加中

JavaScript 2,125 402 Updated Jun 26, 2025

DreamO: A Unified Framework for Image Customization

Python 1,572 118 Updated Jun 26, 2025

ACE-Step: A Step Towards Music Generation Foundation Model

Python 2,565 258 Updated Jun 4, 2025

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 17,156 1,398 Updated May 28, 2025

RF-DETR is a real-time object detection model architecture developed by Roboflow, SOTA on COCO & designed for fine-tuning.

Python 2,280 249 Updated Jun 26, 2025

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 9,195 726 Updated May 27, 2025

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 12,442 1,787 Updated Jun 24, 2025

一款提示词优化器,助力于编写高质量的提示词

TypeScript 7,840 990 Updated Jun 24, 2025

Spark-TTS Inference Code

Python 9,885 1,048 Updated Apr 9, 2025

Implementation of "EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer"

Python 1,570 124 Updated May 27, 2025

Thera: Aliasing-Free Arbitrary-Scale Super-Resolution with Neural Heat Fields

Python 807 55 Updated Apr 30, 2025

Deezer source separation library including pretrained models.

Python 27,035 2,963 Updated Apr 2, 2025

A list of AI autonomous agents

18,955 1,455 Updated Feb 26, 2025

SkyReels V1: The first and most advanced open-source human-centric video foundation model

Python 2,218 225 Updated Mar 10, 2025

Enable AI models for video production in the browser

TypeScript 1,872 223 Updated Jun 12, 2025

坚持分享 GitHub 上高质量、有趣实用的开源技术教程、开发者工具、编程网站、技术资讯。A list cool, interesting projects of GitHub.

38,668 4,032 Updated Mar 20, 2025

Arbitrary-steps Image Super-resolution via Diffusion Inversion (CVPR 2025)

Python 1,168 73 Updated Apr 1, 2025

🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN

Python 46,503 4,448 Updated Jun 25, 2025

Official implementation of OneDiffusion paper (CVPR 2025)

Python 640 19 Updated Dec 14, 2024

Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"

Python 6,859 449 Updated Mar 18, 2025

A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python

Python 19,485 2,636 Updated Jun 18, 2025

StoryMaker: Towards consistent characters in text-to-image generation

Python 703 58 Updated Dec 2, 2024

[AAAI 2025]👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing. It enables customizable human image generation with flexible garment, pose, and scene control, ensuring high …

Python 1,259 111 Updated Jun 2, 2025

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 11,603 1,123 Updated Jun 17, 2025

Fast stable diffusion on CPU

Python 1,721 154 Updated Jun 12, 2025

ECCV2022 - Real-Time Intermediate Flow Estimation for Video Frame Interpolation

Python 4,947 477 Updated May 7, 2025

Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction…

Python 2,749 445 Updated Feb 24, 2025
Next
0