8000 zhanzhanmiao / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View zhanzhanmiao's full-sized avatar

Block or report zhanzhanmiao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🍒 Cherry Studio is a desktop client that supports for multiple LLM providers.

TypeScript 29,796 2,610 Updated Jul 10, 2025

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 5,673 539 Updated Feb 26, 2025

Digital-Employee for all

Python 1 Updated Apr 30, 2025

AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents

Python 16,932 2,341 Updated Jul 9, 2025

[ICCV 2023] TransFace: Calibrating Transformer Training for Face Recognition from a Data-Centric Perspective

Python 92 11 Updated May 19, 2025

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 19,798 1,440 Updated Jun 30, 2025

Espressif IoT Library. IoT Device Drivers, Documentations and Solutions.

C 2,280 870 Updated Jun 26, 2025

OpenShot Video Editor is an award-winning free and open-source video editor for Linux, Mac, and Windows, and is dedicated to delivering high quality video editing and animation solutions to the world.

Python 4,893 596 Updated Jul 10, 2025

Dockerfile containing FFmpeg, OpenCV4 and Python2/3, based on Ubuntu LTS

Dockerfile 77 30 Updated Mar 26, 2025

YOLOv8 TensorRT C++ Implementation

C++ 661 83 Updated Feb 9, 2025

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,922 130 Updated Oct 30, 2024

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Python 8,519 1,109 Updated Sep 14, 2024

Real time interactive streaming digital human

Python 5,931 916 Updated Jul 5, 2025

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Python 12,164 2,584 Updated Jun 22, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 54,005 6,606 Updated Jul 8, 2025

Real time streaming talking head

Python 480 64 Updated May 17, 2024

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 48,606 5,347 Updated Jul 10, 2025

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 12,065 1,927 Updated Jun 26, 2025

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Python 7,549 1,365 Updated Dec 6, 2023
C++ 389 82 Updated Jul 3, 2025

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 10,726 1,047 Updated Nov 18, 2024

Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…

TypeScript 35,537 5,952 Updated Mar 25, 2025

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 8,425 851 Updated Aug 12, 2024

Robust Speech Recognition via Large-Scale Weak Supervision

Python 84,694 10,343 Updated Jun 26, 2025

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++ 11,830 2,220 Updated Jul 9, 2025

Official Code for DragGAN (SIGGRAPH 2023)

Python 35,927 3,439 Updated May 18, 2024

一款创新跨平台摸鱼神器,支持小说、股票、网页、视频、直播、PDF、游戏等摸鱼模式,为上班族打造的上班必备神器,使用此软件可以让上班倍感轻松,远离 ICU。

JavaScript 5,870 552 Updated Feb 27, 2024

A Robust, Real-time, RGB-colored, LiDAR-Inertial-Visual tightly-coupled state Estimation and mapping package

C++ 2,217 456 Updated May 28, 2024

Official repository of NeuMan: Neural Human Radiance Field from a Single Video (ECCV 2022)

Python 1,284 148 Updated May 23, 2023
Next
0