10000 SEMLLYCAT / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View SEMLLYCAT's full-sized avatar

Block or report SEMLLYCAT

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A Lightweight Hybrid Dual Channel Speech Enhancement System under Low-SNR Conditions (Interspeech 2025)

39 2 Updated May 27, 2025

A fast and lightweight framework for creating decentralized agents with ease.

Python 1,454 320 Updated Jul 2, 2025

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 1,289 50 Updated Jun 14, 2025

✨✨Latest Advances on Multimodal Large Language Models

15,734 1,023 Updated Jul 1, 2025

Voice Activity Detector(VAD) from TEN: low-latency, high-performance and lightweight

C 866 77 Updated Jul 3, 2025

A lightweight Python package for managing multi-agent orchestration. Easily define agents with custom instructions, tools, containers, and models, and orchestrate their interactions seamlessly. Per…

Python 46 8 Updated Jun 24, 2025

GTCRN(ncnn).

Python 10 1 Updated May 22, 2025

Awesome multilingual OCR and Document Parsing toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools,…

Python 51,251 8,391 Updated Jul 5, 2025

Like Manus, Computer Use Agent(CUA) and Omniparser, we are computer-using agents.AI-driven local automation assistant that uses natural language to make computers work by themselves

Python 3,551 447 Updated May 14, 2025

Utilizes ONNX Runtime for audio denoising.

Python 57 8 Updated Jul 5, 2025

On the Hidden Mystery of OCR in Large Multimodal Models (OCRBench)

Python 649 49 Updated Jul 5, 2025

A WebUI app for Music-Source-Separation-Training and we packed UVR together!

Python 620 41 Updated Jun 28, 2025

adaptive acoustic feedback cancellation, howling suppression, AI noise reduction, low latency

C++ 2 Updated Apr 7, 2025
Python 8 3 Updated Feb 13, 2025

Python library for extracting chords from multiple sound file formats

Python 181 30 Updated Jun 15, 2025

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 22,574 1,897 Updated Mar 26, 2025

🚀 Efficient implementations of state-of-the-art linear attention models

Python 2,856 207 Updated Jul 5, 2025

code based for rectified flow

Python 172 11 Updated May 20, 2025

AEC3 Extracted From WebRTC

C++ 178 84 Updated Feb 24, 2022

Implementation of the proposed minGRU in Pytorch

Python 301 22 Updated Mar 13, 2025

Official inference framework for 1-bit LLMs

Python 20,434 1,529 Updated Jun 3, 2025

This is the official implementation of the LiSenNet

Python 97 10 Updated Nov 15, 2024

offical code for Dense-TSNet

12 Updated Sep 17, 2024

Target Speaker Extraction Toolkit

Python 179 20 Updated Jul 4, 2025

Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation, clipping, equalization (EQ) distortion, packet loss, codec…

Python 63 5 Updated Jul 29, 2024

Port of Funasr's Sense-voice model in C/C++

C 393 41 Updated Jun 24, 2025

Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.

683 42 Updated Aug 3, 2024

该代码与B站上的视频 https://www.bilibili.com/video/BV18L41197Uz/?spm_id_from=333.788&vd_source=eefa4b6e337f16d87d87c2c357db8ca7 相关联。

Python 69 3 Updated Oct 7, 2023

This is the code and dataset repo for Interspeech 2024 paper "Target conversation extraction: Source separation using turn-taking dynamics"

Python 48 4 Updated Oct 4, 2024
Next
0