SEMLLYCAT

SEMLLYCAT

13 followers · 387 following

Stars

Max1Wz / H-GTCRN

A Lightweight Hybrid Dual Channel Speech Enhancement System under Low-SNR Conditions (Interspeech 2025)

39 2 Updated May 27, 2025

fetchai / uAgents

A fast and lightweight framework for creating decentralized agents with ease.

Python 1,454 320 Updated Jul 2, 2025

ByteDance-Seed / Seed1.5-VL

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 1,289 50 Updated Jun 14, 2025

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

15,734 1,023 Updated Jul 1, 2025

TEN-framework / ten-vad

Voice Activity Detector(VAD) from TEN: low-latency, high-performance and lightweight

C 866 77 Updated Jul 3, 2025

sandeshnaroju / agents_manager

A lightweight Python package for managing multi-agent orchestration. Easily define agents with custom instructions, tools, containers, and models, and orchestrate their interactions seamlessly. Per…

Python 46 8 Updated Jun 24, 2025

machine-perception-robotics-group / MPRGDeepLearningLectureNotebook

Jupyter Notebook 375 38 Updated Jun 17, 2025

wxqwinner / gtcrn-ncnn

GTCRN(ncnn).

Python 10 1 Updated May 22, 2025

PaddlePaddle / PaddleOCR

Awesome multilingual OCR and Document Parsing toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools,…

Python 51,251 8,391 Updated Jul 5, 2025

yuruotong1 / autoMate

Like Manus, Computer Use Agent(CUA) and Omniparser, we are computer-using agents.AI-driven local automation assistant that uses natural language to make computers work by themselves

Python 3,551 447 Updated May 14, 2025

DakeQQ / Audio-Denoiser-ONNX

Utilizes ONNX Runtime for audio denoising.

Python 57 8 Updated Jul 5, 2025

Yuliang-Liu / MultimodalOCR

On the Hidden Mystery of OCR in Large Multimodal Models (OCRBench)

Python 649 49 Updated Jul 5, 2025

SUC-DriverOld / MSST-WebUI

A WebUI app for Music-Source-Separation-Training and we packed UVR together!

Python 620 41 Updated Jun 28, 2025

ReidRR92 / Acoustic-Feedback-Suppression

adaptive acoustic feedback cancellation, howling suppression, AI noise reduction, low latency

C++ 2 Updated Apr 7, 2025

nicekate / qwen2.5-vl-demo

Python 8 3 Updated Feb 13, 2025

ohollo / chord-extractor

Python library for extracting chords from multiple sound file formats

Python 181 30 Updated Jun 15, 2025

microsoft / OmniParser

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 22,574 1,897 Updated Mar 26, 2025

fla-org / flash-linear-attention

🚀 Efficient implementations of state-of-the-art linear attention models

Python 2,856 207 Updated Jul 5, 2025

lqiang67 / rectified-flow

code based for rectified flow

Python 172 11 Updated May 20, 2025

ewan-xu / AEC3

AEC3 Extracted From WebRTC

C++ 178 84 Updated Feb 24, 2022

lucidrains / minGRU-pytorch

Implementation of the proposed minGRU in Pytorch

Python 301 22 Updated Mar 13, 2025

microsoft / BitNet

Official inference framework for 1-bit LLMs

Python 20,434 1,529 Updated Jun 3, 2025

hyyan2k / LiSenNet

This is the official implementation of the LiSenNet

Python 97 10 Updated Nov 15, 2024

huaidanquede / Dense-TSNet

offical code for Dense-TSNet

12 Updated Sep 17, 2024

wenet-e2e / wesep

Target Speaker Extraction Toolkit

Python 179 20 Updated Jul 4, 2025

nanless / universal-speech-enhancement

Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation, clipping, equalization (EQ) distortion, packet loss, codec…

Python 63 5 Updated Jul 29, 2024

lovemefan / SenseVoice.cpp

Port of Funasr's Sense-voice model in C/C++

C 393 41 Updated Jun 24, 2025

EmulationAI / awesome-large-audio-models

Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.

683 42 Updated Aug 3, 2024

shouxieai / tensorRT_quantization

该代码与B站上的视频 https://www.bilibili.com/video/BV18L41197Uz/?spm_id_from=333.788&vd_source=eefa4b6e337f16d87d87c2c357db8ca7 相关联。

Python 69 3 Updated Oct 7, 2023

chentuochao / Target-Conversation-Extraction

This is the code and dataset repo for Interspeech 2024 paper "Target conversation extraction: Source separation using turn-taking dynamics"

Python 48 4 Updated Oct 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly