basahiy

basahiy

wakeup

0 followers · 1 following

Stars

salu133445 / musegan

An AI for Music Generation

Python 1,944 387 Updated Jun 7, 2024

microsoft / muzic

Muzic: Music Understanding and Generation with Artificial Intelligence

Python 4,789 474 Updated Oct 12, 2024

facebookresearch / fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 31,542 6,541 Updated Jun 10, 2025

TencentGameMate / chinese_speech_pretrain

chinese speech pretrained models

Shell 1,136 89 Updated Aug 23, 2024

facebookresearch / audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Jupyter Notebook 22,133 2,355 Updated Mar 13, 2025

ARM-software / ML-KWS-for-MCU

Keyword spotting on Arm Cortex-M Microcontrollers

C 1,183 424 Updated Apr 10, 2019

wenet-e2e / wekws

Production First and Production Ready End-to-End Keyword Spotting Toolkit

Python 569 121 Updated Feb 24, 2025

tensorflow / tflite-micro

Infrastructure to enable deployment of ML models to low-power resource-constrained embedded targets (including microcontrollers and digital signal processors).

C++ 2,323 898 Updated Jun 17, 2025

AIGC-Audio / AudioGPT

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Python 10,157 862 Updated Jul 6, 2024

kaituoxu / Conv-TasNet

A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).

Python 712 155 Updated Apr 6, 2023

JusperLee / Conv-TasNet

Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation Pytorch's Implement

Python 481 77 Updated May 26, 2023

kaijieshi7 / Dynamic-convolution-Pytorch

Pytorch!!!Pytorch!!!Pytorch!!! Dynamic Convolution: Attention over Convolution Kernels (CVPR-2020)

Python 580 90 Updated May 22, 2022

breizhn / DTLN

Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.

Python 624 160 Updated Jul 28, 2023

speechbrain / speechbrain

A PyTorch-based Speech Toolkit

Python 9,991 1,514 Updated Jun 10, 2025

SRPOL-AUI / spectrum-correction

Source code for publication: "Spectrum Correction: Acoustic Scene Classification with Mismatched Recording Devices"

Python 12 3 Updated Feb 22, 2022

PaddlePaddle / PASSL

PASSL包含 SimCLR，MoCo v1/v2，BYOL，CLIP，PixPro，simsiam, SwAV, BEiT，MAE 等图像自监督算法以及 Vision Transformer，DEiT，Swin Transformer，CvT，T2T-ViT，MLP-Mixer，XCiT，ConvNeXt，PVTv2 等基础视觉算法

Python 283 65 Updated Aug 1, 2023