8000 basahiy / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View basahiy's full-sized avatar

Block or report basahiy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An AI for Music Generation

Python 1,944 387 Updated Jun 7, 2024

Muzic: Music Understanding and Generation with Artificial Intelligence

Python 4,789 474 Updated Oct 12, 2024

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 31,542 6,541 Updated Jun 10, 2025

chinese speech pretrained models

Shell 1,136 89 Updated Aug 23, 2024

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Jupyter Notebook 22,133 2,355 Updated Mar 13, 2025

Keyword spotting on Arm Cortex-M Microcontrollers

C 1,183 424 Updated Apr 10, 2019

Production First and Production Ready End-to-End Keyword Spotting Toolkit

Python 569 121 Updated Feb 24, 2025

Infrastructure to enable deployment of ML models to low-power resource-constrained embedded targets (including microcontrollers and digital signal processors).

C++ 2,323 898 Updated Jun 17, 2025

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Python 10,157 862 Updated Jul 6, 2024

A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).

Python 712 155 Updated Apr 6, 2023

Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation Pytorch's Implement

Python 481 77 Updated May 26, 2023

Pytorch!!!Pytorch!!!Pytorch!!! Dynamic Convolution: Attention over Convolution Kernels (CVPR-2020)

Python 580 90 Updated May 22, 2022

Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.

Python 624 160 Updated Jul 28, 2023

A PyTorch-based Speech Toolkit

Python 9,991 1,514 Updated Jun 10, 2025

Source code for publication: "Spectrum Correction: Acoustic Scene Classification with Mismatched Recording Devices"

Python 12 3 Updated Feb 22, 2022

PASSL包含 SimCLR,MoCo v1/v2,BYOL,CLIP,PixPro,simsiam, SwAV, BEiT,MAE 等图像自监督算法以及 Vision Transformer,DEiT,Swin Transformer,CvT,T2T-ViT,MLP-Mixer,XCiT,ConvNeXt,PVTv2 等基础视觉算法

Python 283 65 Updated Aug 1, 2023

Implementation of paper "DPCRN: Dual-Path Convolution Recurrent Network for Single Channel Speech Enhancement"

Python 203 45 Updated Apr 22, 2024

Few-Shot Keyword Spotting

Jupyter Notebook 64 17 Updated Apr 11, 2021

This repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).

260 40 Updated May 23, 2022

Densely Connected Convolutional Networks, In CVPR 2017 (Best Paper Award).

Lua 4,812 1,070 Updated Jan 9, 2024

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech 7D71 "

Python 2,043 576 Updated Oct 27, 2023

The Implementation of FastSpeech based on pytorch.

Python 872 214 Updated Jul 6, 2023

An Open Source Tools for Speaker Recognition

Python 618 130 Updated Aug 5, 2024

Caffe: a fast open framework for deep learning.

C++ 34,422 18,619 Updated Jul 31, 2024

Faster and elegant TensorFlow Implementation of paper: Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks

Python 208 70 Updated May 15, 2022

Production First and Production Ready End-to-End Speech Recognition Toolkit

Python 4,563 1,138 Updated Jun 11, 2025

An implementation of deep-voice-conversion using pytorch

Python 19 3 Updated Dec 10, 2021

Voice conversion model for real-time speech synthesis using PPG (Phonetic PosteriorGram) as an intermediate feature, written in Pytorch.

Python 28 4 Updated Mar 3, 2022

Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)

Python 143 42 Updated Jul 6, 2023
Next
0