10000 lienz (Neil Zeghidour) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View lienz's full-sized avatar

Block or report lienz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio a…

735 59 Updated Feb 25, 2025

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 8,202 689 Updated May 7, 2025

Code repository of the paper "CKConv: Continuous Kernel Convolution For Sequential Data" published at ICLR 2022. https://arxiv.org/abs/2102.02611

Python 121 16 Updated Nov 29, 2022

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 7,499 877 Updated May 8, 2025

A Very Low-Bitrate Codec for Speech Compression

C++ 3,862 360 Updated Aug 20, 2024

TF/Keras code for DiffStride, a pooling layer with learnable strides.

Python 124 7 Updated Feb 7, 2022

A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems

Python 206 36 Updated Feb 19, 2025
Python 214 17 Updated Jan 31, 2022

LEAF is a learnable alternative to audio features such as mel-filterbanks, that can be initialized as an approximation of mel-filterbanks, and then be trained for the task at hand, while using a ve…

Python 509 53 Updated Mar 1, 2022

Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models

Python 3,043 605 Updated Jul 19, 2024

Fast PyTorch based DSP for audio and 1D signals

Python 437 25 Updated Feb 17, 2025

Analyze and manipulate EEG data using PyEEGLab.

Python 61 23 Updated Dec 5, 2020

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

1,742 232 Updated Oct 16, 2024

A list of all public EEG-datasets

2,528 582 Updated Aug 5, 2024

ICU Bed Activity Monitor

JavaScript 33 11 Updated Aug 14, 2023

Pytorch implementation of 2D Discrete Wavelet (DWT) and Dual Tree Complex Wavelet Transforms (DTCWT) and a DTCWT based ScatterNet

Python 1,066 151 Updated Aug 2, 2023

95.47% on CIFAR10 with PyTorch

Python 6,182 2,159 Updated Feb 24, 2023

NIST SPH File reader (e.g. for TEDLIUM Corpus)

Python 25 8 Updated May 2, 2020

HTK features in Python

Jupyter Notebook 74 18 Updated Nov 6, 2018

Wavelet scattering transforms in Python with GPU acceleration

Python 793 140 Updated Jan 28, 2025
TeX 1 Updated Nov 20, 2018

Gammatone-based spectrograms, using gammatone filterbanks or Fourier transform weightings.

MATLAB 221 68 Updated Jun 29, 2023

Soundcloud Music Downloader

Python 3,634 356 Updated Mar 14, 2025

ABNet is a "same/different"-based loss trained neural net.

Python 7 6 Updated Mar 27, 2015
0