10000 lienz (Neil Zeghidour) / Starred · GitHub

More Web Proxy on the site http://driver.im/

lienz

Follow

Neil Zeghidour lienz

Follow

AI Research at Kyutai

111 followers · 4 following

Achievements

Achievements

Stars

Yuan-ManX / ai-audio-datasets

AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio a…

735 59 Updated Feb 25, 2025

kyutai-labs / moshi

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 8,202 689 Updated May 7, 2025

dwromero / ckconv

Code repository of the paper "CKConv: Continuous Kernel Convolution For Sequential Data" published at ICLR 2022. https://arxiv.org/abs/2102.02611

Python 121 16 Updated Nov 29, 2022

pyannote / pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 7,499 877 Updated May 8, 2025

google / lyra

A Very Low-Bitrate Codec for Speech Compression

C++ 3,862 360 Updated Aug 20, 2024

google-research / diffstride

TF/Keras code for DiffStride, a pooling layer with learnable strides.

Python 124 7 Updated Feb 7, 2022

pyannote / pyannote-metrics

A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems

Python 206 36 Updated Feb 19, 2025

google-research / ott

Python 214 17 Updated Jan 31, 2022

google-research / leaf-audio

LEAF is a learnable alternative to audio features such as mel-filterbanks, that can be initialized as an approximation of mel-filterbanks, and then be trained for the task at hand, while using a ve…

Python 509 53 Updated Mar 1, 2022

google / BIG-bench

Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models

Python 3,043 605 Updated Jul 19, 2024

adefossez / julius

Fast PyTorch based DSP for audio and 1D signals

Python 437 25 Updated Feb 17, 2025

AlessioZanga / PyEEGLab

Analyze and manipulate EEG data using PyEEGLab.

Python 61 23 Updated Dec 5, 2020

wq2012 / awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

1,742 232 Updated Oct 16, 2024

meagmohit / EEG-Datasets

A list of all public EEG-datasets

2,528 582 Updated Aug 5, 2024

briansune / tensorflow-fft-cnn-spectral

Forked from oracleofnj/spectral-repr-cnns

Python 4 1 Updated Feb 7, 2019

icubam / icubam

ICU Bed Activity Monitor

JavaScript 33 11 Updated Aug 14, 2023

fbcotter / pytorch_wavelets

Pytorch implementation of 2D Discrete Wavelet (DWT) and Dual Tree Complex Wavelet Transforms (DTCWT) and a DTCWT based ScatterNet

Python 1,066 151 Updated Aug 2, 2023

kuangliu / pytorch-cifar

95.47% on CIFAR10 with PyTorch

Python 6,182 2,159 Updated Feb 24, 2023

mcfletch / sphfile

NIST SPH File reader (e.g. for TEDLIUM Corpus)

Python 25 8 Updated May 2, 2020

danijel3 / PyHTK

HTK features in Python

Jupyter Notebook 74 18 Updated Nov 6, 2018

kymatio / kymatio

Wavelet scattering transforms in Python with GPU acceleration

Python 793 140 Updated Jan 28, 2025

agr-ita / epen-thesis

TeX 1 Updated Nov 20, 2018

detly / gammatone

Gammatone-based spectrograms, using gammatone filterbanks or Fourier transform weightings.

MATLAB 221 68 Updated Jun 29, 2023

scdl-org / scdl

Soundcloud Music Downloader

Python 3,634 356 Updated Mar 14, 2025

syhw / abnet

ABNet is a "same/different"-based loss trained neural net.

Python 7 6 Updated Mar 27, 2015

0