-
IRIT
- Toulouse, France
-
03:49
(UTC +02:00) - labbeti.github.io
- https://orcid.org/0000-0002-7219-5463
Stars
Code for the paper: MACE: Leveraging Audio for Evaluating Audio Captioning Systems
Performance and Energy Balance: A Comprehensive Study of State-of-the-Art Sound Event Detection Systems
ML has an impact on the climate. But not all models are born equal. Compute your model's emissions with our calculator and add the results to your paper with our generated latex template
This package aims at simplifying the download of the AudioCaps dataset.
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
This reporsitory contains metadata of WavCaps dataset and codes for downstream tasks.
A concise but complete full-attention transformer with a set of promising experimental features from various papers
Language Modeling with the H3 State Space Model
High-fidelity performance metrics for generative models in PyTorch
Python bindings for FFmpeg - with complex filtering support
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
Code release for "Git Re-Basin: Merging Models modulo Permutation Symmetries"
A Unified Semi-Supervised Learning Codebase (NeurIPS'22)
A personal experimental C++ Syntax 2 -> Syntax 1 compiler
Python library for downloading, loading & working with sound datasets
✒️ Cedille is a large French language model (6B), released under an open-source license
Code for CVSSP submission to DCASE 2021 Task 6
A list of papers about audio captioning