-
CSC @ Unipd
- Padua, Italy
-
11:02
(UTC +02:00) - matteospanio.github.io
- https://orcid.org/0000-0002-2436-7208
- in/matteo-spanio
Highlights
- Pro
Starred repositories
Diffused Responsibility: Analyzing the Energy Consumption of Generative Text-to-Audio Diffusion Models
A Python package providing a common interface for running machine learning models for audio classification tasks.
Code for <Mel2Word: A Text-based Melody Representation for Symbolic Music Analysis, Music and Science, 2024>
⚡️ OpenAI PHP for Symfony is a supercharged PHP API client that allows you to interact with OpenAI API
Music remixer based on MusicGen-Chord
InspireMusic: A toolkit designed for music, song, and audio generation
BioMachineLearning / openpom
Forked from ARY2260/openpomReplication of the Principal Odor Map paper by Lee et al (2022). The model is implemented such that it integrates with DeepChem
Replication of the Principal Odor Map paper by Brian K. Lee et al. (2023).
A library for audio and music analysis, feature extraction.
Official PyTorch inference code for the Interspeech 2025 paper: Efficient Speech Enhancement via Embeddings from Pre-trained Generative Audioencoders
A unified ensemble framework for PyTorch to improve the performance and robustness of your deep learning model.
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…
Integration of Hotwire's Turbo library with Flask.
A teaching and research repository for exploring generative latent flow matching
UX DataTables is a Symfony bundle integrating the DataTables library in Symfony applications.
Best practices & guides on how to write distributed pytorch training code
Unified automatic quality assessment for speech, music, and sound.
This repo hosts the code and models of "Masked Autoencoders that Listen".
[ICLR 2025] Enhancing Self-Supervised Models with Audio Mixtures for Polyphonic Soundscapes
NotaGen: Advancing Musicality in Symbolic Music Generation with Large Language Model Training Paradigms
Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation
Simple & elegant figures for research papers, with examples in Julia and Python
High quality training free inpaint for every stable diffusion model. Supports ComfyUI