8000 GitHub - cai525/Transformer4SED: This repository aims to collect Transformer-based sound event detection (SED) algorithms.
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

This repository aims to collect Transformer-based sound event detection (SED) algorithms.

Notifications You must be signed in to change notification settings

cai525/Transformer4SED

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Transformer4SED

Transformer4SED is a repository which aims to collect Transformer-based sound event detection (SED) algorithms.

Features

  • Implemented using pytorch, refactored from the DCASE official pytorch-lighting baseline
  • Kaldi style recipes;
  • [TODO] Support for commonly used datasets in the sound event detection field, including DESED, MAESTRO, audioset-strong, etc.

recipes

MAT-SED (Masked Audio Transformer for Sound Event Detection) is a pure Transformer-based SED model with masked-reconstruction-based pre-training.

Prototype based Masked Audio Model (PMAM) is a self-supervised representation learning algorithm designed for frame-level audio tasks like sound event detection, to better exploit unlabeled data.

About

This repository aims to collect Transformer-based sound event detection (SED) algorithms.

Resources

Stars

Watchers

Forks

Packages

No packages published
0