-
CVSSP @ University of Surrey
- Guildford
- xinhaomei.github.io
-
XinhaoMei.github.io Public
Forked from academicpages/academicpages.github.ioGithub Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
JavaScript MIT License UpdatedAug 28, 2024 -
-
WavCaps Public
This reporsitory contains metadata of WavCaps dataset and codes for downstream tasks.
-
audioldm_eval Public
Forked from haoheliu/audioldm_evalThis toolbox aims to unify audio generation model evaluation for easier comparison.
Python MIT License UpdatedJan 7, 2024 -
multimodal Public
Forked from facebookresearch/multimodalTorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.
Python BSD 3-Clause "New" or "Revised" License UpdatedDec 12, 2022 -
DCASE2021_task6_v2 Public
Code for CVSSP submission to DCASE 2021 Task 6
-
LAVIS Public
Forked from salesforce/LAVISLAVIS - A One-stop Library for Language-Vision Intelligence
Python BSD 3-Clause "New" or "Revised" License UpdatedOct 28, 2022 -
audio-text_retrieval Public
Implementation of our paper 'On Metric Learning For Audio-Text Cross-Modal Retrieval'
-
ACT Public
Source code for the paper 'Audio Captioning Transformer'
-
machine-learning-notes Public
Forked from roboticcam/machine-learning-notesMy continuously updated Machine Learning, Probabilistic Models and Deep Learning notes and demos (2000+ slides) 我不间断更新的机器学习,概率模型和深度学习的讲义(2000+页)和视频链接
Jupyter Notebook UpdatedDec 11, 2021 -
audioset_tagging_cnn Public
Forked from qiuqiangkong/audioset_tagging_cnnPython MIT License UpdatedAug 31, 2021 -
audiocaps Public
Forked from cdjkim/audiocaps🔊 Repository for our NAACL-HLT 2019 paper: AudioCaps
Python MIT License UpdatedNov 13, 2020 -
dcase_2020_T6 Public
Forked from lukewys/dcase_2020_T62nd place solution for 2020 DCASE challenge task 6 audio captioning. http://dcase.community/challenge2020/task-automatic-audio-captioning-results#wuyusong2020_t6
Python UpdatedSep 28, 2020