CAV-MAE (Audio-Visual)
|
Contrastive Audio-Visual Masked Autoencoder
|
0.51
|
2022-10-02
|
|
mn40_as (Ensemble)
|
Efficient Large-scale Audio Tagging via Transform…
|
0.50
|
2022-11-09
|
|
PaSST
|
Efficient Training of Audio Transformers with Pat…
|
0.50
|
2021-10-11
|
|
DyMN-L (Audio-Only, Single)
|
Dynamic Convolutional Neural Networks as Efficien…
|
0.49
|
2023-10-24
|
|
Audio Spectrogram Transformer
|
AST: Audio Spectrogram Transformer
|
0.49
|
2021-04-05
|
|
mn40_as (Single)
|
Efficient Large-scale Audio Tagging via Transform…
|
0.48
|
2022-11-09
|
|
PSLA
|
PSLA: Improving Audio Tagging with Pretraining, S…
|
0.47
|
2021-02-02
|
|
ST-SED
|
Zero-shot Audio Source Separation through Query-b…
|
0.47
|
2021-12-15
|
|
CAV-MAE (Audio-Only)
|
Contrastive Audio-Visual Masked Autoencoder
|
0.47
|
2022-10-02
|
|