MSM-MAE
|
Masked Modeling Duo: Towards a Universal Audio Pr…
|
96.60
|
2024-04-09
|
|
M2D/0.6
|
Masked Modeling Duo: Towards a Universal Audio Pr…
|
96.50
|
2024-04-09
|
|
M2D/0.7
|
Masked Modeling Duo: Towards a Universal Audio Pr…
|
96.30
|
2024-04-09
|
|
AudioMAE (local)
|
Masked Autoencoders that Listen
|
94.80
|
2022-07-13
|
|
M2D ratio=0.6
|
Masked Modeling Duo: Learning Representations by …
|
94.80
|
2022-10-26
|
|
ATST Base (ours)
|
ATST: Audio Representation Learning with Teacher-…
|
94.30
|
2022-04-26
|
|
AudioMAE (global)
|
Masked Autoencoders that Listen
|
94.10
|
2022-07-13
|
|
AutoSpeech (N=8,C=128)
|
AutoSpeech: Neural Architecture Search for Speake…
|
87.66
|
2020-05-07
|
|
SSAST-FRAME
|
SSAST: Self-Supervised Audio Spectrogram Transfor…
|
80.80
|
2021-10-19
|
|
SSAMBA
|
SSAMBA: Self-Supervised Audio Representation Lear…
|
70.10
|
2024-05-20
|
|
SSAST-PATCH
|
SSAST: Self-Supervised Audio Spectrogram Transfor…
|
64.20
|
2021-10-19
|
|
COLA
|
Contrastive Learning of General-Purpose Audio Rep…
|
37.70
|
2020-10-21
|
|