PaLI-X-VPD
|
Visual Program Distillation: Distilling Tools and…
|
0.89
|
2023-12-05
|
|
Flamingo (fine-tuned)
|
Flamingo: a Visual Language Model for Few-Shot Le…
|
0.87
|
2022-04-29
|
|
Hate-CLIPper - Align
|
Hate-CLIPper: Multimodal Hateful Meme Classificat…
|
0.86
|
2022-10-12
|
|
ISSUES
|
Mapping Memes to Words for Multimodal Hateful Mem…
|
0.86
|
2023-10-12
|
|
Human
|
The Hateful Memes Challenge: Detecting Hate Speec…
|
0.85
|
2020-05-10
|
|
RA-HMD (Qwen2-VL-7B)
|
Robust Adaptation of Large Multimodal Models for …
|
0.82
|
2025-02-18
|
|
RA-HMD (LLaVA-1.5-7B)
|
Robust Adaptation of Large Multimodal Models for …
|
0.81
|
2025-02-18
|
|
RA-HMD (Qwen2-VL-2B)
|
Robust Adaptation of Large Multimodal Models for …
|
0.79
|
2025-02-18
|
|
RGCL (CLIP)
|
Improving Hateful Meme Detection through Retrieva…
|
0.79
|
2023-11-14
|
|
HateDetectron27
|
Detecting Hate Speech in Memes Using Multimodal D…
|
0.77
|
2020-12-23
|
|
SEER (RegNet10B)
|
Vision Models Are More Robust And Fair When Pretr…
|
0.73
|
2022-02-16
|
|
Ron Zhu
|
Enhance Multimodal Transformer With External Labe…
|
0.73
|
2020-12-15
|
|
Pro-Cap
|
Pro-Cap: Leveraging a Frozen Vision-Language Mode…
|
0.72
|
2023-08-16
|
|
Flamingo (few-shot:32)
|
Flamingo: a Visual Language Model for Few-Shot Le…
|
0.70
|
2022-04-29
|
|
Vilio
|
Vilio: State-of-the-art Visio-Linguistic Models a…
|
0.70
|
2020-12-14
|
|
Visual BERT COCO
|
The Hateful Memes Challenge: Detecting Hate Speec…
|
0.70
|
2020-05-10
|
|
CLIP (zero-shot)
|
Learning Transferable Visual Models From Natural …
|
0.66
|
2021-02-26
|
|