ML Research Wiki / Benchmarks / Open Vocabulary Panoptic Segmentation / ADE20K

ADE20K

Open Vocabulary Panoptic Segmentation Benchmark

Performance Over Time

📊 Showing 10 results | 📏 Metric: PQ

Top Performing Models

Rank	Model	Paper	PQ	Date	Code
1	UMG-CLIP-E/14	UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World Understanding	31.60	2024-01-12	📦 lygsbw/umg-clip
2	PosSAM	PosSAM: Panoptic Open-vocabulary Segment Anything	29.20	2024-03-14	📦 Vibashan/PosSAM
3	UMG-CLIP-L/14	UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World Understanding	29.10	2024-01-12	📦 lygsbw/umg-clip
4	MAFT+	Collaborative Vision-Text Representation Optimizing for Open-Vocabulary Segmentation	27.10	2024-08-01	📦 jiaosiyu1999/MAFT-Plus
5	FC-CLIP	Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP	26.80	2023-08-04	📦 bytedance/fc-clip
6	CLIPSelf	CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction	23.70	2023-10-02	📦 wusize/clipself
7	ODISE(Caption)	Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models	23.40	2023-03-08	📦 nvlabs/odise
8	ODISE (Label)	Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models	22.60	2023-03-08	📦 nvlabs/odise
9	FreeSeg	FreeSeg: Unified, Universal and Open-Vocabulary Image Segmentation	16.30	2023-03-30	-
10	MaskCLIP	Extract Free Dense Labels from CLIP	15.10	2021-12-02	📦 chongzhou96/maskclip

All Papers (10)

UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World Understanding

2024

UMG-CLIP-E/14

lygsbw/umg-clip

PosSAM: Panoptic Open-vocabulary Segment Anything

2024

PosSAM

Vibashan/PosSAM

UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World Understanding

2024

UMG-CLIP-L/14

lygsbw/umg-clip

Collaborative Vision-Text Representation Optimizing for Open-Vocabulary Segmentation

2024

MAFT+

jiaosiyu1999/MAFT-Plus

Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP

2023

FC-CLIP

bytedance/fc-clip

CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction

2023

CLIPSelf

wusize/clipself

Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models

2023

ODISE(Caption)

nvlabs/odise

Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models

2023

ODISE (Label)

nvlabs/odise

FreeSeg: Unified, Universal and Open-Vocabulary Image Segmentation

2023

FreeSeg

Extract Free Dense Labels from CLIP

2021

MaskCLIP

chongzhou96/maskclip

ADE20K

Performance Over Time

Edit Benchmark Results

Edit Result

Top Performing Models

All Papers (10)

UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World Understanding

PosSAM: Panoptic Open-vocabulary Segment Anything

UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World Understanding

Collaborative Vision-Text Representation Optimizing for Open-Vocabulary Segmentation

Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP

CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction

Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models

Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models

FreeSeg: Unified, Universal and Open-Vocabulary Image Segmentation

Extract Free Dense Labels from CLIP

Model	Paper	PQ	Date
UMG-CLIP-E/14	UMG-CLIP: A Unified Multi-Granularity Vision Gene…	31.60	2024-01-12
PosSAM	PosSAM: Panoptic Open-vocabulary Segment Anything	29.20	2024-03-14
UMG-CLIP-L/14	UMG-CLIP: A Unified Multi-Granularity Vision Gene…	29.10	2024-01-12
MAFT+	Collaborative Vision-Text Representation Optimizi…	27.10	2024-08-01
FC-CLIP	Convolutions Die Hard: Open-Vocabulary Segmentati…	26.80	2023-08-04
CLIPSelf	CLIPSelf: Vision Transformer Distills Itself for …	23.70	2023-10-02
ODISE(Caption)	Open-Vocabulary Panoptic Segmentation with Text-t…	23.40	2023-03-08
ODISE (Label)	Open-Vocabulary Panoptic Segmentation with Text-t…	22.60	2023-03-08
FreeSeg	FreeSeg: Unified, Universal and Open-Vocabulary I…	16.30	2023-03-30
MaskCLIP	Extract Free Dense Labels from CLIP	15.10	2021-12-02