ML Research Wiki / Benchmarks / Boundary Detection / CoAuthor

CoAuthor

Boundary Detection Benchmark

Performance Over Time

📊 Showing 3 results | 📏 Metric: Cohen’s Kappa score

Top Performing Models

Rank Model Paper Cohen’s Kappa score Date Code
1 GigaCheck (Mistral-7B-v0.3) GigaCheck: Detecting LLM-generated Content 0.42 2024-10-31 -
2 DeBERTa-v3 (Naive) Detecting AI-Generated Sentences in Human-AI Collaborative Hybrid Texts: Challenges, Strategies, and Insights 0.40 2024-03-06 📦 douglashiwo/aisentencedetection
3 GigaCheck (DN-DAB-DETR) GigaCheck: Detecting LLM-generated Content 0.19 2024-10-31 -

All Papers (3)

GigaCheck: Detecting LLM-generated Content

2024
GigaCheck (Mistral-7B-v0.3)