o1-preview
|
A Case Study of Web App Coding with OpenAI Reasonβ¦
|
0.95
|
2024-09-19
|
|
o1-mini
|
A Case Study of Web App Coding with OpenAI Reasonβ¦
|
0.94
|
2024-09-19
|
|
gpt-4o-2024-08-06
|
Insights from Benchmarking Frontier Language Modeβ¦
|
0.89
|
2024-09-08
|
|
claude-3.5-sonnet
|
Insights from Benchmarking Frontier Language Modeβ¦
|
0.88
|
2024-09-08
|
|
deepseek-v2.5
|
A Case Study of Web App Coding with OpenAI Reasonβ¦
|
0.83
|
2024-09-19
|
|
mistral-large-2
|
Insights from Benchmarking Frontier Language Modeβ¦
|
0.78
|
2024-09-08
|
|
deepseek-coder-v2-instruct
|
Insights from Benchmarking Frontier Language Modeβ¦
|
0.70
|
2024-09-08
|
|
llama-v3p1-405b-instruct
|
Insights from Benchmarking Frontier Language Modeβ¦
|
0.30
|
2024-09-08
|
|