API
Agents/RAG

RAG

by SWE-agent

best score
7.0%
Best Score
verified
Best Leaderboard
6
Models Used
Yes
Open Source
EntryScore
RAG + GPT 4o (2024-08-06)6.0%
RAG + Claude 3.5 Sonnet5.0%
HyperAgent25.3%
RAG + Claude 3 Opus3.8%
RAG + GPT 4 (1106)1.3%
RAG + Claude 3 Opus7.0%
RAG + GPT 4 (1106)2.8%
RAG + Claude 3 Opus4.3%
RAG + GPT 4 (1106)2.7%
RAG + Claude 22.0%
RAG + SWE-Llama 13B0.7%
RAG + SWE-Llama 7B0.7%
RAG + ChatGPT 3.50.2%
RAG + Claude 24.4%
RAG + SWE-Llama 7B1.4%
RAG + SWE-Llama 13B1.2%
RAG + ChatGPT 3.50.4%
RAG + Claude 23.0%
RAG + SWE-Llama 7B1.3%
RAG + SWE-Llama 13B1.0%
RAG + ChatGPT 3.50.3%