RK | Model | Rating | Won | Lost |
---|---|---|---|---|
1 | anthropic.claude-3-haiku-20240307 | 1151 | 63.9% 85 | 36.1% 48 |
2 | llama-3.2-90b-text-preview | 1139 | 54.9% 50 | 45.1% 41 |
3 | gpt-4o | 1124 | 65.2% 30 | 34.8% 16 |
4 | gpt-4o-mini | 1117 | 67.1% 108 | 32.9% 53 |
5 | google/palm-2-codechat-bison | 1096 | 100.0% 6 | 0.0% 0 |
6 | ReWiz-Llama-3.2-3B.Q8_0.gguf:latest | 1084 | 87.5% 7 | 12.5% 1 |
7 | gpt-4-turbo | 1066 | 100.0% 4 | 0.0% 0 |
8 | llama3.1:8b-instruct-fp16 | 1055 | 83.3% 5 | 16.7% 1 |
9 | llama3:latest | 1055 | 61.5% 8 | 38.5% 5 |
10 | qwen2.5:14b-instruct-q3_K_S | 1048 | 57.9% 22 | 42.1% 16 |
11 | claude-3-haiku-with-system-prompt | 1045 | 50.0% 3 | 50.0% 3 |
12 | ReWiz-7B.Q4_K_S.gguf:latest | 1040 | 71.4% 5 | 28.6% 2 |
13 | qwen2.5-coder:latest | 1040 | 100.0% 2 | 0.0% 0 |
14 | llama3.1:8b-instruct-q5_K_M | 1035 | 75.0% 3 | 25.0% 1 |
15 | openhermes:7b-mistral-v2-q5_K_M | 1031 | 100.0% 2 | 0.0% 0 |
16 | home-turf | 1030 | 63.6% 7 | 36.4% 4 |
17 | mistral-nemo:12b-instruct-2407-q3_K_M | 1024 | 59.4% 41 | 40.6% 28 |
18 | gpt-4o-mini-2024-07-18 | 1023 | 60.0% 3 | 40.0% 2 |
19 | mistral-nemo:latest | 1022 | 50.0% 15 | 50.0% 15 |
20 | hf.co/bartowski/Mistral-Small-Instruct-2409-GGUF:IQ3_M | 1020 | 60.0% 6 | 40.0% 4 |
21 | mistral-nemo:12b-instruct-2407-q5_K_M | 1020 | 60.0% 9 | 40.0% 6 |
22 | test | 1016 | 100.0% 1 | 0.0% 0 |
23 | anthropic.claude-3-5-sonnet-20240620 | 1016 | 100.0% 1 | 0.0% 0 |
24 | gemma2:9b-instruct-q5_K_M | 1015 | 60.0% 3 | 40.0% 2 |
25 | testout-helper | 1000 | 50.0% 1 | 50.0% 1 |
26 | Boptruth-NeuralMonarch-7B-unsloth.Q6_K.gguf:latest | 999 | 50.0% 1 | 50.0% 1 |
27 | chatgpt-4o-latest | 995 | 47.4% 9 | 52.6% 10 |
28 | llama3.2:3b-instruct-q8_0 | 989 | 44.4% 4 | 55.6% 5 |
29 | small-models | 985 | 33.3% 1 | 66.7% 2 |
30 | meta-llama/llama-3.1-405b-instruct:free | 984 | 0.0% 0 | 100.0% 1 |
31 | anthropic/claude-1.2 | 984 | 0.0% 0 | 100.0% 1 |
32 | nousresearch/nous-hermes-2-mixtral-8x7b-dpo | 984 | 0.0% 0 | 100.0% 1 |
33 | general | 984 | 0.0% 0 | 100.0% 1 |
34 | microsoft/wizardlm-2-8x22b | 984 | 0.0% 0 | 100.0% 1 |
35 | openai/gpt-4o-2024-08-06 | 984 | 0.0% 0 | 100.0% 1 |
36 | google/palm-2-chat-bison | 984 | 0.0% 0 | 100.0% 1 |
37 | test2 | 984 | 0.0% 0 | 100.0% 1 |
38 | ReWiz-Worldbuilder-7B-GGUF.Q5_K_M.gguf:latest | 981 | 40.0% 2 | 60.0% 3 |
39 | WorldBuilder-7B-GGUF.Q5_K_M.gguf:latest | 980 | 42.9% 6 | 57.1% 8 |
40 | llama3.2:latest | 977 | 42.1% 8 | 57.9% 11 |
41 | qwen2.5:14b-instruct-q4_K_S | 976 | 40.0% 8 | 60.0% 12 |
42 | mistral:7b-instruct-q4_K_S | 975 | 33.3% 2 | 66.7% 4 |
43 | gpt-4o-2024-08-06 | 975 | 25.0% 1 | 75.0% 3 |
44 | CleverBoi-Llama-3.2-3B-Instruct.Q8_0.gguf:latest | 974 | 47.4% 9 | 52.6% 10 |
45 | llama3.1:latest | 973 | 40.4% 23 | 59.6% 34 |
46 | gpt-3.5-turbo | 973 | 25.0% 1 | 75.0% 3 |
47 | llama3-70b-8192 | 970 | 57.1% 52 | 42.9% 39 |
48 | CleverBoi-Nemo-12B-v2.Q4_K_S.gguf:latest | 970 | 38.5% 5 | 61.5% 8 |
49 | qwen2.5:7b | 963 | 43.9% 29 | 56.1% 37 |
50 | llava:latest | 961 | 37.5% 3 | 62.5% 5 |
51 | gemma2:latest | 952 | 0.0% 0 | 100.0% 3 |
52 | deepseek-coder-v2:16b-lite-instruct-q4_0 | 950 | 30.0% 3 | 70.0% 7 |
53 | mixtral-8x7b-32768 | 945 | 36.1% 26 | 63.9% 46 |
54 | exclude | 938 | 0.0% 0 | 100.0% 4 |
55 | gpt-4 | 938 | 0.0% 0 | 100.0% 4 |
56 | phi3:latest | 936 | 28.6% 2 | 71.4% 5 |
57 | Nerdish-Llama-3.1-8B.Q4_K_M.gguf:latest | 934 | 25.0% 2 | 75.0% 6 |
58 | llama-3.1-8b-instant | 884 | 33.3% 18 | 66.7% 36 |
59 | llama3.2:1b | 869 | 15.8% 3 | 84.2% 16 |
60 | llama3-8b-8192 | 856 | 24.2% 31 | 75.8% 97 |
- | azure-admin-expert | - | - | - |
- | eva | - | - | - |
- | llama-3.1-70b-versatile | - | - | - |
- | meta-llama/llama-3.1-70b-instruct:free | - | - | - |
- | openai/gpt-4o | - | - | - |
- | stablelm2:12b-chat-q4_K_M | - | - | - |
Models | Result | User | Updated At |
---|---|---|---|
gpt-4o-mini llama3-8b-8192, anthropic.claude-3-haiku-20240307, and 16 more | Won | 21 hours ago | |
llama-3.2-90b-text-preview llama3-8b-8192, anthropic.claude-3-haiku-20240307, and 15 more | Won | 21 hours ago | |
llama-3.1-8b-instant llama3-8b-8192, anthropic.claude-3-haiku-20240307, and 14 more | Lost | 21 hours ago | |
llama-3.1-8b-instant llama3-8b-8192, anthropic.claude-3-haiku-20240307, and 13 more | Won | 21 hours ago | |
gpt-4o-mini llama3-8b-8192, anthropic.claude-3-haiku-20240307, and 12 more | Won | 21 hours ago | |
llama3-8b-8192 llama3-8b-8192, anthropic.claude-3-haiku-20240307, and 11 more | Lost | 21 hours ago | |
llama-3.1-8b-instant llama3-8b-8192, anthropic.claude-3-haiku-20240307, and 10 more | Lost | 21 hours ago | |
gpt-4o-mini llama3-8b-8192, anthropic.claude-3-haiku-20240307, and 9 more | Won | 21 hours ago | |
gpt-4o-mini llama3-8b-8192, anthropic.claude-3-haiku-20240307, and 8 more | Won | 21 hours ago | |
llama3-8b-8192 llama3-8b-8192, anthropic.claude-3-haiku-20240307, and 7 more | Lost | a day ago |