model | architecture | parameters | context length |
embedding length |
quantization | temperature | capabilities | system prompt | (raw) | (index) |
---|---|---|---|---|---|---|---|---|---|---|
qwen2.5-coder:7b | qwen2 | 7.6B | 32768 | 3584 | Q4_K_M | completion tools insert |
You are Qwen, created by Alibaba Cloud. You are a helpful assistant. | raw | index | |
codellama:7b | llama | 6.7B | 16384 | 4096 | Q4_0 | completion |
raw | index | ||
starcoder:7b | starcoder | 7.5B | 8192 | 4096 | Q4_0 | completion |
raw | index | ||
stable-code:3b | stablelm | 2.8B | 16384 | 2560 | Q4_0 | completion |
raw | index | ||
deepcoder:1.5b | qwen2 | 1.8B | 131072 | 1536 | Q4_K_M | 0.6 | completion |
raw | index | |
dolphin3:8b | llama | 8.0B | 131072 | 4096 | Q4_K_M | completion |
You are Dolphin, a helpful AI assistant. | raw | index | |
gemma:2b | gemma | 2.5B | 8192 | 2048 | Q4_0 | completion |
raw | index | ||
granite3.3:2b | granite | 2.5B | 131072 | 2048 | Q4_K_M | completion tools |
raw | index | ||
mistral:7b | llama | 7.2B | 32768 | 4096 | Q4_0 | completion tools |
raw | index | ||
qwen3:1.7b | qwen3 | 2.0B | 40960 | 2048 | Q4_K_M | 0.6 | completion tools thinking |
raw | index |