| Model | Architecture | Parameters | Context length |
Embedding length |
Quantization | Temperature | Capabilities | System prompt | (raw) | (index) |
|---|---|---|---|---|---|---|---|---|---|---|
| codellama:7b | llama | 6.7B | 16384 | 4096 | Q4_0 | completion |
raw | index | ||
| cogito:3b | llama | 3.6B | 131072 | 3072 | Q4_K_M | completion tools |
raw | index | ||
| cogito:8b | llama | 8.0B | 131072 | 4096 | Q4_K_M | completion tools |
raw | index | ||
| deepcoder:1.5b | qwen2 | 1.8B | 131072 | 1536 | Q4_K_M | 0.6 | completion |
raw | index | |
| deepseek-r1:1.5b | qwen2 | 1.8B | 131072 | 1536 | Q4_K_M | completion thinking |
raw | index | ||
| deepseek-r1:14b | qwen2 | 14.8B | 131072 | 5120 | Q4_K_M | completion thinking |
raw | index | ||
| deepseek-r1:8b | qwen3 | 8.2B | 131072 | 4096 | Q4_K_M | 0.6 | completion thinking |
raw | index | |
| dolphin-mistral:7b | llama | 7.2B | 32768 | 4096 | Q4_0 | completion |
You are Dolphin, a helpful AI assistant. | raw | index | |
| dolphin3:8b | llama | 8.0B | 131072 | 4096 | Q4_K_M | completion |
You are Dolphin, a helpful AI assistant. | raw | index | |
| gemma3:1b | gemma3 | 999.89M | 32768 | 1152 | Q4_K_M | 1 | completion |
raw | index | |
| gemma3:4b | gemma3 | 4.3B | 131072 | 2560 | Q4_K_M | 1 | completion vision |
raw | index | |
| gemma3n:e2b | gemma3n | 4.5B | 32768 | 2048 | Q4_K_M | completion |
raw | index | ||
| gemma3n:e4b | gemma3n | 6.9B | 32768 | 2048 | Q4_K_M | completion |
raw | index | ||
| gemma:2b | gemma | 2.5B | 8192 | 2048 | Q4_0 | completion |
raw | index | ||
| granite3.3:2b | granite | 2.5B | 131072 | 2048 | Q4_K_M | completion tools |
raw | index | ||
| granite3.3:8b | granite | 8.2B | 131072 | 4096 | Q4_K_M | completion tools |
raw | index | ||
| hermes3:8b | llama | 8.0B | 131072 | 4096 | Q4_0 | completion tools |
raw | index | ||
| llama3.1:8b-instruct-q4_1 | llama | 8.0B | 131072 | 4096 | Q4_1 | completion tools |
raw | index | ||
| llama3.2:1b | llama | 1.2B | 131072 | 2048 | Q8_0 | completion tools |
raw | index | ||
| llama3.2:3b | llama | 3.2B | 131072 | 3072 | Q4_K_M | completion tools |
raw | index | ||
| llava-llama3:8b | llama | 8.0B | 8192 | 4096 | Q4_K_M | completion vision |
raw | index | ||
| llava-phi3:3.8b | llama | 3.8B | 4096 | 3072 | Q4_K_M | completion vision |
raw | index | ||
| llava:7b | llama | 7.2B | 32768 | 4096 | Q4_0 | completion vision |
raw | index | ||
| minicpm-v:8b | qwen2 | 7.6B | 32768 | 3584 | Q4_0 | completion vision |
raw | index | ||
| mistral:7b | llama | 7.2B | 32768 | 4096 | Q4_0 | completion tools |
raw | index | ||
| mistral:7b-instruct | llama | 7.2B | 32768 | 4096 | Q4_0 | completion tools |
raw | index | ||
| qwen2.5-coder:7b | qwen2 | 7.6B | 32768 | 3584 | Q4_K_M | completion tools insert |
You are Qwen, created by Alibaba Cloud. You are a helpful assistant. | raw | index | |
| qwen2.5vl:3b | qwen25vl | 3.8B | 128000 | 2048 | Q4_K_M | 0.0001 | completion vision |
You are a helpful assistant. | raw | index |
| qwen2.5vl:7b | qwen25vl | 8.3B | 128000 | 3584 | Q4_K_M | 0.0001 | completion vision |
You are a helpful assistant. | raw | index |
| qwen3:0.6b | qwen3 | 751.63M | 40960 | 1024 | Q4_K_M | 0.6 | completion tools thinking |
raw | index | |
| qwen3:1.7b | qwen3 | 2.0B | 40960 | 2048 | Q4_K_M | 0.6 | completion tools thinking |
raw | index | |
| qwen3:14b | qwen3 | 14.8B | 40960 | 5120 | Q4_K_M | 0.6 | completion tools thinking |
raw | index | |
| qwen3:4b | qwen3 | 4.0B | 40960 | 2560 | Q4_K_M | 0.6 | completion tools thinking |
raw | index | |
| qwen3:8b | qwen3 | 8.2B | 40960 | 4096 | Q4_K_M | 0.6 | completion tools thinking |
raw | index | |
| smollm2:1.7b | llama | 1.7B | 8192 | 2048 | Q8_0 | completion tools |
You are a helpful AI assistant named SmolLM, trained by Hugging Face | raw | index | |
| smollm2:135m | llama | 134.52M | 8192 | 576 | F16 | completion |
You are a helpful AI assistant named SmolLM, trained by Hugging Face | raw | index | |
| smollm2:360m | llama | 361.82M | 8192 | 960 | F16 | completion |
You are a helpful AI assistant named SmolLM, trained by Hugging Face | raw | index |