Model | Architecture | Parameters | Context length |
Embedding length |
Quantization | Temperature | Capabilities | System prompt | (raw) | (index) |
---|---|---|---|---|---|---|---|---|---|---|
codellama:7b | llama | 6.7B | 16384 | 4096 | Q4_0 | completion |
raw | index | ||
cogito:3b | llama | 3.6B | 131072 | 3072 | Q4_K_M | completion tools |
raw | index | ||
cogito:8b | llama | 8.0B | 131072 | 4096 | Q4_K_M | completion tools |
raw | index | ||
deepcoder:1.5b | qwen2 | 1.8B | 131072 | 1536 | Q4_K_M | 0.6 | completion |
raw | index | |
deepseek-r1:1.5b | qwen2 | 1.8B | 131072 | 1536 | Q4_K_M | completion thinking |
raw | index | ||
deepseek-r1:14b | qwen2 | 14.8B | 131072 | 5120 | Q4_K_M | completion thinking |
raw | index | ||
deepseek-r1:8b | qwen3 | 8.2B | 131072 | 4096 | Q4_K_M | 0.6 | completion thinking |
raw | index | |
dolphin-mistral:7b | llama | 7.2B | 32768 | 4096 | Q4_0 | completion |
You are Dolphin, a helpful AI assistant. | raw | index | |
dolphin3:8b | llama | 8.0B | 131072 | 4096 | Q4_K_M | completion |
You are Dolphin, a helpful AI assistant. | raw | index | |
gemma3:1b | gemma3 | 999.89M | 32768 | 1152 | Q4_K_M | 1 | completion |
raw | index | |
gemma3:4b | gemma3 | 4.3B | 131072 | 2560 | Q4_K_M | 1 | completion vision |
raw | index | |
gemma3n:e2b | gemma3n | 4.5B | 32768 | 2048 | Q4_K_M | completion |
raw | index | ||
gemma3n:e4b | gemma3n | 6.9B | 32768 | 2048 | Q4_K_M | completion |
raw | index | ||
gemma:2b | gemma | 2.5B | 8192 | 2048 | Q4_0 | completion |
raw | index | ||
granite3.3:2b | granite | 2.5B | 131072 | 2048 | Q4_K_M | completion tools |
raw | index | ||
granite3.3:8b | granite | 8.2B | 131072 | 4096 | Q4_K_M | completion tools |
raw | index | ||
hermes3:8b | llama | 8.0B | 131072 | 4096 | Q4_0 | completion tools |
raw | index | ||
llama3.1:8b-instruct-q4_1 | llama | 8.0B | 131072 | 4096 | Q4_1 | completion tools |
raw | index | ||
llama3.2:1b | llama | 1.2B | 131072 | 2048 | Q8_0 | completion tools |
raw | index | ||
llama3.2:3b | llama | 3.2B | 131072 | 3072 | Q4_K_M | completion tools |
raw | index | ||
llava-llama3:8b | llama | 8.0B | 8192 | 4096 | Q4_K_M | completion vision |
raw | index | ||
llava-phi3:3.8b | llama | 3.8B | 4096 | 3072 | Q4_K_M | completion vision |
raw | index | ||
llava:7b | llama | 7.2B | 32768 | 4096 | Q4_0 | completion vision |
raw | index | ||
minicpm-v:8b | qwen2 | 7.6B | 32768 | 3584 | Q4_0 | completion vision |
raw | index | ||
mistral:7b | llama | 7.2B | 32768 | 4096 | Q4_0 | completion tools |
raw | index | ||
mistral:7b-instruct | llama | 7.2B | 32768 | 4096 | Q4_0 | completion tools |
raw | index | ||
qwen2.5-coder:7b | qwen2 | 7.6B | 32768 | 3584 | Q4_K_M | completion tools insert |
You are Qwen, created by Alibaba Cloud. You are a helpful assistant. | raw | index | |
qwen2.5vl:3b | qwen25vl | 3.8B | 128000 | 2048 | Q4_K_M | 0.0001 | completion vision |
You are a helpful assistant. | raw | index |
qwen2.5vl:7b | qwen25vl | 8.3B | 128000 | 3584 | Q4_K_M | 0.0001 | completion vision |
You are a helpful assistant. | raw | index |
qwen3:0.6b | qwen3 | 751.63M | 40960 | 1024 | Q4_K_M | 0.6 | completion tools thinking |
raw | index | |
qwen3:1.7b | qwen3 | 2.0B | 40960 | 2048 | Q4_K_M | 0.6 | completion tools thinking |
raw | index | |
qwen3:14b | qwen3 | 14.8B | 40960 | 5120 | Q4_K_M | 0.6 | completion tools thinking |
raw | index | |
qwen3:4b | qwen3 | 4.0B | 40960 | 2560 | Q4_K_M | 0.6 | completion tools thinking |
raw | index | |
qwen3:8b | qwen3 | 8.2B | 40960 | 4096 | Q4_K_M | 0.6 | completion tools thinking |
raw | index | |
smollm2:1.7b | llama | 1.7B | 8192 | 2048 | Q8_0 | completion tools |
You are a helpful AI assistant named SmolLM, trained by Hugging Face | raw | index | |
smollm2:135m | llama | 134.52M | 8192 | 576 | F16 | completion |
You are a helpful AI assistant named SmolLM, trained by Hugging Face | raw | index | |
smollm2:360m | llama | 361.82M | 8192 | 960 | F16 | completion |
You are a helpful AI assistant named SmolLM, trained by Hugging Face | raw | index |