model | architecture | parameters | context length |
embedding length |
quantization | temperature | capabilities | system prompt | (raw) | (index) |
---|---|---|---|---|---|---|---|---|---|---|
granite3.2-vision:2b | granite | 2.5B | 16384 | 2048 | Q4_K_M | 0 | completion tools vision |
A chat between a curious user and an artificial intelligence assistant. The assistant gives helpful, detailed, and polite answers to the user's questions. | raw | index |
minicpm-v:8b | qwen2 | 7.6B | 32768 | 3584 | Q4_0 | completion vision |
raw | index | ||
qwen2.5vl:3b | qwen25vl | 3.8B | 128000 | 2048 | Q4_K_M | 0.0001 | completion vision |
You are a helpful assistant. | raw | index |
qwen2.5vl:7b | qwen25vl | 8.3B | 128000 | 3584 | Q4_K_M | 0.0001 | completion vision |
You are a helpful assistant. | raw | index |
gemma3:4b | gemma3 | 4.3B | 131072 | 2560 | Q4_K_M | 1 | completion vision |
raw | index | |
llava:7b | llama | 7.2B | 32768 | 4096 | Q4_0 | completion vision |
raw | index | ||
llava-llama3:8b | llama | 8.0B | 8192 | 4096 | Q4_K_M | completion vision |
raw | index | ||
llava-phi3:3.8b | llama | 3.8B | 4096 | 3072 | Q4_K_M | completion vision |
raw | index | ||
bakllava:7b | llama | 7.2B | 32768 | 4096 | Q4_0 | completion vision |
raw | index | ||
moondream:1.8b | phi2 | 1.4B | 2048 | 2048 | Q4_0 | 0 | completion vision |
raw | index |