ollama-multirun: my_hovercraft_is_full_of_eels: qwen3:0.6b: 20250708-013839

models: codellama:7b cogito:8b deepcoder:1.5b deepseek-r1:1.5b deepseek-r1:8b dolphin-mistral:7b dolphin3:8b gemma3:1b gemma3:4b gemma3n:e2b gemma:2b granite3.3:2b hermes3:8b llama3.2:1b llama3.2:3b llava-llama3:8b llava-phi3:3.8b llava:7b minicpm-v:8b mistral:7b qwen2.5-coder:7b qwen2.5vl:3b qwen2.5vl:7b qwen3:0.6b qwen3:1.7b qwen3:4b qwen3:8b smollm2:1.7b smollm2:135m smollm2:360m stable-code:3b starcoder:7b

Prompt: (raw) (yaml) words:6 bytes:30

Thinking: qwen3:0.6b (raw)

Output: qwen3:0.6b (raw)

Stats (raw)
words43
bytes246
total duration2.371619625s
load duration28.578208ms
prompt eval count18 token(s)
prompt eval duration116.363291ms
prompt eval rate154.69 tokens/s
eval count192 token(s)
eval duration2.22630925s
eval rate86.24 tokens/s
Model (raw)
nameqwen3:0.6b
architectureqwen3
size2.3 GB 9.1 GB
parameters751.63M
context length40960
embedding length1024
quantizationQ4_K_M
capabilitiescompletion
tools
thinking
System
Ollama proc100% GPU 100% GPU
Ollama version0.9.5
sys archarm64
sys processorarm
sys memory15G + 1929M
sys OSDarwin 24.5.0