ollama-multirun: hi: deepseek-r1:8b: 20250713-152352

models: codellama:7b cogito:3b cogito:8b deepcoder:1.5b deepseek-r1:1.5b deepseek-r1:14b deepseek-r1:8b dolphin-mistral:7b dolphin3:8b gemma3:1b gemma3:4b gemma3n:e2b gemma3n:e4b gemma:2b granite3.3:2b granite3.3:8b hermes3:8b llama3.1:8b-instruct-q4_1 llama3.2:1b llama3.2:3b llava-llama3:8b llava-phi3:3.8b llava:7b minicpm-v:8b mistral:7b mistral:7b-instruct qwen2.5-coder:7b qwen2.5vl:3b qwen2.5vl:7b qwen3:0.6b qwen3:1.7b qwen3:14b qwen3:4b qwen3:8b smollm2:1.7b smollm2:135m smollm2:360m

Prompt: (raw) (yaml) words:1 bytes:3

Thinking: deepseek-r1:8b (raw)

Output: deepseek-r1:8b (raw)

Stats (raw)
Words4
Bytes72
Total duration1m12.251077459s
Load duration32.131542ms
Prompt eval count3 token(s)
Prompt eval duration3.165768208s
Prompt eval rate0.95 tokens/s
Eval count208 token(s)
Eval duration1m9.049341167s
Eval rate3.01 tokens/s
Model (raw)
Namedeepseek-r1:8b
Architectureqwen3
Size21 GB
Parameters8.2B
Context length131072
Embedding length4096
QuantizationQ4_K_M
Capabilitiescompletion
thinking
System
Ollama proc48%/52% CPU/GPU
Ollama context65536
Ollama version0.9.7-rc0
Multirun timeout300 seconds
Sys archarm64
Sys processorarm
sys memory15G + 1382M
Sys OSDarwin 24.5.0