ollama-multirun: swdub3jligfsbcbwcmv2aw91cybpbnn0cnvjdglvbnmucklnbm: llama3-groq-tool-use:8b: 20250703-153147

models: bakllava:7b codellama:7b deepcoder:1.5b deepseek-r1:1.5b deepseek-r1:8b dolphin-mistral:7b dolphin3:8b gemma3:1b gemma3:4b gemma:2b granite3.2-vision:2b granite3.3:2b huihui_ai/baronllm-abliterated:8b llama3-groq-tool-use:8b llama3.2:1b llava-llama3:8b llava-phi3:3.8b llava:7b minicpm-v:8b mistral:7b qwen2.5-coder:7b qwen2.5vl:3b qwen2.5vl:7b qwen3:1.7b qwen3:8b stable-code:3b starcoder:7b

Prompt: (raw) (yaml) words:1 bytes:229

Output: llama3-groq-tool-use:8b (raw)

Stats (raw)
words24
bytes137
total duration3.6975925s
load duration29.421959ms
prompt eval count164 token(s)
prompt eval duration2.24262425s
prompt eval rate73.13 tokens/s
eval count28 token(s)
eval duration1.424920375s
eval rate19.65 tokens/s
Model (raw)
namellama3-groq-tool-use:8b
architecturellama
size6.7 GB
parameters8.0B
context length8192
embedding length4096
quantizationQ4_0
capabilitiescompletion
tools
System
ollama proc100% GPU
ollama version0.9.3
sys archarm64
sys processorarm
sys memory14G + 3441M
sys OSDarwin 24.5.0