ollama-multirun: llama3.1:8b-instruct-q4_1

ollama-multirun: hi: llama3.1:8b-instruct-q4_1: 20250713-162732

Prompt: (raw) (yaml) words:1 bytes:3
hi

Output: llama3.1:8b-instruct-q4_1 (raw)
How's it going? Is there something I can help you with or would you like to chat?

Stats (raw)
Words	17
Bytes	83
Total duration	1.698175417s
Load duration	32.823375ms
Prompt eval count	11 token(s)
Prompt eval duration	415.571292ms
Prompt eval rate	26.47 tokens/s
Eval count	21 token(s)
Eval duration	1.249219541s
Eval rate	16.81 tokens/s

Model (raw)
Name	llama3.1:8b-instruct-q4_1
Architecture	llama
Size	8.8 GB
Parameters	8.0B
Context length	131072
Embedding length	4096
Quantization	Q4_1
Capabilities	completion tools

System
Ollama proc	100% GPU
Ollama context	16384
Ollama version	0.9.7-rc0
Multirun timeout	300 seconds
Sys arch	arm64
Sys processor	arm
sys memory	13G + 481M
Sys OS	Darwin 24.5.0