Prompt: (raw) (yaml)
words:1 bytes:3
Output: llama3.1:8b-instruct-q4_1 (raw)
Stats (raw) | |
Words | 17 |
Bytes | 83 |
Total duration | 1.694726959s |
Load duration | 31.69ms |
Prompt eval count | 11 token(s) |
Prompt eval duration | 410.469792ms |
Prompt eval rate | 26.80 tokens/s |
Eval count | 21 token(s) |
Eval duration | 1.252014375s |
Eval rate | 16.77 tokens/s |
Model (raw) | |
Name | llama3.1:8b-instruct-q4_1 |
Architecture | llama |
Size | 7.2 GB |
Parameters | 8.0B |
Context length | 131072 |
Embedding length | 4096 |
Quantization | Q4_1 |
Capabilities | completion tools |
System | |
Ollama proc | 100% GPU |
Ollama context | 8192 |
Ollama version | 0.9.7-rc0 |
Multirun timeout | 300 seconds |
Sys arch | arm64 |
Sys processor | arm |
sys memory | 13G + 572M |
Sys OS | Darwin 24.5.0 |