Prompt: (raw) (yaml)
words:1 bytes:3
Output: llama3.1:8b-instruct-q4_1 (raw)
| Stats (raw) | |
| Words | 17 |
| Bytes | 83 |
| Total duration | 1.694726959s |
| Load duration | 31.69ms |
| Prompt eval count | 11 token(s) |
| Prompt eval duration | 410.469792ms |
| Prompt eval rate | 26.80 tokens/s |
| Eval count | 21 token(s) |
| Eval duration | 1.252014375s |
| Eval rate | 16.77 tokens/s |
| Model (raw) | |
| Name | llama3.1:8b-instruct-q4_1 |
| Architecture | llama |
| Size | 7.2 GB |
| Parameters | 8.0B |
| Context length | 131072 |
| Embedding length | 4096 |
| Quantization | Q4_1 |
| Capabilities | completion tools |
| System | |
| Ollama proc | 100% GPU |
| Ollama context | 8192 |
| Ollama version | 0.9.7-rc0 |
| Multirun timeout | 300 seconds |
| Sys arch | arm64 |
| Sys processor | arm |
| sys memory | 13G + 572M |
| Sys OS | Darwin 24.5.0 |