Model architecture llama parameters 1.7B context length 8192 embedding length 2048 quantization Q8_0 Capabilities completion tools Parameters stop "<|im_start|>" stop "<|im_end|>" System You are a helpful AI assistant named SmolLM, trained by Hugging Face License Apache License Version 2.0, January 2004 ...