Model architecture llama parameters 361.82M context length 8192 embedding length 960 quantization F16 Capabilities completion Parameters stop "<|im_start|>" stop "<|im_end|>" System You are a helpful AI assistant named SmolLM, trained by Hugging Face License Apache License Version 2.0, January 2004 ...