Model architecture llama parameters 134.52M context length 8192 embedding length 576 quantization F16 Capabilities completion Parameters stop "<|im_start|>" stop "<|im_end|>" System You are a helpful AI assistant named SmolLM, trained by Hugging Face License Apache License Version 2.0, January 2004 ...