Model architecture qwen3 parameters 14.8B context length 40960 embedding length 5120 quantization Q4_K_M Capabilities completion tools thinking Parameters repeat_penalty 1 stop "<|im_start|>" stop "<|im_end|>" temperature 0.6 top_k 20 top_p 0.95 License Apache License Version 2.0, January 2004 ...