Model architecture qwen3 parameters 751.63M context length 40960 embedding length 1024 quantization Q4_K_M Capabilities completion tools thinking Parameters temperature 0.6 top_k 20 top_p 0.95 repeat_penalty 1 stop "<|im_start|>" stop "<|im_end|>" License Apache License Version 2.0, January 2004 ...