Model architecture qwen3 parameters 2.0B context length 40960 embedding length 2048 quantization Q4_K_M Capabilities completion tools thinking Parameters top_k 20 top_p 0.95 repeat_penalty 1 stop "<|im_start|>" stop "<|im_end|>" temperature 0.6 License Apache License Version 2.0, January 2004 ...