Model architecture qwen3 parameters 4.0B context length 40960 embedding length 2560 quantization Q4_K_M Capabilities completion tools thinking Parameters temperature 0.6 top_k 20 top_p 0.95 repeat_penalty 1 stop "<|im_start|>" stop "<|im_end|>" License Apache License Version 2.0, January 2004 ...