Model architecture llama parameters 7.2B context length 32768 embedding length 4096 quantization Q4_0 Capabilities completion vision Projector architecture clip parameters 311.89M embedding length 1024 dimensions 768 Parameters stop "[INST]" stop "[/INST]" License Apache License Version 2.0, January 2004 ...