Models
Docs
Pricing
Sign in
Download
Models
Download
Docs
Pricing
Sign in
llava
:34b
12.8M
Downloads
Updated
2 years ago
๐ LLaVA is a novel end-to-end trained large multimodal model that combines a vision encoder and Vicuna for general-purpose visual and language understanding. Updated to version 1.6.
๐ LLaVA is a novel end-to-end trained large multimodal model that combines a vision encoder and Vicuna for general-purpose visual and language understanding. Updated to version 1.6.
Cancel
vision
7b
13b
34b
llava:34b
...
/
params
f02dd72bb242 ยท 59B
{
"stop": [
"<|im_start|>",
"<|im_end|>"
]
}