952 3 months ago

A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

vision 8b

12 models