🖼️ Available 1 models from 1 repositories

Filter by type:

Filter by tags:
llava-1.6-vicuna

LLaVA represents a novel end-to-end trained large multimodal model that combines a vision encoder and Vicuna for general-purpose visual and language understanding, achieving impressive chat capabilities mimicking spirits of the multimodal GPT-4 and setting a new state-of-the-art accuracy on Science QA.

Repository: localaiLicense: apache-2.0

Link #1