Opengvlab Models

Explore the Opengvlab language and embedding models available through our OpenAI Assistants API-compatible service.

Opengvlab logo

OpenGVLab: InternVL3 78B

Context Length:
32,768 tokens
Architecture:
text+image->text
Max Output:
32,768 tokens

Pricing:

Prompt: $0.00000007
Completion: $0.00000026

The InternVL3 series is an advanced multimodal large language model (MLLM). Compared to InternVL 2.5, InternVL3 demonstrates stronger multimodal perception and reasoning capabilities.

In addition, InternVL3 is benchmarked against the Qwen2.5 Chat models, whose pre-trained base models serve as the initialization for its language component. Benefiting from Native Multimodal Pre-Training, the InternVL3 series surpasses the Qwen2.5 series in overall text performance.

Ready to build with Opengvlab?

Start using these powerful models in your applications with our flexible pricing plans.