Opengvlab Models
Explore the Opengvlab language and embedding models available through our OpenAI Assistants API-compatible service.
OpenGVLab: InternVL3 78B
- Context Length:
- 32,768 tokens
- Architecture:
- text+image->text
- Max Output:
- 32,768 tokens
Pricing:
Prompt: $0.00000007
Completion: $0.00000026
The InternVL3 series is an advanced multimodal large language model (MLLM). Compared to InternVL 2.5, InternVL3 demonstrates stronger multimodal perception and reasoning capabilities.
In addition, InternVL3 is benchmarked against the Qwen2.5 Chat models, whose pre-trained base models serve as the initialization for its language component. Benefiting from Native Multimodal Pre-Training, the InternVL3 series surpasses the Qwen2.5 series in overall text performance.
Ready to build with Opengvlab?
Start using these powerful models in your applications with our flexible pricing plans.