InternVL3 78B

Name: InternVL3 78B
Brand: OpenGVLab

OpenGVLab

The InternVL3 series is an advanced multimodal large language model (MLLM). Compared to InternVL 2.5, InternVL3 demonstrates stronger multimodal perception and reasoning capabilities. In addition, InternVL3 is benchmarked against the Qwen2.5 Chat models, whose pre-trained base models serve as the initialization for its language component. Benefiting from Native Multimodal Pre-Training, the InternVL3 series surpasses the Qwen2.5 series in overall text performance.

Try Now

Capabilities

Image Input

Technical Specifications

Context Window

32,768 tokens

Max Output

32,768 tokens

Pricing

Token Costs (per 1M tokens)

Cache Miss Input

$0.15

Non-Reasoning Output

$0.60

Cache Read Input

$0.075

Retired

Made legacy on February 17, 2026

Reason

78B vision model; research-focused; limited provider support

Recommended Replacement

Qwen3.6 Plus