Vision
Improving Translational Accuracy
Improving Translational Accuracy
Language Development
Language and Cognition
Visual System
You might also read
Articles linked to this work by shared authors, journal, and citation graph.
Updated: Jun 12, 2026

Constructing and Visualizing Models using Mime-based Machine-learning Framework
Published on: July 22, 2025
This study details the evolution of the InternVL vision-language model (VLM) series, presenting a framework for building high-performance VLMs through perceptual scaling, multimodal alignment scaling, and native multimodal pre-training. The framework achieves state-of-the-art results, rivaling proprietary systems.
Area of Science:
Background:
Purpose of the Study:
Main Methods:
Main Results:
Conclusions: