Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

OsGAPC3 modulates starch synthesis and grain chalkiness by integrating glycolysis and transcriptional regulation in rice.

Plant science : an international journal of experimental plant biology·2026
Same author

Advances in Clinical Management Strategies for Sarcopenia: From Exercise and Nutrition to Pharmacotherapy and Comprehensive Interventions.

Molecular neurobiology·2026
Same author

Impact of concordance between basal LH/FSH ratio and testosterone levels on pregnancy outcomes in patients with PCOS undergoing IVF-ET cycle: a retrospective cohort study.

Journal of endocrinological investigation·2026
Same author

Skeletal-muscle-targeted non-viral delivery of full-length DMD mRNA for Duchenne muscular dystrophy.

Nature biomedical engineering·2026
Same author

The cellular ecosystem of skeletal muscle regeneration: molecular mechanisms, pathological disorders, and potential therapeutic strategies.

Stem cell research & therapy·2026
Same author

Ethical issues in multi-agent AI systems for healthcare: a narrative review.

Frontiers in public health·2026
Same journal

Dataset of Optimized Structures of Aliphatic Chains Chemisorbed on Si(110) and Si(111) Surfaces via First-Principles Methods.

Scientific data·2026
Same journal

EURO-PROBE - Manual segmentations of the prostate and intraprostatic urethra on T2-weighted MRI.

Scientific data·2026
Same journal

Chromosome-Level Genome Assembly of Southern Africa Mozambique Tilapia (Oreochromis mossambicus) using PacBio HiFi and Omni-C sequencing.

Scientific data·2026
Same journal

Ovarian Stainology: Database of evidence-based immunohistochemical antigen expression in ovarian tumors.

Scientific data·2026
Same journal

A dataset of small protein conformational ensembles from all-atom molecular dynamics simulations.

Scientific data·2026
Same journal

A real-world Fitbit-derived dataset of activity, sleep, and heart rate with matched clinical factors in on-treatment lung cancer patients.

Scientific data·2026
See all related articles

Related Experiment Video

Updated: May 30, 2025

Databases to Efficiently Manage Medium Sized, Low Velocity, Multidimensional Data in Tissue Engineering
09:43

Databases to Efficiently Manage Medium Sized, Low Velocity, Multidimensional Data in Tissue Engineering

Published on: November 22, 2019

6.2K

A Multidisciplinary Multimodal Aligned Dataset for Academic Data Processing.

Haitao Song1,2,3,4, Hongyi Xu1, Zikai Wang5,6,7

  • 1Shanghai Artificial Intelligence Research Institute Co., Ltd., Shanghai, 200240, China.

Scientific Data
|January 29, 2025
PubMed
Summary
This summary is machine-generated.

A new multimodal aligned dataset (MMAD) for academic data processing includes over 1.1 million scholarly articles with aligned text and visuals. This dataset advances research in scientometrics and bibliometrics by enabling new analyses.

More Related Videos

Deep Neural Networks for Image-Based Dietary Assessment
13:19

Deep Neural Networks for Image-Based Dietary Assessment

Published on: March 13, 2021

8.9K
Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.4K

Related Experiment Videos

Last Updated: May 30, 2025

Databases to Efficiently Manage Medium Sized, Low Velocity, Multidimensional Data in Tissue Engineering
09:43

Databases to Efficiently Manage Medium Sized, Low Velocity, Multidimensional Data in Tissue Engineering

Published on: November 22, 2019

6.2K
Deep Neural Networks for Image-Based Dietary Assessment
13:19

Deep Neural Networks for Image-Based Dietary Assessment

Published on: March 13, 2021

8.9K
Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.4K

Area of Science:

  • Bibliometrics
  • Scientometrics
  • Data Science

Background:

  • Academic data processing relies heavily on textual data in scientometrics and bibliometrics.
  • Existing datasets overlook the significance of visual elements in scholarly articles.
  • There is a need for comprehensive datasets that integrate both textual and visual information.

Purpose of the Study:

  • Introduce a novel, multidisciplinary multimodal aligned dataset (MMAD) for academic data processing.
  • Enhance research capabilities in areas like trend analysis and citation recommendation by incorporating visual data.
  • Provide a foundation for new research avenues in academic data analysis.

Main Methods:

  • Compiled a dataset of over 1.1 million peer-reviewed scholarly articles.
  • Integrated text with aligned visual elements and associated metadata.
  • Developed a Language Model-based quality validation method using specific prompts to assess text-to-visual alignment accuracy.
  • Assessed dataset representativeness by comparing country/region distribution with SCImago benchmarks.

Main Results:

  • The multimodal aligned dataset (MMAD) contains over 1.1 million scholarly articles with aligned text and visuals.
  • A novel validation method using Language Models demonstrates effective quality control for text-to-visual alignment.
  • MMAD's representativeness was evaluated against established benchmarks.

Conclusions:

  • The introduction of MMAD addresses the gap in existing datasets by incorporating aligned visual elements.
  • MMAD facilitates advanced academic data processing, enabling new research in automated caption generation and figure trend analysis.
  • This dataset offers a fertile ground for future advancements in scientometrics and bibliometrics.