Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Light Acquisition02:16

Light Acquisition

8.6K
In order to produce glucose, plants need to capture sufficient light energy. Many modern plants have evolved leaves specialized for light acquisition. Leaves can be only millimeters in width or tens of meters wide, depending on the environment. Due to competition for sunlight, evolution has driven the evolution of increasingly larger leaves and taller plants, to avoid shading by their neighbors with contaminant elaboration of root architecture and mechanisms to transport water and nutrients.
8.6K
Vision01:24

Vision

55.4K
Vision is the result of light being detected and transduced into neural signals by the retina of the eye. This information is then further analyzed and interpreted by the brain. First, light enters the front of the eye and is focused by the cornea and lens onto the retina—a thin sheet of neural tissue lining the back of the eye. Because of refraction through the convex lens of the eye, images are projected onto the retina upside-down and reversed.
55.4K
Visual Agnosia01:12

Visual Agnosia

310
Visual agnosia is a condition characterized by the inability to recognize visually presented objects despite having normal vision. For instance, a person with visual agnosia can describe the shape and color of an object but cannot identify or name it. This impairment does not affect their visual field, acuity, color vision, brightness discrimination, language, or memory. An example of this condition in a social setting is someone at a dinner party asking for "that silver thing with a round...
310
Prosopagnosia01:24

Prosopagnosia

260
Prosopagnosia, also known as face blindness, is the inability to recognize faces. In severe cases, individuals with prosopagnosia may not recognize close family members, including parents and spouses, by their faces. For instance, someone with prosopagnosia might walk past their child in a crowd, only realizing their mistake upon noticing their child's distinctive backpack or favorite jacket. Prosopagnosia specifically impairs facial recognition, while the recognition of other objects or...
260
Classification of Illness01:17

Classification of Illness

8.0K
The meaning of illness is individualized to each person who experiences an alteration in health. In contrast, disease is a medical term indicating a pathological change in the structure and function of the body or mind. It is a condition that has specific symptoms and boundaries.
An illness is a response to a disease in which the person's level of functioning is changed compared with a previous level. The general classification of illness includes acute and chronic.
Acute illness is severe...
8.0K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

A grading system based on multiparametric MRI-predicted tumor perfusion, invasiveness and texture (GOLDS-PIT) for surgical difficulty assessment in pituitary macroadenomas: A prospective single-centre cohort study.

World neurosurgery·2026
Same author

Dual role and metabolic reprogramming: Key amino acids in marinade bidirectionally control aflatoxin B<sub>1</sub> biosynthesis in Aspergillus parasiticus.

Food microbiology·2026
Same author

Overexpression of FOXO1 may suppress proliferation and migration and correlate with tumor immune cells in nasopharyngeal carcinoma.

Translational cancer research·2026
Same author

One-stage transplantation of artificial dermis and autologous split-thickness skin graft for the repair of severe foot trauma: a case report.

International journal of burns and trauma·2026
Same author

Development and validation of a stability-indicating high-performance liquid chromatography method for iomeprol.

Scientific reports·2026
Same author

Enhancing crop disease recognition framework via vision-language model with cross-attention and gated fusion.

Scientific reports·2026
Same journal

RETRACTED: Zhang et al. A Novel Framework for Reconstruction and Imaging of Target Scattering Centers via Wide-Angle Incidence in Radar Networks. <i>Sensors</i> 2025, <i>25</i>, 6802.

Sensors (Basel, Switzerland)·2026
Same journal

Enhancing Unsupervised Multi-Source Domain Adaptation for Person Re-Identification via Mixture of Experts and Graph-Based Relation.

Sensors (Basel, Switzerland)·2026
Same journal

Development of an Instrumented Glove for Palmar Pressure Assessment in Kayakers.

Sensors (Basel, Switzerland)·2026
Same journal

Development and Experimental Validation of an Autonomous IoT-Based Monitoring System for Real-Time Water Quality Assessment in the Amazon River.

Sensors (Basel, Switzerland)·2026
Same journal

Semi-Supervised Adversarial Learning Framework for Controller Area Network Bus Intrusion Detection.

Sensors (Basel, Switzerland)·2026
Same journal

Smart Optimization Method for Safety Signs in Innovative Manufacturing Environments Integrating Industrial Field IoT Sensors and Knowledge Graphs.

Sensors (Basel, Switzerland)·2026
See all related articles

Related Experiment Video

Updated: Sep 16, 2025

Author Spotlight: UAV Remote Sensing for Efficient Invasive Plant Biomass Estimation
08:47

Author Spotlight: UAV Remote Sensing for Efficient Invasive Plant Biomass Estimation

Published on: February 9, 2024

1.6K

Cross-Modal Data Fusion via Vision-Language Model for Crop Disease Recognition.

Wenjie Liu1, Guoqing Wu2, Han Wang1

  • 1School of Transportation and Civil Engineering, Nantong University, Nantong 226019, China.

Sensors (Basel, Switzerland)
|July 12, 2025
PubMed
Summary
This summary is machine-generated.

This study introduces a new vision-language model for crop disease recognition, combining image and text data. The model significantly improves accuracy in identifying crop diseases, enhancing agricultural productivity.

Keywords:
crop disease recognitioncross-model data fusionimage classificationvision-language model

More Related Videos

Author Spotlight: Insights into Visual Cortex Research Through Wide-View fMRI Mapping
07:11

Author Spotlight: Insights into Visual Cortex Research Through Wide-View fMRI Mapping

Published on: December 8, 2023

1.9K
Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment
08:25

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

9.1K

Related Experiment Videos

Last Updated: Sep 16, 2025

Author Spotlight: UAV Remote Sensing for Efficient Invasive Plant Biomass Estimation
08:47

Author Spotlight: UAV Remote Sensing for Efficient Invasive Plant Biomass Estimation

Published on: February 9, 2024

1.6K
Author Spotlight: Insights into Visual Cortex Research Through Wide-View fMRI Mapping
07:11

Author Spotlight: Insights into Visual Cortex Research Through Wide-View fMRI Mapping

Published on: December 8, 2023

1.9K
Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment
08:25

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

9.1K

Area of Science:

  • Agricultural Science
  • Computer Science
  • Artificial Intelligence

Background:

  • Crop diseases significantly threaten global food security and agricultural productivity.
  • Accurate and timely disease identification is essential for crop yield and quality management.
  • Existing deep learning methods primarily rely on image data, often neglecting valuable textual information.

Purpose of the Study:

  • To develop a novel cross-modal data fusion approach for crop disease recognition.
  • To enhance disease identification accuracy by integrating visual and textual features.
  • To leverage a vision-language model for a more comprehensive understanding of crop leaf diseases.

Main Methods:

  • Utilized Zhipu.ai multi-model to generate detailed textual descriptions of crop diseases (global, local lesion, color-texture).
  • Encoded textual descriptions and image features into vectors.
  • Employed a cross-attention mechanism for iterative fusion of multimodal features across layers.
  • Implemented a classification prediction module for disease identification.

Main Results:

  • The proposed cross-modal fusion model outperformed state-of-the-art image-only methods on Soybean Disease, AI Challenge 2018, and PlantVillage datasets.
  • Achieved high recognition accuracies: 98.74% (Soybean Disease), 87.64% (AI Challenge 2018), and 99.08% (PlantVillage).
  • Demonstrated superior performance with a significantly lower parameter count (1.14M).

Conclusions:

  • Cross-modal learning effectively integrates visual and textual data for precise and efficient crop disease recognition.
  • The developed vision-language model offers a scalable and accurate solution for agricultural disease identification.
  • This approach enhances the potential for improving crop yield and global food security through advanced AI techniques.