Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Light Acquisition02:16

Light Acquisition

8.6K
In order to produce glucose, plants need to capture sufficient light energy. Many modern plants have evolved leaves specialized for light acquisition. Leaves can be only millimeters in width or tens of meters wide, depending on the environment. Due to competition for sunlight, evolution has driven the evolution of increasingly larger leaves and taller plants, to avoid shading by their neighbors with contaminant elaboration of root architecture and mechanisms to transport water and nutrients.
8.6K
Plant Breeding and Biotechnology01:59

Plant Breeding and Biotechnology

19.8K
Crop cultivation has a long history in human civilization, with records showing the cultivation of cereal plants beginning at around 8000 BC. This early plant breeding was developed primarily to provide a steady supply of food.
19.8K
Basic Plant Anatomy: Roots, Stems, and Leaves02:27

Basic Plant Anatomy: Roots, Stems, and Leaves

60.9K
The primary organs of vascular plants are roots, stems, and leaves, but these structures can be highly variable, adapted for the specific needs and environment of different plant species.
60.9K
Transgenic Plants02:50

Transgenic Plants

7.4K
Recombinant DNA technology called transgenesis is often used to add a foreign gene or remove a detrimental gene from an organism. Such genetically modified organisms are called transgenic organisms.
The first-ever transgenic plant was a tobacco plant developed in 1983 that showed resistance against the tobacco mosaic virus. Since then, many transgenic plants have been developed and commercialized for improving the agricultural, ornamental, and horticultural value of a crop plant. Transgenic...
7.4K
Plant Tissues01:18

Plant Tissues

7.1K
Plants are multicellular eukaryotes with tissue systems made of various cell types that carry out specific functions. Different tissues work together to perform a unique function and form an organ. Organs working together form organ systems. Vascular plants have two distinct organ systems: a shoot system and a root system. The shoot system consists of two portions: the vegetative (non-reproductive) parts of the plant, such as the leaves and the stems, and the reproductive parts of the plant,...
7.1K
Plant Hormones01:56

Plant Hormones

24.8K
Plant hormones—or phytohormones—are chemical molecules that modulate one or more physiological processes of a plant. In animals, hormones are often produced in specific glands and circulated via the circulatory system. However, plants lack hormone-producing glands.
24.8K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

The Somnolink-Hub: A Central Infrastructure That Unites Sleep Data at Point of Care.

Studies in health technology and informatics·2026
Same author

The global geography of plant invasion risk under future climate and land-use changes.

Nature ecology & evolution·2026
Same author

The global island species-area relationship for plants.

Proceedings of the National Academy of Sciences of the United States of America·2026
Same author

AI-driven landscape values mapping.

Chaos (Woodbury, N.Y.)·2026
Same author

Crowdsourced biodiversity monitoring fills gaps in global plant trait mapping.

Nature communications·2026
Same author

Climatic niche conservatism in non-native plants depends on introduction history and biogeographic context.

Nature communications·2026
Same journal

Leveraging target enrichment and genome skimming (Hyb-Seq) of herbarium collections to unlock timber DNA barcoding.

Applications in plant sciences·2026
Same journal

Detecting cryptic ghost lineage introgression in four-taxon genomic datasets.

Applications in plant sciences·2026
Same journal

HapAsmbl: A reference-aided pipeline for assembling haplotypes in Nanopore amplicon sequence data of polymorphic populations.

Applications in plant sciences·2026
Same journal

HybSuite: An integrated pipeline for hybrid capture phylogenomics from reads to trees.

Applications in plant sciences·2026
Same journal

Detecting introgression from phylogenetic invariant site patterns using machine learning.

Applications in plant sciences·2026
Same journal

tanggle: An R package for the visualization of phylogenetic networks.

Applications in plant sciences·2026
See all related articles
  1. Home
  2. Using Large Language Models To Extract Plant Functional Traits From Unstructured Text.
  1. Home
  2. Using Large Language Models To Extract Plant Functional Traits From Unstructured Text.

Related Experiment Video

Annotation of Plant Gene Function via Combined Genomics, Metabolomics and Informatics
08:09

Annotation of Plant Gene Function via Combined Genomics, Metabolomics and Informatics

Published on: June 17, 2012

19.8K

Using large language models to extract plant functional traits from unstructured text.

Viktor Domazetoski1, Holger Kreft1,2,3, Helena Bestova1,2

  • 1Department of Biodiversity, Macroecology, and Biogeography University of Göttingen Göttingen Germany.

Applications in Plant Sciences
|June 27, 2025

View abstract on PubMed

Summary
This summary is machine-generated.

This study introduces a natural language processing pipeline to extract plant functional traits from text. The method significantly improves data extraction accuracy, aiding ecological research.

Keywords:
automatic trait extractionbiodiversityfunctional plant ecologylarge language modelsnatural language processingvascular plants

More Related Videos

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

692
A Bioinformatics Pipeline to Accurately and Efficiently Analyze the MicroRNA Transcriptomes in Plants
06:34

A Bioinformatics Pipeline to Accurately and Efficiently Analyze the MicroRNA Transcriptomes in Plants

Published on: January 21, 2020

8.5K

Related Experiment Videos

Annotation of Plant Gene Function via Combined Genomics, Metabolomics and Informatics
08:09

Annotation of Plant Gene Function via Combined Genomics, Metabolomics and Informatics

Published on: June 17, 2012

19.8K
Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

692
A Bioinformatics Pipeline to Accurately and Efficiently Analyze the MicroRNA Transcriptomes in Plants
06:34

A Bioinformatics Pipeline to Accurately and Efficiently Analyze the MicroRNA Transcriptomes in Plants

Published on: January 21, 2020

8.5K

Area of Science:

  • Ecology
  • Computational Biology
  • Bioinformatics

Background:

  • Functional plant ecology relies on trait data for understanding species distributions and ecosystem functions.
  • Existing global trait datasets have limitations, and manual data extraction is time-consuming and expensive.
  • Automated extraction of plant traits from textual sources using machine learning is needed to complement field data.

Purpose of the Study:

  • To develop and evaluate a natural language processing (NLP) pipeline for extracting plant functional traits from unstructured text.
  • To compare the performance of the NLP pipeline against traditional methods like regular expressions.
  • To assess the pipeline's effectiveness for both categorical and numerical plant traits.

Main Methods:

  • A novel NLP pipeline was designed, incorporating classification models for categorical traits and question-answering models for numerical traits.
  • The pipeline was tested on two extensive databases containing over 50,000 species descriptions.
  • Various machine learning approaches, including transformer architectures and large language models, were explored.
  • Main Results:

    • The optimized NLP pipeline achieved high performance, with a mean precision of 90.8% and mean recall of 88.6% for categorical traits.
    • This represents a significant improvement over regular expression methods, with a 9.83% increase in precision and 42.35% increase in recall.
    • The question-answering model for numerical traits demonstrated a normalized mean absolute error of 10.3%.

    Conclusions:

    • The proposed NLP pipeline effectively automates the extraction of plant functional trait information from diverse textual sources.
    • This approach has the potential to greatly accelerate the digitization and utilization of vast amounts of ecological data.
    • The findings support the use of advanced NLP techniques to overcome data gaps in functional plant ecology.