Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Correlations02:20

Correlations

34.7K
Correlation means that there is a relationship between two or more variables (such as ice cream consumption and crime), but this relationship does not necessarily imply cause and effect. When two variables are correlated, it simply means that as one variable changes, so does the other. We can measure correlation by calculating a statistic known as a correlation coefficient. A correlation coefficient is a number from -1 to +1 that indicates the strength and direction of the relationship between...
34.7K
Extraction: Advanced Methods00:56

Extraction: Advanced Methods

1.2K
Metal ions can be separated from one another by complexation with organic ligands–the chelating agent– to form uncharged chelates. Here, the chelating agent must contain hydrophobic groups and behave as a weak acid, losing a proton to bind with the metal. Since most organic ligands used in this process are insoluble or undergo oxidation in the aqueous phase, the chelating agent is initially added to the organic phase and extracted into the aqueous phase. The metal-ligand complex is...
1.2K
Correlation01:09

Correlation

12.2K
In statistics, two variables are said to be correlated if the values of one variable are associated with the other variable. Depending on the relationship between two variables, correlation can be of three types– positive correlation, negative correlation, and zero correlation.
Two variables, for example, a and b, are said to be positively correlated if both variables move in the same direction. In other words, a positive correlation exists between two variables, a and b, if:
12.2K
Correlation and Regression00:53

Correlation and Regression

3.8K
In statistics, correlation describes the degree of association between two variables. In the subfield of linear regression, correlation is mathematically expressed by the correlation coefficient, which describes the strength and direction of the relationship between two variables. The coefficient is symbolically represented by 'r' and ranges from -1 to +1. A positive value indicates a positive correlation where the two variables move in the same direction. A negative value suggests a...
3.8K
Data Collection by Observations01:08

Data Collection by Observations

11.0K
Data collection refers to a systematic way of obtaining, observing, measuring, and analyzing accurate information. Observational studies are one of the most widely used methods of data collection. It involves collecting data by observing the behavior and physical characteristics of a sample without making any modifications to the sample.
An astronomer viewing the motion and brightness of stars in the sky and recording the data is an example of observational data collection. A botanist recording...
11.0K
Ecological Niches02:02

Ecological Niches

22.3K
All organisms have a position within an ecosystem. The complete set of living and nonliving factors—including food resources, climate, and terrain—that define the position of a given organism are collectively referred to as the organism’s ecological niche.
22.3K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Loss of Hfe impairs hematopoietic stem and progenitor cell function.

Blood. Red cells & iron·2026
Same author

Integrating plot-based methods for monitoring biodiversity in island habitats under the scope of BIODIVERSA+ project BioMonI: Beetle monitoring in Pico and Terceira Islands.

Biodiversity data journal·2026
Same author

Are <i>Sepsis</i> flies Batesian mimics of ants?

Current zoology·2026
Same author

Subterranean environments contribute to three-quarters of classified ecosystem services.

Biological reviews of the Cambridge Philosophical Society·2026
Same author

A comprehensive trait dataset for Terrestrial Arthropods of the Azores: insights for conservation, island ecology and species invasion.

Biodiversity data journal·2026
Same author

Species inventory and morphological measurements of spiders (Arachnida, Araneae) and ants (Insecta, Hymenoptera, Formicidae) collected in northern Ghana.

Biodiversity data journal·2026
Same journal

The First Functional Traits Dataset for the Endemic Flora of Greece: morphology, ecology and ecosystem services.

Biodiversity data journal·2026
Same journal

Environmental drivers of Odonate assemblages across biogeographic provinces in the Mexican Transition Zone.

Biodiversity data journal·2026
Same journal

The stoneflies (Plecoptera) of West Virginia: a new occurrence dataset.

Biodiversity data journal·2026
Same journal

16S rRNA sequence dataset for the identification of the French hoverflies (Diptera, Syrphidae).

Biodiversity data journal·2026
Same journal

Description of a new species of the orb-weaver spider genus <i>Zangaraneus</i> Mi, Wang & Feng, 2026 (Araneae, Araneidae) from China.

Biodiversity data journal·2026
Same journal

Rosario: An algorithm to analyse cyclical data in Ecology.

Biodiversity data journal·2026
See all related articles
  1. Home
  2. Recode - Relational Ecological Corpus For Data Extraction.
  1. Home
  2. Recode - Relational Ecological Corpus For Data Extraction.

Related Experiment Video

A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts
07:50

A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts

Published on: September 20, 2018

15.8K

RECODE - Relational Ecological COrpus for Data Extraction.

Vasco V Branco1,2,3, Lidia Pivovarova4, Kari-E J Lintulaakso1

  • 1Finnish Museum of Natural History LUOMUS, University of Helsinki, Helsinki, Finland Finnish Museum of Natural History LUOMUS, University of Helsinki Helsinki Finland.

Biodiversity Data Journal
|April 27, 2026

View abstract on PubMed

Summary
This summary is machine-generated.

Researchers created RECODE, a new dataset for training AI models to extract species occurrence and trait data from scientific texts. This resource aids ecological research by making unstructured data machine-readable.

Keywords:
biodiversityfunctional diversityinsectslarge language modelsmachine learningnamed entity recognitionnatural language processingoccurrence dataspecies traitsspiders

More Related Videos

RBDT: A Computerized Task System based in Transposition for the Continuous Analysis of Relational Behavior Dynamics in Humans
11:09

RBDT: A Computerized Task System based in Transposition for the Continuous Analysis of Relational Behavior Dynamics in Humans

Published on: July 17, 2021

2.5K
Integrating Computerized Linguistic and Social Network Analyses to Capture Addiction Recovery Capital in an Online Community
08:53

Integrating Computerized Linguistic and Social Network Analyses to Capture Addiction Recovery Capital in an Online Community

Published on: May 31, 2019

5.8K

Related Experiment Videos

A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts
07:50

A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts

Published on: September 20, 2018

15.8K
RBDT: A Computerized Task System based in Transposition for the Continuous Analysis of Relational Behavior Dynamics in Humans
11:09

RBDT: A Computerized Task System based in Transposition for the Continuous Analysis of Relational Behavior Dynamics in Humans

Published on: July 17, 2021

2.5K
Integrating Computerized Linguistic and Social Network Analyses to Capture Addiction Recovery Capital in an Online Community
08:53

Integrating Computerized Linguistic and Social Network Analyses to Capture Addiction Recovery Capital in an Online Community

Published on: May 31, 2019

5.8K

Area of Science:

  • Ecology
  • Conservation Biology
  • Taxonomy

Background:

  • Ecological research relies heavily on species location and trait data.
  • This crucial data is often unstructured text in publications, especially for invertebrates.
  • Automated data extraction using AI is challenging due to the need for labeled training corpora.

Purpose of the Study:

  • To introduce RECODE, a manually annotated corpus of ecological and taxonomic literature.
  • To facilitate the training and fine-tuning of AI models for automated data extraction.
  • To address the lack of standard datasets for ecological and taxonomic text mining.

Main Methods:

  • Manual annotation of ecological and taxonomic literature by subject matter experts.
  • Validation of annotations by experts familiar with the traits of spiders and insects.
  • Development of a corpus for training AI models in named entity recognition and relation extraction.
  • Main Results:

    • Creation of RECODE, a manually annotated corpus of scientific literature.
    • The corpus contains validated occurrence and trait data for spiders and insects.
    • RECODE provides a standardized dataset for AI model development in ecology.

    Conclusions:

    • RECODE is a valuable resource for advancing automated data extraction in ecology and conservation.
    • The availability of such annotated corpora is essential for improving AI model performance.
    • This work supports the development of more efficient methods for ecological data mining.