Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Microbial Classification System01:24

Microbial Classification System

478
Classification is the process of organizing organisms into hierarchically inclusive groups based on their phenotypic similarities or evolutionary relationships. A species comprises one or more strains, and closely related species are grouped into genera. Genera are further classified into families, families into orders, orders into classes, and so forth, up to the domain level, which is the broadest taxonomic rank derived from a combination of phenotypic and genotypic data.The nomenclature of...
478
How Data are Classified: Categorical Data01:11

How Data are Classified: Categorical Data

39.2K
A variable, usually notated by capital letters such as X and Y, is a characteristic or measurement that can be determined for each member of a population. Data are the actual values of variables. They may be numbers, or they may be words. Datum is a single value.
Data are classified based on whether they are measurable or not. Categorical data cannot be measured; instead, it can be divided into categories. For example, if Y denotes a person's party affiliation, some examples of Y include...
39.2K
Classification of Systems-II01:31

Classification of Systems-II

319
Continuous-time systems have continuous input and output signals, with time measured continuously. These systems are generally defined by differential or algebraic equations. For instance, in an RC circuit, the relationship between input and output voltage is expressed through a differential equation derived from Ohm's law and the capacitor relation,
319
Fungal Phylum Basidiomycota01:26

Fungal Phylum Basidiomycota

500
Basidiomycota is a diverse phylum of fungi that includes ecologically significant decomposers such as white rot fungi, symbionts like mycorrhizal fungi, plant pathogens such as rusts and smuts, and edible species like Agaricus bisporus (the common button mushroom). These fungi play crucial roles in nutrient cycling, symbiotic relationships, and even human health. Their defining feature is the basidium, a microscopic club-shaped structure responsible for producing basidiospores.Fruiting Bodies...
500
Classification of Systems-I01:26

Classification of Systems-I

403
Linearity is a system property characterized by a direct input-output relationship, combining homogeneity and additivity.
Homogeneity dictates that if an input x(t) is multiplied by a constant c, the output y(t) is multiplied by the same constant. Mathematically, this is expressed as:
403
Cluster Sampling Method01:20

Cluster Sampling Method

13.5K
Appropriate sampling methods ensure that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. All the members from these clusters are in the cluster sample. For example, if you randomly sample four departments from your...
13.5K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

The Open Syndrome Definition as a Machine-Readable Standard for Public Health: Design and Implementation Study.

Journal of medical Internet research·2026
Same author

Clinical, Dietary, Lifestyle and Genetic Factors Associated With Age at Onset of Esophageal Adenocarcinoma.

United European gastroenterology journal·2026
Same author

Evaluating ensemble learning approaches for horizontal gene transfer detection.

Scientific reports·2026
Same author

Generalization of ML Models Between ECG and VCG Representation.

Studies in health technology and informatics·2026
Same author

DicomShield: A Pseudonymization Proxy for the Secondary Use of Imaging Data in the Research Context.

Studies in health technology and informatics·2026
Same author

Harnessing generative AI for predicting and optimizing antimicrobial peptides against drug-resistant infections.

npj antimicrobials and resistance·2026
Same journal

Turbulent flow in a vortex separator with a directed pipe inlet.

Scientific reports·2026
Same journal

Systematic characteristic evaluation of clay-based cementitious material derived from calcium carbide residue and waste tile powder.

Scientific reports·2026
Same journal

Retraction Note: Improvement of a rapid diagnostic application of monoclonal antibodies against avian influenza H7 subtype virus using Europium nanoparticles.

Scientific reports·2026
Same journal

Applying large language models to spam detection in the Kazakh low-resource language setting.

Scientific reports·2026
Same journal

An open-source 3D printing system enabling in-situ freeze-thaw processing of hydrogels.

Scientific reports·2026
Same journal

An enhanced EfficientNet framework for automated waste classification using cosine annealing and label smoothing.

Scientific reports·2026
See all related articles

Related Experiment Video

Updated: Nov 9, 2025

Mycorrhizal Maps as a Tool to Explore Colonization Patterns and Fungal Strategies in the Roots of Festuca rubra and Zea mays
08:28

Mycorrhizal Maps as a Tool to Explore Colonization Patterns and Fungal Strategies in the Roots of Festuca rubra and Zea mays

Published on: August 26, 2022

3.0K

Mushroom data creation, curation, and simulation to support classification tasks.

Dennis Wagner1, Dominik Heider1, Georges Hattab2

  • 1Department of Mathematics and Computer Science, University of Marburg, 35043, Marburg, Germany.

Scientific Reports
|April 15, 2021
PubMed
Summary
This summary is machine-generated.

This study introduces the largest mushroom dataset for classifying edible vs. poisonous species. Random Forests (RF) achieved perfect accuracy, indicating complex, non-linear relationships in the data.

More Related Videos

A Method to Define the Effects of Environmental Enrichment on Colon Microbiome Biodiversity in a Mouse Colon Tumor Model
08:14

A Method to Define the Effects of Environmental Enrichment on Colon Microbiome Biodiversity in a Mouse Colon Tumor Model

Published on: February 28, 2018

9.0K
Investigating Bacterial-Fungal Interactions using Fungal Highway Columns in Diverse Environments and Substrates
05:22

Investigating Bacterial-Fungal Interactions using Fungal Highway Columns in Diverse Environments and Substrates

Published on: January 24, 2025

553

Related Experiment Videos

Last Updated: Nov 9, 2025

Mycorrhizal Maps as a Tool to Explore Colonization Patterns and Fungal Strategies in the Roots of Festuca rubra and Zea mays
08:28

Mycorrhizal Maps as a Tool to Explore Colonization Patterns and Fungal Strategies in the Roots of Festuca rubra and Zea mays

Published on: August 26, 2022

3.0K
A Method to Define the Effects of Environmental Enrichment on Colon Microbiome Biodiversity in a Mouse Colon Tumor Model
08:14

A Method to Define the Effects of Environmental Enrichment on Colon Microbiome Biodiversity in a Mouse Colon Tumor Model

Published on: February 28, 2018

9.0K
Investigating Bacterial-Fungal Interactions using Fungal Highway Columns in Diverse Environments and Substrates
05:22

Investigating Bacterial-Fungal Interactions using Fungal Highway Columns in Diverse Environments and Substrates

Published on: January 24, 2025

553

Area of Science:

  • Mycology
  • Machine Learning
  • Data Science

Background:

  • Accurate mushroom identification is crucial for public safety.
  • Existing datasets may lack comprehensiveness for robust machine learning models.
  • Developing reliable classification rules for edible and poisonous mushrooms is a key challenge.

Purpose of the Study:

  • To create and curate the largest attribute-based dataset for mushroom classification.
  • To evaluate machine learning algorithms for predicting mushroom edibility.
  • To provide a reproducible workflow and FAIR data for future research.

Main Methods:

  • Utilized natural language processing on a mushroom identification textbook to generate primary data.
  • Included simulated and hypothetical entries for comprehensive data simulation.
  • Evaluated Naive Bayes, Logistic Regression, Linear Discriminant Analysis (LDA), and Random Forests (RF).

Main Results:

  • Random Forests (RF) achieved perfect five-fold Cross-Validation accuracy (1.0) and F2-score.
  • The developed dataset demonstrated that mushroom classification is not linearly separable.
  • The dataset comprises 173 species from 23 families, making it the largest available.

Conclusions:

  • The curated dataset and RF model offer a powerful tool for mushroom edibility prediction.
  • The non-linear separability highlights the complexity of mushroom identification.
  • The study provides a reproducible and FAIR-compliant resource for mycological research.