Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Multiple Allele Traits01:49

Multiple Allele Traits

34.4K
The Concept of Multiple Allelism
34.4K
Quantitative Analysis01:12

Quantitative Analysis

348
Quantitative analysis is a technique for measuring the amount of specific constituents in a sample. When the sample's composition is unknown, qualitative analysis is performed first to identify its components, which ensures that the correct substances are measured during the quantitative phase.
In quantitative analysis, two key measurements are made: the sample quantity and a property proportional to the amount of the analyte (the substance being analyzed). This forms the basis of the...
348
Genome-wide Association Studies-GWAS01:11

Genome-wide Association Studies-GWAS

13.7K
Genome-wide association studies or GWAS are used to identify whether common SNPs are associated with certain diseases. Suppose specific SNPs are more frequently observed in individuals with a particular disease than those without the disease. In that case, those SNPs are said to be associated with the disease. Chi-square analysis is performed to check the probability of the allele likely to be associated with the disease.
GWAS does not require the identification of the target gene involved in...
13.7K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Comparison of Deep Learning Tools for Optic Nerve Axon Quantification Finds Limited Generalizability Upon Independent Validation.

Bioengineering (Basel, Switzerland)·2026
Same author

Development of a Core Outcome Domain Set for Facial Aging.

JAMA dermatology·2026
Same author

Genome-wide association study of cocaine self-administration behavior in Heterogeneous Stock rats.

Nature communications·2026
Same author

Management of Post-Procedure Eruptive Keratoacanthomas: A Retrospective Cohort Study.

Journal of the American Academy of Dermatology·2026
Same author

Genetic Architecture of Addiction-Relevant Behaviors in Outbred Sprague-Dawley Rats Reveals Loci for Anxiety-Like and Nociceptive Traits.

Genes, brain, and behavior·2026
Same author

Substrain-specific behavioral variation in female C57BL/6 and C57BL/10 mice.

Frontiers in behavioral neuroscience·2026
Same journal

Interpretable machine learning for Parkinson's disease diagnosis, staging, and biological mechanism exploration: a multicenter analysis.

BioData mining·2026
Same journal

Learning a distance for the clustering of patients with amyotrophic lateral sclerosis.

BioData mining·2026
Same journal

Multi-domain feature fusion with variational mode decomposition and hybrid LightGBM-Logistic Regression for multi-class seizure classification.

BioData mining·2026
Same journal

Large-scale transcriptomic data mining using explainable XGBoost and SHAP reveals shared biomarkers and molecular mechanisms between type-2 diabetes and triple-negative breast cancer for drug repurposing.

BioData mining·2026
Same journal

AVSeg-XAI: Deep learning framework for A/V segmentation with vascular features reveals retinal oculomics as biomarker for cardiovascular disease.

BioData mining·2026
Same journal

Navigating the uncharted: AI-driven advances in protein structure, dynamics, interactions and ligand interactions for understudied families.

BioData mining·2026
See all related articles

Related Experiment Video

Updated: Aug 3, 2025

Large-Scale Multi-Omics Genome-Wide Association Studies Mo-GWAS: Guidelines for Sample Preparation and Normalization
08:27

Large-Scale Multi-Omics Genome-Wide Association Studies Mo-GWAS: Guidelines for Sample Preparation and Normalization

Published on: July 27, 2021

3.8K

Automated quantitative trait locus analysis (AutoQTL).

Philip J Freda1, Attri Ghosh1, Elizabeth Zhang1

  • 1Department of Computational Biomedicine, Cedars-Sinai Medical Center, 700 N. San Vicente Blvd., Pacific Design Center, Suite G540, West Hollywood, CA, 90069, USA.

Biodata Mining
|April 10, 2023
PubMed
Summary
This summary is machine-generated.

This article introduces AutoQTL, an automated machine learning tool designed to streamline the complex process of identifying genetic variants linked to specific traits. By reducing the manual effort required for data preparation and parameter selection, this system helps researchers uncover both simple and complex genetic interactions. The authors demonstrate its effectiveness using rat body mass index data, showing it can identify standard genetic effects as well as more intricate interactions between genes. This approach offers a powerful new way to analyze large-scale genetic datasets more efficiently.

Keywords:
AutomatedDominanceEpistasisEvolutionary algorithmsGWASGenetic programmingInheritanceMachine learningQTLGenome-Wide Association StudiesMachine LearningPhenotypic VarianceEpistatic InteractionsGenetic Mapping

Frequently Asked Questions

More Related Videos

Determining the Likelihood of Variant Pathogenicity Using Amino Acid-level Signal-to-Noise Analysis of Genetic Variation
07:15

Determining the Likelihood of Variant Pathogenicity Using Amino Acid-level Signal-to-Noise Analysis of Genetic Variation

Published on: January 16, 2019

11.1K
Automating Aggregate Quantification in Caenorhabditis elegans
07:50

Automating Aggregate Quantification in Caenorhabditis elegans

Published on: October 14, 2021

2.8K

Related Experiment Videos

Last Updated: Aug 3, 2025

Large-Scale Multi-Omics Genome-Wide Association Studies Mo-GWAS: Guidelines for Sample Preparation and Normalization
08:27

Large-Scale Multi-Omics Genome-Wide Association Studies Mo-GWAS: Guidelines for Sample Preparation and Normalization

Published on: July 27, 2021

3.8K
Determining the Likelihood of Variant Pathogenicity Using Amino Acid-level Signal-to-Noise Analysis of Genetic Variation
07:15

Determining the Likelihood of Variant Pathogenicity Using Amino Acid-level Signal-to-Noise Analysis of Genetic Variation

Published on: January 16, 2019

11.1K
Automating Aggregate Quantification in Caenorhabditis elegans
07:50

Automating Aggregate Quantification in Caenorhabditis elegans

Published on: October 14, 2021

2.8K

Area of Science:

  • Computational genetics and AutoQTL methodology
  • Bioinformatics and statistical genomics

Background:

Genetic mapping techniques often struggle to balance computational efficiency with the depth of biological insight required for complex traits. Researchers frequently face significant hurdles when attempting to optimize parameters for large-scale genomic investigations. That uncertainty drove the development of new strategies to handle massive, heterogeneous datasets more effectively. Prior research has shown that standard statistical models often overlook intricate non-linear relationships between genetic variants and phenotypes. Manual selection of analytical pipelines remains a time-consuming bottleneck for many laboratories worldwide. No prior work had resolved the challenge of automating these diverse decision-making processes within a single unified framework. This gap motivated the creation of tools that integrate machine learning to assist in data processing and model selection. Such innovations aim to simplify the identification of genetic markers while maintaining high levels of accuracy across varied experimental conditions.

Purpose Of The Study:

The study aims to describe a proof-of-concept for an automated machine learning approach designed to analyze complex genetic traits. Researchers sought to address the significant time and effort required for manual parameter optimization in genomic investigations. The project focuses on automating complicated decision-making processes that occur during the analysis of large, heterogeneous datasets. By creating a unified framework, the authors intend to simplify the identification of genetic variants that capture phenotypic variance. The motivation stems from the difficulty of applying standard statistical methods to increasingly complex and massive genomic information. This work explores whether machine learning can effectively complement traditional association studies to improve analytical efficiency. The investigators also aim to demonstrate the ability of their tool to detect both additive and non-additive genetic effects. Ultimately, the research provides a foundation for more intelligent feature selection and engineering strategies in future genomic analyses.

Main Methods:

The investigators developed a proof-of-concept framework to automate decision-making in the analysis of complex genetic traits. Their review approach involved testing the software against a publicly available dataset containing 18 putative loci. This validation set originated from a large-scale study of body mass index in laboratory rats. The team implemented machine learning algorithms to handle parameter optimization and data pre-processing tasks automatically. They evaluated the system by comparing its output against standard additive models typically used in association studies. The researchers also utilized simulated data to assess the ability of the tool to detect non-additive effects. Feature importance metrics were calculated to provide insights into the predictive power of the identified genetic markers. This systematic evaluation confirms the capacity of the software to generate multiple optimal solutions for describing genetic relationships.

Main Results:

The primary finding shows that the software successfully captures phenotypic variance explained under a standard additive model using rat body mass index data. Key findings from the literature suggest that the tool also detects evidence of non-additive effects in simulated datasets. Specifically, the system identifies deviations from additivity and two-way epistatic interactions through multiple optimal solutions. Feature importance metrics provide distinct insights into the inheritance models of various putative loci derived from association studies. The results demonstrate that automated techniques can complement traditional approaches by uncovering complex genetic relationships. The study confirms that the tool manages complicated analytical decisions that often require extensive manual input. These findings illustrate the potential of machine learning to enhance the depth of genomic investigations. The researchers report that these automated strategies consistently provide reliable outputs across different testing scenarios.

Conclusions:

The authors demonstrate that automated machine learning can successfully complement traditional statistical methods in genomic research. Their findings suggest that AutoQTL effectively identifies both additive and non-additive genetic effects within complex datasets. The study highlights how multiple optimal solutions provide a more comprehensive view of the underlying genetic architecture. Feature importance metrics offer valuable insights into the predictive power of specific genetic variants. These results indicate that automated systems can handle complicated analytical decisions that typically require extensive manual intervention. The researchers propose that such tools are capable of uncovering epistatic interactions that standard models might otherwise miss. This synthesis implies that machine learning integration could significantly enhance the efficiency of large-scale association studies. Future iterations of this technology may accommodate even larger omics-level data structures through advanced feature engineering.

The researchers propose that AutoQTL identifies genetic relationships by automating parameter optimization and model selection. It captures phenotypic variance using an additive model while simultaneously detecting non-additive effects, such as two-way epistatic interactions, through multiple optimal solutions.

The authors utilize feature importance metrics to evaluate the inheritance models and predictive strength of putative quantitative trait loci. These metrics allow the system to rank different genetic variants based on their contribution to phenotypic variance within the analyzed datasets.

The researchers explain that the complexity of large, heterogeneous datasets necessitates automated approaches. Manual selection of methods and pre-processing steps is time-consuming, making automated machine learning essential for efficiently managing the high-dimensional data typical of modern genome-wide association studies.

AutoQTL processes genetic data by integrating machine learning to handle complex decisions regarding trait analysis. It specifically uses a publicly available dataset of 18 putative quantitative trait loci from a large-scale study of body mass index in Rattus norvegicus.

The researchers measure the effectiveness of their tool by comparing its performance against standard additive models. They specifically look for deviations from additivity and the presence of two-way epistatic interactions, which are key indicators of complex genetic architecture.

The authors propose that their automated approach will eventually support omics-level datasets. They intend to incorporate intelligent feature selection and advanced engineering strategies to expand the utility of the software for broader genomic applications.