Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Wald-Wolfowitz Runs Test I01:17

Wald-Wolfowitz Runs Test I

The Wald-Wolfowitz test, also known as the runs test, is a nonparametric statistical test used to assess the randomness of a sequence of two different types of elements (e.g., positive/negative values, successes/failures). It examines whether the order of the elements in a sequence is random or if there is a pattern or trend present. This nonparametric test applies to any ordered data despite the population and sample data distribution, even if a higher sample size is available.
The test works...
Evolutionary Relationships through Genome Comparisons02:54

Evolutionary Relationships through Genome Comparisons

Genome comparison is one of the excellent ways to interpret the evolutionary relationships between organisms. The basic principle of genome comparison is that if two species share a common feature, it is likely encoded by the DNA sequence conserved between both species. The advent of genome sequencing technologies in the late 20th century enabled scientists to understand the concept of conservation of domains between species and helped them to deduce evolutionary relationships across diverse...
Randomized Experiments01:13

Randomized Experiments

The randomization process involves assigning study participants randomly to experimental or control groups based on their probability of being equally assigned. Randomization is meant to eliminate selection bias and balance known and unknown confounding factors so that the control group is similar to the treatment group as much as possible. A computer program and a random number generator can be used to assign participants to groups in a way that minimizes bias.
Simple randomization
Simple...
Comparing Copy Number Variations and SNPs02:26

Comparing Copy Number Variations and SNPs

Sequencing of the human genome has opened up several best-kept secrets of the genome. Scientists have identified thousands of genome variations that exist within a population. These variations can be a single nucleotide or a larger chromosomal variation.
Copy number variations or CNVs are the structural variations that cover more than 1kb of DNA sequence. The single nucleotide polymorphism (SNP), on the other hand, is a single nucleotide change or a point mutation that is found in more than 1%...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Do One Health Networks Compete, Coordinate, or Both? Insights from a Cross-Network Dialogue.

EcoHealth·2026
Same author

Allele-specific expression in the brain links genetic risk and cortical thinning in psychiatric disorders.

Neuropsychopharmacology : official publication of the American College of Neuropsychopharmacology·2026
Same author

Development of Highly Conductive and Stable Direct Writeable Metal-Free Inks Based on Carbon Nanotube and Functionalized Graphene Oxide.

ACS applied materials & interfaces·2025
Same author

Immune, Developmental, and Synaptic Pathways Define Bipolar Disorder Clinical Heterogeneity.

medRxiv : the preprint server for health sciences·2025
Same author

Development of a Pharmacogenomics Case Repository for Pharmacy Educators.

American journal of pharmaceutical education·2025
Same author

Turnover atlas of proteome and phosphoproteome across mouse tissues and brain regions.

Cell·2025
Same journal

Abstracts from Specialized Centers of Research Excellence (SCORE) on Sex Differences 2025 annual meeting.

BMC proceedings·2026
Same journal

Conference abstracts the 1st UDOM scientific conference on health: healthy lives and well-being for all: opportunities and challenges.

BMC proceedings·2026
Same journal

Entrepreneurship beyond the lab: commercializing your creative outputs.

BMC proceedings·2026
Same journal

The need to strengthen laboratory leadership, systems, and networks to enhance outbreak detection and resilience in Africa: proceedings of a regional workshop.

BMC proceedings·2026
Same journal

Abstracts from the Globesync Community Research and Sustainability 2025 (GlobeCoReS 2025).

BMC proceedings·2026
Same journal

Bauru International Craniofacial Symposium: Comprehensive Care, Policy and Advocacy Proceedings.

BMC proceedings·2026
See all related articles

Related Experiment Video

Updated: May 24, 2026

Three Differential Expression Analysis Methods for RNA Sequencing: limma, EdgeR, DESeq2
10:10

Three Differential Expression Analysis Methods for RNA Sequencing: limma, EdgeR, DESeq2

Published on: September 18, 2021

Performance of random forests and logic regression methods using mini-exome sequence data.

Yoonhee Kim1, Qing Li1, Cheryl D Cropp1

  • 1Inherited Disease Research Branch, National Human Genome Research Institute, National Institutes of Health, Baltimore, MD 21224, USA.

BMC Proceedings
|March 1, 2012
PubMed
Summary
This summary is machine-generated.

Machine learning methods like random forests and logic regression can identify genetic variants influencing quantitative traits. Collapsing rare variants by genes or pathways improved performance for most methods, enhancing genetic analysis.

More Related Videos

Flow-sorting and Exome Sequencing of the Reed-Sternberg Cells of Classical Hodgkin Lymphoma
08:53

Flow-sorting and Exome Sequencing of the Reed-Sternberg Cells of Classical Hodgkin Lymphoma

Published on: June 10, 2017

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER
14:06

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Published on: June 23, 2012

Related Experiment Videos

Last Updated: May 24, 2026

Three Differential Expression Analysis Methods for RNA Sequencing: limma, EdgeR, DESeq2
10:10

Three Differential Expression Analysis Methods for RNA Sequencing: limma, EdgeR, DESeq2

Published on: September 18, 2021

Flow-sorting and Exome Sequencing of the Reed-Sternberg Cells of Classical Hodgkin Lymphoma
08:53

Flow-sorting and Exome Sequencing of the Reed-Sternberg Cells of Classical Hodgkin Lymphoma

Published on: June 10, 2017

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER
14:06

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Published on: June 23, 2012

Area of Science:

  • Genetics
  • Bioinformatics
  • Statistical Modeling

Background:

  • Analyzing large-scale genetic data for quantitative traits is complex.
  • Traditional methods may have limitations with rare variants and distributional assumptions.
  • Machine learning offers alternative approaches for genetic variant detection.

Purpose of the Study:

  • To evaluate machine learning methods (random forests, logic regression) for genetic variant analysis.
  • To compare these methods against standard linear regression.
  • To assess the impact of collapsing rare variants on analytical performance.

Main Methods:

  • Utilized Genetic Analysis Workshop 17 mini-exome data.
  • Applied random forests, logic regression, and univariate linear regression.
  • Compared analyses of individual variants versus variants collapsed by genes or pathways.

Main Results:

  • Random forests and linear regression showed improved performance when rare variants were collapsed by genes or pathways.
  • Logic regression performed best when rare variants were collapsed by genes.
  • Collapsing rare variants generally enhanced the effectiveness of the tested methods.

Conclusions:

  • Machine learning, particularly random forests, is effective for analyzing genetic variants influencing quantitative traits.
  • Collapsing rare variants by biological units (genes, pathways) is a beneficial strategy for improving analytical power.
  • Method choice (e.g., logic regression vs. random forests) can influence optimal variant collapsing strategies.