Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Genome-wide Association Studies-GWAS01:11

Genome-wide Association Studies-GWAS

17.1K
Genome-wide association studies or GWAS are used to identify whether common SNPs are associated with certain diseases. Suppose specific SNPs are more frequently observed in individuals with a particular disease than those without the disease. In that case, those SNPs are said to be associated with the disease. Chi-square analysis is performed to check the probability of the allele likely to be associated with the disease.
GWAS does not require the identification of the target gene involved in...
17.1K
Single Nucleotide Polymorphisms-SNPs01:05

Single Nucleotide Polymorphisms-SNPs

20.3K
A single nucleotide polymorphism or SNP is a single nucleotide variation at a specific genomic position in a large population. It is the most prevalent type of sequence variation found in the human genome. Point mutations that occur in more than 1% of the population qualify as SNPs. These are present once every 1000 nucleotides on an average in the human genome. Replacement of a purine with another purine (A/G) or a pyrimidine with another pyrimidine (C/T) is known as a transition. In contrast,...
20.3K
Evolutionary Relationships through Genome Comparisons02:54

Evolutionary Relationships through Genome Comparisons

7.3K
Genome comparison is one of the excellent ways to interpret the evolutionary relationships between organisms. The basic principle of genome comparison is that if two species share a common feature, it is likely encoded by the DNA sequence conserved between both species. The advent of genome sequencing technologies in the late 20th century enabled scientists to understand the concept of conservation of domains between species and helped them to deduce evolutionary relationships across diverse...
7.3K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Junctophilin-2-orchestrated calcium signalosome regulates brown adipocyte thermogenesis and energy metabolism.

Proceedings of the National Academy of Sciences of the United States of America·2026
Same author

Comprehensive Evaluation of Indazole-Based Synthetic Cannabinoid Receptor Agonists with Aliphatic Tail Groups: In Vitro Activities, Metabolic Stability, and In Vivo Pharmacokinetics.

ACS chemical neuroscience·2026
Same author

Short-Term Safety and Adverse Event Risk Profiles for SARS-CoV-2 Booster Vaccine Doses in Australian Adults: A Survey Study.

The Medical journal of Australia·2026
Same author

Associations between specialty registered nurse certification and patient outcomes: A nurse-patient-level analysis.

Nursing outlook·2026
Same author

Superficial primary corneal melanoma.

Canadian journal of ophthalmology. Journal canadien d'ophtalmologie·2026
Same author

Brown fat protects against hepatic oxidative stress by remodeling the circulating metabolome.

bioRxiv : the preprint server for biology·2026
Same journal

Tissue identity is the dominant determinant of cross-species transferability of a porcine developmental programme.

BMC genomics·2026
Same journal

Characterization of mitochondrial genomes from three medicinal species of rutaceae and comparative analysis within the family: insights into evolution.

BMC genomics·2026
Same journal

Comparative genomic analysis of Herbaspirillum strains associated with banana sheath rot reveals potential virulence factors.

BMC genomics·2026
Same journal

Transposable element disruption of a second thyroglobulin-like gene confers Vip3Aa resistance in Helicoverpa armigera.

BMC genomics·2026
Same journal

Molecular ontogeny of Bemisia tabaci life stages reveals miRNA-governed development and egg-stalk support for early embryogenesis.

BMC genomics·2026
Same journal

Genome-wide characterization of the accessible chromatin regions in chickpea.

BMC genomics·2026
See all related articles

Related Experiment Video

Updated: Apr 17, 2026

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

8.1K

Genome-wide association data classification and SNPs selection using two-stage quality-based Random Forests.

Thanh-Tung Nguyen, Joshua Huang, Qingyao Wu

    BMC Genomics
    |February 25, 2015
    PubMed
    Summary
    This summary is machine-generated.

    A novel two-stage quality-based sampling method in Random Forests (RF) improves Genome-wide Association Studies (GWAS) by effectively selecting informative single-nucleotide polymorphisms (SNPs). This approach reduces prediction errors and enhances accuracy in identifying disease-associated genetic variants.

    More Related Videos

    Candidate Gene Testing in Clinical Cohort Studies with Multiplexed Genotyping and Mass Spectrometry
    05:53

    Candidate Gene Testing in Clinical Cohort Studies with Multiplexed Genotyping and Mass Spectrometry

    Published on: June 21, 2018

    10.9K
    Large-Scale Multi-Omics Genome-Wide Association Studies Mo-GWAS: Guidelines for Sample Preparation and Normalization
    08:27

    Large-Scale Multi-Omics Genome-Wide Association Studies Mo-GWAS: Guidelines for Sample Preparation and Normalization

    Published on: July 27, 2021

    5.1K

    Related Experiment Videos

    Last Updated: Apr 17, 2026

    Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
    07:35

    Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

    Published on: October 11, 2018

    8.1K
    Candidate Gene Testing in Clinical Cohort Studies with Multiplexed Genotyping and Mass Spectrometry
    05:53

    Candidate Gene Testing in Clinical Cohort Studies with Multiplexed Genotyping and Mass Spectrometry

    Published on: June 21, 2018

    10.9K
    Large-Scale Multi-Omics Genome-Wide Association Studies Mo-GWAS: Guidelines for Sample Preparation and Normalization
    08:27

    Large-Scale Multi-Omics Genome-Wide Association Studies Mo-GWAS: Guidelines for Sample Preparation and Normalization

    Published on: July 27, 2021

    5.1K

    Area of Science:

    • Genetics
    • Bioinformatics
    • Machine Learning

    Background:

    • Genome-wide association data analysis involves selecting single-nucleotide polymorphisms (SNPs), a challenging task due to high dimensionality and irrelevant data.
    • Random Forests (RF) are effective for identifying genetic variants in complex diseases but struggle with SNP selection and model accuracy in Genome-wide Association Studies (GWAS).

    Purpose of the Study:

    • To propose a new two-stage quality-based sampling method in Random Forests (ts-RF) for improved SNP subspace selection in GWAS.
    • To enhance the accuracy of prediction models and reduce the dimensionality of GWAS data.

    Main Methods:

    • A two-stage quality-based sampling method (ts-RF) was developed for SNP subspace selection in GWAS.
    • The method uses p-value assessment to categorize SNPs into informative and irrelevant groups, further dividing informative SNPs into highly and weakly informative subgroups for sampling.

    Main Results:

    • The ts-RF approach generated more accurate trees with lower prediction error, potentially avoiding overfitting.
    • Experiments on Parkinson's and Alzheimer's datasets demonstrated significant reduction in prediction errors, outperforming existing state-of-the-art random forests.
    • The model identified top SNPs, including genes associated with neurological disorders, and reduced data dimensionality.

    Conclusions:

    • The ts-RF method effectively selects informative SNP subgroups associated with diseases, outperforming traditional statistical approaches.
    • This RF model is well-suited for GWAS data where the number of cases is significantly smaller than the number of SNPs.
    • Experimental results confirm the superiority of ts-RF over existing RF methods like Breiman's RF, GRRF, and wsRF.