Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Genome-wide Association Studies-GWAS01:11

Genome-wide Association Studies-GWAS

15.3K
Genome-wide association studies or GWAS are used to identify whether common SNPs are associated with certain diseases. Suppose specific SNPs are more frequently observed in individuals with a particular disease than those without the disease. In that case, those SNPs are said to be associated with the disease. Chi-square analysis is performed to check the probability of the allele likely to be associated with the disease.
GWAS does not require the identification of the target gene involved in...
15.3K
Single Nucleotide Polymorphisms-SNPs01:05

Single Nucleotide Polymorphisms-SNPs

17.9K
A single nucleotide polymorphism or SNP is a single nucleotide variation at a specific genomic position in a large population. It is the most prevalent type of sequence variation found in the human genome. Point mutations that occur in more than 1% of the population qualify as SNPs. These are present once every 1000 nucleotides on an average in the human genome. Replacement of a purine with another purine (A/G) or a pyrimidine with another pyrimidine (C/T) is known as a transition. In contrast,...
17.9K
Genetic Screens02:46

Genetic Screens

5.6K
Genetic screens are tools used to identify genes and mutations responsible for phenotypes of interest. Genetic screens help identify individuals or a group of people at risk of developing  genetic diseases and help them with early intervention, targeted therapy, and reproductive options.
Forward genetic screens
Forward or “classical” genetic screens involve creating random mutations in an organism’s DNA using radiation, mutagens, or insertion of additional bases, which...
5.6K
Genomics02:02

Genomics

39.7K
Genomics is the science of genomes: it is the study of all the genetic material of an organism. In humans, the genome consists of information carried in 23 pairs of chromosomes in the nucleus, as well as mitochondrial DNA. In genomics, both coding and non-coding DNA is sequenced and analyzed. Genomics allows a better understanding of all living things, their evolution, and their diversity. It has a myriad of uses: for example, to build phylogenetic trees, to improve productivity and...
39.7K
Multi-species Conserved Sequences02:51

Multi-species Conserved Sequences

4.6K
Next-generation sequencing technologies have created large genomic databases of a variety of animals and plants. Ever since the human genome project was completed, scientists studied the genome of primates, mammals, and other phylogenetically distant living beings. Such large-scale  studies have provided new insights into the evolutionary relationship between organisms.
Although the genome of each species varies greatly from each other, a few sequences are highly conserved. Such conserved...
4.6K
DNA Microarrays02:34

DNA Microarrays

20.7K
Microarrays are high-throughput and relatively inexpensive assays that can be automated to analyze large quantities of data at a time. They are used in genome-wide studies to compare gene or protein expression under two varied conditions, such as healthy and diseased states. Microarrays consist of glass or silica slides on which probe molecules are covalently attached through surface functionalization. Most commonly, the slides are prepared through the chemisorption of silanes to silica...
20.7K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Statistical knockoffs improve biomarker discovery from transcriptomic data.

Briefings in bioinformatics·2026
Same author

Multimodal BEHRT: transformers for multimodal electronic health records to predict breast cancer prognosis.

Frontiers in oncology·2025
Same author

Assessing Random Forest self-reproducibility for optimal short biomarker signature discovery.

Briefings in bioinformatics·2025
Same author

Noninvasive Multicancer Detection Using DNA Hypomethylation of LINE-1 Retrotransposons.

Clinical cancer research : an official journal of the American Association for Cancer Research·2024
Same author

Drug-Target Interactions Prediction at Scale: The Komet Algorithm with the LCIdb Dataset.

Journal of chemical information and modeling·2024
Same author

Concomitant medication, comorbidity and survival in patients with breast cancer.

Nature communications·2024
Same journal

DeepMethylation: A deep learning framework for tissue-specific DNA methylation prediction and functional variant annotation.

PLoS computational biology·2026
Same journal

Redefining and estimating the early-phase reproduction ratio for epidemic outbreaks in spatially structured populations.

PLoS computational biology·2026
Same journal

Optimized phenotype definitions boost GWAS power.

PLoS computational biology·2026
Same journal

Detection, communication, and individual identification with deep audio embeddings: A case study with North Atlantic right whales.

PLoS computational biology·2026
Same journal

Exploring the structural lexicon of the Proteome via Metric Geometry.

PLoS computational biology·2026
Same journal

Linking retinal sampling in neural encoding models to temporal profiles of visual processing in humans.

PLoS computational biology·2026
See all related articles

Related Experiment Video

Updated: Jan 18, 2026

Large-Scale Multi-Omics Genome-Wide Association Studies Mo-GWAS: Guidelines for Sample Preparation and Normalization
08:27

Large-Scale Multi-Omics Genome-Wide Association Studies Mo-GWAS: Guidelines for Sample Preparation and Normalization

Published on: July 27, 2021

4.8K

Sparse multitask group Lasso for genome-wide association studies.

Asma Nouira1,2,3,4, Chloé-Agathe Azencott1,2,3

  • 1Mines ParisTech, PSL Research University, CBIO-Centre for Computational Biology, Paris, France.

Plos Computational Biology
|September 12, 2025
PubMed
Summary
This summary is machine-generated.

Sparse Multitask Group Lasso (SMuGLasso) effectively addresses population stratification in Genome-Wide Association Studies (GWAS). This novel method improves the identification of population-specific genetic variants, offering superior performance over existing techniques.

More Related Videos

Candidate Gene Testing in Clinical Cohort Studies with Multiplexed Genotyping and Mass Spectrometry
05:53

Candidate Gene Testing in Clinical Cohort Studies with Multiplexed Genotyping and Mass Spectrometry

Published on: June 21, 2018

10.6K
Associated Chromosome Trap for Identifying Long-range DNA Interactions
14:49

Associated Chromosome Trap for Identifying Long-range DNA Interactions

Published on: April 23, 2011

14.9K

Related Experiment Videos

Last Updated: Jan 18, 2026

Large-Scale Multi-Omics Genome-Wide Association Studies Mo-GWAS: Guidelines for Sample Preparation and Normalization
08:27

Large-Scale Multi-Omics Genome-Wide Association Studies Mo-GWAS: Guidelines for Sample Preparation and Normalization

Published on: July 27, 2021

4.8K
Candidate Gene Testing in Clinical Cohort Studies with Multiplexed Genotyping and Mass Spectrometry
05:53

Candidate Gene Testing in Clinical Cohort Studies with Multiplexed Genotyping and Mass Spectrometry

Published on: June 21, 2018

10.6K
Associated Chromosome Trap for Identifying Long-range DNA Interactions
14:49

Associated Chromosome Trap for Identifying Long-range DNA Interactions

Published on: April 23, 2011

14.9K

Area of Science:

  • Genetics
  • Statistical Genetics
  • Bioinformatics

Background:

  • Population stratification, due to differing ancestry and allele frequencies across subpopulations, presents a significant challenge in Genome-Wide Association Studies (GWAS).
  • This stratification can lead to the identification of non-causal associations and mask true population-specific risk variants.

Purpose of the Study:

  • To introduce and evaluate Sparse Multitask Group Lasso (SMuGLasso), a novel statistical method designed to overcome population stratification in GWAS.
  • To enhance the accurate detection of population-specific genetic variants and improve the robustness of GWAS findings.

Main Methods:

  • SMuGLasso builds upon the Multitask Group Lasso (MuGLasso) framework, utilizing a multitask group lasso approach where tasks represent subpopulations.
  • Groups consist of population-specific Linkage-Disequilibrium (LD)-correlated Single Nucleotide Polymorphisms (SNPs).
  • A key innovation is the addition of an L1-norm regularization for selecting population-specific variants, alongside stability selection and gap-safe screening for efficiency and robustness.

Main Results:

  • SMuGLasso demonstrated superior performance compared to MuGLasso in identifying population-specific SNPs on both simulated and real datasets.
  • Evaluations on a breast cancer GWAS and an Arabidopsis thaliana GWAS confirmed SMuGLasso's effectiveness in handling LD and population stratification.
  • Pathway and network analyses of identified loci on real data corroborated the biological relevance of SMuGLasso's findings, showing greater consistency with existing literature than MuGLasso.

Conclusions:

  • SMuGLasso is a robust and computationally efficient tool for analyzing GWAS data, particularly effective in managing population stratification and linkage disequilibrium.
  • The method enhances the discovery of population-specific genetic variants, contributing to a deeper understanding of population-specific biological mechanisms and disease associations.