Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Genome-wide Association Studies-GWAS01:11

Genome-wide Association Studies-GWAS

15.3K
Genome-wide association studies or GWAS are used to identify whether common SNPs are associated with certain diseases. Suppose specific SNPs are more frequently observed in individuals with a particular disease than those without the disease. In that case, those SNPs are said to be associated with the disease. Chi-square analysis is performed to check the probability of the allele likely to be associated with the disease.
GWAS does not require the identification of the target gene involved in...
15.3K
One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for ka Estimation01:24

One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for ka Estimation

1.1K
This lesson introduces two critical methods in pharmacokinetics, the Wagner-Nelson and Loo-Riegelman methods, used for estimating the absorption rate constant (ka) for drugs administered via non-intravenous routes. The Wagner-Nelson method relates ka to the plasma concentration derived from the slope of a semilog percent unabsorbed time plot. However, it is limited to drugs with one-compartment kinetics and can be impacted by factors like gastrointestinal motility or enzymatic degradation.
On...
1.1K
Quantifying and Rejecting Outliers: The Grubbs Test01:02

Quantifying and Rejecting Outliers: The Grubbs Test

3.5K
Sometimes, a data set can have a recorded numerical observation that greatly  deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier.  To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...
3.5K
Single Nucleotide Polymorphisms-SNPs01:05

Single Nucleotide Polymorphisms-SNPs

17.9K
A single nucleotide polymorphism or SNP is a single nucleotide variation at a specific genomic position in a large population. It is the most prevalent type of sequence variation found in the human genome. Point mutations that occur in more than 1% of the population qualify as SNPs. These are present once every 1000 nucleotides on an average in the human genome. Replacement of a purine with another purine (A/G) or a pyrimidine with another pyrimidine (C/T) is known as a transition. In contrast,...
17.9K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Interpretable convolutional neural networks for sequence-based classification and discovery of plastic-degrading enzymes.

Applied and environmental microbiology·2026
Same author

Echo-SMADS: A hierarchical planning model for predicting ejection fraction using echocardiography.

Computer methods and programs in biomedicine·2026
Same author

Norm-based measures of inequality: A property-focused evaluation.

PloS one·2026
Same author

Prediction of bacterial protein-compound interactions with only positive samples.

Bioinformatics (Oxford, England)·2026
Same author

Environmental Influences on Blood Donation: Reducing Stress and Enhancing Motivation in Young Adults.

HERD·2026
Same author

Trustworthy prediction of enzyme commission numbers using a hierarchical interpretable transformer.

Nature communications·2026
Same journal

Poly(bromophenol blue)/CoSn(OH)<sub>6</sub> cubic particles modified pencil graphite electrode for electrochemical determination of diphenhydramine.

Scientific reports·2026
Same journal

Dietary Chlorella, Spirulina, and acidifier modulate jejunal cytokine-related gene expression in broiler chickens.

Scientific reports·2026
Same journal

Perceived physical activity barriers in university students: associations with fatigue and eating behaviours.

Scientific reports·2026
Same journal

Refuge limitation structures habitat use in agricultural landscapes: evidence from Sunda pangolins.

Scientific reports·2026
Same journal

Lightweight stateless transaction verification with outsourced witness updates for UTXO blockchains.

Scientific reports·2026
Same journal

Efficacy of historical context and exogenous features on deep learning for cooling load forecasting in chilled water plants.

Scientific reports·2026
See all related articles

Related Experiment Video

Updated: Jan 17, 2026

Candidate Gene Testing in Clinical Cohort Studies with Multiplexed Genotyping and Mass Spectrometry
05:53

Candidate Gene Testing in Clinical Cohort Studies with Multiplexed Genotyping and Mass Spectrometry

Published on: June 21, 2018

10.6K

Stochastic LASSO for extremely high-dimensional genomic data.

Beomsu Baek1, Jongkwon Jo2,3, Mingon Kang4

  • 1Department of Computer Science, University of Nevada, Las Vegas, 89154, NV, USA.

Scientific Reports
|January 14, 2026
PubMed
Summary
This summary is machine-generated.

Stochastic LASSO enhances feature selection for high-dimensional genomic data by reducing multicollinearity and sampling randomness. This new method outperforms existing models in identifying significant biomarkers and estimating coefficients.

Keywords:
High-dimensional dataLASSOStochastic LASSOVariable selection

More Related Videos

Large-Scale Multi-Omics Genome-Wide Association Studies Mo-GWAS: Guidelines for Sample Preparation and Normalization
08:27

Large-Scale Multi-Omics Genome-Wide Association Studies Mo-GWAS: Guidelines for Sample Preparation and Normalization

Published on: July 27, 2021

4.8K
Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER
14:06

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Published on: June 23, 2012

15.7K

Related Experiment Videos

Last Updated: Jan 17, 2026

Candidate Gene Testing in Clinical Cohort Studies with Multiplexed Genotyping and Mass Spectrometry
05:53

Candidate Gene Testing in Clinical Cohort Studies with Multiplexed Genotyping and Mass Spectrometry

Published on: June 21, 2018

10.6K
Large-Scale Multi-Omics Genome-Wide Association Studies Mo-GWAS: Guidelines for Sample Preparation and Normalization
08:27

Large-Scale Multi-Omics Genome-Wide Association Studies Mo-GWAS: Guidelines for Sample Preparation and Normalization

Published on: July 27, 2021

4.8K
Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER
14:06

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Published on: June 23, 2012

15.7K

Area of Science:

  • Genomics
  • Biostatistics
  • Machine Learning

Background:

  • High-dimensional data analysis is crucial for genomic studies.
  • Least Absolute Shrinkage and Selection Operator (LASSO) and its variants are used for biomarker discovery.
  • Existing bootstrap-based LASSO models face challenges like multicollinearity and sampling randomness.

Purpose of the Study:

  • To introduce Stochastic LASSO, a novel bootstrap-based method for feature selection.
  • To address limitations of existing LASSO models in extremely high-dimensional but low sample size (EHDLSS) data.
  • To improve feature selection accuracy and coefficient estimation in genomic analysis.

Main Methods:

  • Developed Stochastic LASSO, a new bootstrap-based approach.
  • Implemented techniques to reduce multicollinearity and sampling randomness.
  • Utilized a two-stage t-test strategy for statistical significance.
  • Evaluated performance through extensive simulations and TCGA cancer gene expression data.

Main Results:

  • Stochastic LASSO demonstrated superior performance in feature selection and coefficient estimation compared to benchmark models.
  • The method effectively reduced multicollinearity and sampling randomness.
  • Identified statistically significant genes associated with survival prediction in TCGA cancer datasets.
  • Showed improved robustness in simulation experiments.

Conclusions:

  • Stochastic LASSO offers a significant advancement over existing LASSO models for EHDLSS genomic data.
  • The proposed method provides a robust and accurate tool for biomarker discovery and feature selection.
  • Stochastic LASSO has practical applications in cancer genomics for survival prediction.