Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Quantifying and Rejecting Outliers: The Grubbs Test01:02

Quantifying and Rejecting Outliers: The Grubbs Test

4.0K
Sometimes, a data set can have a recorded numerical observation that greatly  deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier.  To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...
4.0K
One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for ka Estimation01:24

One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for ka Estimation

1.5K
This lesson introduces two critical methods in pharmacokinetics, the Wagner-Nelson and Loo-Riegelman methods, used for estimating the absorption rate constant (ka) for drugs administered via non-intravenous routes. The Wagner-Nelson method relates ka to the plasma concentration derived from the slope of a semilog percent unabsorbed time plot. However, it is limited to drugs with one-compartment kinetics and can be impacted by factors like gastrointestinal motility or enzymatic degradation.
On...
1.5K
Residuals and Least-Squares Property01:11

Residuals and Least-Squares Property

7.2K
The vertical distance between the actual value of y and the estimated value of y. In other words, it measures the vertical distance between the actual data point and the predicted point on the line
If the observed data point lies above the line, the residual is positive, and the line underestimates the actual data value for y. If the observed data point lies below the line, the residual is negative, and the line overestimates the actual data value for y.
The process of fitting the best-fit...
7.2K
Multiple Regression01:25

Multiple Regression

3.4K
Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...
3.4K
Outliers and Influential Points01:08

Outliers and Influential Points

5.2K
An outlier is an observation of data that does not fit the rest of the data. It is sometimes called an extreme value. When you graph an outlier, it will appear not to fit the pattern of the graph. Some outliers are due to mistakes (for example, writing down 50 instead of 500), while others may indicate that something unusual is happening. Outliers are present far from the least squares line in the vertical direction. They have large "errors," where the "error" or residual is the...
5.2K
Linear Approximation in Frequency Domain01:26

Linear Approximation in Frequency Domain

505
Linear systems are characterized by two main properties: superposition and homogeneity. Superposition allows the response to multiple inputs to be the sum of the responses to each individual input. Homogeneity ensures that scaling an input by a scalar results in the response being scaled by the same scalar.
In contrast, nonlinear systems do not inherently possess these properties. However, for small deviations around an operating point, a nonlinear system can often be approximated as linear....
505

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Open Straightening and Direct Puncture of the Superficial Temporal Artery for Tumor Embolization via Transosseous Feeders.

Cureus·2026
Same author

A case of left internal carotid artery-posterior communicating artery aneurysm with right aortic arch treated with coil embolization by direct carotid puncture.

Surgical neurology international·2026
Same author

Instance-dependent Early Stopping for Adaptive Data Pruning.

IEEE transactions on pattern analysis and machine intelligence·2026
Same author

Class-Distribution-Aware Pseudo-Labeling for Semi-Supervised Multi-Label Learning.

IEEE transactions on pattern analysis and machine intelligence·2026
Same author

Atlas posterior arch defect with Atlanto-occipital assimilation: Two case reports highlighting fracture mimicry and retro-odontoid pseudotumor.

Surgical neurology international·2026
Same author

Gross-total resection of a large conus-region intradural extramedullary schwannoma with dense adhesion: Operative video and 11-year follow-up.

Surgical neurology international·2026
Same journal

A Model-Free Reinforcement Learning Implementation of Decision Making Under Uncertainty by Sequential Sampling.

Neural computation·2026
Same journal

DROP: Distributional and Regular Optimism and Pessimism for Reinforcement Learning.

Neural computation·2026
Same journal

Hierarchical Active Inference Using Successor Representations.

Neural computation·2026
Same journal

W-Kernel and Its Principal Space for Frequentist Evaluation of Bayesian Estimators.

Neural computation·2026
Same journal

A Hidden Markov Model-Inspired Sequence Classification Method for Hyperdimensional Computing.

Neural computation·2026
Same journal

Sparse Graphical Modeling for Electrophysiological Phase-Based Connectivity Using Circular Statistics.

Neural computation·2026
See all related articles

Related Experiment Video

Updated: May 7, 2026

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.0K

High-dimensional feature selection by feature-wise kernelized Lasso.

Makoto Yamada1, Wittawat Jitkrittum, Leonid Sigal

  • 1Yahoo Labs, 701 1st Ave., Sunnyvale, CA 94098, U.S.A. makotoy@yahoo-inc.com.

Neural Computation
|October 10, 2013
PubMed
Summary
This summary is machine-generated.

This study introduces kernelized Lasso for effective feature selection, capturing nonlinear relationships. The method efficiently identifies relevant features for high-dimensional classification and regression tasks.

More Related Videos

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

1.6K
Assisted Selection of Biomarkers by Linear Discriminant Analysis Effect Size LEfSe in Microbiome Data
04:57

Assisted Selection of Biomarkers by Linear Discriminant Analysis Effect Size LEfSe in Microbiome Data

Published on: May 16, 2022

17.4K

Related Experiment Videos

Last Updated: May 7, 2026

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.0K
Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

1.6K
Assisted Selection of Biomarkers by Linear Discriminant Analysis Effect Size LEfSe in Microbiome Data
04:57

Assisted Selection of Biomarkers by Linear Discriminant Analysis Effect Size LEfSe in Microbiome Data

Published on: May 16, 2022

17.4K

Area of Science:

  • Machine Learning
  • Statistical Learning Theory

Background:

  • Supervised feature selection aims to identify predictive input features.
  • The standard Lasso method efficiently selects features based on linear dependencies.
  • Capturing nonlinear input-output relationships remains a challenge for traditional Lasso.

Purpose of the Study:

  • To develop a kernelized Lasso method for feature selection that captures nonlinear dependencies.
  • To enable efficient computation of globally optimal solutions for high-dimensional data.
  • To demonstrate the method's effectiveness in classification and regression tasks.

Main Methods:

  • A feature-wise kernelized Lasso approach is proposed.
  • Kernel functions are utilized to capture nonlinear input-output dependencies.
  • Kernel-based independence measures, like the Hilbert-Schmidt independence criterion, are employed.

Main Results:

  • The method successfully identifies nonredundant features with strong statistical dependence.
  • Efficient computation of globally optimal solutions is achieved, ensuring scalability.
  • Experiments show effectiveness on datasets with thousands of features for classification and regression.

Conclusions:

  • Kernelized Lasso offers a scalable and effective solution for feature selection in the presence of nonlinear dependencies.
  • The approach enhances the capabilities of Lasso for complex, high-dimensional datasets.
  • This method advances feature selection techniques in machine learning and statistical modeling.