Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Random Sampling Method

Random Sampling Method

Sampling is a technique to select a portion (or subset) of the larger population and study that portion (the sample) to gain information about the population. Data are the result of sampling from a population. The sampling method ensures that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest. Among the various sampling methods used by...

Classification of Systems-II

Classification of Systems-II

Continuous-time systems have continuous input and output signals, with time measured continuously. These systems are generally defined by differential or algebraic equations. For instance, in an RC circuit, the relationship between input and output voltage is expressed through a differential equation derived from Ohm's law and the capacitor relation,

Classification of Systems-I

Classification of Systems-I

Linearity is a system property characterized by a direct input-output relationship, combining homogeneity and additivity.
Homogeneity dictates that if an input x(t) is multiplied by a constant c, the output y(t) is multiplied by the same constant. Mathematically, this is expressed as:

Random Variables

Random Variables

A random variable is a single numerical value that indicates the outcome of a procedure. The concept of random variables is fundamental to the probability theory and was introduced by a Russian mathematician, Pafnuty Chebyshev, in the mid-nineteenth century.
Uppercase letters such as X or Y denote a random variable. Lowercase letters like x or y denote the value of a random variable. If X is a random variable, then X is written in words, and x is given as a number.
For example, let X = the...

State Space Representation

State Space Representation

The frequency-domain technique, commonly used in analyzing and designing feedback control systems, is effective for linear, time-invariant systems. However, it falls short when dealing with nonlinear, time-varying, and multiple-input multiple-output systems. The time-domain or state-space approach addresses these limitations by utilizing state variables to construct simultaneous, first-order differential equations, known as state equations, for an nth-order system.
Consider an RLC circuit, a...

Cluster Sampling Method

Cluster Sampling Method

Appropriate sampling methods ensure that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. All the members from these clusters are in the cluster sample. For example, if you randomly sample four departments from your...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Understanding the determinants of public trust in the health care system in China: an analysis of a cross-sectional survey.

Journal of health services research & policy·2018

Same author

Adverse Childhood Experiences, Epigenetic Measures, and Obesity in Youth.

The Journal of pediatrics·2018

Same author

Regularized Latent Class Model for Joint Analysis of High-Dimensional Longitudinal Biomarkers and a Time-to-Event Outcome.

Biometrics·2018

Same author

LncRNA UCA1 sponges miR-204-5p to promote migration, invasion and epithelial-mesenchymal transition of glioma cells via upregulation of ZEB1.

Pathology, research and practice·2018

Same author

International variations in trust in health care systems.

The International journal of health planning and management·2018

Same author

Toll-like receptor 9 negatively regulates pancreatic islet beta cell growth and function in a mouse model of type 1 diabetes.

Diabetologia·2018

Same journal

Improving Overall Risk Ranking via Subgroup-Level Information Borrowing in Survival Risk Stratification.

Statistics and its interface·2026

Same journal

High-dimensional Bayesian mediation analysis with adaptive Laplace priors.

Statistics and its interface·2026

Same journal

Imaging mediation analysis for longitudinal outcomes: a case study of childhood brain tumor survivorship.

Statistics and its interface·2025

Same journal

Variable selection for doubly robust causal inference.

Statistics and its interface·2025

Same journal

Smooth online parameter estimation for time varying VAR models with application to rat local field potential activity data.

Statistics and its interface·2025

Same journal

A Double Regression Method for Graphical Modeling of High-dimensional Nonlinear and Non-Gaussian Data.

Statistics and its interface·2025

See all related articles

Search research articles

Related Experiment Videos

Weighted random subspace method for high dimensional data classification.

Xiaoye Li¹, Hongyu Zhao

¹Susquehanna International Group L.L.P., 401 City Avenue, Bala Cynwyd, PA 19004.

Statistics and Its Interface

|September 16, 2011

Summary

This summary is machine-generated.

This study introduces a novel weighted random subspace method to improve classification accuracy for high-dimensional data, particularly in genomics. The method optimizes classifier weights, outperforming standard approaches on gene expression and mass spectrometry datasets.

Related Experiment Videos

Area of Science:

Bioinformatics
Computational Biology
Machine Learning

Background:

High-dimensional data from genomics and proteomics challenge traditional classification algorithms.
Existing feature selection methods may overfit or ignore feature interactions.
Aggregating algorithms show promise but lack optimal weight assignment strategies.

Purpose of the Study:

To address limitations in handling high-dimensional biological data.
To propose a heuristic optimization solution for classifier weight assignment.
To develop and evaluate a weighted random subspace method.

Main Methods:

Formulation of the weight assignment problem in classification.
Development of a heuristic optimization approach for assigning weights.
Application of the method to the random subspace algorithm, creating a weighted random subspace method.

Main Results:

The weighted random subspace method was applied to public gene expression and mass spectrometry datasets.
Significant improvements in classification accuracy were observed compared to equal weight assignment.
The novel method demonstrates enhanced performance on complex biological data.

Conclusions:

Optimal weight assignment can substantially improve classification accuracy in high-dimensional data analysis.
The proposed weighted random subspace method offers a promising solution for genomics and proteomics studies.
This approach effectively handles noisy features and potential interactions, advancing classification performance.