Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Random Sampling Method01:09

Random Sampling Method

Sampling is a technique to select a portion (or subset) of the larger population and study that portion (the sample) to gain information about the population. Data are the result of sampling from a population. The sampling method ensures that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest. Among the various sampling methods used by...
Classification of Systems-II01:31

Classification of Systems-II

Continuous-time systems have continuous input and output signals, with time measured continuously. These systems are generally defined by differential or algebraic equations. For instance, in an RC circuit, the relationship between input and output voltage is expressed through a differential equation derived from Ohm's law and the capacitor relation,
Classification of Systems-I01:26

Classification of Systems-I

Linearity is a system property characterized by a direct input-output relationship, combining homogeneity and additivity.
Homogeneity dictates that if an input x(t) is multiplied by a constant c, the output y(t) is multiplied by the same constant. Mathematically, this is expressed as:
Random Variables01:09

Random Variables

A random variable is a single numerical value that indicates the outcome of a procedure. The concept of random variables is fundamental to the probability theory and was introduced by a Russian mathematician, Pafnuty Chebyshev, in the mid-nineteenth century.
Uppercase letters such as X or Y denote a random variable. Lowercase letters like x or y denote the value of a random variable. If X is a random variable, then X is written in words, and x is given as a number.
For example, let X = the...
State Space Representation01:27

State Space Representation

The frequency-domain technique, commonly used in analyzing and designing feedback control systems, is effective for linear, time-invariant systems. However, it falls short when dealing with nonlinear, time-varying, and multiple-input multiple-output systems. The time-domain or state-space approach addresses these limitations by utilizing state variables to construct simultaneous, first-order differential equations, known as state equations, for an nth-order system.
Consider an RLC circuit, a...
Cluster Sampling Method01:20

Cluster Sampling Method

Appropriate sampling methods ensure that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. All the members from these clusters are in the cluster sample. For example, if you randomly sample four departments from your...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Understanding the determinants of public trust in the health care system in China: an analysis of a cross-sectional survey.

Journal of health services research & policy·2018
Same author

Adverse Childhood Experiences, Epigenetic Measures, and Obesity in Youth.

The Journal of pediatrics·2018
Same author

Regularized Latent Class Model for Joint Analysis of High-Dimensional Longitudinal Biomarkers and a Time-to-Event Outcome.

Biometrics·2018
Same author

LncRNA UCA1 sponges miR-204-5p to promote migration, invasion and epithelial-mesenchymal transition of glioma cells via upregulation of ZEB1.

Pathology, research and practice·2018
Same author

International variations in trust in health care systems.

The International journal of health planning and management·2018
Same author

Toll-like receptor 9 negatively regulates pancreatic islet beta cell growth and function in a mouse model of type 1 diabetes.

Diabetologia·2018
Same journal

Improving Overall Risk Ranking via Subgroup-Level Information Borrowing in Survival Risk Stratification.

Statistics and its interface·2026
Same journal

High-dimensional Bayesian mediation analysis with adaptive Laplace priors.

Statistics and its interface·2026
Same journal

Imaging mediation analysis for longitudinal outcomes: a case study of childhood brain tumor survivorship.

Statistics and its interface·2025
Same journal

Variable selection for doubly robust causal inference.

Statistics and its interface·2025
Same journal

Smooth online parameter estimation for time varying VAR models with application to rat local field potential activity data.

Statistics and its interface·2025
Same journal

A Double Regression Method for Graphical Modeling of High-dimensional Nonlinear and Non-Gaussian Data.

Statistics and its interface·2025
See all related articles

Related Experiment Videos

Weighted random subspace method for high dimensional data classification.

Xiaoye Li1, Hongyu Zhao

  • 1Susquehanna International Group L.L.P., 401 City Avenue, Bala Cynwyd, PA 19004.

Statistics and Its Interface
|September 16, 2011
PubMed
Summary
This summary is machine-generated.

This study introduces a novel weighted random subspace method to improve classification accuracy for high-dimensional data, particularly in genomics. The method optimizes classifier weights, outperforming standard approaches on gene expression and mass spectrometry datasets.

Related Experiment Videos

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Machine Learning

Background:

  • High-dimensional data from genomics and proteomics challenge traditional classification algorithms.
  • Existing feature selection methods may overfit or ignore feature interactions.
  • Aggregating algorithms show promise but lack optimal weight assignment strategies.

Purpose of the Study:

  • To address limitations in handling high-dimensional biological data.
  • To propose a heuristic optimization solution for classifier weight assignment.
  • To develop and evaluate a weighted random subspace method.

Main Methods:

  • Formulation of the weight assignment problem in classification.
  • Development of a heuristic optimization approach for assigning weights.
  • Application of the method to the random subspace algorithm, creating a weighted random subspace method.

Main Results:

  • The weighted random subspace method was applied to public gene expression and mass spectrometry datasets.
  • Significant improvements in classification accuracy were observed compared to equal weight assignment.
  • The novel method demonstrates enhanced performance on complex biological data.

Conclusions:

  • Optimal weight assignment can substantially improve classification accuracy in high-dimensional data analysis.
  • The proposed weighted random subspace method offers a promising solution for genomics and proteomics studies.
  • This approach effectively handles noisy features and potential interactions, advancing classification performance.