Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Multiple Comparison Tests01:13

Multiple Comparison Tests

4.0K
Multiple comparison test, abbreviated as MCT, is a post hoc analysis generally performed after comparing multiple samples with one or more tests. An MCT will help identify a significantly different sample among multiple samples or a factor among multiple factors.
It would be easy to compare two samples using a significance alpha level of 0.05. In other words, there is only one sample pair to be compared. However, it would be difficult to identify a significantly different sample if the number...
4.0K
Quantifying and Rejecting Outliers: The Grubbs Test01:02

Quantifying and Rejecting Outliers: The Grubbs Test

1.8K
Sometimes, a data set can have a recorded numerical observation that greatly  deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier.  To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...
1.8K
Multiple Regression01:25

Multiple Regression

3.1K
Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...
3.1K
Multi-input and Multi-variable systems01:22

Multi-input and Multi-variable systems

139
Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence...
139
Cluster Sampling Method01:20

Cluster Sampling Method

12.3K
Appropriate sampling methods ensure that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. All the members from these clusters are in the cluster sample. For example, if you randomly sample four departments from your...
12.3K
Contingency Table01:29

Contingency Table

2.6K
A contingency table provides a way of portraying data that can facilitate calculating probabilities. It is a method of displaying a frequency distribution as a table with rows and columns to show how two variables may be dependent (contingent) upon each other; The table helps determine conditional probabilities quite quickly and can help systematically organize, analyze and quantify data. The table displays sample values concerning two variables that may be dependent or contingent on one...
2.6K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Stump Appendicitis: A 25-Year Review of Pathophysiology, Diagnosis, and Management (2000-2025).

Cureus·2026
Same author

Incidentally Detected Low-Grade Appendiceal Mucinous Neoplasm Following Laparoscopic Appendectomy for Acute Appendicitis.

Cureus·2026
Same author

Incidentally Detected Gallbladder Adenocarcinoma Presenting as Acute Calculous Cholecystitis: A Case Report and Management Considerations.

Cureus·2026
Same author

SenSet defines cell-type specific senescence signatures in the aged human lung.

The EMBO journal·2026
Same author

Emergence of Local Ordering and Mesoscale Giant Number Fluctuations in Active Turbulence.

Physical review letters·2026
Same author

MolQuery: Prediction of Lipid Synthesizability Using Active Learning.

ACS omega·2026
Same journal

A computational method to design broad-spectrum T cell-inducing vaccines applied to Betacoronaviruses.

Cell reports methods·2026
Same journal

MalDeepSeq panel: A targeted ultra-deep sequencing approach to trace drug resistance markers in Plasmodium falciparum.

Cell reports methods·2026
Same journal

Induced pluripotent stem cell-derived macrophages enable broad modeling of human inflammasome signaling.

Cell reports methods·2026
Same journal

Rapid discovery of cell-surface glycosylation regulators using a lectin-based magnetic CRISPR screen.

Cell reports methods·2026
Same journal

A real-time FRET ubiquitin transfer assay for quantitative characterization of ternary complexes in targeted protein degradation.

Cell reports methods·2026
Same journal

A high-throughput, end-to-end pipeline for extracellular miRNA biomarker discovery from human biofluids.

Cell reports methods·2026
See all related articles

Related Experiment Video

Updated: Aug 19, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.6K

Multiset multicover methods for discriminative marker selection.

Euxhen Hasanaj1, Amir Alavi2, Anupam Gupta3

  • 1Machine Learning Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA.

Cell Reports Methods
|December 1, 2022
PubMed
Summary
This summary is machine-generated.

New algorithms address the phenotype cover (PC) problem for selecting gene markers in large biological datasets. These methods improve the discriminative power of marker sets for accurate cell type identification in high-throughput studies.

Keywords:
algorithmbiomarkercross-entropy methodgene setsmarker discoverymultiset multicoverphenotype coverscRNA-seqset cover

More Related Videos

Basics of Multivariate Analysis in Neuroimaging Data
06:35

Basics of Multivariate Analysis in Neuroimaging Data

Published on: July 24, 2010

16.9K
Cross-Modal Multivariate Pattern Analysis
13:51

Cross-Modal Multivariate Pattern Analysis

Published on: November 9, 2011

20.0K

Related Experiment Videos

Last Updated: Aug 19, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.6K
Basics of Multivariate Analysis in Neuroimaging Data
06:35

Basics of Multivariate Analysis in Neuroimaging Data

Published on: July 24, 2010

16.9K
Cross-Modal Multivariate Pattern Analysis
13:51

Cross-Modal Multivariate Pattern Analysis

Published on: November 9, 2011

20.0K

Area of Science:

  • Computational Biology
  • Bioinformatics
  • Genomics

Background:

  • Markers are crucial for high-throughput data analysis, including cell type assignment in single-cell RNA sequencing (scRNA-seq) and deconvolving bulk gene expression.
  • Current marker selection methods, primarily based on differential expression (DE) analysis, are inadequate for large-scale datasets with numerous cell types and tissues.

Purpose of the Study:

  • To address the limitations of existing marker selection methods for large, complex biological datasets.
  • To introduce a novel framework, the phenotype cover (PC) problem, for marker selection.
  • To develop and present algorithms that enhance the discriminative power of marker sets.

Main Methods:

  • Definition of the phenotype cover (PC) problem tailored for marker selection in large datasets.
  • Development of new algorithms designed to solve the PC problem.
  • Evaluation of the proposed algorithms on various marker-selection tasks using high-throughput biological data.

Main Results:

  • The developed algorithms effectively address the phenotype cover problem.
  • The proposed marker selection methods demonstrate improved discriminative power compared to traditional approaches.
  • Analysis confirms the ability of these methods to accurately distinguish different phenotypes within complex datasets.

Conclusions:

  • The novel phenotype cover (PC) problem and associated algorithms offer a significant advancement for marker selection in large-scale biological studies.
  • These methods provide accurate solutions for distinguishing phenotypes, particularly in scRNA-seq and spatial proteomics.
  • The findings pave the way for more robust and precise analysis of complex biological systems using marker-based approaches.