Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Biodiversity and Human Values01:24

Biodiversity and Human Values

16.6K
Human civilization relies on biodiversity in many ways. Sudden changes in species biodiversity result in environmental changes that can modify weather patterns and therefore human civilizations.
16.6K
Professional Values01:29

Professional Values

10.5K
Nurses are responsible for caring for patients during birth, death, illness, and healing. Professional values guide the decisions and actions that nurses make in their careers. If nurses know the decisions and actions to take, providing patients with exceptional care is possible.
The values that are the foundation of the nursing profession are altruism, autonomy, human dignity, and social justice.
First, altruism refers to the concern for the welfare and well-being of others without personal...
10.5K
Critical Values01:31

Critical Values

10.2K
A critical value is a definite value obtained from a particular probability distribution at a predecided confidence level (or a predecided significance level) for a given population parameter. The critical value provides demarcation that separates the sample statistics that are likely to occur from the ones that are unlikely to occur based on the given probability distribution and the population parameter to be estimated. The critical value for normal distribution is obtained from the z...
10.2K
z Scores and Unusual Values01:07

z Scores and Unusual Values

11.0K
The z score is one of the three measures of relative standing. It describes the location of a value in a dataset relative to the mean. z scores are obtained after the standardization of the values in a dataset. The z score for the mean is 0.
 This score indicates how far a value is from the mean in terms of standard deviation. For example, if a data value has a z score of +1, the researcher can infer that the particular data value is one standard deviation above the mean. If another data...
11.0K
Absolute and Local Extreme Values01:22

Absolute and Local Extreme Values

58
The highest and lowest values of a function, relative to a reference axis, are known as extreme values. These include absolute maximum and absolute minimum values, which represent the highest and lowest points the function reaches across its entire domain. Within a restricted portion of the function, the highest and lowest values are referred to as local maximum and local minimum values, respectively.Periodic functions, such as sine and cosine, show extreme values at infinitely many points due...
58
Finding Critical Values for Chi-Square01:18

Finding Critical Values for Chi-Square

4.3K
Consider a curve representing sample data drawn randomly from a normally distributed population. One must construct confidence intervals to estimate or to test a claim regarding the population standard deviation. For example, a 95% confidence interval covers 95% of the area under the curve, and the remaining 5% is equally distributed on either side of the curve. To achieve such confidence intervals, one must determine the critical values. The critical values are simply the values separating the...
4.3K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

An integrated computational antigen discovery pipeline with hierarchical filtering for emerging viral variants.

NAR molecular medicine·2026
Same author

Enhancing protein immunogenicity prediction via uncertainty weighted deep ensemble.

Oxford open immunology·2026
Same author

ImmUQBench: a benchmark on uncertainty quantification of protein immunogenicity prediction.

Oxford open immunology·2026
Same author

Epidemiological model calibration via graybox Bayesian optimization.

Infectious Disease Modelling·2026
Same author

Uncertainty-Aware Adaptation of Large Language Models for Protein-Protein Interaction Analysis.

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference·2025
Same author

CYP1B1-AS1 regulates CYP1B1 to promote Coxiella burnetii pathogenesis by inhibiting ROS and host cell death.

Nature communications·2025
Same journal

OpenIMC: an open-source platform for analyzing single-cell and spatial proteomics by imaging mass cytometry.

BMC bioinformatics·2026
Same journal

NAP: an open source pipeline for cross-domain microbiome profiling using Nanopore sequencing-derived amplicon data.

BMC bioinformatics·2026
Same journal

SurvGME: an R package for survival analysis with graphical and measurement error models.

BMC bioinformatics·2026
Same journal

SimMapNet: a Bayesian framework for gene regulatory network inference using gene ontology similarities as external hint.

BMC bioinformatics·2026
Same journal

Dual channel drug-drug interactions extraction based on cross attention.

BMC bioinformatics·2026
Same journal

FeSseqdb: a curated sequence-level database and interpretable machine learning framework for identifying iron-sulfur proteins.

BMC bioinformatics·2026
See all related articles

Related Experiment Video

Updated: Jan 23, 2026

CRISPR Gene Editing Tool for MicroRNA Cluster Network Analysis
10:40

CRISPR Gene Editing Tool for MicroRNA Cluster Network Analysis

Published on: April 25, 2022

2.8K

Optimal clustering with missing values.

Shahin Boluki1, Siamak Zamani Dadaneh1, Xiaoning Qian1,2

  • 1Department of Electrical and Computer Engineering, Texas A&M University, MS3128 TAMU, College Station, 77843, TX, USA.

BMC Bioinformatics
|June 21, 2019
PubMed
Summary
This summary is machine-generated.

This study introduces optimal clustering that directly handles missing values in biomedical data, avoiding imputation. The new method demonstrates superior performance and accuracy in clustering complex datasets.

Keywords:
ClusteringMissing dataOptimal designPattern recognition

More Related Videos

Spatial Separation of Molecular Conformers and Clusters
10:37

Spatial Separation of Molecular Conformers and Clusters

Published on: January 9, 2014

11.7K
Computation of Atmospheric Concentrations of Molecular Clusters from ab initio Thermochemistry
12:11

Computation of Atmospheric Concentrations of Molecular Clusters from ab initio Thermochemistry

Published on: April 8, 2020

8.7K

Related Experiment Videos

Last Updated: Jan 23, 2026

CRISPR Gene Editing Tool for MicroRNA Cluster Network Analysis
10:40

CRISPR Gene Editing Tool for MicroRNA Cluster Network Analysis

Published on: April 25, 2022

2.8K
Spatial Separation of Molecular Conformers and Clusters
10:37

Spatial Separation of Molecular Conformers and Clusters

Published on: January 9, 2014

11.7K
Computation of Atmospheric Concentrations of Molecular Clusters from ab initio Thermochemistry
12:11

Computation of Atmospheric Concentrations of Molecular Clusters from ab initio Thermochemistry

Published on: April 8, 2020

8.7K

Area of Science:

  • Biostatistics
  • Computational Biology
  • Genomics

Background:

  • Missing values are common in biomedical studies, complicating clustering.
  • Imputation is a standard but potentially flawed approach to handle missing data before clustering.

Purpose of the Study:

  • To develop an optimal clustering framework that directly addresses missing values.
  • To integrate missing data mechanisms into the random labeled point process (RLPP) for robust clustering.

Main Methods:

  • Incorporating missing value mechanisms into the RLPP framework.
  • Marginalizing out the missing-value process within optimal clustering.
  • Demonstration using Gaussian models with arbitrary covariance structures.

Main Results:

  • The proposed optimal clustering framework effectively handles missing values without imputation.
  • Experimental studies on synthetic and RNA-seq data show superior performance compared to existing methods.
  • The approach achieves smaller clustering errors in the presence of missing data.

Conclusions:

  • Optimal clustering with missing values eliminates the need for imputation pre-processing.
  • This method offers improved accuracy and reduced clustering errors for biomedical data with missing values.