Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Biodiversity and Human Values

Biodiversity and Human Values

Human civilization relies on biodiversity in many ways. Sudden changes in species biodiversity result in environmental changes that can modify weather patterns and therefore human civilizations.

Professional Values

Professional Values

Nurses are responsible for caring for patients during birth, death, illness, and healing. Professional values guide the decisions and actions that nurses make in their careers. If nurses know the decisions and actions to take, providing patients with exceptional care is possible.
The values that are the foundation of the nursing profession are altruism, autonomy, human dignity, and social justice.
First, altruism refers to the concern for the welfare and well-being of others without personal...

Critical Values

Critical Values

A critical value is a definite value obtained from a particular probability distribution at a predecided confidence level (or a predecided significance level) for a given population parameter. The critical value provides demarcation that separates the sample statistics that are likely to occur from the ones that are unlikely to occur based on the given probability distribution and the population parameter to be estimated. The critical value for normal distribution is obtained from the z...

z Scores and Unusual Values

z Scores and Unusual Values

The z score is one of the three measures of relative standing. It describes the location of a value in a dataset relative to the mean. z scores are obtained after the standardization of the values in a dataset. The z score for the mean is 0.
This score indicates how far a value is from the mean in terms of standard deviation. For example, if a data value has a z score of +1, the researcher can infer that the particular data value is one standard deviation above the mean. If another data...

Absolute and Local Extreme Values

Absolute and Local Extreme Values

The highest and lowest values of a function, relative to a reference axis, are known as extreme values. These include absolute maximum and absolute minimum values, which represent the highest and lowest points the function reaches across its entire domain. Within a restricted portion of the function, the highest and lowest values are referred to as local maximum and local minimum values, respectively.Periodic functions, such as sine and cosine, show extreme values at infinitely many points due...

Finding Critical Values for Chi-Square

Finding Critical Values for Chi-Square

Consider a curve representing sample data drawn randomly from a normally distributed population. One must construct confidence intervals to estimate or to test a claim regarding the population standard deviation. For example, a 95% confidence interval covers 95% of the area under the curve, and the remaining 5% is equally distributed on either side of the curve. To achieve such confidence intervals, one must determine the critical values. The critical values are simply the values separating the...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

An integrated computational antigen discovery pipeline with hierarchical filtering for emerging viral variants.

NAR molecular medicine·2026

Same author

Enhancing protein immunogenicity prediction via uncertainty weighted deep ensemble.

Oxford open immunology·2026

Same author

ImmUQBench: a benchmark on uncertainty quantification of protein immunogenicity prediction.

Oxford open immunology·2026

Same author

Epidemiological model calibration via graybox Bayesian optimization.

Infectious Disease Modelling·2026

Same author

Uncertainty-Aware Adaptation of Large Language Models for Protein-Protein Interaction Analysis.

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference·2025

Same author

CYP1B1-AS1 regulates CYP1B1 to promote Coxiella burnetii pathogenesis by inhibiting ROS and host cell death.

Nature communications·2025

Same journal

OpenIMC: an open-source platform for analyzing single-cell and spatial proteomics by imaging mass cytometry.

BMC bioinformatics·2026

Same journal

NAP: an open source pipeline for cross-domain microbiome profiling using Nanopore sequencing-derived amplicon data.

BMC bioinformatics·2026

Same journal

SurvGME: an R package for survival analysis with graphical and measurement error models.

BMC bioinformatics·2026

Same journal

SimMapNet: a Bayesian framework for gene regulatory network inference using gene ontology similarities as external hint.

BMC bioinformatics·2026

Same journal

Dual channel drug-drug interactions extraction based on cross attention.

BMC bioinformatics·2026

Same journal

FeSseqdb: a curated sequence-level database and interpretable machine learning framework for identifying iron-sulfur proteins.

BMC bioinformatics·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jan 23, 2026

CRISPR Gene Editing Tool for MicroRNA Cluster Network Analysis

CRISPR Gene Editing Tool for MicroRNA Cluster Network Analysis

Published on: April 25, 2022

Optimal clustering with missing values.

Shahin Boluki¹, Siamak Zamani Dadaneh¹, Xiaoning Qian^1,2

¹Department of Electrical and Computer Engineering, Texas A&M University, MS3128 TAMU, College Station, 77843, TX, USA.

BMC Bioinformatics

|June 21, 2019

Summary

This summary is machine-generated.

This study introduces optimal clustering that directly handles missing values in biomedical data, avoiding imputation. The new method demonstrates superior performance and accuracy in clustering complex datasets.

Keywords:

Clustering Missing data Optimal design Pattern recognition

More Related Videos

Spatial Separation of Molecular Conformers and Clusters

Spatial Separation of Molecular Conformers and Clusters

Published on: January 9, 2014

Computation of Atmospheric Concentrations of Molecular Clusters from ab initio Thermochemistry

Computation of Atmospheric Concentrations of Molecular Clusters from ab initio Thermochemistry

Published on: April 8, 2020

Related Experiment Videos

Last Updated: Jan 23, 2026

CRISPR Gene Editing Tool for MicroRNA Cluster Network Analysis

CRISPR Gene Editing Tool for MicroRNA Cluster Network Analysis

Published on: April 25, 2022

Spatial Separation of Molecular Conformers and Clusters

Spatial Separation of Molecular Conformers and Clusters

Published on: January 9, 2014

Computation of Atmospheric Concentrations of Molecular Clusters from ab initio Thermochemistry

Computation of Atmospheric Concentrations of Molecular Clusters from ab initio Thermochemistry

Published on: April 8, 2020

Area of Science:

Biostatistics
Computational Biology
Genomics

Background:

Missing values are common in biomedical studies, complicating clustering.
Imputation is a standard but potentially flawed approach to handle missing data before clustering.

Purpose of the Study:

To develop an optimal clustering framework that directly addresses missing values.
To integrate missing data mechanisms into the random labeled point process (RLPP) for robust clustering.

Main Methods:

Incorporating missing value mechanisms into the RLPP framework.
Marginalizing out the missing-value process within optimal clustering.
Demonstration using Gaussian models with arbitrary covariance structures.

Main Results:

The proposed optimal clustering framework effectively handles missing values without imputation.
Experimental studies on synthetic and RNA-seq data show superior performance compared to existing methods.
The approach achieves smaller clustering errors in the presence of missing data.

Conclusions:

Optimal clustering with missing values eliminates the need for imputation pre-processing.
This method offers improved accuracy and reduced clustering errors for biomedical data with missing values.