Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Data Reporting and Recording01:24

Data Reporting and Recording

4.7K
Reporting and recording are crucial in data documentation. The timely, thorough, and accurate documentation of facts is essential when recording patient data. Failure to record findings during an assessment or interpretation of a problem will result in loss of information and make the patient document unreliable. The reader is left with general impressions if the information is not specific. A recording is documenting data of the individual's health information in a traceable, secure, and...
4.7K
Censoring Survival Data01:09

Censoring Survival Data

95
Survival analysis is a statistical method used to analyze time-to-event data, often employed in fields such as medicine, engineering, and social sciences. One of the key challenges in survival analysis is dealing with incomplete data, a phenomenon known as "censoring." Censoring occurs when the event of interest (such as death, relapse, or system failure) has not occurred for some individuals by the end of the study period or is otherwise unobservable, and it might have many different...
95
Detection of Gross Error: The Q Test01:00

Detection of Gross Error: The Q Test

6.1K
When one or more data points appear far from the rest of the data, there is a need to determine whether they are outliers and whether they should be eliminated from the data set to ensure an accurate representation of the measured value. In many cases, outliers arise from gross errors (or human errors) and do not accurately reflect the underlying phenomenon. In some cases, however, these apparent outliers reflect true phenomenological differences. In these cases, we can use statistical methods...
6.1K
Data Collection by Observations01:08

Data Collection by Observations

12.0K
Data collection refers to a systematic way of obtaining, observing, measuring, and analyzing accurate information. Observational studies are one of the most widely used methods of data collection. It involves collecting data by observing the behavior and physical characteristics of a sample without making any modifications to the sample.
An astronomer viewing the motion and brightness of stars in the sky and recording the data is an example of observational data collection. A botanist recording...
12.0K
Weighted Mean00:57

Weighted Mean

5.2K
While taking the arithmetic, geometric, or harmonic mean of a sample data set, equal importance is assigned to all the data points. However, all the values may not always be equally important in some data sets. An intrinsic bias might make it more important to give more weightage to specific values over others.
For example, consider the number of goals scored in the matches of a tournament. While computing the average number of goals scored in the tournament, it may be more important to...
5.2K
Data Collection I01:30

Data Collection I

6.2K
Data collection gathers information needed to make accurate judgments about a patient's present condition. During a health history interview, subjective data is collected from the patient, their caregivers, or family members, and objective data is collected through observations and physical assessment. Patients are the primary source of subjective data. Thus information gathered from patients through interviews, observations, and physical examination is primary data. Secondary sources of...
6.2K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

ROI-aware uncertainty fusion for label-efficient glioma MRI segmentation.

Physics in medicine and biology·2026
Same author

H-NGPCA: Hierarchical clustering of data streams with adaptive number of clusters and adaptive dimensionality.

PloS one·2026
Same author

Seminal angiotensin II as a predictive factor of spermatogenic activity in non-obstructive azoospermia.

Revista internacional de andrologia·2025
Same author

Image restoration in frequency space using complex-valued CNNs.

Frontiers in artificial intelligence·2024
Same author

Evaluation of Swin Transformer and knowledge transfer for denoising of super-resolution structured illumination microscopy data.

GigaScience·2024
Same author

sEMG-based prediction of human forearm movements utilizing a biomechanical model based on individual anatomical/ physiological measures and a reduced set of optimization parameters.

PloS one·2023
Same journal

Intervention Feasible Region and Driver Risk Capacity Aware Human-Machine Collaborative Safe Trajectory Planning.

IEEE transactions on neural networks and learning systems·2026
Same journal

A Unified Differential Denoising Learning Framework With a Pre-Trained Model and Fuzzy Graph Networks for Drug-Drug Interaction Prediction.

IEEE transactions on neural networks and learning systems·2026
Same journal

Self-Supervised Continuous Dynamic Graph Representation Learning via Hawkes Processes.

IEEE transactions on neural networks and learning systems·2026
Same journal

cPU: Consistent Risk Estimator for Positive-Unlabeled Learning.

IEEE transactions on neural networks and learning systems·2026
Same journal

Tuning-Free Latent Diffusion Models for Ultrahigh-Resolution Image Editing.

IEEE transactions on neural networks and learning systems·2026
Same journal

Hidden Data Recovery and Forecasting via Next-Generation Reservoir Computing With Multiscale Delay Selection.

IEEE transactions on neural networks and learning systems·2026
See all related articles

Related Experiment Video

Updated: Jul 4, 2025

Eye-tracking Technology and Data-mining Techniques used for a Behavioral Analysis of Adults engaged in Learning Processes
10:43

Eye-tracking Technology and Data-mining Techniques used for a Behavioral Analysis of Adults engaged in Learning Processes

Published on: June 10, 2021

5.4K

Active Learning for Handling Missing Data.

Alaa Tharwat, Wolfram Schenck

    IEEE Transactions on Neural Networks and Learning Systems
    |January 26, 2024
    PubMed
    Summary
    This summary is machine-generated.

    Active learning addresses unlabeled data challenges by selecting informative samples. This study introduces a novel strategy that accounts for imputation uncertainty, improving model performance on incomplete datasets.

    More Related Videos

    Databases to Efficiently Manage Medium Sized, Low Velocity, Multidimensional Data in Tissue Engineering
    09:43

    Databases to Efficiently Manage Medium Sized, Low Velocity, Multidimensional Data in Tissue Engineering

    Published on: November 22, 2019

    6.3K
    Project-Based Learning Guidelines for Health Sciences Students: An Analysis with Data Mining and Qualitative Techniques
    13:44

    Project-Based Learning Guidelines for Health Sciences Students: An Analysis with Data Mining and Qualitative Techniques

    Published on: December 9, 2022

    3.5K

    Related Experiment Videos

    Last Updated: Jul 4, 2025

    Eye-tracking Technology and Data-mining Techniques used for a Behavioral Analysis of Adults engaged in Learning Processes
    10:43

    Eye-tracking Technology and Data-mining Techniques used for a Behavioral Analysis of Adults engaged in Learning Processes

    Published on: June 10, 2021

    5.4K
    Databases to Efficiently Manage Medium Sized, Low Velocity, Multidimensional Data in Tissue Engineering
    09:43

    Databases to Efficiently Manage Medium Sized, Low Velocity, Multidimensional Data in Tissue Engineering

    Published on: November 22, 2019

    6.3K
    Project-Based Learning Guidelines for Health Sciences Students: An Analysis with Data Mining and Qualitative Techniques
    13:44

    Project-Based Learning Guidelines for Health Sciences Students: An Analysis with Data Mining and Qualitative Techniques

    Published on: December 9, 2022

    3.5K

    Area of Science:

    • Machine Learning
    • Data Science
    • Artificial Intelligence

    Background:

    • Massive growth in IoT devices generates vast unlabeled data, posing challenges for supervised learning.
    • Labeling data is costly and time-consuming, hindering the development of effective machine learning models.
    • Active learning (AL) offers a solution by selecting informative data points for labeling, but struggles with incomplete data.

    Purpose of the Study:

    • To develop a novel active learning strategy that addresses the challenge of incomplete data with missing values.
    • To improve the selection of informative and representative data points by accounting for imputation uncertainty.
    • To enhance the performance of machine learning models trained on datasets with missing values.

    Main Methods:

    • Introduced a novel multiple imputation method considering feature importance for missing value estimation.
    • Developed a query selection strategy that quantifies and incorporates imputation uncertainty.
    • Integrated imputation uncertainty into both exploration and exploitation phases of the active learner to reduce the selection of uncertain points.

    Main Results:

    • The proposed active learner effectively reduces the probability of selecting data points with high imputation uncertainty.
    • Demonstrated improved classification performance on various binary and multiclass datasets with different missing rates.
    • The novel strategy enhances the generalizability and robustness of active learning models in the presence of missing data.

    Conclusions:

    • Accounting for imputation uncertainty is crucial for effective active learning with incomplete datasets.
    • The proposed method offers a significant advancement in handling missing data within active learning frameworks.
    • This approach holds promise for improving machine learning applications in fields with abundant, but incomplete, data like healthcare and industry.