Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Ratio Level of Measurement00:54

Ratio Level of Measurement

19.5K
The way a set of data is measured is called its level of measurement. Correct statistical procedures depend on a researcher being familiar with levels of measurement. For analysis, data are classified into four levels of measurement—nominal, ordinal, interval, and ratio.
A set of data measured using the ratio scale takes care of the ratio problem and provides complete information. Ratio scale data are like interval scale data, except they have a zero point and ratios can be calculated....
19.5K
Review and Preview01:10

Review and Preview

8.0K
In statistics, several tools are used to interpret the data. Measures of central tendency represent the characteristics of the data, such as mean, median, and mode. Additionally, measures of variance like standard deviation and range are used to find the spread of data from the mean. Relative standing measures the distance between data locations. Commonly used measures of relative standings are percentile, z score, and quartiles.
Percentiles are a type of fractile that partition data into...
8.0K
How Data are Classified: Numerical Data00:59

How Data are Classified: Numerical Data

32.9K
Data that are countable or measurable in specific units are called numerical or quantitative data. Quantitative data are always numbers. Quantitative data are the result of counting or measuring the attributes of a population. Amount of money, pulse rate, weight, number of people living in a town, and number of students who opt for statistics are examples of quantitative data.
Quantitative data may be either discrete or continuous. All quantitative data that take on only specific numerical...
32.9K
Interval Level of Measurement00:55

Interval Level of Measurement

16.5K
For effective statistical analysis, data are classified into four levels of measurement—nominal, ordinal, interval, and ratio.
Data measured using the interval scale are similar to ordinal level data because they have a definite arrangement. However, in the interval level of measurement, the differences between data values are meaningful even though the data does not have a starting point.
Temperature is measured using the interval scale. It is measurable data, and the difference between...
16.5K
Ordinal Level of Measurement00:55

Ordinal Level of Measurement

27.3K
The way a set of data is measured is called its level of measurement. Correct statistical procedures depend on a researcher being familiar with levels of measurement. For analysis, data are classified into four levels of measurement—nominal, ordinal, interval, and ratio.
Data measured using an ordinal scale are similar to nominal scale data, but there is one major difference. The ordinal scale data can be ordered. An example of ordinal scale data is a list of the top five national parks...
27.3K
Central Tendency: Analysis01:10

Central Tendency: Analysis

266
Measures of central tendency are tools used in biostatistics to identify the average or center of a dataset. They offer a single representative value for understanding and summarizing data distribution.
The mean is one such measure, calculated by totaling all values in a dataset and dividing by the number of values. For instance, the mean blood pressure reading (120, 130, 140, 150) would be 135. However, the mean can be affected by extreme values or outliers.
The median, another measure,...
266

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Reduction of cancer cell quantity and viability in autologous blood salvaged from patients with liver cancer: efficacy of intraoperative cell salvage coupled with leukocyte depletion filtration.

BMC cancer·2026
Same author

Influence of polystyrene microplastics on bioconcentration and metabolism of 2-ethylhexyl diphenyl phosphate in common carp (Cyprinus carpio).

Journal of environmental sciences (China)·2026
Same author

Quantifying the benefits of improved operations for polio outbreak response: A model-based analysis.

Vaccine·2026
Same author

Global cessation of bivalent oral polio vaccine (bOPV): A comprehensive review of modeling studies from 2000 to 2025.

Vaccine·2026
Same author

Temporal trends and contributing factors of racial and ethnic disparities in cutaneous melanoma survival: A SEER database analysis.

Journal of the American Academy of Dermatology·2026
Same author

Correction: Proximity proteomics reveals OTUD6B regulation of stress granule dynamics through coalescence with VCP/p97.

Cell death & disease·2026
Same journal

CardiaTics: An explainable AI integrated heart disease diagnosis model with feature engineering and stacked ensemble approach.

Journal of big data·2026
Same journal

Comprehensive representation of health-related phenotypes in one million dogs using topic modelling of electronic health records.

Journal of big data·2026
Same journal

UniqueNOSD: a novel framework for NoSQL over SQL databases.

Journal of big data·2025
Same journal

<i>F</i>u<i>n</i>Da: scalable serverless data analytics and in situ query processing.

Journal of big data·2025
Same journal

Integrating Big Data, Artificial Intelligence, and motion analysis for emerging precision medicine applications in Parkinson's Disease.

Journal of big data·2024
Same journal

Interpolation-split: a data-centric deep learning approach with big interpolated data to boost airway segmentation performance.

Journal of big data·2024
See all related articles

Related Experiment Video

Updated: Oct 13, 2025

Quantification of Information Encoded by Gene Expression Levels During Lifespan Modulation Under Broad-range Dietary Restriction in C. elegans
09:23

Quantification of Information Encoded by Gene Expression Levels During Lifespan Modulation Under Broad-range Dietary Restriction in C. elegans

Published on: August 16, 2017

8.3K

A data value metric for quantifying information content and utility.

Morteza Noshad1,2, Jerome Choi3,4, Yuming Sun3,5

  • 1Department of Electrical Engineering and Computer Science, University of Michigan, Ann Arbor, MI 48109 USA.

Journal of Big Data
|November 15, 2021
PubMed
Summary
This summary is machine-generated.

A new Data Value Metric (DVM) quantifies information utility in large datasets, balancing analytical value and model complexity. This metric helps determine if expanding data enhances or degrades its usefulness for specific tasks.

Keywords:
Artificial intelligenceData energyData utilityInformation contentMachine learning

More Related Videos

Author Spotlight: Quantifying Pain Experience &#8211; An Illustrative Approach Using the Pain Body Diagram
09:00

Author Spotlight: Quantifying Pain Experience – An Illustrative Approach Using the Pain Body Diagram

Published on: July 7, 2023

3.9K
Cloud-Based Phrase Mining and Analysis of User-Defined Phrase-Category Association in Biomedical Publications
09:20

Cloud-Based Phrase Mining and Analysis of User-Defined Phrase-Category Association in Biomedical Publications

Published on: February 23, 2019

8.9K

Related Experiment Videos

Last Updated: Oct 13, 2025

Quantification of Information Encoded by Gene Expression Levels During Lifespan Modulation Under Broad-range Dietary Restriction in C. elegans
09:23

Quantification of Information Encoded by Gene Expression Levels During Lifespan Modulation Under Broad-range Dietary Restriction in C. elegans

Published on: August 16, 2017

8.3K
Author Spotlight: Quantifying Pain Experience &#8211; An Illustrative Approach Using the Pain Body Diagram
09:00

Author Spotlight: Quantifying Pain Experience – An Illustrative Approach Using the Pain Body Diagram

Published on: July 7, 2023

3.9K
Cloud-Based Phrase Mining and Analysis of User-Defined Phrase-Category Association in Biomedical Publications
09:20

Cloud-Based Phrase Mining and Analysis of User-Defined Phrase-Category Association in Biomedical Publications

Published on: February 23, 2019

8.9K

Area of Science:

  • Data Science
  • Information Theory
  • Machine Learning

Background:

  • Data-driven innovation relies on massive, heterogeneous datasets.
  • Assessing data quality and informativeness is crucial for effective decision support.
  • Existing measures like VoI, QoI, and MI have limitations in quantifying utility for complex data.

Purpose of the Study:

  • Introduce a novel information-theoretic measure, the Data Value Metric (DVM).
  • Quantify the useful information content and utility of large, complex datasets.
  • Determine if data expansion enhances or degrades information content for specific tasks.

Main Methods:

  • Developed the Data Value Metric (DVM) based on a regularized model.
  • DVM balances data analytical value (utility) with model complexity.
  • Incorporated fidelity (model performance) and regularization (computational complexity) terms, inspired by information bottleneck.

Main Results:

  • DVM effectively quantifies the useful information content (energy) of heterogeneous datasets.
  • Demonstrated DVM's ability to capture the trade-off between analytical value and algorithmic complexity.
  • Showcased DVM's utility in assessing the impact of data size and feature richness on information content.

Conclusions:

  • DVM provides a robust method for evaluating data utility in data-driven innovation.
  • The metric aids in optimizing dataset characteristics for various supervised and unsupervised learning tasks.
  • DVM supports informed decisions on data acquisition and augmentation strategies.