Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Accuracy, limits, and approximation01:28

Accuracy, limits, and approximation

1.3K
Accuracy, limits, and approximations are common in many fields, especially in engineering calculations. These concepts are imperative for ensuring that a given value is as close as possible to its true value.
Accuracy is defined as the closeness of the measured value to the true or actual value. In engineering mechanics, repeated measurements are taken during theoretical or experimental analyses to ensure that the result is precise and accurate.
The accuracy of any solution is based on the...
1.3K
Statistical Analysis: Overview01:11

Statistical Analysis: Overview

16.8K
When we take repeated measurements on the same or replicated samples, we will observe inconsistencies in the magnitude. These inconsistencies are called errors. To categorize and characterize these results and their errors, the researcher can use statistical analysis to determine the quality of the measurements and/or suitability of the methods.
One of the most commonly used statistical quantifiers is the mean, which is the ratio between the sum of the numerical values of all results and the...
16.8K
5-Number Summary01:04

5-Number Summary

6.1K
In a dataset, the 5-number summary includes the minimum data value, the data value of the first quartile, the median data value or data value of the second quartile, the data value of the third quartile, and the maximum data value. These 5 data values can be visualized as a box and whisker plot.
In a box plot, the minimum and maximum data values represent the lower and upper whiskers in the graph, and the median is designated as the center of the box in the chart. The first quartile and third...
6.1K
Review and Preview01:13

Review and Preview

12.1K
Data are individual items of information obtained from a population or sample. Data may be classified as qualitative (categorical), quantitative continuous, or quantitative discrete. Because it is not practical to measure the entire population in a study, researchers use samples to represent the population. A random sample is a representative group from the population chosen by using a method that gives each individual in the population an equal chance of being included in the sample. Random...
12.1K
Review and Preview01:10

Review and Preview

8.8K
In statistics, several tools are used to interpret the data. Measures of central tendency represent the characteristics of the data, such as mean, median, and mode. Additionally, measures of variance like standard deviation and range are used to find the spread of data from the mean. Relative standing measures the distance between data locations. Commonly used measures of relative standings are percentile, z score, and quartiles.
Percentiles are a type of fractile that partition data into...
8.8K
Methods of Documentation I: Source-Oriented Records01:18

Methods of Documentation I: Source-Oriented Records

1.8K
Source-oriented records, or SOR, are medical record-keeping organized by the data source. The SOR system was first developed in the mid-1900s to organize the growing patient data in hospitals and other healthcare facilities.
In an SOR, each discipline involved in patient care maintains a separate medical record section. This record-keeping method enables easy tracking of patient progress and ensures healthcare staff have access to up-to-date information.
Key Attributes include the following:
1.8K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Automating data citation: the eagle-i experience.

Proceedings of the ... ACM/IEEE Joint Conference on Digital Libraries. ACM/IEEE Joint Conference on Digital Libraries·2018
Same author

Data Citation: a Computational Challenge.

Proceedings of the ... ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems. ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems·2017
Same author

A phase I and pharmacokinetic study of short infusions of UCN-01 in patients with refractory solid tumors.

Clinical cancer research : an official journal of the American Association for Cancer Research·2005
Same author

Sharing biomedical data with impunity and ease.

Omics : a journal of integrative biology·2003
Same journal

Fair Spatial Indexing: A paradigm for Group Spatial Fairness.

Advances in database technology : proceedings. International Conference on Extending Database Technology·2024
Same journal

3DPro: Querying Complex Three-Dimensional Data with Progressive Compression and Refinement.

Advances in database technology : proceedings. International Conference on Extending Database Technology·2022
Same journal

Publishing Video Data with Indistinguishable Objects.

Advances in database technology : proceedings. International Conference on Extending Database Technology·2020
Same journal

Distributed query-aware quantization for high-dimensional similarity searches.

Advances in database technology : proceedings. International Conference on Extending Database Technology·2018
Same journal

Query-Based Outlier Detection in Heterogeneous Information Networks.

Advances in database technology : proceedings. International Conference on Extending Database Technology·2016
Same journal

Differentially Private Synthesization of Multi-Dimensional Data using Copula Functions.

Advances in database technology : proceedings. International Conference on Extending Database Technology·2014
See all related articles

Related Experiment Video

Updated: Mar 15, 2026

Optimized Bone Sampling Protocols for the Retrieval of Ancient DNA from Archaeological Remains
06:18

Optimized Bone Sampling Protocols for the Retrieval of Ancient DNA from Archaeological Remains

Published on: November 30, 2021

5.3K

PROX: Approximated Summarization of Data Provenance.

Eleanor Ainy1, Pierre Bourhis2, Susan B Davidson3

  • 1Tel Aviv University.

Advances in Database Technology : Proceedings. International Conference on Extending Database Technology
|August 30, 2016
PubMed
Summary
This summary is machine-generated.

We introduce approximated summarized provenance to manage complex data, offering a compact representation with potential information loss. PROX, a novel system, demonstrates this for intricate applications like crowd-sourced movie ratings.

More Related Videos

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations
08:03

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations

Published on: December 7, 2021

2.9K
The Terroir Concept Interpreted through Grape Berry Metabolomics and Transcriptomics
13:02

The Terroir Concept Interpreted through Grape Berry Metabolomics and Transcriptomics

Published on: October 5, 2016

11.0K

Related Experiment Videos

Last Updated: Mar 15, 2026

Optimized Bone Sampling Protocols for the Retrieval of Ancient DNA from Archaeological Remains
06:18

Optimized Bone Sampling Protocols for the Retrieval of Ancient DNA from Archaeological Remains

Published on: November 30, 2021

5.3K
Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations
08:03

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations

Published on: December 7, 2021

2.9K
The Terroir Concept Interpreted through Grape Berry Metabolomics and Transcriptomics
13:02

The Terroir Concept Interpreted through Grape Berry Metabolomics and Transcriptomics

Published on: October 5, 2016

11.0K

Area of Science:

  • Computer Science
  • Data Management

Background:

  • Modern applications generate vast datasets from multiple sources, complicating understanding of data logic and derivation.
  • Full data provenance is often infeasible to maintain and present due to its size and complexity.

Purpose of the Study:

  • To introduce and develop a system for approximated summarized provenance.
  • To address the challenges of managing and presenting data provenance in complex applications.

Main Methods:

  • Developed PROX, a system for managing, presenting, and utilizing approximated summarized provenance.
  • Proposed a notion of approximated summarized provenance, balancing compactness with potential information loss.

Main Results:

  • PROX facilitates the management and presentation of summarized data provenance.
  • Demonstrated PROX's utility in a movie rating crowd-sourcing system for gaining application insights.

Conclusions:

  • Approximated summarized provenance offers a practical approach to handling large-scale data provenance.
  • PROX provides a viable solution for understanding complex application logic and data derivation through summarized provenance.