Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Random Error

Random Error

Random or indeterminate errors originate from various uncontrollable variables, such as variations in environmental conditions, instrument imperfections, or the inherent variability of the phenomena being measured. Usually, these errors cannot be predicted, estimated, or characterized because their direction and magnitude often vary in magnitude and direction even during consecutive measurements. As a result, they are difficult to eliminate. However, the aggregate effect of these errors can be...

Wald-Wolfowitz Runs Test I

Wald-Wolfowitz Runs Test I

The Wald-Wolfowitz test, also known as the runs test, is a nonparametric statistical test used to assess the randomness of a sequence of two different types of elements (e.g., positive/negative values, successes/failures). It examines whether the order of the elements in a sequence is random or if there is a pattern or trend present. This nonparametric test applies to any ordered data despite the population and sample data distribution, even if a higher sample size is available.
The test works...

Random Variables

Random Variables

A random variable is a single numerical value that indicates the outcome of a procedure. The concept of random variables is fundamental to the probability theory and was introduced by a Russian mathematician, Pafnuty Chebyshev, in the mid-nineteenth century.
Uppercase letters such as X or Y denote a random variable. Lowercase letters like x or y denote the value of a random variable. If X is a random variable, then X is written in words, and x is given as a number.
For example, let X = the...

Wald-Wolfowitz Runs Test II

Wald-Wolfowitz Runs Test II

The Wald-Wolfowitz runs test, commonly referred to as the runs test, is a nonparametric test used to assess the randomness of ordered data. The test evaluates the number of runs, which are consecutive sequences of similar elements within the data. If the number of runs is significantly higher or lower than expected, the data is considered non-random, indicating a detectable pattern or structure.
For binary data, runs are identified using symbols such as + and −, or equivalently, 1s and 0s. In...

Random Sampling Method

Random Sampling Method

Sampling is a technique to select a portion (or subset) of the larger population and study that portion (the sample) to gain information about the population. Data are the result of sampling from a population. The sampling method ensures that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest. Among the various sampling methods used by...

Randomized Experiments

Randomized Experiments

The randomization process involves assigning study participants randomly to experimental or control groups based on their probability of being equally assigned. Randomization is meant to eliminate selection bias and balance known and unknown confounding factors so that the control group is similar to the treatment group as much as possible. A computer program and a random number generator can be used to assign participants to groups in a way that minimizes bias.
Simple randomization
Simple...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Optimal Sketching for Residual Error Estimation for Matrix and Vector Norms.

... International Conference on Learning Representations·2026

Same author

Pseudorandom Hashing for Space-bounded Computation with Applications in Streaming.

Proceedings ... annual Symposium on Foundations of Computer Science. Symposium on Foundations of Computer Science·2025

Same author

Learning-augmented sketching offers improved performance for privacy preserving and secure GWAS.

iScience·2025

Same author

<math> </math> -Regression in the Arbitrary Partition Model of Communication.

Proceedings of machine learning research·2024

Same author

Improved Algorithms for White-Box Adversarial Streams.

Proceedings of machine learning research·2024

Same author

Tight Lower Bounds for Directed Cut Sparsification and Distributed Min-Cut.

Proceedings of the ACM on management of data·2024

Same journal

Turbulent flow in a vortex separator with a directed pipe inlet.

Scientific reports·2026

Same journal

Systematic characteristic evaluation of clay-based cementitious material derived from calcium carbide residue and waste tile powder.

Scientific reports·2026

Same journal

Retraction Note: Improvement of a rapid diagnostic application of monoclonal antibodies against avian influenza H7 subtype virus using Europium nanoparticles.

Scientific reports·2026

Same journal

Applying large language models to spam detection in the Kazakh low-resource language setting.

Scientific reports·2026

Same journal

An open-source 3D printing system enabling in-situ freeze-thaw processing of hydrogels.

Scientific reports·2026

Same journal

An enhanced EfficientNet framework for automated waste classification using cosine annealing and label smoothing.

Scientific reports·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Mar 14, 2026

Large Scale Energy Efficient Sensor Network Routing Using a Quantum Processor Unit

Large Scale Energy Efficient Sensor Network Routing Using a Quantum Processor Unit

Published on: September 8, 2023

True Randomness from Big Data.

Periklis A Papakonstantinou¹, David P Woodruff², Guang Yang³

¹Rutgers University, MSIS, Piscataway, NJ 08853, USA.

Scientific Reports

|September 27, 2016

Summary

This summary is machine-generated.

This study introduces a new method to generate provably random bits from large datasets, crucial for cryptography and simulations. The approach efficiently extracts high-quality randomness from big data sources, outperforming previous methods.

More Related Videos

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

Related Experiment Videos

Last Updated: Mar 14, 2026

Large Scale Energy Efficient Sensor Network Routing Using a Quantum Processor Unit

Large Scale Energy Efficient Sensor Network Routing Using a Quantum Processor Unit

Published on: September 8, 2023

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

Area of Science:

Computer Science
Information Theory
Cryptography

Background:

Generating high-quality random bits is essential for various applications, including physical systems simulation and cryptography.
Existing methods for extracting randomness from large datasets often rely on statistical assumptions and can be computationally inefficient.

Purpose of the Study:

To develop a general method for generating provably random bits from massive datasets.
To introduce the concept of 'big sources' into the randomness extraction literature.
To provide an efficient and practical solution for randomness generation from large-scale data.

Main Methods:

Viewing large datasets as samples from a 'big source' (a random variable of at least a few gigabytes).
Developing a novel randomness extraction technique applicable to these big sources.
Empirically validating the method on real-world datasets.

Main Results:

The proposed method provably extracts almost-uniform random bits from big sources.
The method is computationally efficient and practical for handling large datasets.
Empirical validation shows the method's quality matches or exceeds existing approaches like quantum randomness expanders.

Conclusions:

This research establishes a new paradigm for randomness extraction from big data.
The developed method offers an efficient and reliable way to generate high-quality random bits.
The findings have significant implications for fields requiring robust randomness, such as cryptography and scientific simulations.