Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Random Error01:04

Random Error

10.0K
Random or indeterminate errors originate from various uncontrollable variables, such as variations in environmental conditions, instrument imperfections, or the inherent variability of the phenomena being measured. Usually, these errors cannot be predicted, estimated, or characterized because their direction and magnitude often vary in magnitude and direction even during consecutive measurements. As a result, they are difficult to eliminate. However, the aggregate effect of these errors can be...
10.0K
Wald-Wolfowitz Runs Test I01:17

Wald-Wolfowitz Runs Test I

1.0K
The Wald-Wolfowitz test, also known as the runs test, is a nonparametric statistical test used to assess the randomness of a sequence of two different types of elements (e.g., positive/negative values, successes/failures). It examines whether the order of the elements in a sequence is random or if there is a pattern or trend present. This nonparametric test applies to any ordered data despite the population and sample data distribution, even if a higher sample size is available.
The test works...
1.0K
Random Variables01:09

Random Variables

18.7K
A random variable is a single numerical value that indicates the outcome of a procedure. The concept of random variables is fundamental to the probability theory and was introduced by a Russian mathematician, Pafnuty Chebyshev, in the mid-nineteenth century.
Uppercase letters such as X or Y denote a random variable. Lowercase letters like x or y denote the value of a random variable. If X is a random variable, then X is written in words, and x is given as a number.
For example, let X = the...
18.7K
Wald-Wolfowitz Runs Test II01:17

Wald-Wolfowitz Runs Test II

608
The Wald-Wolfowitz runs test, commonly referred to as the runs test, is a nonparametric test used to assess the randomness of ordered data. The test evaluates the number of runs, which are consecutive sequences of similar elements within the data. If the number of runs is significantly higher or lower than expected, the data is considered non-random, indicating a detectable pattern or structure.
For binary data, runs are identified using symbols such as + and −, or equivalently, 1s and 0s. In...
608
Random Sampling Method01:09

Random Sampling Method

15.5K
Sampling is a technique to select a portion (or subset) of the larger population and study that portion (the sample) to gain information about the population. Data are the result of sampling from a population. The sampling method ensures that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest. Among the various sampling methods used by...
15.5K
Randomized Experiments01:13

Randomized Experiments

9.2K
The randomization process involves assigning study participants randomly to experimental or control groups based on their probability of being equally assigned. Randomization is meant to eliminate selection bias and balance known and unknown confounding factors so that the control group is similar to the treatment group as much as possible. A computer program and a random number generator can be used to assign participants to groups in a way that minimizes bias.
Simple randomization
Simple...
9.2K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Optimal Sketching for Residual Error Estimation for Matrix and Vector Norms.

... International Conference on Learning Representations·2026
Same author

Pseudorandom Hashing for Space-bounded Computation with Applications in Streaming.

Proceedings ... annual Symposium on Foundations of Computer Science. Symposium on Foundations of Computer Science·2025
Same author

Learning-augmented sketching offers improved performance for privacy preserving and secure GWAS.

iScience·2025
Same author

<math> </math> -Regression in the Arbitrary Partition Model of Communication.

Proceedings of machine learning research·2024
Same author

Improved Algorithms for White-Box Adversarial Streams.

Proceedings of machine learning research·2024
Same author

Tight Lower Bounds for Directed Cut Sparsification and Distributed Min-Cut.

Proceedings of the ACM on management of data·2024
Same journal

Turbulent flow in a vortex separator with a directed pipe inlet.

Scientific reports·2026
Same journal

Systematic characteristic evaluation of clay-based cementitious material derived from calcium carbide residue and waste tile powder.

Scientific reports·2026
Same journal

Retraction Note: Improvement of a rapid diagnostic application of monoclonal antibodies against avian influenza H7 subtype virus using Europium nanoparticles.

Scientific reports·2026
Same journal

Applying large language models to spam detection in the Kazakh low-resource language setting.

Scientific reports·2026
Same journal

An open-source 3D printing system enabling in-situ freeze-thaw processing of hydrogels.

Scientific reports·2026
Same journal

An enhanced EfficientNet framework for automated waste classification using cosine annealing and label smoothing.

Scientific reports·2026
See all related articles

Related Experiment Video

Updated: Mar 14, 2026

Large Scale Energy Efficient Sensor Network Routing Using a Quantum Processor Unit
05:30

Large Scale Energy Efficient Sensor Network Routing Using a Quantum Processor Unit

Published on: September 8, 2023

1.2K

True Randomness from Big Data.

Periklis A Papakonstantinou1, David P Woodruff2, Guang Yang3

  • 1Rutgers University, MSIS, Piscataway, NJ 08853, USA.

Scientific Reports
|September 27, 2016
PubMed
Summary
This summary is machine-generated.

This study introduces a new method to generate provably random bits from large datasets, crucial for cryptography and simulations. The approach efficiently extracts high-quality randomness from big data sources, outperforming previous methods.

More Related Videos

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments
08:12

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

3.0K

Related Experiment Videos

Last Updated: Mar 14, 2026

Large Scale Energy Efficient Sensor Network Routing Using a Quantum Processor Unit
05:30

Large Scale Energy Efficient Sensor Network Routing Using a Quantum Processor Unit

Published on: September 8, 2023

1.2K
A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments
08:12

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

3.0K

Area of Science:

  • Computer Science
  • Information Theory
  • Cryptography

Background:

  • Generating high-quality random bits is essential for various applications, including physical systems simulation and cryptography.
  • Existing methods for extracting randomness from large datasets often rely on statistical assumptions and can be computationally inefficient.

Purpose of the Study:

  • To develop a general method for generating provably random bits from massive datasets.
  • To introduce the concept of 'big sources' into the randomness extraction literature.
  • To provide an efficient and practical solution for randomness generation from large-scale data.

Main Methods:

  • Viewing large datasets as samples from a 'big source' (a random variable of at least a few gigabytes).
  • Developing a novel randomness extraction technique applicable to these big sources.
  • Empirically validating the method on real-world datasets.

Main Results:

  • The proposed method provably extracts almost-uniform random bits from big sources.
  • The method is computationally efficient and practical for handling large datasets.
  • Empirical validation shows the method's quality matches or exceeds existing approaches like quantum randomness expanders.

Conclusions:

  • This research establishes a new paradigm for randomness extraction from big data.
  • The developed method offers an efficient and reliable way to generate high-quality random bits.
  • The findings have significant implications for fields requiring robust randomness, such as cryptography and scientific simulations.