Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Search research articles

Related Experiment Video

Updated: Jul 30, 2025

Databases to Efficiently Manage Medium Sized, Low Velocity, Multidimensional Data in Tissue Engineering

Databases to Efficiently Manage Medium Sized, Low Velocity, Multidimensional Data in Tissue Engineering

Published on: November 22, 2019

Consistently estimating network statistics using aggregated relational data.

Emily Breza¹, Arun G Chandrasekhar², Shane Lubold³

¹Department of Economics, Harvard University, Cambridge, MA 02138.

Proceedings of the National Academy of Sciences of the United States of America

|May 16, 2023

Summary

This summary is machine-generated.

Related Concept Videos

What are Estimates?

What are Estimates?

It isn't easy to measure a parameter such as the mean height or the mean weight of a population. So, we draw samples from the population and calculate the mean height or mean weight of the individuals in the sample. This sample data acts as a representative measure of the population parameter. These sample statistics are known as estimates.
The estimate for the mean of a sample is denoted by ͞x, whereas the mean of the population is designated as μ. Further, parameters such...

Statistical Analysis: Overview

Statistical Analysis: Overview

When we take repeated measurements on the same or replicated samples, we will observe inconsistencies in the magnitude. These inconsistencies are called errors. To categorize and characterize these results and their errors, the researcher can use statistical analysis to determine the quality of the measurements and/or suitability of the methods.
One of the most commonly used statistical quantifiers is the mean, which is the ratio between the sum of the numerical values of all results and the...

Maximum Size of Aggregate

Maximum Size of Aggregate

The maximum size of aggregate is defined as the aperture of the sieve retaining 15 percent or more of the particles present in the aggregate sample. The aggregate's maximum size impacts the concrete's water requirement, workability, and strength. Larger aggregates reduce the surface area needing cement paste coverage, which can lower water needs, thereby allowing a decrease in the water-to-cement ratio when the desired workability and richness of the mix are to be maintained, which can...

Relative Frequency Distribution

Relative Frequency Distribution

A relative frequency distribution is the proportion or fraction of times a value occurs in a data set. To find the relative frequencies, one can divide each frequency by the total number of data points in the sample. It is very similar to a regular frequency distribution, except that instead of reporting how many data values fall in a class, a relative frequency distribution reports the fraction of data values that fall in a class. These fractions or proportions are called relative frequencies...

Relative Frequency Histogram

Relative Frequency Histogram

The relative frequency depicts the proportion of data points that have each value. The frequency tells the number of data points that have each value. Like the histogram, a relative frequency histogram also has the same shape with a horizontal scale (the x-axis), but the vertical scale (the y-axis) is marked with relative frequencies (percentages of the whole) instead of actual frequencies. A relative frequency histogram is a graphical representation of a frequency distribution where the...

Introduction to Statistics

Introduction to Statistics

The science of statistics involves collecting, analyzing, interpreting, and presenting data. The method of collecting, organizing, and summarizing data is called descriptive statistics. The systematic method of drawing inferences from the sample data and predicting unknown characteristics of a population is called inferential statistics.
In statistics, the collection of individuals or objects under study is called population. The idea of sampling is to select a portion of the larger population...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Organoid modeling of lung branching morphogenesis and epithelial lineage specification.

Developmental cell·2026

Same author

Developmental chronology of mouse embryo from 2-cell stage through birth.

Nature cell biology·2026

Same author

Mechanistic insights into d-band center engineering and interfacial charge transfer in ternary heterostructures for energy storage.

Journal of colloid and interface science·2026

Same author

Corrigendum to "Spatiotemporal dynamics of neuron differentiation and migration in the developing human spinal cord" [J. Genet. Genom. 52 (2025) 1283-1295].

Journal of genetics and genomics = Yi chuan xue bao·2025

Same author

Inhibition of H3K79me2 by DOT1L Inhibitor EPZ5676 Promotes Mouse Embryonic Lung Branching Morphogenesis via Increasing Epithelium Proliferation.

FASEB journal : official publication of the Federation of American Societies for Experimental Biology·2025

Same author

Enabling fast ion diffusion and charge transfer in NiCo<sub>2</sub>O<sub>4</sub>/C/MnO<sub>2</sub> hollow nanocages via interfacial electronic modulation for supercapacitors.

Journal of colloid and interface science·2025

Same journal

The TaMYB55-TaSnRK1α1-TabZIP9 module confers heat stress tolerance in wheat.

Proceedings of the National Academy of Sciences of the United States of America·2026

Same journal

Superstatistics approach to turbulent circulation fluctuations.

Proceedings of the National Academy of Sciences of the United States of America·2026

Same journal

A molecular timescale for evolution of cobamide biosynthesis.

Proceedings of the National Academy of Sciences of the United States of America·2026

Same journal

Pierre Chambon, a pioneer of molecular biology and gene regulation in eukaryotes.

Proceedings of the National Academy of Sciences of the United States of America·2026

Same journal

Granulosa cell glycogen fuels the avascular corpus luteum.

Proceedings of the National Academy of Sciences of the United States of America·2026

Same journal

Synthetic essentiality of TRAIL/TNFSF10 in VHL-deficient renal cell carcinoma.

Proceedings of the National Academy of Sciences of the United States of America·2026

See all related articles

Aggregated Relational Data (ARD) offers a cost-effective method for network analysis when complete data is unfeasible. This study establishes conditions for ARD to accurately estimate unobserved network features and parameters.

Area of Science:

Network Science
Sociology
Statistics
Data Analysis

Background:

Collecting comprehensive network data is often prohibitively expensive and time-consuming.
Aggregated Relational Data (ARD) offers a lower-cost alternative by asking about the number of contacts with specific traits, rather than direct dyadic connections.
A systematic understanding of ARD's accuracy in recovering unobserved network features is lacking.

Purpose of the Study:

To systematically characterize the conditions under which Aggregated Relational Data (ARD) can accurately recover features of unobserved networks.
To derive conditions for the consistent estimation of network statistics and model parameters using ARD.
To evaluate the efficacy of ARD for probabilistic network models including beta-models, stochastic block models, and latent geometric space models.

Keywords:

aggregated relational data consistency social networks survey methods

More Related Videos

Integrating Computerized Linguistic and Social Network Analyses to Capture Addiction Recovery Capital in an Online Community

Integrating Computerized Linguistic and Social Network Analyses to Capture Addiction Recovery Capital in an Online Community

Published on: May 31, 2019

Executing Complexity-Increasing Queries in Relational MySQL and NoSQL MongoDB and EXist Size-Growing ISO/EN 13606 Standardized EHR Databases

Executing Complexity-Increasing Queries in Relational MySQL and NoSQL MongoDB and EXist Size-Growing ISO/EN 13606 Standardized EHR Databases

Published on: March 19, 2018

Related Experiment Videos

Last Updated: Jul 30, 2025

Databases to Efficiently Manage Medium Sized, Low Velocity, Multidimensional Data in Tissue Engineering

Databases to Efficiently Manage Medium Sized, Low Velocity, Multidimensional Data in Tissue Engineering

Published on: November 22, 2019

Integrating Computerized Linguistic and Social Network Analyses to Capture Addiction Recovery Capital in an Online Community

Integrating Computerized Linguistic and Social Network Analyses to Capture Addiction Recovery Capital in an Online Community

Published on: May 31, 2019

Executing Complexity-Increasing Queries in Relational MySQL and NoSQL MongoDB and EXist Size-Growing ISO/EN 13606 Standardized EHR Databases

Executing Complexity-Increasing Queries in Relational MySQL and NoSQL MongoDB and EXist Size-Growing ISO/EN 13606 Standardized EHR Databases

Published on: March 19, 2018

Main Methods:

Derivation of conditions for consistent estimation of network statistics and model parameters from ARD.
Estimation of network model parameters for beta-models, stochastic block models, and latent geometric space models using ARD.
Simulation of networks based on fitted ARD models to analyze the estimation of unobserved network statistics (e.g., eigenvector centrality) and response functions (e.g., regression coefficients).

Main Results:

Established conditions under which statistics and parameters of unobserved networks can be consistently estimated using ARD.
Demonstrated that cross-group link probabilities are sufficient for estimating parameters in commonly used probabilistic network models.
Characterized when simulated networks from ARD facilitate consistent estimation of network statistics and regression coefficients.

Conclusions:

ARD provides a viable and statistically sound method for inferring network structures and statistics when complete data collection is infeasible.
The derived conditions offer guidance on the appropriate application and interpretation of results obtained from ARD.
This work bridges the gap in understanding the theoretical underpinnings and practical utility of ARD in network analysis.