Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Extraction: Advanced Methods00:56

Extraction: Advanced Methods

564
Metal ions can be separated from one another by complexation with organic ligands–the chelating agent– to form uncharged chelates. Here, the chelating agent must contain hydrophobic groups and behave as a weak acid, losing a proton to bind with the metal. Since most organic ligands used in this process are insoluble or undergo oxidation in the aqueous phase, the chelating agent is initially added to the organic phase and extracted into the aqueous phase. The metal-ligand complex is...
564
Data Collection by Survey01:07

Data Collection by Survey

7.2K
The systematic method of obtaining and analyzing accurate information of a population is called data collection. A survey is a standard method of data collection that involves collecting information from a target human population about their experience, opinion, or knowledge of a product, service, or process. The responses are recorded and interpreted. The most common survey examples are written questionnaires, face-to-face or telephonic conversations, focus groups, and electronic (e-mail or...
7.2K
How Data are Classified: Categorical Data01:11

How Data are Classified: Categorical Data

37.1K
A variable, usually notated by capital letters such as X and Y, is a characteristic or measurement that can be determined for each member of a population. Data are the actual values of variables. They may be numbers, or they may be words. Datum is a single value.
Data are classified based on whether they are measurable or not. Categorical data cannot be measured; instead, it can be divided into categories. For example, if Y denotes a person's party affiliation, some examples of Y include...
37.1K
Stratified Sampling Method01:16

Stratified Sampling Method

13.1K
Sampling is a technique to select a portion (or subset) of the larger population and study that portion (the sample) to gain information about the population. The sampling method ensures that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a stratified sample, divide the population into groups called strata and then take a...
13.1K
Statistical Analysis: Overview01:11

Statistical Analysis: Overview

8.6K
When we take repeated measurements on the same or replicated samples, we will observe inconsistencies in the magnitude. These inconsistencies are called errors. To categorize and characterize these results and their errors, the researcher can use statistical analysis to determine the quality of the measurements and/or suitability of the methods.
One of the most commonly used statistical quantifiers is the mean, which is the ratio between the sum of the numerical values of all results and the...
8.6K
Convenience Sampling Method00:55

Convenience Sampling Method

9.8K
Sampling is a technique to select a portion (or subset) of the larger population and study that portion (the sample) to gain information about the population. Data are the result of sampling from a population. The sampling method ensures that samples are drawn without bias and accurately represent the population.
Convenience sampling is a non-random method of sample selection; this method selects individuals that are easily accessible and may result in biased data. For example, a marketing...
9.8K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Spatio-Temporal Modeling for Multi-County Opioid Overdose Surveillance: A Unified Graph Convolutional Framework.

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science·2026
Same author

Geographic Information Systems as Data Sharing Infrastructure for Clinical Data Warehouses.

Journal of the Society for Clinical Data Management·2026
Same author

Evaluation and enhancement of suspected opioid overdose definitions in emergency medical services data using machine learning with natural language processing.

PloS one·2026
Same author

Addressing diagnostic code variability in intimate partner violence surveillance through natural language processing: Evidence from substance use disorder populations.

Drug and alcohol dependence·2026
Same author

Implementation and Assessment of Machine Learning Models for Forecasting Suspected Opioid Overdoses in Emergency Medical Services Data.

AMIA ... Annual Symposium proceedings. AMIA Symposium·2026
Same author

The impact of the communities that HEAL intervention on the provision of jail-based medication for opioid use disorder & linkage programs at release: Results from a randomized, wait-list controlled trial.

Journal of substance use and addiction treatment·2025
Same journal

Hootation: A GUI and API library for ontology validation and verbalization.

Proceedings. IEEE International Conference on Semantic Computing·2026
Same journal

Developing a high-performing network computation of big bipartite network data toward alcohol use disorder treatment referrals.

Proceedings. IEEE International Conference on Semantic Computing·2025
Same journal

Visual Enhancement and Semantic Segmentation of Murine Tissue Scans with Pulsed THz Spectroscopy.

Proceedings. IEEE International Conference on Semantic Computing·2024
Same journal

Speeding up Batch Alignment of Large Ontologies Using MapReduce.

Proceedings. IEEE International Conference on Semantic Computing·2014
Same journal

Adopting Graph Traversal Techniques for Context-Driven Value Sets Extraction from Biomedical Knowledge Sources.

Proceedings. IEEE International Conference on Semantic Computing·2011
See all related articles

Related Experiment Video

Updated: Oct 1, 2025

A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts
07:50

A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts

Published on: September 20, 2018

16.0K

Extracting Semantics from Census-based Reference Data.

Daniel R Harris1, Nima Seyedtalebi2

  • 1Center for Clinical and Translational Sciences, Institute for Pharmaceutical Outcomes and Policy, University of Kentucky, Lexington, KY USA.

Proceedings. IEEE International Conference on Semantic Computing
|March 7, 2022
PubMed
Summary
This summary is machine-generated.

We used natural language processing to extract meaning from complex United States Census Bureau data. This approach simplifies vast datasets, making demographic and socioeconomic information more accessible for research and analysis.

Keywords:
natural language processingsemantic technology

More Related Videos

Cloud-Based Phrase Mining and Analysis of User-Defined Phrase-Category Association in Biomedical Publications
09:20

Cloud-Based Phrase Mining and Analysis of User-Defined Phrase-Category Association in Biomedical Publications

Published on: February 23, 2019

8.9K
Applying Cheminformatics to Develop a Structure Searchable Database of Analytical Methods
05:34

Applying Cheminformatics to Develop a Structure Searchable Database of Analytical Methods

Published on: June 6, 2025

867

Related Experiment Videos

Last Updated: Oct 1, 2025

A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts
07:50

A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts

Published on: September 20, 2018

16.0K
Cloud-Based Phrase Mining and Analysis of User-Defined Phrase-Category Association in Biomedical Publications
09:20

Cloud-Based Phrase Mining and Analysis of User-Defined Phrase-Category Association in Biomedical Publications

Published on: February 23, 2019

8.9K
Applying Cheminformatics to Develop a Structure Searchable Database of Analytical Methods
05:34

Applying Cheminformatics to Develop a Structure Searchable Database of Analytical Methods

Published on: June 6, 2025

867

Area of Science:

  • Data Science
  • Computational Social Science
  • Natural Language Processing

Background:

  • United States Census Bureau reference data contains thousands of variables, posing challenges for analysis.
  • The complexity of census data hinders direct use, leading researchers to rely on simplified subsets.
  • Integrating comprehensive census data into research is often difficult due to its scale and structure.

Purpose of the Study:

  • To develop a method for extracting semantic meaning from United States Census Bureau reference data.
  • To map complex census variables to established ontologies for better organization.
  • To reduce the dimensionality of census datasets by identifying conceptual variables.

Main Methods:

  • Application of natural language processing (NLP) techniques to census reference data.
  • Semantic analysis to identify underlying meanings within variable descriptions.
  • Ontology mapping to link extracted semantics to existing knowledge structures.

Main Results:

  • Demonstrated preliminary success in extracting meaningful semantics from US Census data.
  • Developed a process to translate numerous variables into a more manageable set of conceptual variables.
  • Showcased the potential for organizing census data by semantic type and meaning.

Conclusions:

  • Natural language processing offers a viable solution for understanding and utilizing complex US Census data.
  • Semantic extraction can significantly simplify large-scale demographic and socioeconomic datasets.
  • This approach enhances the accessibility and integration of census data for diverse research applications.