Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Steps in Outbreak Investigation01:18

Steps in Outbreak Investigation

263
In the ever-evolving field of public health, statistical analysis serves as a cornerstone for understanding and managing disease outbreaks. By leveraging various statistical tools, health professionals can predict potential outbreaks, analyze ongoing situations, and devise effective responses to mitigate impact. For that to happen, there are a few possible stages of the analysis:
263
Statistical Methods for Analyzing Epidemiological Data01:25

Statistical Methods for Analyzing Epidemiological Data

638
Epidemiological data primarily involves information on specific populations' occurrence, distribution, and determinants of health and diseases. This data is crucial for understanding disease patterns and impacts, aiding public health decision-making and disease prevention strategies. The analysis of epidemiological data employs various statistical methods to interpret health-related data effectively. Here are some commonly used methods:
638
Principles of Disease Surveillance01:26

Principles of Disease Surveillance

270
Disease surveillance is the systematic collection, analysis, and interpretation of health data essential to the planning, implementation, and evaluation of public health practice. This process integrates data dissemination to entities responsible for preventing and controlling disease, injury, and disability. Surveillance systems provide crucial information for action, helping public health authorities make informed decisions to manage and prevent outbreaks, ensure public safety, optimize...
270
Statistical Software for Data Analysis and Clinical Trials01:12

Statistical Software for Data Analysis and Clinical Trials

974
Statistical software is pivotal in data analysis and clinical trials by providing tools to analyze data, draw conclusions, and make predictions. These software packages range from simple data management applications to complex analytical platforms, supporting various statistical tests, models, and simulation techniques. Their significance lies in their ability to handle vast amounts of data with precision and efficiency, enabling researchers to validate hypotheses, identify trends, and make...
974
Genome-wide Association Studies-GWAS01:11

Genome-wide Association Studies-GWAS

14.8K
Genome-wide association studies or GWAS are used to identify whether common SNPs are associated with certain diseases. Suppose specific SNPs are more frequently observed in individuals with a particular disease than those without the disease. In that case, those SNPs are said to be associated with the disease. Chi-square analysis is performed to check the probability of the allele likely to be associated with the disease.
GWAS does not require the identification of the target gene involved in...
14.8K
Genome Annotation and Assembly03:36

Genome Annotation and Assembly

19.6K
The genome refers to all of the genetic material in an organism. It can range from a few million base pairs in microbial cells to several billion base pairs in many eukaryotic organisms. Genome assembly refers to the process of taking the DNA sequencing data and putting it all back together in a correct order to create a close representation of the original genome. This is followed by the identification of functional elements on the newly assembled genome, a process called genome annotation.
19.6K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Perspective: Machine Learning for Health Should Consider Social Drivers of Health.

Proceedings of machine learning research·2026
Same author

A Microsimulation-Based Approach for Mitigating Societal Bias in Chronic Kidney Disease Data.

Medical decision making : an international journal of the Society for Medical Decision Making·2026
Same author

A Novel Decision-Modeling Framework for Health Policy Analyses When Outcomes Are Influenced by Social and Disease Processes.

Medical decision making : an international journal of the Society for Medical Decision Making·2026
Same author

Causal modeling of chronic kidney disease in a participatory framework for informing the inclusion of social drivers in health algorithms.

Journal of the American Medical Informatics Association : JAMIA·2026
Same author

A microsimulation-based framework for mitigating societal bias in primary care data.

medRxiv : the preprint server for health sciences·2025
Same author

Causal modeling of chronic kidney disease in a participatory framework for informing the inclusion of social drivers in health algorithms.

medRxiv : the preprint server for health sciences·2025
Same journal

DataAtlas: automatic generation of data dictionaries using large language models.

JAMIA open·2026
Same journal

An examination of the availability and characteristics of social needs data in the electronic health records: a path to social data harmonization and standardization at Johns Hopkins medicine.

JAMIA open·2026
Same journal

Generative artificial intelligence implementation in REDCap.

JAMIA open·2026
Same journal

Improving readability of layperson abstracts and summaries in oncology using task-specific large language model powered tool: results from the BRIDGE-AI 7 study.

JAMIA open·2026
Same journal

Accuracy of administrative data in ascertaining health conditions: a systematic review.

JAMIA open·2026
Same journal

Building a consumer health informatics introductory course consensus curriculum: an eDelphi study.

JAMIA open·2026
See all related articles

Related Experiment Video

Updated: Oct 25, 2025

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations
08:03

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations

Published on: December 7, 2021

2.5K

A generalizable data assembly algorithm for infectious disease outbreaks.

Maimuna S Majumder1,2, Sherri Rose3

  • 1Computational Health Informatics Program, Boston Children's Hospital, Boston, Massachusetts, USA.

JAMIA Open
|August 5, 2021
PubMed
Summary
This summary is machine-generated.

This study presents an automated algorithm to convert text-based infectious disease outbreak data into a machine-readable format. The tool accurately curates information from various sources, aiding outbreak researchers.

Keywords:
automationdata curationinfectious diseasesoutbreaksregular expressions

More Related Videos

Multi-target Parallel Processing Approach for Gene-to-structure Determination of the Influenza Polymerase PB2 Subunit
22:10

Multi-target Parallel Processing Approach for Gene-to-structure Determination of the Influenza Polymerase PB2 Subunit

Published on: June 28, 2013

13.4K
Use of an Influenza Antigen Microarray to Measure the Breadth of Serum Antibodies Across Virus Subtypes
08:52

Use of an Influenza Antigen Microarray to Measure the Breadth of Serum Antibodies Across Virus Subtypes

Published on: July 26, 2019

8.3K

Related Experiment Videos

Last Updated: Oct 25, 2025

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations
08:03

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations

Published on: December 7, 2021

2.5K
Multi-target Parallel Processing Approach for Gene-to-structure Determination of the Influenza Polymerase PB2 Subunit
22:10

Multi-target Parallel Processing Approach for Gene-to-structure Determination of the Influenza Polymerase PB2 Subunit

Published on: June 28, 2013

13.4K
Use of an Influenza Antigen Microarray to Measure the Breadth of Serum Antibodies Across Virus Subtypes
08:52

Use of an Influenza Antigen Microarray to Measure the Breadth of Serum Antibodies Across Virus Subtypes

Published on: July 26, 2019

8.3K

Area of Science:

  • Epidemiology
  • Public Health Informatics
  • Computational Biology

Background:

  • Health agencies disseminate crucial outbreak data as text, posing accessibility challenges for researchers.
  • Lack of machine-readable data hinders timely analysis and response during infectious disease emergencies.

Purpose of the Study:

  • To develop and validate a generalizable algorithm for automatically curating text-based outbreak information.
  • To transform unstructured outbreak data into a machine-readable format for enhanced research accessibility.

Main Methods:

  • Developed a data assembly algorithm utilizing regular expressions.
  • Curated data from formal reports, email newsletters, and Twitter across three distinct outbreaks.
  • Created manual validation datasets for performance comparison.

Main Results:

  • The algorithm demonstrated high accuracy, with cumulative missingness and misidentification rates ≤2% and ≤1%, respectively.
  • Successfully curated data across diverse sources and three different infectious disease outbreaks.
  • Validated performance against manually curated datasets.

Conclusions:

  • The developed algorithm effectively transforms text-based outbreak information into machine-readable data.
  • Addresses a critical need for generalizable tools in outbreak research.
  • Facilitates improved data analysis and response for future infectious disease events.