Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Statistical Methods for Analyzing Epidemiological Data

Statistical Methods for Analyzing Epidemiological Data

Epidemiological data primarily involves information on specific populations' occurrence, distribution, and determinants of health and diseases. This data is crucial for understanding disease patterns and impacts, aiding public health decision-making and disease prevention strategies. The analysis of epidemiological data employs various statistical methods to interpret health-related data effectively. Here are some commonly used methods:

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Interpretable machine learning uncovers structural determinants of Wnt-Wntless binding specificity from atomistic simulations.

Communications chemistry·2026

Same author

Holistic evaluation of large language models for medical tasks with MedHELM.

Nature medicine·2026

Same author

Monitoring Deployed AI Systems in Health Care.

ArXiv·2025

Same author

Mondo: integrating disease terminology across communities.

Genetics·2025

Same author

Feasibility of Automated Precharting using GPT-4 in New Specialty Referrals.

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science·2025

Same author

Red teaming ChatGPT in medicine to yield real-world insights on model behavior.

NPJ digital medicine·2025

Same journal

Precision Medicine Gene Network Analyser: part I-cancer driver gene identification through network topology and ensemble machine learning.

Genomics & informatics·2026

Same journal

A bioinformatics pipeline for the design of a SART3-targeted cancer vaccine with enhanced immunogenicity.

Genomics & informatics·2026

Same journal

Measuring the gap: correlating synthetic-to-real drift with PHI de-identification performance.

Genomics & informatics·2026

Same journal

Correction: Towards a transparent and reproducible AI-assisted research paper writing.

Genomics & informatics·2026

Same journal

Correction: Peptide‑based therapeutics targeting the SLC39A14‑PIWIL2 fusion in hepatocellular carcinoma.

Genomics & informatics·2026

Same journal

BioOne: a national-scale platform for integrated discovery and utilization of diverse biological resources in South Korea.

Genomics & informatics·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Oct 17, 2025

Lung CT Segmentation to Identify Consolidations and Ground Glass Areas for Quantitative Assesment of SARS-CoV Pneumonia

Lung CT Segmentation to Identify Consolidations and Ground Glass Areas for Quantitative Assesment of SARS-CoV Pneumonia

Published on: December 19, 2020

A biomedically oriented automatically annotated Twitter COVID-19 dataset.

Luis Alberto Robles Hernandez¹, Tiffany J Callahan², Juan M Banda¹

¹Department of Computer Science, Georgia State University, Atlanta, GA 30303, USA.

Genomics & Informatics

|October 12, 2021

Summary

This summary is machine-generated.

Researchers created a large, automatically annotated dataset of 120 million tweets for biomedical research. This resource addresses the need for accessible social media data to study diseases like COVID-19 and their impacts.

Keywords:

COVID-19 biomedical annotations datasets social media data

More Related Videos

A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts

A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts

Published on: September 20, 2018

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

Related Experiment Videos

Last Updated: Oct 17, 2025

Lung CT Segmentation to Identify Consolidations and Ground Glass Areas for Quantitative Assesment of SARS-CoV Pneumonia

Lung CT Segmentation to Identify Consolidations and Ground Glass Areas for Quantitative Assesment of SARS-CoV Pneumonia

Published on: December 19, 2020

A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts

A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts

Published on: September 20, 2018

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

Area of Science:

Biomedical Informatics
Computational Social Science
Public Health Informatics

Background:

Social media, particularly Twitter, is increasingly used for biomedical research.
The COVID-19 pandemic highlighted the need for real-time clinical data from non-traditional sources.
Manual annotation of social media data is costly, time-consuming, and often results in small, non-generalizable datasets.

Purpose of the Study:

To develop and release a large-scale, automatically annotated dataset of tweets for biomedical research.
To address the limitations of manually curated datasets in terms of cost, size, and generalizability.
To facilitate near-real-time analysis of diseases, interventions, and sequelae using social media data.

Main Methods:

Leveraged best-practices for identifying tweets with potential clinical relevance.
Evaluated multiple SpaCy-based annotation frameworks against a manually annotated gold-standard dataset.
Selected the optimal automatic annotation method and applied it to over 120 million tweets.

Main Results:

A publicly released dataset of 120 million automatically annotated tweets for biomedical research.
Demonstrated the feasibility of large-scale automatic annotation for clinical relevance.
Established a benchmark for automatic tweet annotation in the biomedical domain.

Conclusions:

The developed dataset provides a valuable, scalable resource for biomedical research.
Automatic annotation offers a cost-effective solution to the data scarcity problem in social media research.
This work enables broader and more efficient use of social media data for public health surveillance and clinical studies.