Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Statistical Methods for Analyzing Epidemiological Data01:25

Statistical Methods for Analyzing Epidemiological Data

612
Epidemiological data primarily involves information on specific populations' occurrence, distribution, and determinants of health and diseases. This data is crucial for understanding disease patterns and impacts, aiding public health decision-making and disease prevention strategies. The analysis of epidemiological data employs various statistical methods to interpret health-related data effectively. Here are some commonly used methods:
612

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Interpretable machine learning uncovers structural determinants of Wnt-Wntless binding specificity from atomistic simulations.

Communications chemistry·2026
Same author

Holistic evaluation of large language models for medical tasks with MedHELM.

Nature medicine·2026
Same author

Monitoring Deployed AI Systems in Health Care.

ArXiv·2025
Same author

Mondo: integrating disease terminology across communities.

Genetics·2025
Same author

Feasibility of Automated Precharting using GPT-4 in New Specialty Referrals.

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science·2025
Same author

Red teaming ChatGPT in medicine to yield real-world insights on model behavior.

NPJ digital medicine·2025
Same journal

Precision Medicine Gene Network Analyser: part I-cancer driver gene identification through network topology and ensemble machine learning.

Genomics & informatics·2026
Same journal

A bioinformatics pipeline for the design of a SART3-targeted cancer vaccine with enhanced immunogenicity.

Genomics & informatics·2026
Same journal

Measuring the gap: correlating synthetic-to-real drift with PHI de-identification performance.

Genomics & informatics·2026
Same journal

Correction: Towards a transparent and reproducible AI-assisted research paper writing.

Genomics & informatics·2026
Same journal

Correction: Peptide‑based therapeutics targeting the SLC39A14‑PIWIL2 fusion in hepatocellular carcinoma.

Genomics & informatics·2026
Same journal

BioOne: a national-scale platform for integrated discovery and utilization of diverse biological resources in South Korea.

Genomics & informatics·2026
See all related articles

Related Experiment Video

Updated: Oct 17, 2025

Lung CT Segmentation to Identify Consolidations and Ground Glass Areas for Quantitative Assesment of SARS-CoV Pneumonia
08:05

Lung CT Segmentation to Identify Consolidations and Ground Glass Areas for Quantitative Assesment of SARS-CoV Pneumonia

Published on: December 19, 2020

14.4K

A biomedically oriented automatically annotated Twitter COVID-19 dataset.

Luis Alberto Robles Hernandez1, Tiffany J Callahan2, Juan M Banda1

  • 1Department of Computer Science, Georgia State University, Atlanta, GA 30303, USA.

Genomics & Informatics
|October 12, 2021
PubMed
Summary
This summary is machine-generated.

Researchers created a large, automatically annotated dataset of 120 million tweets for biomedical research. This resource addresses the need for accessible social media data to study diseases like COVID-19 and their impacts.

Keywords:
COVID-19biomedical annotationsdatasetssocial media data

More Related Videos

A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts
07:50

A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts

Published on: September 20, 2018

16.1K
Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems
05:47

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

746

Related Experiment Videos

Last Updated: Oct 17, 2025

Lung CT Segmentation to Identify Consolidations and Ground Glass Areas for Quantitative Assesment of SARS-CoV Pneumonia
08:05

Lung CT Segmentation to Identify Consolidations and Ground Glass Areas for Quantitative Assesment of SARS-CoV Pneumonia

Published on: December 19, 2020

14.4K
A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts
07:50

A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts

Published on: September 20, 2018

16.1K
Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems
05:47

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

746

Area of Science:

  • Biomedical Informatics
  • Computational Social Science
  • Public Health Informatics

Background:

  • Social media, particularly Twitter, is increasingly used for biomedical research.
  • The COVID-19 pandemic highlighted the need for real-time clinical data from non-traditional sources.
  • Manual annotation of social media data is costly, time-consuming, and often results in small, non-generalizable datasets.

Purpose of the Study:

  • To develop and release a large-scale, automatically annotated dataset of tweets for biomedical research.
  • To address the limitations of manually curated datasets in terms of cost, size, and generalizability.
  • To facilitate near-real-time analysis of diseases, interventions, and sequelae using social media data.

Main Methods:

  • Leveraged best-practices for identifying tweets with potential clinical relevance.
  • Evaluated multiple SpaCy-based annotation frameworks against a manually annotated gold-standard dataset.
  • Selected the optimal automatic annotation method and applied it to over 120 million tweets.

Main Results:

  • A publicly released dataset of 120 million automatically annotated tweets for biomedical research.
  • Demonstrated the feasibility of large-scale automatic annotation for clinical relevance.
  • Established a benchmark for automatic tweet annotation in the biomedical domain.

Conclusions:

  • The developed dataset provides a valuable, scalable resource for biomedical research.
  • Automatic annotation offers a cost-effective solution to the data scarcity problem in social media research.
  • This work enables broader and more efficient use of social media data for public health surveillance and clinical studies.