Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Genomics02:02

Genomics

Genomics is the science of genomes: it is the study of all the genetic material of an organism. In humans, the genome consists of information carried in 23 pairs of chromosomes in the nucleus, as well as mitochondrial DNA. In genomics, both coding and non-coding DNA is sequenced and analyzed. Genomics allows a better understanding of all living things, their evolution, and their diversity. It has a myriad of uses: for example, to build phylogenetic trees, to improve productivity and...
Chi-square Analysis02:46

Chi-square Analysis

The chi-square test is a statistical hypothesis test. It is used to check whether there is a significant difference between an expected value and an observed value. In the context of genetics, it enables us to either accept or reject a hypothesis, based on how much the observed values deviate from the expected values.
The chi-square test was developed by Pearson in 1990.
The first step of performing a Chi-square analysis is to establish a null hypothesis, which assumes that there is no real...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Applications of Machine Learning, Natural Language Processing, and Generative Artificial Intelligence in Dermatology Education and Research: A Scoping Review.

International journal of dermatology·2026
Same author

Digital Transformation of Rural, Regional, and Remote Australian Hospitals: A Pragmatic Strategy for Introducing AI.

The Australian journal of rural health·2026
Same author

Using clinical simulation to evaluate a video telehealth consultation summary application.

NPJ digital medicine·2026
Same author

Crossover Evaluation of Two Ambient AI Scribe Tools in the Emergency Department.

Applied clinical informatics·2026
Same author

Implementation of a digital coordination centre in a hospital: a qualitative evaluation of enablers, barriers and strategies.

BMC health services research·2025
Same author

International partnership for governing generative artificial intelligence models in medicine.

Nature medicine·2025
Same journal

Sensitivity Analyses of a Scoring System for a Contraception Decision Aid.

AMIA ... Annual Symposium proceedings. AMIA Symposium·2026
Same journal

Improving electronic health record processing of large language models via retrieval-augmented generation: A case study on dietary supplements.

AMIA ... Annual Symposium proceedings. AMIA Symposium·2026
Same journal

Developing a User-Centered Mobile Application Prototype: Bridging Lower-Limb Fracture Care from Skilled Nursing Facility and Back to the Community.

AMIA ... Annual Symposium proceedings. AMIA Symposium·2026
Same journal

KERAP: A Knowledge-Enhanced Reasoning Approach for Accurate Zero-shot Diagnosis Prediction Using Multi-agent LLMs.

AMIA ... Annual Symposium proceedings. AMIA Symposium·2026
Same journal

Automating Adjudication of Cardiovascular Events Using Large Language Models.

AMIA ... Annual Symposium proceedings. AMIA Symposium·2026
Same journal

Predictive Factors and State-Level Barriers to Postpartum Birth Control Usage in the United States: Insights from PRAMS Phase 8.

AMIA ... Annual Symposium proceedings. AMIA Symposium·2026
See all related articles

Related Experiment Video

Updated: Jun 28, 2026

A Knowledge Graph Approach to Elucidate the Role of Organellar Pathways in Disease via Biomedical Reports
07:35

A Knowledge Graph Approach to Elucidate the Role of Organellar Pathways in Disease via Biomedical Reports

Published on: October 13, 2023

Identifying data sharing in biomedical literature.

Heather A Piwowar1, Wendy W Chapman, Wendy Chapman

  • 1University of Pittsburgh, Pittsburgh, PA, USA.

AMIA ... Annual Symposium Proceedings. AMIA Symposium
|November 13, 2008
PubMed
Summary
This summary is machine-generated.

Researchers developed a new method using natural language processing (NLP) to find shared research datasets in scientific papers. This approach helps evaluate data sharing policies more effectively.

More Related Videos

Performing Data Mining And Integrative Analysis Of Biomarker in Breast Cancer Using Multiple Publicly Accessible Databases
07:41

Performing Data Mining And Integrative Analysis Of Biomarker in Breast Cancer Using Multiple Publicly Accessible Databases

Published on: May 17, 2019

A Robust Discovery Platform for the Identification of Novel Mediators of Melanoma Metastasis
07:41

A Robust Discovery Platform for the Identification of Novel Mediators of Melanoma Metastasis

Published on: March 8, 2022

Related Experiment Videos

Last Updated: Jun 28, 2026

A Knowledge Graph Approach to Elucidate the Role of Organellar Pathways in Disease via Biomedical Reports
07:35

A Knowledge Graph Approach to Elucidate the Role of Organellar Pathways in Disease via Biomedical Reports

Published on: October 13, 2023

Performing Data Mining And Integrative Analysis Of Biomarker in Breast Cancer Using Multiple Publicly Accessible Databases
07:41

Performing Data Mining And Integrative Analysis Of Biomarker in Breast Cancer Using Multiple Publicly Accessible Databases

Published on: May 17, 2019

A Robust Discovery Platform for the Identification of Novel Mediators of Melanoma Metastasis
07:41

A Robust Discovery Platform for the Identification of Novel Mediators of Melanoma Metastasis

Published on: March 8, 2022

Area of Science:

  • Biomedical Informatics
  • Data Science
  • Scientific Publishing

Background:

  • Increasing emphasis on open science and data sharing in research.
  • Challenges in tracking and measuring the effectiveness of data sharing initiatives due to diverse sharing mechanisms.
  • Need for automated methods to identify shared datasets within scientific literature.

Purpose of the Study:

  • To propose and evaluate a novel approach for identifying shared datasets using natural language processing (NLP) techniques.
  • To assess the feasibility of using NLP to detect dataset sharing declarations in full-text research articles.
  • To provide a method for better evaluation of data sharing policies and initiatives.

Main Methods:

  • Application of NLP techniques, including regular expression patterns and machine learning algorithms.
  • Analysis of open access biomedical literature to identify mentions of dataset sharing.
  • Development and testing of two classifier versions with varying precision and recall.

Main Results:

  • A sophisticated NLP system identified 61% of articles with shared datasets with 80% precision.
  • A simpler classifier achieved higher recall (86%) but lower precision (49%).
  • Demonstrated the feasibility of using NLP for automated dataset retrieval.

Conclusions:

  • NLP techniques offer a viable method for discovering shared research datasets within publications.
  • The developed approach can aid in evaluating the impact of data sharing policies.
  • Further research into automated dataset retrieval and policy evaluation is encouraged.