Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Genomics02:02

Genomics

41.4K
Genomics is the science of genomes: it is the study of all the genetic material of an organism. In humans, the genome consists of information carried in 23 pairs of chromosomes in the nucleus, as well as mitochondrial DNA. In genomics, both coding and non-coding DNA is sequenced and analyzed. Genomics allows a better understanding of all living things, their evolution, and their diversity. It has a myriad of uses: for example, to build phylogenetic trees, to improve productivity and...
41.4K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

An AI-Driven Pipeline for Localization, Segmentation, and Classification of Carpal Tunnel Syndrome Using Ultrasound Images of the Median Nerve.

Hand (New York, N.Y.)·2026
Same author

Trustworthy artificial intelligence in predictive medicine: cancer survival analysis using ethical-by-design and explainable artificial intelligence models.

JAMIA open·2026
Same author

Trustworthy and Uncertainty-Aware AI for Predicting Respiratory Complications Following Total Hip and Knee Arthroplasty.

AMIA ... Annual Symposium proceedings. AMIA Symposium·2026
Same author

Making sense of TILs: recommendations for morphological assessment of tumour-infiltrating lymphocytes in gastro-oesophageal carcinoma: A report on behalf of the International Immuno-Oncology Biomarker Working Group.

Histopathology·2026
Same author

Explainable machine learning to predict prolonged post-operative opioid use in rotator cuff patients.

BMC musculoskeletal disorders·2025
Same author

Explainable Contrastive Learning for KL Grading Classification in Knee Osteoarthritis.

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference·2025
Same journal

Thymidylate synthase inhibitory drugs induce p53-dependent pathways differently.

PloS one·2026
Same journal

Top-down and bottom-up attention for joint pattern classification and reconstruction.

PloS one·2026
Same journal

Short- and long-term scaling behavior of blood pressure and pulse arrival time during sleep in healthy controls and patients with obstructive sleep apnea.

PloS one·2026
Same journal

Double DQN-based secrecy energy efficiency and fairness performance in IRS-assisted NOMA systems with friendly jamming.

PloS one·2026
Same journal

10 recommendations for strengthening citizen science for improved societal and ecological outcomes: A co-produced analysis of challenges and opportunities in the 21st century.

PloS one·2026
Same journal

Paying in public: Peer effects, impression management, and willingness to pay on digital payment platforms.

PloS one·2026
See all related articles

Related Experiment Video

Updated: Mar 14, 2026

Cloud-Based Phrase Mining and Analysis of User-Defined Phrase-Category Association in Biomedical Publications
09:20

Cloud-Based Phrase Mining and Analysis of User-Defined Phrase-Category Association in Biomedical Publications

Published on: February 23, 2019

9.3K

SparkText: Biomedical Text Mining on Big Data Framework.

Zhan Ye1, Ahmad P Tafti2,3, Karen Y He4

  • 1Biomedical Informatics Research Center, Marshfield Clinic Research Foundation, Marshfield, WI, 54449, United States of America.

Plos One
|September 30, 2016
PubMed
Summary
This summary is machine-generated.

SparkText, a Big Data text mining framework, efficiently classifies cancer types from scientific articles. It achieved 93.81% accuracy in just 6 minutes, outperforming traditional methods.

More Related Videos

Constructing and Visualizing Models using Mime-based Machine-learning Framework
06:19

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Published on: July 22, 2025

2.9K
Performing Data Mining And Integrative Analysis Of Biomarker in Breast Cancer Using Multiple Publicly Accessible Databases
07:41

Performing Data Mining And Integrative Analysis Of Biomarker in Breast Cancer Using Multiple Publicly Accessible Databases

Published on: May 17, 2019

9.7K

Related Experiment Videos

Last Updated: Mar 14, 2026

Cloud-Based Phrase Mining and Analysis of User-Defined Phrase-Category Association in Biomedical Publications
09:20

Cloud-Based Phrase Mining and Analysis of User-Defined Phrase-Category Association in Biomedical Publications

Published on: February 23, 2019

9.3K
Constructing and Visualizing Models using Mime-based Machine-learning Framework
06:19

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Published on: July 22, 2025

2.9K
Performing Data Mining And Integrative Analysis Of Biomarker in Breast Cancer Using Multiple Publicly Accessible Databases
07:41

Performing Data Mining And Integrative Analysis Of Biomarker in Breast Cancer Using Multiple Publicly Accessible Databases

Published on: May 17, 2019

9.7K

Area of Science:

  • Biomedical Informatics
  • Computational Biology
  • Data Science

Background:

  • Daily publication of biomedical research generates vast data on genes, diseases, and treatments.
  • Accurate text mining of scientific literature is crucial for advancing disease understanding and improving healthcare.

Purpose of the Study:

  • To develop an efficient text mining framework for large-scale biomedical literature.
  • To demonstrate the framework's capability in classifying cancer types.

Main Methods:

  • Developed SparkText on a Big Data infrastructure using Apache Spark, machine learning, and a Cassandra NoSQL database.
  • Extracted cancer-related information from tens of thousands of PubMed articles.
  • Utilized Naïve Bayes, Support Vector Machine (SVM), and Logistic Regression for classification models.

Main Results:

  • SparkText achieved 93.81% accuracy in predicting cancer types using SVM on 29,437 full-text articles.
  • The framework processed the dataset in approximately 6 minutes, significantly faster than competing tools (over 11 hours).

Conclusions:

  • Big Data infrastructure enables efficient mining of large-scale scientific articles with real-time updates.
  • SparkText demonstrates potential for real-time biomedical knowledge discovery and can be applied to other research areas.