Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Proteins: From Genes to Degradation02:11

Proteins: From Genes to Degradation

13.2K
Within a biological system, the DNA encodes the RNA, and the nucleotide sequence in the RNA further defines the amino acid sequence in the protein. This is referred to as “The Central Dogma of Molecular Biology” - a term coined by Francis Crick.  Central dogma is a firm principle in biology that defines the flow of genetic information within any life form. The two fundamental steps in central dogma are - transcription and translation.
Transcription is the synthesis of RNA...
13.2K
Protein Networks02:26

Protein Networks

2.5K
2.5K
Peptide Identification Using Tandem Mass Spectrometry01:33

Peptide Identification Using Tandem Mass Spectrometry

7.3K
Tandem mass spectrometry, also known as MS/MS or MS2, is an analytical technique that employs two mass analyzers. Essentially it is a series of mass spectrometers that helps isolate a particular biomolecule and then helps study its chemical properties.
This technique helps gather information regarding the protein from which the peptide was obtained and to study the peptides’ amino acid sequence. Identifying peptides from a complex mixture is an important component of the growing field of...
7.3K
Signal Sequences and Sorting Receptors01:41

Signal Sequences and Sorting Receptors

11.5K
Signal sequences are short amino acid sequences that guide newly synthesized proteins to their proper location within the cell. Classical signal sequences are fifteen to sixty amino acids long and present at the N-terminus of a polypeptide chain. Each signal sequence has a conserved segment of basic residues towards their N terminus, a hydrophobic core, and a C-terminus rich in polar residues. The C-terminus also contains a signal cleavage site and features a -3 -1 sequence motif. The -3-1...
11.5K
Genome Annotation and Assembly03:36

Genome Annotation and Assembly

19.6K
The genome refers to all of the genetic material in an organism. It can range from a few million base pairs in microbial cells to several billion base pairs in many eukaryotic organisms. Genome assembly refers to the process of taking the DNA sequencing data and putting it all back together in a correct order to create a close representation of the original genome. This is followed by the identification of functional elements on the newly assembled genome, a process called genome annotation.
19.6K
Genetic Lingo01:11

Genetic Lingo

108.1K
Overview
108.1K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Alexithymia in schizophrenia spectrum disorders and dissociative disorders: two meta-analytic reviews.

Schizophrenia (Heidelberg, Germany)·2026
Same author

Immediate Postoperative Pacing Dependency Is Not Associated With Worse Mid-Term Survival in Redo Mitral Valve Surgery.

The American journal of cardiology·2026
Same author

Beta-Blocker therapy improves survival after concomitant septal myectomy and aortic valve replacement.

Journal of cardiothoracic surgery·2026
Same author

Using large language models to address the bottleneck of georeferencing natural history collections.

Nature plants·2025
Same author

PromptLink: Leveraging Large Language Models for Cross-Source Biomedical Concept Linking.

International ACM SIGIR Conference on Research and Development in Information Retrieval. Annual International ACMSIGIR Conference on Research & Development in Information Retrieval·2025
Same author

EHRAgent: Code Empowers Large Language Models for Few-shot Complex Tabular Reasoning on Electronic Health Records.

Proceedings of the Conference on Empirical Methods in Natural Language Processing. Conference on Empirical Methods in Natural Language Processing·2025
Same journal

Evaluation of temporal preservation in synthetic longitudinal patient data.

Journal of biomedical informatics·2026
Same journal

ARKE: An ontology-driven framework for automated mapping of local radiology procedure terms to the LOINC-RadLex playbook using large language model.

Journal of biomedical informatics·2026
Same journal

A validation-driven training controller for cross-lingual biomedical NER via reinforcement learning-based adaptive loss weighting.

Journal of biomedical informatics·2026
Same journal

ASP-HR: An Adaptive Spatial Perception and Hierarchical Reasoning mechanism for document-level biomedical relation extraction.

Journal of biomedical informatics·2026
Same journal

Beyond Accuracy: Safety-Centered guidelines for the evaluation of LLM-based therapy recommendation systems for chronic multimorbidity patients.

Journal of biomedical informatics·2026
Same journal

DeepEN: A deep reinforcement learning framework for personalized enteral nutrition in critical care.

Journal of biomedical informatics·2026
See all related articles

Related Experiment Video

Updated: Oct 24, 2025

Cloud-Based Phrase Mining and Analysis of User-Defined Phrase-Category Association in Biomedical Publications
09:20

Cloud-Based Phrase Mining and Analysis of User-Defined Phrase-Category Association in Biomedical Publications

Published on: February 23, 2019

8.9K

PMCVec: Distributed phrase representation for biomedical text processing.

Zelalem Gero1, Joyce Ho1

  • 1Emory University, Department of Computer Science, Atlanta, USA.

Journal of Biomedical Informatics
|August 13, 2021
PubMed
Summary
This summary is machine-generated.

This study introduces PMCVec, a new method for creating better biomedical text representations. It effectively captures the meaning of both single words and important multi-word phrases, improving various natural language processing tasks.

Keywords:
Biomedical NLPPhrase embeddingsPubMed abstracts

More Related Videos

A Knowledge Graph Approach to Elucidate the Role of Organellar Pathways in Disease via Biomedical Reports
07:35

A Knowledge Graph Approach to Elucidate the Role of Organellar Pathways in Disease via Biomedical Reports

Published on: October 13, 2023

1.9K
Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems
05:47

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

775

Related Experiment Videos

Last Updated: Oct 24, 2025

Cloud-Based Phrase Mining and Analysis of User-Defined Phrase-Category Association in Biomedical Publications
09:20

Cloud-Based Phrase Mining and Analysis of User-Defined Phrase-Category Association in Biomedical Publications

Published on: February 23, 2019

8.9K
A Knowledge Graph Approach to Elucidate the Role of Organellar Pathways in Disease via Biomedical Reports
07:35

A Knowledge Graph Approach to Elucidate the Role of Organellar Pathways in Disease via Biomedical Reports

Published on: October 13, 2023

1.9K
Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems
05:47

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

775

Area of Science:

  • Biomedical Natural Language Processing
  • Computational Linguistics
  • Bioinformatics

Background:

  • Distributed semantic representations enhance biomedical text analysis, including classification and information retrieval.
  • Current models like Word2Vec and GloVe primarily offer unigram representations, limiting the accurate semantic capture of multi-word biomedical phrases.
  • Accurate representation of technical phrases (diseases, drugs) as single entities is crucial for biomedical text processing.

Purpose of the Study:

  • To introduce PMCVec, an unsupervised technique for generating phrase embeddings from biomedical text.
  • To enable simultaneous learning of embeddings for both single words and important multi-word phrases.
  • To improve semantic representation for enhanced biomedical text analysis.

Main Methods:

  • PMCVec utilizes an unsupervised approach to identify and extract key phrases from PubMed abstracts.
  • The technique learns distributed semantic representations (embeddings) for both individual words and identified multi-word phrases.
  • Embeddings are generated simultaneously to capture contextual meaning effectively.

Main Results:

  • PMCVec demonstrates significant performance improvements in benchmark evaluations.
  • Qualitative and quantitative analyses confirm the effectiveness of the learned embeddings.
  • The method successfully addresses the limitations of unigram-based models for biomedical phrases.

Conclusions:

  • PMCVec offers a novel and effective solution for learning comprehensive semantic representations in biomedical text.
  • The simultaneous embedding of words and phrases enhances the understanding of complex biomedical terminology.
  • This approach holds significant potential for advancing various biomedical NLP applications.