Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Ligand Binding Sites02:40

Ligand Binding Sites

12.8K
Proteins are dynamic macromolecules that carry out a wide variety of essential processes; however, the activities of most proteins depend on their interactions with other molecules or ions, known as ligands.
Protein-ligand interactions are quite specific; even though numerous potential ligands surround a cellular protein at any given time, only a particular ligand can bind to that protein. Moreover, a ligand binds only to a dedicated area on the surface of the protein, known as the...
12.8K
Improving Translational Accuracy02:07

Improving Translational Accuracy

2.6K
2.6K
Leaky Scanning02:28

Leaky Scanning

5.1K
During most eukaryotic translation processes, the small 40S ribosome subunit scans an mRNA from its 5' end until it encounters the first start AUG codon. The large 60S ribosomal subunit then joins the smaller one to initiate protein synthesis. The location of the translation initiation is largely determined by the nucleotides near the start codon as there may be multiple translation initiation sites present on the mRNA.  Marilyn Kozak discovered that the sequence RCCAUGG (where R...
5.1K
Super-resolution Fluorescence Microscopy01:37

Super-resolution Fluorescence Microscopy

7.0K
Super-resolution fluorescence microscopy (SRFM) provides a better resolution than conventional fluorescence microscopy by reducing the point spread function (PSF). PSF is the light intensity distribution from a point that causes it to appear blurred. Due to PSF, each fluorescing point appears bigger than its actual size, and it is the PSF interference of nearby fluorophores that causes the blurred image. Various approaches to achieving higher resolution through SRFM have recently been...
7.0K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Predicting the Thermodynamic Limits of Metal-Organic Framework Metastability.

Journal of the American Chemical Society·2026
Same author

Ion correlations explain kinetic selectivity in diffusion-limited solid-state synthesis reactions.

Nature materials·2026
Same author

Identification of Solid-Electrolyte Interphase Species by Joint Characterization of Li-Ion Battery Chemistry by Mass Spectrometry and Electrochemical Reaction Networks.

Journal of the American Chemical Society·2026
Same author

Blood-based RNA-Seq of 5412 individuals with rare disease identifies new candidate diagnoses in the National Genomic Research Library.

medRxiv : the preprint server for health sciences·2026
Same author

Generative Models for Crystalline Materials.

Advanced materials (Deerfield Beach, Fla.)·2026
Same author

Li<sup>+</sup>/H<sup>+</sup> Exchange in Solid-State Oxide Li-Ion Conductors.

ACS energy letters·2026
Same journal

Demonstration of a quantum C-NOT gate in a time-multiplexed fully reconfigurable photonic processor.

Nature communications·2026
Same journal

Nonlinear quantum light source with van der Waals ferroelectric NbOX<sub>2</sub> (X = Br, I).

Nature communications·2026
Same journal

Antagonistic histone H2A variants and autonomous heterochromatin formation shape epigenomic patterns in Arabidopsis.

Nature communications·2026
Same journal

The long tail of nitrate pollution in groundwater challenges governance of global water quality.

Nature communications·2026
Same journal

Select microbial metabolites promote tau aggregation in a murine tauopathy model.

Nature communications·2026
Same journal

Warming climate has lengthened global intense tropical cyclone seasons.

Nature communications·2026
See all related articles

Related Experiment Video

Updated: Jul 3, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

558

Structured information extraction from scientific text with large language models.

John Dagdelen1,2, Alexander Dunn1,2, Sanghoon Lee1,2

  • 1Lawrence Berkeley National Laboratory, Berkeley, CA, USA.

Nature Communications
|February 15, 2024
PubMed
Summary
This summary is machine-generated.

We developed a simple machine learning method using large language models to extract structured scientific knowledge from text. This approach efficiently creates large databases for materials chemistry research.

More Related Videos

A Knowledge Graph Approach to Elucidate the Role of Organellar Pathways in Disease via Biomedical Reports
07:35

A Knowledge Graph Approach to Elucidate the Role of Organellar Pathways in Disease via Biomedical Reports

Published on: October 13, 2023

1.6K
A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts
07:50

A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts

Published on: September 20, 2018

15.9K

Related Experiment Videos

Last Updated: Jul 3, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

558
A Knowledge Graph Approach to Elucidate the Role of Organellar Pathways in Disease via Biomedical Reports
07:35

A Knowledge Graph Approach to Elucidate the Role of Organellar Pathways in Disease via Biomedical Reports

Published on: October 13, 2023

1.6K
A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts
07:50

A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts

Published on: September 20, 2018

15.9K

Area of Science:

  • Materials Chemistry
  • Computational Science
  • Natural Language Processing

Background:

  • Extracting structured knowledge from scientific literature is a significant challenge for machine learning.
  • Existing methods may lack the flexibility to handle diverse scientific data types.

Purpose of the Study:

  • To present a simple, accessible approach for joint named entity recognition and relation extraction.
  • To demonstrate the effectiveness of fine-tuning pretrained large language models (LLMs) for scientific knowledge extraction.
  • To create large, structured databases from scientific research papers.

Main Methods:

  • Fine-tuning pretrained LLMs (GPT-3, Llama-2) for named entity recognition and relation extraction.
  • Applying the method to three materials chemistry tasks: dopant-host linking, metal-organic framework cataloging, and composition/phase/morphology/application extraction.
  • Processing text from single sentences to entire paragraphs.

Main Results:

  • Successfully extracted structured records of complex scientific knowledge.
  • Demonstrated flexibility in output formats, including simple English sentences and JSON objects.
  • Showcased the potential for creating large, specialized scientific knowledge databases.

Conclusions:

  • The proposed approach offers a highly flexible and accessible route to automated scientific knowledge extraction.
  • Fine-tuning LLMs is a viable strategy for building structured databases from unstructured scientific text.
  • This method can significantly accelerate the curation of scientific data for research and development.