Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Protein Networks02:26

Protein Networks

An organism can have thousands of different proteins, and these proteins must cooperate to ensure the health of an organism. Proteins bind to other proteins and form complexes to carry out their functions. Many proteins interact with multiple other proteins creating a complex network of protein interactions.
These interactions can be represented through maps depicting protein-protein interaction networks, represented as nodes and edges. Nodes are circles that are representative of a protein,...
Protein Networks02:26

Protein Networks

An organism can have thousands of different proteins, and these proteins must cooperate to ensure the health of an organism. Proteins bind to other proteins and form complexes to carry out their functions. Many proteins interact with multiple other proteins creating a complex network of protein interactions.
These interactions can be represented through maps depicting protein-protein interaction networks, represented as nodes and edges. Nodes are circles that are representative of a protein,...
Protein-protein Interfaces02:04

Protein-protein Interfaces

Many proteins form complexes to carry out their functions, making protein-protein interactions (PPIs) essential for an organism's survival. Most PPIs are stabilized by numerous weak noncovalent chemical forces. The physical shape of the interfaces determines the way two proteins interact. Many globular proteins have closely-matching shapes on their surfaces, which form a large number of weak bonds. Additionally, many PPIs occur between two helices or between a surface cleft and a polypeptide...
Proteomics01:33

Proteomics

A proteome is the entire set of proteins that a cell type produces. We can study proteomes using the knowledge of genomes because genes code for mRNAs, and the mRNAs encode proteins. Although mRNA analysis is a step in the right direction, not all mRNAs are translated into proteins.
Proteomics is the study of proteomes' function. It involves the large-scale systematic study of the proteome to denote the protein complement expressed by a genome. Scientist Mark Wilkins coined the term proteomics...
Genome Annotation and Assembly03:36

Genome Annotation and Assembly

The genome refers to all of the genetic material in an organism. It can range from a few million base pairs in microbial cells to several billion base pairs in many eukaryotic organisms. Genome assembly refers to the process of taking the DNA sequencing data and putting it all back together in a correct order to create a close representation of the original genome. This is followed by the identification of functional elements on the newly assembled genome, a process called genome annotation.

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same authorSame journal

Folding the unfoldable 2: using AlphaFold and ESMFold to explore spurious proteins.

Bioinformatics advances·2026
Same author

Viral non-coding RNA structure annotation and API-based data retrieval with Rfam and R2DT.

bioRxiv : the preprint server for biology·2026
Same author

Empowering biological knowledgebases: advances in human-in-the-loop AI-driven literature curation.

Bioinformatics advances·2026
Same author

Amos Bairoch (1957-2025): pioneer of bioinformatics and founder of Swiss-Prot.

Bioinformatics advances·2026
Same author

Same-different learning of odour stimuli in dogs.

Animal cognition·2026
Same author

Expression Atlas in 2026: enabling FAIR and open expression data through community collaboration and integration.

Nucleic acids research·2025
Same journal

MOREshiny: a user-friendly application for the inference of phenotype-specific multi-omic regulatory networks.

Bioinformatics advances·2026
Same journal

spammR: an R package designed for analysis and integration of spatial multi-omic measurements.

Bioinformatics advances·2026
Same journal

Interpretable prediction and generation of ASC-speck aptamers using multiscale deep biological learning models.

Bioinformatics advances·2026
Same journal

vClassifier: a toolkit for high-resolution phylogenetic classification of prokaryotic viruses.

Bioinformatics advances·2026
Same journal

GWAIS-Web: a free and secure web service for ultra-fast and large-scale genome-wide association interaction studies.

Bioinformatics advances·2026
See all related articles

Related Experiment Video

Updated: Jun 2, 2026

Mass Spectrometry-Based Proteomics Analyses Using the OpenProt Database to Unveil Novel Proteins Translated from Non-Canonical Open Reading Frames
07:38

Mass Spectrometry-Based Proteomics Analyses Using the OpenProt Database to Unveil Novel Proteins Translated from Non-Canonical Open Reading Frames

Published on: April 11, 2019

InterProScan 6: a modern large-scale protein function annotation pipeline.

Matthias Blum1, Emma Hobbs1, Laise Florentino1

  • 1European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, CB10 1SD, United Kingdom.

Bioinformatics Advances
|June 1, 2026
PubMed
Summary
This summary is machine-generated.

InterProScan 6, a Nextflow pipeline, enhances protein function annotation by improving scalability and reproducibility. This new version offers faster analysis and integrates modern predictors for efficient genome-scale protein classification.

More Related Videos

An Integrated Approach for Microprotein Identification and Sequence Analysis
09:37

An Integrated Approach for Microprotein Identification and Sequence Analysis

Published on: July 12, 2022

A Fast and Quantitative Method for Post-translational Modification and Variant Enabled Mapping of Peptides to Genomes
09:10

A Fast and Quantitative Method for Post-translational Modification and Variant Enabled Mapping of Peptides to Genomes

Published on: May 22, 2018

Related Experiment Videos

Last Updated: Jun 2, 2026

Mass Spectrometry-Based Proteomics Analyses Using the OpenProt Database to Unveil Novel Proteins Translated from Non-Canonical Open Reading Frames
07:38

Mass Spectrometry-Based Proteomics Analyses Using the OpenProt Database to Unveil Novel Proteins Translated from Non-Canonical Open Reading Frames

Published on: April 11, 2019

An Integrated Approach for Microprotein Identification and Sequence Analysis
09:37

An Integrated Approach for Microprotein Identification and Sequence Analysis

Published on: July 12, 2022

A Fast and Quantitative Method for Post-translational Modification and Variant Enabled Mapping of Peptides to Genomes
09:10

A Fast and Quantitative Method for Post-translational Modification and Variant Enabled Mapping of Peptides to Genomes

Published on: May 22, 2018

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Genomics

Background:

  • InterProScan 5 is a widely used tool for protein function classification, integral to major annotation pipelines.
  • InterProScan 5 faced challenges in scalability, installation, data management, and workflow integration.

Purpose of the Study:

  • To present InterProScan 6, a reimplementation addressing InterProScan 5's limitations.
  • To enhance scalability, portability, and reproducibility of protein annotation across diverse computational environments.

Main Methods:

  • Reimplemented InterProScan as a Nextflow pipeline.
  • Decoupled application code from signature data.
  • Integrated native container support, on-demand data management, and deep-learning predictors.
  • Developed a redesigned Matches API for JSON annotation retrieval.

Main Results:

  • InterProScan 6 demonstrated approximately two-fold speedups on large eukaryotic proteomes compared to InterProScan 5.
  • Runtimes were reduced to minutes when using the Matches API for pre-computed annotations.
  • InterProScan 6 reproduced InterProScan 5 results with near-identical precision and sensitivity.

Conclusions:

  • InterProScan 6 provides a more efficient, flexible, and reproducible solution for genome-scale protein function annotation.
  • The Nextflow implementation enhances usability across local, HPC, and cloud platforms.