Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Genome Annotation and Assembly03:36

Genome Annotation and Assembly

16.5K
The genome refers to all of the genetic material in an organism. It can range from a few million base pairs in microbial cells to several billion base pairs in many eukaryotic organisms. Genome assembly refers to the process of taking the DNA sequencing data and putting it all back together in a correct order to create a close representation of the original genome. This is followed by the identification of functional elements on the newly assembled genome, a process called genome annotation.
16.5K
Protein Networks02:26

Protein Networks

3.6K
An organism can have thousands of different proteins, and these proteins must cooperate to ensure the health of an organism. Proteins bind to other proteins and form complexes to carry out their functions. Many proteins interact with multiple other proteins creating a complex network of protein interactions.
These interactions can be represented through maps depicting protein-protein interaction networks, represented as nodes and edges. Nodes are circles that are representative of a protein,...
3.6K
Conservation of Protein Domains Over Different Proteins02:26

Conservation of Protein Domains Over Different Proteins

11.7K
Protein domains are small structurally independent units that are part of a single amino acid chain.  Although these domains are often structurally independent, they may rely on synergistic effects to perform their functions as part of a larger protein. Protein domains may be conserved within the same organism, as well as across different organisms.
A limited set of protein domains often duplicate and recombine during evolution. These domains can be organized in different combinations to...
11.7K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Coronary Artery Calcium Scans by Area Deprivation Index in a Multicenter Health System.

JACC. Advances·2026
Same author

Cohort Study of Initial Diuretic Dosing and Outcomes Among Patients Hospitalized for Congestive Heart Failure: Insights From the Cardiovascular Quality Improvement and Care Innovation Consortium.

Journal of the American Heart Association·2026
Same author

Association Between Artificial Intelligence-Detected Features on the ECG and Presence of Microvascular Obstruction.

Circulation. Cardiovascular interventions·2025
Same author

Three-Year Outcomes of Second Bioprosthesis After Early Surgical Aortic Bioprosthetic Failure.

JACC. Advances·2025
Same author

High-Profile Cardiac Arrests Correlate With Online Searches and Public Interest: A Call for Action.

Journal of the American Heart Association·2025
Same author

Incidence, predictors and outcomes of tricuspid regurgitation progression after left-sided valvular intervention.

Heart (British Cardiac Society)·2025
Same journal

Real-Time XFEL Data Analysis at SLAC and NERSC: a Trial Run of Nascent Exascale Experimental Data Analysis.

Concurrency and computation : practice & experience·2025
Same journal

BrainForge: an online data analysis platform for integrative neuroimaging acquisition, analysis, and sharing.

Concurrency and computation : practice & experience·2023
Same journal

Data-driven analysis and predictive modeling on COVID-19.

Concurrency and computation : practice & experience·2023
Same journal

A face mask detection system: An approach to fight with COVID-19 scenario.

Concurrency and computation : practice & experience·2023
Same journal

Poor and rich dolphin optimization algorithm with modified deep fuzzy clustering for COVID-19 patient analysis.

Concurrency and computation : practice & experience·2023
Same journal

Evaluating COVID-19 risk under the estimation of population mean using two attributes.

Concurrency and computation : practice & experience·2023
See all related articles

Related Experiment Video

Updated: Apr 22, 2026

Application of I TASSER, trRosetta, UCSF Chimera, HADDOCK server, and HEX loria for De Novo and In Silico Design of Proteins
05:08

Application of I TASSER, trRosetta, UCSF Chimera, HADDOCK server, and HEX loria for De Novo and In Silico Design of Proteins

Published on: July 8, 2025

1.3K

Optimizing high performance computing workflow for protein functional annotation.

Larissa Stanberry1, Bhanu Rekepalli2, Yuan Liu2

  • 1Bioinformatics & High-Throughput Analysis Laboratory and High-Throughput Analysis Core, Seattle Children's Research Institute (SCRI), DELSA Global, Seattle, WA 98101, USA.

Concurrency and Computation : Practice & Experience
|October 15, 2014
PubMed
Summary
This summary is machine-generated.

Annotating vast amounts of protein data is challenging. This study introduces an optimized automated workflow using high-performance computing for efficient large-scale protein annotation, ensuring high accuracy.

Keywords:
BLASTCOGHSPp-BLASTPSPSI-BLASTXSEDEcomputational bioinformaticsdata-enabled life sciencespetascaleprotein annotationprotein sequence universescience gatewayssequence similarity

More Related Videos

Label-Free Immunoprecipitation Mass Spectrometry Workflow for Large-scale Nuclear Interactome Profiling
11:19

Label-Free Immunoprecipitation Mass Spectrometry Workflow for Large-scale Nuclear Interactome Profiling

Published on: November 17, 2019

18.0K
A Protocol for Computer-Based Protein Structure and Function Prediction
16:41

A Protocol for Computer-Based Protein Structure and Function Prediction

Published on: November 3, 2011

70.1K

Related Experiment Videos

Last Updated: Apr 22, 2026

Application of I TASSER, trRosetta, UCSF Chimera, HADDOCK server, and HEX loria for De Novo and In Silico Design of Proteins
05:08

Application of I TASSER, trRosetta, UCSF Chimera, HADDOCK server, and HEX loria for De Novo and In Silico Design of Proteins

Published on: July 8, 2025

1.3K
Label-Free Immunoprecipitation Mass Spectrometry Workflow for Large-scale Nuclear Interactome Profiling
11:19

Label-Free Immunoprecipitation Mass Spectrometry Workflow for Large-scale Nuclear Interactome Profiling

Published on: November 17, 2019

18.0K
A Protocol for Computer-Based Protein Structure and Function Prediction
16:41

A Protocol for Computer-Based Protein Structure and Function Prediction

Published on: November 3, 2011

70.1K

Area of Science:

  • Genomics and Bioinformatics
  • Computational Biology
  • Molecular Biology

Background:

  • The rapid expansion of genomic sequencing generates massive amounts of protein data, overwhelming manual annotation efforts.
  • Existing automated protein annotation methods face limitations due to high computational costs.
  • Accurate functional annotation of newly sequenced genomes is crucial for biological discovery.

Purpose of the Study:

  • To develop and optimize an automated workflow for large-scale protein annotation.
  • To address the challenge of annotating millions of newly sequenced bacterial proteins efficiently.
  • To provide a scalable solution for the growing volume of genomic data.

Main Methods:

  • Implementation of an optimized automated workflow leveraging high-performance computing architectures.
  • Utilization of a low-complexity classification algorithm to assign proteins into clusters of orthologous groups.
  • Application of the Position-Specific Iterative Basic Local Alignment Search Tool (PSI-BLAST) for classification, ensuring high specificity and sensitivity (≥80%).
  • Employment of highly scalable parallel applications for sequence alignment and classification.

Main Results:

  • The workflow successfully processed 1,200,000 newly sequenced bacterial proteins using the Extreme Science and Engineering Discovery Environment (XSEDE) supercomputers.
  • The automated approach demonstrated high specificity and sensitivity in protein classification.
  • The optimized workflow significantly enhances the efficiency of large-scale protein annotation.

Conclusions:

  • The proposed automated workflow provides an efficient and scalable solution for the functional annotation of big genome data.
  • This approach overcomes the limitations of manual curation and computationally expensive existing methods.
  • The workflow is essential for keeping pace with the rapid expansion of the protein sequence universe and enabling future biological research.