Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Setting Time of Cement01:12

Setting Time of Cement

709
The setting time of cement refers to the process of cement paste transitioning from a plastic state to a solid state. This process is crucial in construction as it dictates the timeframe for concrete placement, compaction, and finishing. The onset of this solidification is termed the initial set, indicating when the paste becomes unworkable. The final set is when the paste has solidified completely, and further handling or manipulation can no longer affect its shape. The cement strength is...
709
Linear time-invariant Systems01:23

Linear time-invariant Systems

937
A system is linear if it displays the characteristics of homogeneity and additivity, together termed the superposition property. This principle is fundamental in all linear systems. Linear time-invariant (LTI) systems include systems with linear elements and constant parameters.
The input-output behavior of an LTI system can be fully defined by its response to an impulsive excitation at its input. Once this impulse response is known, the system's reaction to any other input can be...
937
Linear Approximation in Time Domain01:21

Linear Approximation in Time Domain

376
Nonlinear systems often require sophisticated approaches for accurate modeling and analysis, with state-space representation being particularly effective. This method is especially useful for systems where variables and parameters vary with time or operating conditions, such as in a simple pendulum or a translational mechanical system with nonlinear springs.
For a simple pendulum with a mass evenly distributed along its length and the center of mass located at half the pendulum's length,...
376
Voltammetric Techniques: Linear-Scan (E vs Time)01:12

Voltammetric Techniques: Linear-Scan (E vs Time)

1.3K
Polarography is a classical voltammetric technique used to analyze electrochemical reactions. This method applies a linear potential sweep to a dropping mercury electrode (DME), and the resulting current is measured. A dropping mercury electrode is commonly used as the working electrode in polarography. It consists of a capillary tube filled with mercury, where the tiny droplet forms at the tip. This droplet continuously drops from the capillary, creating a new electrode surface for each...
1.3K
Linear Circuits01:17

Linear Circuits

882
A linear circuit is characterized by its output having a direct proportionality to its input, adhering to the linearity property, which encompasses the principles of homogeneity (scaling) and additivity. Homogeneity dictates that when the input, also referred to as the excitation, is multiplied by a constant factor, the output, known as the response, is correspondingly scaled by the same constant factor. For instance, if the current is multiplied by a constant 'k,' the voltage likewise...
882
Linear Equations01:27

Linear Equations

498
Linear equations form the foundation of many algebraic and real-world applications, characterized by their simplicity and utility. A linear equation is an algebraic statement in which each term is either a constant or a product of a constant and a single variable. These equations represent straight lines when plotted on a Cartesian coordinate plane, reflecting a constant rate of change between two quantities.A typical linear equation in one variable has the form: ax + b = c, where a, b, and c...
498

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Structural motif search across the protein universe with Folddisco.

Nature biotechnology·2026
Same author

A novel method to select Reference Proteomes in UniProt.

bioRxiv : the preprint server for biology·2026
Same author

Multiple protein structure alignment at scale with FoldMason.

Science (New York, N.Y.)·2026
Same author

Protein structure-informed bacteriophage genome annotation with Phold.

Nucleic acids research·2026
Same author

AlphaFold Protein Structure Database 2025: a redesigned interface and updated structural coverage.

Nucleic acids research·2025
Same author

Democratizing protein language model training, sharing and collaboration.

Nature biotechnology·2025
Same journal

Demonstration of a quantum C-NOT gate in a time-multiplexed fully reconfigurable photonic processor.

Nature communications·2026
Same journal

Nonlinear quantum light source with van der Waals ferroelectric NbOX<sub>2</sub> (X = Br, I).

Nature communications·2026
Same journal

Antagonistic histone H2A variants and autonomous heterochromatin formation shape epigenomic patterns in Arabidopsis.

Nature communications·2026
Same journal

The long tail of nitrate pollution in groundwater challenges governance of global water quality.

Nature communications·2026
Same journal

Select microbial metabolites promote tau aggregation in a murine tauopathy model.

Nature communications·2026
Same journal

Warming climate has lengthened global intense tropical cyclone seasons.

Nature communications·2026
See all related articles

Related Experiment Video

Updated: Feb 8, 2026

Linearization of the Bradford Protein Assay
06:35

Linearization of the Bradford Protein Assay

Published on: April 12, 2010

103.6K

Clustering huge protein sequence sets in linear time.

Martin Steinegger1,2,3, Johannes Söding4

  • 1Quantitative and Computational Biology group, Max-Planck Institute for Biophysical Chemistry, Am Fassberg 11, 37077, Göttingen, Germany. martin.steinegger@mpibpc.mpg.de.

Nature Communications
|July 1, 2018
PubMed
Summary
This summary is machine-generated.

We developed Linclust, a novel protein sequence clustering algorithm that significantly accelerates the analysis of large metagenomic datasets. This new method enables rapid functional annotation and structure prediction from billions of protein sequences.

More Related Videos

Investigating Protein Sequence-structure-dynamics Relationships with Bio3D-web
09:51

Investigating Protein Sequence-structure-dynamics Relationships with Bio3D-web

Published on: July 16, 2017

16.1K
JUMPn: A Streamlined Application for Protein Co-Expression Clustering and Network Analysis in Proteomics
07:28

JUMPn: A Streamlined Application for Protein Co-Expression Clustering and Network Analysis in Proteomics

Published on: October 19, 2021

3.7K

Related Experiment Videos

Last Updated: Feb 8, 2026

Linearization of the Bradford Protein Assay
06:35

Linearization of the Bradford Protein Assay

Published on: April 12, 2010

103.6K
Investigating Protein Sequence-structure-dynamics Relationships with Bio3D-web
09:51

Investigating Protein Sequence-structure-dynamics Relationships with Bio3D-web

Published on: July 16, 2017

16.1K
JUMPn: A Streamlined Application for Protein Co-Expression Clustering and Network Analysis in Proteomics
07:28

JUMPn: A Streamlined Application for Protein Co-Expression Clustering and Network Analysis in Proteomics

Published on: October 19, 2021

3.7K

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Genomics

Background:

  • Metagenomic datasets contain billions of protein sequences, offering vast potential for functional annotation and structure prediction.
  • Current similarity clustering algorithms struggle with large datasets due to near-quadratic runtime scaling with input size (N).

Purpose of the Study:

  • To develop a scalable and efficient clustering algorithm for massive protein sequence datasets.
  • To overcome the computational limitations of existing methods for analyzing metagenomic data.

Main Methods:

  • Development of Linclust, a novel clustering algorithm with runtime scaling linearly with input size (N), independent of cluster count (K).
  • Implementation of Linclust capable of processing datasets exceeding available main memory.
  • Clustering of 1.6 billion metagenomic sequence fragments at 50% sequence identity.

Main Results:

  • Linclust achieved a runtime of 10 hours on a single server for 1.6 billion sequences.
  • The new algorithm is over 1000 times faster than previous methods for large-scale clustering.
  • Linclust demonstrates scalability for datasets significantly larger than main memory.

Conclusions:

  • Linclust offers a breakthrough in efficiently clustering massive protein sequence datasets.
  • This advancement will facilitate large-scale functional annotation and structure prediction from metagenomic and genomic data.
  • Linclust unlocks the potential of vast sequence databases for biological discovery.