Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

Blast sampling for structural and functional analyses.

Anne Friedrich1, Raymond Ripp, Nicolas Garnier

  • 1Laboratoire de Bioinformatique et Génomique Intégratives, Institut de Génétique et de Biologie Moléculaire et Cellulaire, Illkirch, France. friedric@igbmc.u-strasbg.fr <friedric@igbmc.u-strasbg.fr>

BMC Bioinformatics
|February 27, 2007
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Microstructural Properties and Pressure Distribution in Ultra-Short-Pulse Welds of Sapphire to Iron.

Nanomaterials (Basel, Switzerland)·2026
Same author

Population-scale chemical response revealed by a barcoded yeast collection.

Nature communications·2026
Same author

Novel <i>PCDH12</i> pathogenic missense variants cause neurodevelopmental disorders with ocular malformation.

medRxiv : the preprint server for health sciences·2026
Same author

Domestication drives repeated evolution of sexual-asexual life cycle trade-offs in yeast.

Proceedings of the National Academy of Sciences of the United States of America·2026
Same author

Preferences, attitudes and views regarding genetic newborn screening (gNBS) for rare diseases: a systematic review of the literature and synthesis from 2009 to 2022.

Orphanet journal of rare diseases·2026
Same author

Dynamics of genome evolution in the era of pangenome analysis.

Cell genomics·2025
Same journal

SNPio: a Python interface for population genomic data processing.

BMC bioinformatics·2026
Same journal

SpaHNR: a spatial domain identification method via sparse attention-based hierarchical node representation and multi-view contrastive learning.

BMC bioinformatics·2026
Same journal

OpenIMC: an open-source platform for analyzing single-cell and spatial proteomics by imaging mass cytometry.

BMC bioinformatics·2026
Same journal

NAP: an open source pipeline for cross-domain microbiome profiling using Nanopore sequencing-derived amplicon data.

BMC bioinformatics·2026
Same journal

SurvGME: an R package for survival analysis with graphical and measurement error models.

BMC bioinformatics·2026
Same journal

SimMapNet: a Bayesian framework for gene regulatory network inference using gene ontology similarities as external hint.

BMC bioinformatics·2026
See all related articles

Efficiently sampling homologous sequences is crucial in the post-genomic era. This study compares four methods to reduce sequence data while retaining key information for structural and functional analysis.

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Genomics

Background:

  • The post-genomic era generates vast amounts of biological data, making sequence similarity searches yield numerous homologous sequences.
  • Analyzing this large volume of data for structural, functional, and evolutionary insights is challenging.
  • Efficient strategies are needed to sample sequence space and reduce the number of sequences for analysis while retaining critical information.

Purpose of the Study:

  • To compare the efficiency of four different sequence sampling methods.
  • To identify a sampling strategy that reduces the number of homologous sequences while preserving essential information for structural and functional studies.

Main Methods:

  • Evaluated four distinct sampling methods on a large dataset of 284 protein families.

Related Experiment Videos

  • Compared methods based on user-defined parameters (maximal number or percentage) and automatic program-determined selection.
  • Utilized multiple sequence alignment as the primary validation tool to assess information content.
  • Analyzed the impact of sampling on E-value distributions, sequence coverage, alignment quality, and active site characterization.
  • Main Results:

    • Four sampling methods were compared for their efficiency in selecting pertinent homologous sequences.
    • The study analyzed the information content of sampled sets using multiple sequence alignment.
    • The influence of sampling methods on E-value distributions, sequence coverage, alignment quality, and active site characterization was investigated.

    Conclusions:

    • A suitable sampling strategy was identified through comparative analysis of the four methods.
    • The proposed strategy significantly reduces the number of homologous sequences needed for alignment.
    • This approach effectively maintains relevant information, particularly concerning active site residues.