Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

Minimal-risk scoring matrices for sequence analysis.

T D Wu1, C G Nevill-Manning, D L Brutlag

  • 1Department of Biochemistry, Stanford University School of Medicine, California 94305-5307, USA. thomas.wu@stanford.edu

Journal of Computational Biology : a Journal of Computational Molecular Cell Biology
|July 27, 1999
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Disrupted Iron Storage in Dental Fluorosis.

Journal of dental research·2019
Same author

Kinetics of intracolloidal iodine in thyroid of iodine-deficient or equilibrated newborn rats. Direct imaging using secondary ion mass spectrometry.

Cellular and molecular biology (Noisy-le-Grand, France)·2008
Same author

Study of the localization of iron, ferritin, and hemosiderin in Alzheimer's disease hippocampus by analytical microscopy at the subcellular level.

Journal of structural biology·2005
Same author

Expression of vascular endothelial growth factor, hypoxia inducible factor 1alpha, and carbonic anhydrase IX in human tumours.

Journal of clinical pathology·2004
Same author

Automated construction of structural motifs for predicting functional sites on protein structures.

Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing·2003
Same author

Bioinformatics in the post-genomic era.

Trends in biotechnology·2001
Same journal

GMSA: A Graph Matching and Point Cloud Registration-Based Method for Spatial Transcriptomics Data Alignment.

Journal of computational biology : a journal of computational molecular cell biology·2026
Same journal

Investigations on Multiple Protein Scaffold Filling.

Journal of computational biology : a journal of computational molecular cell biology·2026
Same journal

Cell Type Prediction for Single-Cell RNA Sequencing Utilizing Unsupervised Domain Adaptation and Semi-Supervised Learning.

Journal of computational biology : a journal of computational molecular cell biology·2026
Same journal

PPIGAN: Prediction of Protein-Protein Interactions Using Generative Adversarial Networks.

Journal of computational biology : a journal of computational molecular cell biology·2026
Same journal

Deep Structure-Enhanced Cell Clustering Model for Single-Cell RNA Sequencing Data.

Journal of computational biology : a journal of computational molecular cell biology·2026
Same journal

Asymmetric Drug-Drug Interaction Prediction Based on Generative Adversarial Networks and Knowledge Graph.

Journal of computational biology : a journal of computational molecular cell biology·2026
See all related articles

We developed a minimal-risk method to accurately estimate amino acid frequencies in protein families. This approach optimizes weighting between observed data and prior information, outperforming other methods in cross-validation for sequence analysis.

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Protein Sequence Analysis

Background:

  • Accurate estimation of amino acid frequencies is crucial for understanding protein families and conserved positions.
  • Existing methods may not optimally balance observed data with prior biological information.

Purpose of the Study:

  • To introduce a novel minimal-risk method for estimating amino acid frequencies at conserved protein positions.
  • To provide a robust framework for generating scoring matrices used in sequence analysis.

Main Methods:

  • Developed a minimal-risk estimation technique balancing observed amino acid counts with pseudofrequencies.
  • Utilized squared-error and relative-entropy metrics to minimize expected distance between estimated and true frequencies.
  • Incorporated pseudofrequency sources, including background distributions and substitution matrices.

Related Experiment Videos

Main Results:

  • Minimal-risk estimation, particularly with the squared-error metric, demonstrated superior performance in a large-scale cross-validation study.
  • The method's frequency estimates are influenced by observed data characteristics and pseudofrequency origins.
  • Converted frequency estimates into effective minimal-risk scoring matrices for sequence analysis.

Conclusions:

  • Minimal-risk estimation offers an improved approach for determining amino acid frequencies in conserved protein regions.
  • The Ematrix package provides a practical implementation of this superior method for bioinformatics applications.