Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

EST clustering error evaluation and correction.

Ji-Ping Z Wang1, Bruce G Lindsay, James Leebens-Mack

  • 1Department of Statistics, Northwestern University, Evanston, IL 60208, USA. jzwang@northwestern.edu

Bioinformatics (Oxford, England)
|June 11, 2004
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Natural history and 12-month progression of multiple system atrophy in a Chinese cohort.

BMC neurology·2026
Same author

Low-Frequency Repetitive Nerve Stimulation in Myasthenia Gravis and Lambert-Eaton Myasthenic Syndrome.

Journal of clinical neurophysiology : official publication of the American Electroencephalographic Society·2026
Same author

Identity Styles and Social Adaptation in Emerging Adulthood: The Mediating Role of Coping Styles and the Moderating Role of Tolerance for Uncertainty.

The Journal of genetic psychology·2026
Same author

Interface-homogenized engineering of red phosphorus/activated carbon for high-performance lithium-ion battery anodes.

Nanoscale·2026
Same author

DR-MHEA-GCN: A Semisupervised Graph Framework with Dual-Residual Edge Attention for Chemical Bond Classification in 2D Nanomaterials.

The journal of physical chemistry letters·2026
Same author

Slowly progressive amyotrophic lateral sclerosis associated with a rare SOD1 Pro67Arg mutation: a case report.

Neurological sciences : official journal of the Italian Neurological Society and of the Italian Society of Clinical Neurophysiology·2026
Same journal

conMItion: an R package adjusting confounding factors for associations in multi-omics.

Bioinformatics (Oxford, England)·2026
Same journal

SpaMFG: a Spatial Multi-omics Integration Method based on Feature Grouping.

Bioinformatics (Oxford, England)·2026
Same journal

CSCN: Inference of Cell-Specific Causal Networks Using Single-Cell RNA-Seq Data.

Bioinformatics (Oxford, England)·2026
Same journal

Sparse CCA-Based Mediation Analysis with High-Dimensional Exposures and Mediators.

Bioinformatics (Oxford, England)·2026
Same journal

Enhancing Cross-Context Generalization in Drug Perturbation Prediction with a Multimodal Conditional Diffusion Framework.

Bioinformatics (Oxford, England)·2026
Same journal

Primer Design through Submodular Function Estimation.

Bioinformatics (Oxford, England)·2026
See all related articles

Expressed Sequence Tag (EST) clustering errors inflate gene counts. This study quantifies Type I and II errors, finding Type I errors in 5' ESTs are significantly higher, and proposes a novel statistical method to correct these errors for accurate gene profiling.

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Genomics

Background:

  • Expressed Sequence Tag (EST) data provides gene expression insights but is hindered by clustering errors.
  • Inaccurate gene number and expression patterns result from inflated estimates due to these errors.
  • Systematic investigation of EST clustering error structure and correction methods is crucial.

Purpose of the Study:

  • To identify and quantify types of EST clustering errors (Type I and Type II).
  • To analyze the impact of clustering criteria on error rates.
  • To develop a novel statistical approach for correcting specific EST clustering errors.

Main Methods:

  • Utilized the CAP3 assembly program for EST clustering.
  • Quantified Type I and Type II errors in both 5' and 3' EST datasets.

Related Experiment Videos

  • Developed and proposed a statistical method to correct ISO (InSufficient Overlap) error.
  • Main Results:

    • Identified and quantified Type I and Type II EST clustering errors.
    • Type II error rate remained low (<1.5%) for both 5' and 3' ESTs.
    • Type I error rate was substantially higher for 5' ESTs (approx. 30%) compared to 3' ESTs (approx. 3%).
    • Over-stringent identity rules (e.g., P >/= 95%) exacerbated Type I errors.
    • Identified insufficient overlap among sibling ESTs (ISO error) as the primary cause (approx. 80%) of Type I errors in 5' EST clustering.

    Conclusions:

    • EST clustering errors, particularly Type I, significantly impact gene expression analysis.
    • A novel statistical approach effectively corrects ISO errors, leading to more accurate gene cluster profiles.
    • The findings provide a method to improve the reliability of gene expression studies using EST data.