Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

SPLASH: structural pattern localization analysis by sequential histograms.

A Califano1

  • 1IBM TJ Watson Research Center, PO Box 704, Yorktown Heights, NY 10598, USA. acal@us.ibm.com

Bioinformatics (Oxford, England)
|June 27, 2000
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

SARS-CoV-2 infection and H1N1 vaccination: does a relationship between the two factors really exist? A retrospective analysis of a territorial cohort in Ferrara, Italy.

European review for medical and pharmacological sciences·2021
Same author

Human papillomavirus vaccine effectiveness within a cervical cancer screening programme: cohort study.

BJOG : an international journal of obstetrics and gynaecology·2020
Same author

A meta-analysis of the accuracy of a neuroendocrine tumor mRNA genomic biomarker (NETest) in blood.

Annals of oncology : official journal of the European Society for Medical Oncology·2020
Same author

TOpic: Rare and Special Cases, The Real "Strange Cases".

Hernia : the journal of hernias and abdominal wall surgery·2015
Same author

STK38 is a critical upstream regulator of MYC's oncogenic activity in human B-cell lymphoma.

Oncogene·2012
Same author

Modeling heat transfer and inactivation of Escherichia coli O157:H7 in precooked meat products in Argentina using the finite element method.

Meat science·2011
Same journal

MCFST: Spatial domain identification method based on multi-view graph convolutional network and graph fusion network.

Bioinformatics (Oxford, England)·2026
Same journal

SpaBiT: Enhancing Spatial Transcriptomics Resolution via Bidirectional Attention Transformers.

Bioinformatics (Oxford, England)·2026
Same journal

EDEL: Enhancing Dense Retrievers for Curation of Biomedical Knowledge Bases.

Bioinformatics (Oxford, England)·2026
Same journal

Informative Relational Learning for Adverse Reaction Prediction with Enhanced Generalization to Novel Drugs.

Bioinformatics (Oxford, England)·2026
Same journal

An interpretable deep learning framework uncovers features governing CRISPR-Cas9 genome-editing efficiency.

Bioinformatics (Oxford, England)·2026
Same journal

3DICE: Interpretable 3D Cross-Modal Learning for Drug-Target Interaction Prediction and Large-Scale Drug Discovery.

Bioinformatics (Oxford, England)·2026
See all related articles

Splash is a new algorithm for discovering sparse patterns in DNA and protein sequences. It is efficient, parallelizable, and can identify conserved regions for building accurate biological models.

Area of Science:

  • Computational Biology
  • Bioinformatics
  • Genomics

Background:

  • Discovering sparse amino acid patterns in protein sequences is crucial for identifying evolutionarily conserved functional or structural regions.
  • Existing algorithms for sparse pattern discovery are often inefficient or limited in the types of patterns they can detect.
  • Conserved DNA patterns also hold significant biological information, necessitating effective discovery methods.

Purpose of the Study:

  • Introduce Splash, a novel deterministic algorithm for discovering sparse patterns in biological sequences.
  • Enable the identification of both identical and similar sparse patterns in protein and DNA sequences.
  • Overcome the limitations of existing algorithms in terms of efficiency and pattern type.

Main Methods:

Related Experiment Videos

  • Developed a deterministic pattern discovery algorithm named Splash.
  • Designed Splash to be highly efficient and inherently parallelizable.
  • Ensured Splash can discover sparse patterns of any length without performance degradation.
  • Main Results:

    • Splash successfully identifies sparse amino acid and nucleic acid patterns in protein and DNA sequences.
    • The algorithm demonstrates high efficiency, processing large databases like SWISS-PROT within hours on a standard workstation.
    • Identified biologically relevant motifs in histone I and G-Protein Coupled Receptor families, showcasing its utility.

    Conclusions:

    • Splash provides an efficient and versatile tool for discovering conserved sequence patterns.
    • Its speed and parallelizability allow for exhaustive analysis of protein families and large databases.
    • The discovered patterns can be leveraged to construct accurate Position-Specific Scoring Matrix (PSSM) or Hidden Markov Model (HMM) profiles for enhanced sequence analysis.