Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

Extendable words in nucleotide sequences.

M S Gelfand1, C G Kozhukhin, P A Pevzner

  • 1Institute of Protein Research, Acad. Sci. USSR, Pushchino, Moscow Region.

Computer Applications in the Biosciences : CABIOS
|April 1, 1992
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

ProAD - A database of rotary ion-translocating ATPases in prokaryotic genomes.

Frontiers in molecular biosciences·2025
Same author

Towards practical applications in quantum computational biology.

Nature computational science·2024
Same author

Re: 'Cefazolin versus anti-staphylococcal penicillins for the treatment of patients with Staphylococcus aureus bacteremia' by Weis et al.

Clinical microbiology and infection : the official publication of the European Society of Clinical Microbiology and Infectious Diseases·2019
Same author

Serum-derived bovine immunoglobulin/protein isolate should be considered in patients with HIV gut barrier dysfunction.

Infection·2015
Same author

Posaconazole as successful treatment for fungemia due to Cryptococcus albidus in a liver transplant recipient.

QJM : monthly journal of the Association of Physicians·2012
Same author

Regulation and evolution of malonate and propionate catabolism in proteobacteria.

Journal of bacteriology·2012
Same journal

DCA: an efficient implementation of the divide-and-conquer approach to simultaneous multiple sequence alignment.

Computer applications in the biosciences : CABIOS·1998
Same journal

Two applications to facilitate the viewing of database search result files on the Macintosh.

Computer applications in the biosciences : CABIOS·1998
Same journal

BioWish: a molecular biology command extension to Tcl/Tk.

Computer applications in the biosciences : CABIOS·1998
Same journal

The Sequence Alerting Server--a new WEB server.

Computer applications in the biosciences : CABIOS·1998
Same journal

A software tool for the analysis of mass spectrometric disulfide mapping experiments.

Computer applications in the biosciences : CABIOS·1998
Same journal

SAMBA: hardware accelerator for biological sequence comparison.

Computer applications in the biosciences : CABIOS·1998
See all related articles

Statistical analyses reveal unique patterns in DNA and RNA sequences, distinguishing them from random text. These peculiarities, particularly word distribution anomalies, challenge existing models for nucleotide sequence description.

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Genomics

Background:

  • Statistical analyses have identified peculiarities in nucleotide sequences.
  • These peculiarities prevent description by existing models, enabling distinction from random texts.
  • Anomalous distributions of specific words (homonucleotide runs, palindromes, non-stationary words) are observed.

Purpose of the Study:

  • Introduce a novel probabilistic approach for nucleotide sequence analysis.
  • Investigate anomalies in the distribution of words with poor nucleotide composition.
  • Identify obstacles hindering simple Markov model descriptions of nucleotide sequences.

Main Methods:

  • Developed a probabilistic approach inspired by linguistic principles.
  • Analyzed word distribution patterns in DNA and RNA sequences.

Related Experiment Videos

  • Examined anomalies in homonucleotide/homopurine/homopyrimidine runs, palindromes, and non-stationary words.
  • Main Results:

    • Confirmed that certain word distributions in nucleotide sequences deviate from standard models.
    • Identified anomalies in the distribution of words with poor nucleotide composition.
    • Demonstrated that these specific word classes impede accurate Markov model descriptions.

    Conclusions:

    • The unique statistical properties of DNA/RNA sequences necessitate advanced descriptive models.
    • A probabilistic, linguistics-inspired approach offers a new framework for sequence analysis.
    • Anomalous word distributions are key challenges for current sequence modeling techniques.