Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

RNA-seq03:21

RNA-seq

11.6K
RNA sequencing, or RNA-Seq, is a high-throughput sequencing technology used to study the transcriptome of a cell. Transcriptomics helps to interpret the functional elements of a genome and identify the molecular constituents of an organism. Additionally, it also helps in understanding the development of an organism and the occurrence of diseases. 
Before the discovery of RNA-seq, microarray-based methods and Sanger sequencing were used for transcriptome analysis. However, while...
11.6K
Modern Molecular Taxonomy01:29

Modern Molecular Taxonomy

491
Advancements in molecular biology have revolutionized the identification and characterization of bacteria, with multiple methods leveraging DNA sequencing for enhanced precision. As sequencing technologies improve and costs decline, these approaches are increasingly used in clinical, environmental, and evolutionary studies.Multilocus Sequence Typing (MLST) examines several housekeeping genes, essential chromosomal genes encoding cellular functions, to distinguish strains. Approximately...
491
Sanger Sequencing01:57

Sanger Sequencing

771.9K
DNA sequencing is a fundamental technique that is routinely used in the biological sciences. This method can be applied to a range of questions at different scales - from the sequencing of a cloned DNA fragment or the study of a mutation in a gene up to whole-genome sequencing. However, despite the widespread use of sequencing today, it was not until 1977 that Fredrick Sanger and his collaborators developed the chain-termination method to decode DNA sequences. It relies on the separation of a...
771.9K
Nucleic Acid Structure01:25

Nucleic Acid Structure

8.2K
The pentose sugar in DNA is deoxyribose, while in RNA the pentose sugar is ribose. The difference between the sugars is the presence of the hydroxyl group on the ribose's second carbon and a hydrogen on the deoxyribose's second carbon. The phosphate residue attaches to the hydroxyl group of the 5′ carbon of one sugar and the hydroxyl group of the 3′ carbon of the sugar of the next nucleotide, which forms  a 5′ to 3′ phosphodiester linkage.
DNA Structure
DNA...
8.2K
Evolutionary Relationships through Genome Comparisons02:54

Evolutionary Relationships through Genome Comparisons

6.7K
Genome comparison is one of the excellent ways to interpret the evolutionary relationships between organisms. The basic principle of genome comparison is that if two species share a common feature, it is likely encoded by the DNA sequence conserved between both species. The advent of genome sequencing technologies in the late 20th century enabled scientists to understand the concept of conservation of domains between species and helped them to deduce evolutionary relationships across diverse...
6.7K
Next-generation Sequencing03:00

Next-generation Sequencing

97.3K
The first human genome sequencing project cost $2.7 billion and was declared complete in 2003, after 15 years of international cooperation and collaboration between several research teams and funding agencies. Today, with the advent of next-generation sequencing technologies, the cost and time of sequencing a human genome have dropped over 100 fold.
Next-Generation Sequencing Methods
Although all next-generation methods use different technologies, they all share a set of standard features....
97.3K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

High-throughput sequencing-based profiling of endophytic bacterial community composition and diversity in seeds of Yunnan cytoplasmic male-sterile rice.

Antonie van Leeuwenhoek·2026
Same author

A Novel Glucomannan from <i>Dendrobium officinale</i> Inhibits Obesity and Improves Insulin Sensitivity through Regulating Adipogenesis and Lipid Metabolism.

Journal of agricultural and food chemistry·2025
Same author

Post-subsidy Era: Potential for Carbon Pricing in Industrial Fisheries among Global Major Fishing Countries.

Environmental science & technology·2025
Same author

Untargeted metabolomics reveals the inhibition effect of a high-fat diet on colorectal cancer tumorigenesis in obesity-resistant mice <i>via</i> regulating bile acid, glutathione, and glycerophospholipid metabolisms.

Food & function·2025
Same author

Green Tea Epigallocatechin-3-gallate Ameliorates Lipid Accumulation and Obesity-Associated Metabolic Syndrome via Regulating Autophagy and Lipolysis in Preadipocytes and Adipose Tissue.

Journal of agricultural and food chemistry·2025
Same author

UPLC-Q-TOF/MS-based study on chemical composition, in vivo metabolites, and tissue distribution of ethanol extract of Ganoderma lucidum.

Journal of pharmaceutical and biomedical analysis·2025
Same journal

Association between intestinal functional disorders and anal fistula: evidence from a retrospective case-control study.

PeerJ·2026
Same journal

Automated recognition of Meso-Cenozoic foraminifera from Senegalese sedimentary deposits using convolutional neural networks.

PeerJ·2026
Same journal

Genome-wide analysis of <i>HSP70</i> gene superfamily in kelp (<i>Saccharina japonica</i>): identification, characterization, and heat stress-responsive expression profiles.

PeerJ·2026
Same journal

Morphological and molecular evidence of the Antarctic sleeper shark <i>Somniosus antarcticus</i> (Somniosidae) in northern Chile.

PeerJ·2026
Same journal

Stroboscopic balance training enhances dynamic stability and postural control in collegiate badminton players: a randomized controlled trial.

PeerJ·2026
Same journal

Frequent exposure to biologics is associated with small intestinal bacterial overgrowth in patients with Crohn's disease: a retrospective case-control study.

PeerJ·2026
See all related articles

Related Experiment Video

Updated: Dec 23, 2025

DNA Sequence Recognition by DNA Primase Using High-Throughput Primase Profiling
08:04

DNA Sequence Recognition by DNA Primase Using High-Throughput Primase Profiling

Published on: October 8, 2019

9.0K

CFSP: a collaborative frequent sequence pattern discovery algorithm for nucleic acid sequence classification.

He Peng1

  • 1School of Information Science and Engineering, Xiamen University, Xiamen, Fujian, China.

Peerj
|April 29, 2020
PubMed
Summary
This summary is machine-generated.

A new algorithm, CFSP, efficiently extracts frequent subsequences from nucleic acid data, improving motif discovery for biological applications. This method captures long-distance correlations and handles sequence variations effectively.

Keywords:
Mutational information miningLong range correlationSequence feature extraction

More Related Videos

Novel Sequence Discovery by Subtractive Genomics
09:40

Novel Sequence Discovery by Subtractive Genomics

Published on: January 25, 2019

9.0K
Rare Event Detection Using Error-corrected DNA and RNA Sequencing
10:36

Rare Event Detection Using Error-corrected DNA and RNA Sequencing

Published on: August 3, 2018

12.4K

Related Experiment Videos

Last Updated: Dec 23, 2025

DNA Sequence Recognition by DNA Primase Using High-Throughput Primase Profiling
08:04

DNA Sequence Recognition by DNA Primase Using High-Throughput Primase Profiling

Published on: October 8, 2019

9.0K
Novel Sequence Discovery by Subtractive Genomics
09:40

Novel Sequence Discovery by Subtractive Genomics

Published on: January 25, 2019

9.0K
Rare Event Detection Using Error-corrected DNA and RNA Sequencing
10:36

Rare Event Detection Using Error-corrected DNA and RNA Sequencing

Published on: August 3, 2018

12.4K

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Genomics

Background:

  • Conserved nucleic acid sequences are vital for transcriptional regulation and serve as biomarkers.
  • Motif extraction from large-scale data is crucial for developing predictive models.
  • Template-based classification often outperforms feature extraction methods like N-gram.

Purpose of the Study:

  • To propose a novel feature extraction algorithm for discovering frequent subsequences in nucleic acid data.
  • To improve motif discovery by capturing long-distance correlations and handling sequence variations.
  • To enhance the creation of predictive models for biochemical properties and non-coding RNAs.

Main Methods:

  • Introduced a Teiresias-like algorithm called CFSP (Common Frequent Sub-sequence Pattern) for motif discovery.
  • CFSP allows for larger gaps in sequence pairs compared to existing algorithms.
  • Incorporated mutation information for fuzzy matching to accommodate single nucleotide variants and indels.

Main Results:

  • The CFSP algorithm successfully identified frequent subsequence combinations, capturing long-distance correlations.
  • Validated the algorithm with case studies involving miRNA, piRNA, and Sigma 54 promoters.
  • Demonstrated superior performance compared to existing feature descriptor-based methods.

Conclusions:

  • The proposed CFSP algorithm is effective for nucleic acid sequence prediction.
  • CFSP offers improved motif extraction capabilities, outperforming current methods.
  • The algorithm is publicly available in C++ and shell script.