Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Maxam-Gilbert Sequencing01:05

Maxam-Gilbert Sequencing

13.4K
In the same year as the discovery of the Sanger sequencing method, another group of scientists, Allan Maxam and Walter Gilbert, demonstrated their chemical-cleavage method for DNA sequencing. The Maxam-Gilbert method relies on using different chemicals that can cleave the DNA sequence at specific sites, the separation of resulting DNA fragments of variable size using electrophoresis, and deciphering the DNA sequence from the resulting gel bands.
Challenges of the Maxam-Gilbert Method
The...
13.4K
Distance Problem01:29

Distance Problem

112
When an object's velocity changes over time, the total distance traveled can be determined by summing small displacement intervals over short increments. This approach approximates the true distance through numerical summation and the use of integral calculus. An estimate of the total displacement can be obtained by measuring velocity at regular intervals and multiplying each value by the corresponding time step.If a runner accelerates over the first three seconds of a race, speed measurements...
112
Wald-Wolfowitz Runs Test I01:17

Wald-Wolfowitz Runs Test I

1.0K
The Wald-Wolfowitz test, also known as the runs test, is a nonparametric statistical test used to assess the randomness of a sequence of two different types of elements (e.g., positive/negative values, successes/failures). It examines whether the order of the elements in a sequence is random or if there is a pattern or trend present. This nonparametric test applies to any ordered data despite the population and sample data distribution, even if a higher sample size is available.
The test works...
1.0K
Multi-species Conserved Sequences02:51

Multi-species Conserved Sequences

4.9K
Next-generation sequencing technologies have created large genomic databases of a variety of animals and plants. Ever since the human genome project was completed, scientists studied the genome of primates, mammals, and other phylogenetically distant living beings. Such large-scale  studies have provided new insights into the evolutionary relationship between organisms.
Although the genome of each species varies greatly from each other, a few sequences are highly conserved. Such conserved...
4.9K
Wilcoxon Signed-Ranks Test for Matched Pairs01:09

Wilcoxon Signed-Ranks Test for Matched Pairs

553
The Wilcoxon signed-rank test for matched pairs evaluates the null hypothesis by combining the ranks of differences with their signs. It essentially tests whether the median of the differences in a population of matched pairs is zero. Since the test incorporates more information than the sign test, it generally yields more trustable conclusions. This test also does not require the data to follow a normal distribution, but two conditions must be met for it to be applicable: (1) the data must...
553
Per-Unit Sequence Models01:26

Per-Unit Sequence Models

476
An ideal Y-Y transformer, grounded through neutral impedances, displays per-unit sequence networks akin to those of a single-phase ideal transformer when subjected to balanced positive- or negative-sequence currents. These currents do not produce neutral currents, and their associated voltage drops.
Zero-sequence currents, which are identical in magnitude and phase, generate a neutral current, resulting in voltage drops across the neutral impedance and the low-voltage winding. If the...
476

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Microbial detoxification of 2,4,6-tribromophenol via a novel process with consecutive oxidative and hydrolytic debromination: Biochemical, genetic and evolutionary characterization.

Environmental research·2021
Same author

A large-scale systematic survey of SARS-CoV-2 antibodies reveals recurring molecular features.

bioRxiv : the preprint server for biology·2021
Same author

Inclusion of Soluble Fiber During Gestation Regulates Gut Microbiota, Improves Bile Acid Homeostasis, and Enhances the Reproductive Performance of Sows.

Frontiers in veterinary science·2021
Same author

Jumper enables discontinuous transcript assembly in coronaviruses.

Nature communications·2021
Same author

Role of bioactive peptides derived from food proteins in programmed cell death to treat inflammatory diseases and cancer.

Critical reviews in food science and nutrition·2021
Same author

Activating a Multielectron Reaction of NASICON-Structured Cathodes toward High Energy Density for Sodium-Ion Batteries.

Journal of the American Chemical Society·2021
Same journal

AnchorDrug: A system for drug-induced gene expression prediction in new contexts through active learning.

Proceedings of the ... SIAM International Conference on Data Mining. SIAM International Conference on Data Mining·2026
Same journal

Domain-Adaptive Continual Meta-Learning for Modeling Dynamical Systems: An Application in Environmental Ecosystems.

Proceedings of the ... SIAM International Conference on Data Mining. SIAM International Conference on Data Mining·2025
Same journal

MedDiffusion: Boosting Health Risk Prediction via Diffusion-based Data Augmentation.

Proceedings of the ... SIAM International Conference on Data Mining. SIAM International Conference on Data Mining·2024
Same journal

Automated Fusion of Multimodal Electronic Health Records for Better Medical Predictions.

Proceedings of the ... SIAM International Conference on Data Mining. SIAM International Conference on Data Mining·2024
Same journal

FAME: Fragment-based Conditional Molecular Generation for Phenotypic Drug Discovery.

Proceedings of the ... SIAM International Conference on Data Mining. SIAM International Conference on Data Mining·2022
Same journal

Harmonic Alignment.

Proceedings of the ... SIAM International Conference on Data Mining. SIAM International Conference on Data Mining·2021
See all related articles

Related Experiment Video

Updated: Mar 7, 2026

A Nonsequencing Approach for the Rapid Detection of RNA Editing
08:50

A Nonsequencing Approach for the Rapid Detection of RNA Editing

Published on: April 21, 2022

3.0K

MACFP: Maximal Approximate Consecutive Frequent Pattern Mining under Edit Distance.

Jingbo Shang1, Jian Peng1, Jiawei Han1

  • 1Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL, USA.

Proceedings of the ... SIAM International Conference on Data Mining. SIAM International Conference on Data Mining
|February 9, 2017
PubMed
Summary
This summary is machine-generated.

This study introduces an efficient algorithm for approximate consecutive frequent pattern mining, addressing limitations of existing methods that ignore insertions/deletions. The new approach significantly reduces computational complexity for biological sequence analysis.

More Related Videos

Cross-Modal Multivariate Pattern Analysis
13:51

Cross-Modal Multivariate Pattern Analysis

Published on: November 9, 2011

20.6K
Temporal Ordering of Dynamic Expression Data from Detailed Spatial Expression Maps
11:52

Temporal Ordering of Dynamic Expression Data from Detailed Spatial Expression Maps

Published on: February 9, 2017

6.6K

Related Experiment Videos

Last Updated: Mar 7, 2026

A Nonsequencing Approach for the Rapid Detection of RNA Editing
08:50

A Nonsequencing Approach for the Rapid Detection of RNA Editing

Published on: April 21, 2022

3.0K
Cross-Modal Multivariate Pattern Analysis
13:51

Cross-Modal Multivariate Pattern Analysis

Published on: November 9, 2011

20.6K
Temporal Ordering of Dynamic Expression Data from Detailed Spatial Expression Maps
11:52

Temporal Ordering of Dynamic Expression Data from Detailed Spatial Expression Maps

Published on: February 9, 2017

6.6K

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Data Mining

Background:

  • Frequent pattern mining is crucial for biological sequence analysis, time series, and network logs.
  • Existing methods often overlook insertions/deletions (indels), limiting approximate string pattern discovery.
  • High computational complexity hinders approximate pattern mining under edit distance, especially for large DNA sequences.

Purpose of the Study:

  • To address the challenge of approximate consecutive frequent pattern mining under edit distance.
  • To develop an efficient algorithm for identifying substring patterns with indels in long sequences.
  • To reduce the computational burden of Maximal Approximate Consecutive Frequent Pattern Mining (MACFP).

Main Methods:

  • Formulation of the Maximal Approximate Consecutive Frequent Pattern Mining (MACFP) problem.
  • Proposal of a novel algorithm with linear time complexity for support threshold checking.
  • Integration of indexing and searching techniques for efficient pattern discovery.

Main Results:

  • A significant reduction in computational complexity for approximate consecutive frequent pattern mining.
  • Demonstrated effectiveness and efficiency through comprehensive experiments on sequence pattern analysis.
  • Successful application in a cancer genomics study, showcasing practical utility.

Conclusions:

  • The developed algorithm provides an efficient solution for approximate consecutive frequent pattern mining under edit distance.
  • The approach overcomes limitations of existing methods by effectively handling indels.
  • The algorithm shows promise for applications in bioinformatics and genomics, particularly with large datasets.