Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Comparing Copy Number Variations and SNPs02:26

Comparing Copy Number Variations and SNPs

18.7K
Sequencing of the human genome has opened up several best-kept secrets of the genome. Scientists have identified thousands of genome variations that exist within a population. These variations can be a single nucleotide or a larger chromosomal variation.
Copy number variations or CNVs are the structural variations that cover more than 1kb of DNA sequence. The single nucleotide polymorphism (SNP), on the other hand, is a single nucleotide change or a point mutation that is found in more than 1%...
18.7K
Single Nucleotide Polymorphisms-SNPs01:05

Single Nucleotide Polymorphisms-SNPs

18.3K
A single nucleotide polymorphism or SNP is a single nucleotide variation at a specific genomic position in a large population. It is the most prevalent type of sequence variation found in the human genome. Point mutations that occur in more than 1% of the population qualify as SNPs. These are present once every 1000 nucleotides on an average in the human genome. Replacement of a purine with another purine (A/G) or a pyrimidine with another pyrimidine (C/T) is known as a transition. In contrast,...
18.3K
Chromatin Position Affects Gene Expression02:35

Chromatin Position Affects Gene Expression

24.8K
Chromatin is the massive complex of DNA and proteins packaged inside the nucleus. The complexity of chromatin folding and how it is packaged inside the nucleus greatly influences  access to genetic information. Generally, the nucleus' periphery is considered transcriptionally repressive, while the cell's interior is considered a transcriptionally active area. 
Topologically Associated Domains (TADs)
The 3-dimensional positioning of chromatin in the nucleus influences the...
24.8K
Cluster Sampling Method01:20

Cluster Sampling Method

14.4K
Appropriate sampling methods ensure that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. All the members from these clusters are in the cluster sample. For example, if you randomly sample four departments from your...
14.4K
Vesicular Tubular Clusters01:45

Vesicular Tubular Clusters

3.2K
After budding out from the ER membrane, some COPII vesicles lose their coat and fuse with one another to form larger vesicles and interconnected tubules called vesicular tubular clusters or VTCs. These clusters constitute a compartment at the ER-Golgi interface known as ERGIC (Endoplasmic Reticulum Golgi Intermediate Compartment). The ERGIC is a mobile membrane-bound cargo transport system that sorts proteins secreted from ER and delivers them to the Golgi.
With the help of motor proteins such...
3.2K
Position-effect Variegation02:32

Position-effect Variegation

7.1K
In 1928, a German botanist Emil Heitz observed the moss nuclei with a DNA binding dye. He observed that while some chromatin regions decondense and spread out in the interphase nucleus, others do not. He termed them euchromatin and heterochromatin, respectively. He proposed that the heterochromatin regions reflect a functionally inactive state of the genome. It was later confirmed that heterochromatin is transcriptionally repressed, and euchromatin is transcriptionally active chromatin.
7.1K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Pattern matching with Elastic-Degenerate strings and Elastic-Founder graphs.

Algorithms for molecular biology : AMB·2026
Same author

Faster run-length compressed suffix arrays.

Oasics : openaccess series in informatics·2026
Same author

RettDb: the Rett syndrome omics database to navigate the Rett syndrome genomic landscape.

Database : the journal of biological databases and curation·2024
Same author

Pangenome comparison via ED strings.

Frontiers in bioinformatics·2024
Same author

Space-time Trade-offs for the LCP Array of Wheeler DFAs.

International Symposium on String Processing and Information Retrieval : SPIRE ... : proceedings. SPIRE (Symposium)·2024
Same author

Computing matching statistics on Wheeler DFAs.

Proceedings. Data Compression Conference·2024
Same journal

Haplotype-aware long-read error correction.

Algorithms for molecular biology : AMB·2026
Same journal

Extension of partial atom-to-atom maps: uniqueness and algorithms.

Algorithms for molecular biology : AMB·2026
Same journal

Lossless pangenome indexing using tag arrays.

Algorithms for molecular biology : AMB·2026
Same journal

Dolphyin: a combinatorial algorithm for identifying 1-Dollo phylogenies in cancer.

Algorithms for molecular biology : AMB·2026
Same journal

Probing transcription factor subsets in gene regulatory networks.

Algorithms for molecular biology : AMB·2026
Same journal

Comparing the ability of embedding methods on metabolic hypergraphs for capturing taxonomy-based features.

Algorithms for molecular biology : AMB·2026
See all related articles

Related Experiment Video

Updated: Jan 28, 2026

Spatial Separation of Molecular Conformers and Clusters
10:37

Spatial Separation of Molecular Conformers and Clusters

Published on: January 9, 2014

11.7K

SNPs detection by eBWT positional clustering.

Nicola Prezza1, Nadia Pisanti1,2, Marinella Sciortino3

  • 11Dipartimento di Informatica, University of Pisa, Pisa, Italy.

Algorithms for Molecular Biology : AMB
|March 7, 2019
PubMed
Summary
This summary is machine-generated.

We introduce positional clustering theory to analyze sequencing data. This alignment-free method efficiently identifies single nucleotide polymorphisms (SNPs) directly from raw reads, offering a promising approach for variant calling.

Keywords:
Assembly-freeBWTLCP arrayReference-freeSNPs

More Related Videos

CRISPR Gene Editing Tool for MicroRNA Cluster Network Analysis
10:40

CRISPR Gene Editing Tool for MicroRNA Cluster Network Analysis

Published on: April 25, 2022

2.9K
Differentiation of Human Pluripotent Stem Cells into Insulin-Producing Islet Clusters
08:41

Differentiation of Human Pluripotent Stem Cells into Insulin-Producing Islet Clusters

Published on: June 23, 2023

4.3K

Related Experiment Videos

Last Updated: Jan 28, 2026

Spatial Separation of Molecular Conformers and Clusters
10:37

Spatial Separation of Molecular Conformers and Clusters

Published on: January 9, 2014

11.7K
CRISPR Gene Editing Tool for MicroRNA Cluster Network Analysis
10:40

CRISPR Gene Editing Tool for MicroRNA Cluster Network Analysis

Published on: April 25, 2022

2.9K
Differentiation of Human Pluripotent Stem Cells into Insulin-Producing Islet Clusters
08:41

Differentiation of Human Pluripotent Stem Cells into Insulin-Producing Islet Clusters

Published on: June 23, 2023

4.3K

Area of Science:

  • Genomics
  • Bioinformatics
  • Computational Biology

Background:

  • Rapid advancements in sequencing technology necessitate efficient data structures for storing and analyzing raw sequencing reads.
  • There is a growing demand for alignment-free and reference-free variant calling methods that operate directly on indexed raw reads.
  • Current methods often rely on reference genomes, limiting their applicability in certain research scenarios.

Purpose of the Study:

  • To develop a novel theoretical framework, positional clustering, for analyzing sequencing data.
  • To design and implement an alignment-free and reference-free method for single nucleotide polymorphism (SNP) calling.
  • To evaluate the efficacy of the proposed method on both synthetic and real sequencing data.

Main Methods:

  • Development of the positional clustering theory based on the extended Burrows-Wheeler Transform (eBWT) and LCP array.
  • Design and implementation of an alignment-free and reference-free SNP calling pipeline utilizing the eBWT and LCP arrays.
  • Experimental validation using synthetic datasets and real sequencing data to assess performance and accuracy.

Main Results:

  • The positional clustering theory accurately describes how bases covering the same genome position cluster in the eBWT.
  • A simple scan of the eBWT and LCP arrays allows for the detection of SNPs within these clusters.
  • The implemented tool provides an intrinsic reference-free evaluation of accuracy by reporting SNP coverage.

Conclusions:

  • The positional clustering framework is effective for identifying SNPs directly from raw sequencing data.
  • This approach offers a promising avenue for calling other types of genetic variants without relying on a reference genome.
  • The developed software, ebwt2snp, is freely available for academic use, facilitating further research in this area.