Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Genome Annotation and Assembly03:36

Genome Annotation and Assembly

21.4K
The genome refers to all of the genetic material in an organism. It can range from a few million base pairs in microbial cells to several billion base pairs in many eukaryotic organisms. Genome assembly refers to the process of taking the DNA sequencing data and putting it all back together in a correct order to create a close representation of the original genome. This is followed by the identification of functional elements on the newly assembled genome, a process called genome annotation.
21.4K
RNA-seq03:21

RNA-seq

12.4K
RNA sequencing, or RNA-Seq, is a high-throughput sequencing technology used to study the transcriptome of a cell. Transcriptomics helps to interpret the functional elements of a genome and identify the molecular constituents of an organism. Additionally, it also helps in understanding the development of an organism and the occurrence of diseases. 
Before the discovery of RNA-seq, microarray-based methods and Sanger sequencing were used for transcriptome analysis. However, while...
12.4K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Continuous multi-omics pathway enrichment analysis resolves hidden functional heterogeneity.

Briefings in bioinformatics·2026
Same author

Improved RNA-DNA interaction calling suggests RNA-based gene regulation of phenotypic transitions.

Nucleic acids research·2026
Same author

Predicting gene-specific regulation with transcriptomic and epigenetic single-cell data.

Bioinformatics (Oxford, England)·2026
Same author

Probing transcription factor subsets in gene regulatory networks.

Algorithms for molecular biology : AMB·2026
Same author

Quantifying Components in a Model Vaccine with Machine-Learning-Augmented Raman Spectroscopy.

Analytical chemistry·2026
Same author

Cell type-specific epigenetic regulatory circuitry of coronary artery disease loci.

Nature communications·2026
Same journal

3DICE: Interpretable 3D Cross-Modal Learning for Drug-Target Interaction Prediction and Large-Scale Drug Discovery.

Bioinformatics (Oxford, England)·2026
Same journal

KASSPer: Kinase Active Site Structure Prediction using Protein and Ligand Language Models and Its Application to Virtual Screening.

Bioinformatics (Oxford, England)·2026
Same journal

IDR searcher: a search engine solution for public image resources.

Bioinformatics (Oxford, England)·2026
Same journal

KCFtools: Rapid alignment-free method for introgression screening and GWAS using k-mer profiles.

Bioinformatics (Oxford, England)·2026
Same journal

Meta2DB: Curated shotgun metagenomic feature sets and metadata for health state prediction.

Bioinformatics (Oxford, England)·2026
Same journal

conMItion: an R package adjusting confounding factors for associations in multi-omics.

Bioinformatics (Oxford, England)·2026
See all related articles

Related Experiment Video

Updated: Mar 21, 2026

Leveraging CyVerse Resources for De Novo Comparative Transcriptomics of Underserved Non-model Organisms
10:41

Leveraging CyVerse Resources for De Novo Comparative Transcriptomics of Underserved Non-model Organisms

Published on: May 9, 2017

9.7K

Informed kmer selection for de novo transcriptome assembly.

Dilip A Durai1, Marcel H Schulz1

  • 1Cluster of Excellence on Multimodal Computing and Interaction, Saarland University, Saarbrücken, 66123, Germany Department for Computational Biology and Applied Algorithmics, Max Planck Institute for Informatics, Saarbrücken, 66123, Germany.

Bioinformatics (Oxford, England)
|May 7, 2016
PubMed
Summary
This summary is machine-generated.

This study introduces an automated method to optimize de novo transcriptome assembly by determining the ideal k-mer value. This approach saves computational time without sacrificing assembly quality, making multi-k-mer methods more accessible.

More Related Videos

Novel Sequence Discovery by Subtractive Genomics
09:40

Novel Sequence Discovery by Subtractive Genomics

Published on: January 25, 2019

9.2K
Hybrid De Novo Genome Assembly for the Generation of Complete Genomes of Urinary Bacteria using Short- and Long-read Sequencing Technologies
12:08

Hybrid De Novo Genome Assembly for the Generation of Complete Genomes of Urinary Bacteria using Short- and Long-read Sequencing Technologies

Published on: August 20, 2021

5.9K

Related Experiment Videos

Last Updated: Mar 21, 2026

Leveraging CyVerse Resources for De Novo Comparative Transcriptomics of Underserved Non-model Organisms
10:41

Leveraging CyVerse Resources for De Novo Comparative Transcriptomics of Underserved Non-model Organisms

Published on: May 9, 2017

9.7K
Novel Sequence Discovery by Subtractive Genomics
09:40

Novel Sequence Discovery by Subtractive Genomics

Published on: January 25, 2019

9.2K
Hybrid De Novo Genome Assembly for the Generation of Complete Genomes of Urinary Bacteria using Short- and Long-read Sequencing Technologies
12:08

Hybrid De Novo Genome Assembly for the Generation of Complete Genomes of Urinary Bacteria using Short- and Long-read Sequencing Technologies

Published on: August 20, 2021

5.9K

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Genomics

Background:

  • De novo transcriptome assembly is crucial for RNA-seq workflows, especially for non-model organisms.
  • Most assemblers utilize de Bruijn graphs (DBGs), where assembly quality depends heavily on the chosen k-mer length.
  • Current multi-k-mer approaches merge assemblies from various k-mer values but lack an optimal strategy for selecting these values.

Purpose of the Study:

  • To develop an automated method for identifying the optimal k-mer value to stop transcriptome assembly.
  • To improve the efficiency and reliability of multi-k-mer assembly strategies.
  • To reduce computational time in transcriptome assembly without compromising assembly quality.

Main Methods:

  • Investigated the contribution of individual k-mer values in multi-k-mer assembly.
  • Employed comparative clustering of related assemblies to estimate the importance of additional k-mer assemblies.
  • Developed a model-fit algorithm to predict the optimal k-mer value for assembly termination.

Main Results:

  • Demonstrated that comparative clustering can effectively estimate the importance of k-mer assemblies.
  • Successfully predicted the k-mer value at which further assembly is unnecessary.
  • Validated the approach across different de novo assemblers, coverage values, and read lengths.
  • Identified a post-processing step that significantly enhances multi-k-mer assembly quality.

Conclusions:

  • An automated method was developed to limit k-mer values in transcriptome assembly.
  • The method achieves significant savings in assembly time with minimal loss in assembly quality.
  • This advancement makes multi-k-mer assembly methods more practical and user-friendly.