Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Genome Annotation and Assembly03:36

Genome Annotation and Assembly

20.5K
The genome refers to all of the genetic material in an organism. It can range from a few million base pairs in microbial cells to several billion base pairs in many eukaryotic organisms. Genome assembly refers to the process of taking the DNA sequencing data and putting it all back together in a correct order to create a close representation of the original genome. This is followed by the identification of functional elements on the newly assembled genome, a process called genome annotation.
20.5K
DNA as a Genetic Template02:05

DNA as a Genetic Template

27.3K
Two structural features of the DNA molecule provide a basis for the mechanisms of heredity: the four nucleotide bases and its double-stranded nature. The Watson-Crick model of double-helical DNA structure, proposed in 1952, drew heavily upon the X-ray crystallography work of researchers Rosalind Franklin and Maurice Wilkins. Watson, Crick, and Wilkins jointly received the Nobel Prize in Physiology or Medicine for their work in 1962. Franklin was, controversially, excluded from the prize for...
27.3K
DNA as a Genetic Template02:05

DNA as a Genetic Template

9.3K
9.3K
Genomics02:02

Genomics

39.6K
Genomics is the science of genomes: it is the study of all the genetic material of an organism. In humans, the genome consists of information carried in 23 pairs of chromosomes in the nucleus, as well as mitochondrial DNA. In genomics, both coding and non-coding DNA is sequenced and analyzed. Genomics allows a better understanding of all living things, their evolution, and their diversity. It has a myriad of uses: for example, to build phylogenetic trees, to improve productivity and...
39.6K
Evolutionary Relationships through Genome Comparisons02:54

Evolutionary Relationships through Genome Comparisons

6.8K
Genome comparison is one of the excellent ways to interpret the evolutionary relationships between organisms. The basic principle of genome comparison is that if two species share a common feature, it is likely encoded by the DNA sequence conserved between both species. The advent of genome sequencing technologies in the late 20th century enabled scientists to understand the concept of conservation of domains between species and helped them to deduce evolutionary relationships across diverse...
6.8K
Genomic DNA in Prokaryotes00:46

Genomic DNA in Prokaryotes

48.2K
The genome of most prokaryotic organisms consists of double-stranded DNA organized into one circular chromosome in a region of cytoplasm called the nucleoid. The chromosome is tightly wound, or supercoiled, for efficient storage. Prokaryotes also contain other circular pieces of DNA called plasmids. These plasmids are smaller than the chromosome and often carry genes that confer adaptive functions, such as antibiotic resistance.
Genomic Diversity in Bacteria
Although bacterial genomes are much...
48.2K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Decoding RNA N6-Methyladenosine Methylome of Wheat Using Machine Learning and Nanopore Direct RNA Sequencing.

Genomics, proteomics & bioinformatics·2026
Same author

Translating functional molecular knowledge into crop-breeding success.

Nature reviews. Genetics·2026
Same author

Observational epidemiological studies can mitigate genetic confounding with a genetic relatedness matrix.

Proceedings of the National Academy of Sciences of the United States of America·2026
Same author

Phenotypic complexity determines the predictability of molecular convergence.

bioRxiv : the preprint server for biology·2026
Same author

Discovery of additional ancient genome duplications in yeasts.

Current biology : CB·2026
Same author

Characterizing and mitigating confounding by unreplicated evolutionary events in phylogenetic regression.

bioRxiv : the preprint server for biology·2026
Same journal

Genetic Impacts on Variability of Body Fat Distribution Uncover Gene-Environment and Gene-Gene Interactions.

bioRxiv : the preprint server for biology·2026
Same journal

16S ribosomal RNA modification drives transcript-specific translation efficiency.

bioRxiv : the preprint server for biology·2026
Same journal

FlcE latches onto the FliL-stator complex to turbocharge flagellar motility in <i>Borrelia burgdorferi</i>.

bioRxiv : the preprint server for biology·2026
Same journal

Synaptic pruning, myelination and the emergence of psychiatric disorders in late adolescence.

bioRxiv : the preprint server for biology·2026
Same journal

Structural and functional insights into the Rcs phosphorelay.

bioRxiv : the preprint server for biology·2026
Same journal

The structural basis of RanGAP1 regulation and catalysis in nuclear transport.

bioRxiv : the preprint server for biology·2026
See all related articles

Related Experiment Video

Updated: Jan 10, 2026

Comprehensive Workflow for the Genome-wide Identification and Expression Meta-analysis of the ATL E3 Ubiquitin Ligase Gene Family in Grapevine
10:40

Comprehensive Workflow for the Genome-wide Identification and Expression Meta-analysis of the ATL E3 Ubiquitin Ligase Gene Family in Grapevine

Published on: December 22, 2017

10.9K

GeneCAD: Plant Genome Annotation with a DNA Foundation Model.

Zong-Yan Liu1, Ana Berthel2, Eric Czech3

  • 1Section of Plant Breeding and Genetics, School of Integrative Plant Sciences, Cornell University, Ithaca, NY USA 14853.

Biorxiv : the Preprint Server for Biology
|November 24, 2025
PubMed
Summary
This summary is machine-generated.

GeneCAD is a new DNA-only method for plant genome annotation. It accurately predicts gene models, overcoming challenges like polyploidy and repeats, improving accuracy over existing tools.

More Related Videos

Analyzing and Building Nucleic Acid Structures with 3DNA
16:24

Analyzing and Building Nucleic Acid Structures with 3DNA

Published on: April 26, 2013

21.2K
Navigating MARRVEL, a Web-Based Tool that Integrates Human Genomics and Model Organism Genetics Information
09:37

Navigating MARRVEL, a Web-Based Tool that Integrates Human Genomics and Model Organism Genetics Information

Published on: August 15, 2019

10.3K

Related Experiment Videos

Last Updated: Jan 10, 2026

Comprehensive Workflow for the Genome-wide Identification and Expression Meta-analysis of the ATL E3 Ubiquitin Ligase Gene Family in Grapevine
10:40

Comprehensive Workflow for the Genome-wide Identification and Expression Meta-analysis of the ATL E3 Ubiquitin Ligase Gene Family in Grapevine

Published on: December 22, 2017

10.9K
Analyzing and Building Nucleic Acid Structures with 3DNA
16:24

Analyzing and Building Nucleic Acid Structures with 3DNA

Published on: April 26, 2013

21.2K
Navigating MARRVEL, a Web-Based Tool that Integrates Human Genomics and Model Organism Genetics Information
09:37

Navigating MARRVEL, a Web-Based Tool that Integrates Human Genomics and Model Organism Genetics Information

Published on: August 15, 2019

10.3K

Area of Science:

  • Genomics
  • Bioinformatics

Background:

  • Accurate plant genome annotation is challenging due to polyploidy and repetitive sequences.
  • Existing homology- and RNA-based annotation methods struggle with these complexities.

Purpose of the Study:

  • To develop a sequence-only method for accurate plant gene model prediction directly from DNA.
  • To overcome limitations of current annotation pipelines in plants.

Main Methods:

  • GeneCAD utilizes a plant DNA foundation model (PlantCAD2) and a ModernBERT encoder.
  • A chromosome-wide conditional random field enforces biological constraints.
  • A protein language model filters repeat-driven open reading frames.
  • Public annotations are refined using a masked-motif score and fine-tuned on high-quality references.

Main Results:

  • GeneCAD improves transcript-level F1 scores by 8-10% compared to Helixer and BRAKER3 across five angiosperms.
  • The method increases exact match transcripts and refines boundaries at start/stop codons and splice junctions.
  • Demonstrated accuracy on the allotetraploid *Nicotiana tabacum*.

Conclusions:

  • GeneCAD offers an accurate and scalable approach for plant gene model prediction using DNA sequence alone.
  • It eliminates the need for species-matched RNA-seq or proteomics data.
  • Provides biologically coherent gene models essential for plant genomics research.