Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Comparing Copy Number Variations and SNPs02:26

Comparing Copy Number Variations and SNPs

18.4K
Sequencing of the human genome has opened up several best-kept secrets of the genome. Scientists have identified thousands of genome variations that exist within a population. These variations can be a single nucleotide or a larger chromosomal variation.
Copy number variations or CNVs are the structural variations that cover more than 1kb of DNA sequence. The single nucleotide polymorphism (SNP), on the other hand, is a single nucleotide change or a point mutation that is found in more than 1%...
18.4K
Single Nucleotide Polymorphisms-SNPs01:05

Single Nucleotide Polymorphisms-SNPs

17.7K
A single nucleotide polymorphism or SNP is a single nucleotide variation at a specific genomic position in a large population. It is the most prevalent type of sequence variation found in the human genome. Point mutations that occur in more than 1% of the population qualify as SNPs. These are present once every 1000 nucleotides on an average in the human genome. Replacement of a purine with another purine (A/G) or a pyrimidine with another pyrimidine (C/T) is known as a transition. In contrast,...
17.7K
Genome Copying Errors02:46

Genome Copying Errors

4.9K
DNA replication is a well-evolved process that copies millions of base pairs with high fidelity during each cell division. Occasionally a wrong base or a long stretch of wrong bases may get added to the daughter strands. If the errors are left unchecked, cells might accumulate several mutations that might endanger their  survival. Therefore, the copying errors are checked and repaired at three levels.
4.9K
Genetic Variation01:25

Genetic Variation

1.1K
Genetic variation is the diversity in DNA sequences found among individuals of the same species. This diversity is crucial for a species' survival because it helps organisms adapt to environmental changes. Genetic variation begins with fertilization, where an egg and sperm cell merge. Each of these cells carries 23 chromosomes, up to 46 in the fertilized egg. Chromosomes are long DNA strands that contain genes, the basic units of heredity.
Genes exist in different versions called alleles,...
1.1K
Genome-wide Association Studies-GWAS01:11

Genome-wide Association Studies-GWAS

15.1K
Genome-wide association studies or GWAS are used to identify whether common SNPs are associated with certain diseases. Suppose specific SNPs are more frequently observed in individuals with a particular disease than those without the disease. In that case, those SNPs are said to be associated with the disease. Chi-square analysis is performed to check the probability of the allele likely to be associated with the disease.
GWAS does not require the identification of the target gene involved in...
15.1K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Prussian Blue Analogue-Derived NiFe Sulfide Enabling Synergistic ORR/OER via Tuned Electronic Structures for Zn-Air Batteries.

Langmuir : the ACS journal of surfaces and colloids·2026
Same author

Genome-wide analysis of the LAR gene family and the role of OvLAR71 in proanthocyanidin biosynthesis in Onobrychis viciifolia.

Plant physiology and biochemistry : PPB·2026
Same author

Statistics and AI - A Fireside Conversation.

Harvard data science review·2026
Same author

GatedGeoGO:Multi-Modal Geometry-Aware Network with Gated Fusion and GO Semantic Attention for Protein Function Prediction.

Journal of chemical information and modeling·2026
Same author

A comparative study of machine learning models for microbiome-based diagnosis and multi-class staging of colorectal cancer.

Scientific reports·2026
Same author

Jujube Polysaccharide Promotes Neuroprotection and Longevity in <i>Caenorhabditis elegans</i> Through Oxidative Stress Resistance and Stress-Response Signaling.

International journal of molecular sciences·2026
Same journal

Elective genomic sequencing for adults in research, clinical and commercial contexts.

BMC medical genomics·2026
Same journal

Targeting acetylcholine: a novel strategy for treating lung adenocarcinoma.

BMC medical genomics·2026
Same journal

Role of the PTPN22 C1858T (R263Q) variant in tuberculosis susceptibility: genetic and functional evidence from a South Asian cohort.

BMC medical genomics·2026
Same journal

Improved prognostic survival models for pediatric medulloblastoma using high dimensional gene expression data.

BMC medical genomics·2026
Same journal

Identification of a novel pathogenic variant in MYLK in an Iranian family with non-syndromic familial aortic aneurysm and dissection by whole-exome sequencing and literature review.

BMC medical genomics·2026
Same journal

Genomic determinants of fluoroquinolone resistance in Escherichia coli in Nigeria: dominance of QRDR mutations and limited contribution of PMQR in a cross-sectional study.

BMC medical genomics·2026
See all related articles

Related Experiment Video

Updated: Dec 10, 2025

Detection of Copy Number Alterations Using Single Cell Sequencing
09:45

Detection of Copy Number Alterations Using Single Cell Sequencing

Published on: February 17, 2017

12.0K

A machine learning framework for genotyping the structural variations with copy number variant.

Tian Zheng1, Xiaoyan Zhu2, Xuanping Zhang1

  • 1School of Computer Science and Technology, Xi'an Jiaotong University, Xi'an, 710049, China.

BMC Medical Genomics
|August 29, 2020
PubMed
Summary
This summary is machine-generated.

This study introduces a new machine learning method for accurate genotyping of structural variations in cancer genomes, even with copy number variants (CNVs). The developed approach outperforms existing tools in accuracy and efficiency for next-generation sequencing data analysis.

Keywords:
Cancer genomicsCopy number variantGenotyping structural variationMulticlass relevance vector machineNGS data analysis

More Related Videos

Array Comparative Genomic Hybridization Array CGH for Detection of Genomic Copy Number Variants
09:16

Array Comparative Genomic Hybridization Array CGH for Detection of Genomic Copy Number Variants

Published on: February 21, 2015

20.2K
Pre-Implantation Genetic Testing for Aneuploidy on a Semiconductor Based Next-Generation Sequencing Platform
09:30

Pre-Implantation Genetic Testing for Aneuploidy on a Semiconductor Based Next-Generation Sequencing Platform

Published on: August 17, 2022

3.4K

Related Experiment Videos

Last Updated: Dec 10, 2025

Detection of Copy Number Alterations Using Single Cell Sequencing
09:45

Detection of Copy Number Alterations Using Single Cell Sequencing

Published on: February 17, 2017

12.0K
Array Comparative Genomic Hybridization Array CGH for Detection of Genomic Copy Number Variants
09:16

Array Comparative Genomic Hybridization Array CGH for Detection of Genomic Copy Number Variants

Published on: February 21, 2015

20.2K
Pre-Implantation Genetic Testing for Aneuploidy on a Semiconductor Based Next-Generation Sequencing Platform
09:30

Pre-Implantation Genetic Testing for Aneuploidy on a Semiconductor Based Next-Generation Sequencing Platform

Published on: August 17, 2022

3.4K

Area of Science:

  • Genomics
  • Bioinformatics
  • Computational Biology

Background:

  • Genotyping structural variations in next-generation sequencing data is computationally challenging.
  • Copy number variants (CNVs) in cancer genomes reduce the accuracy of existing genotyping methods by causing biases in sequencing coverage and variant allelic frequency.
  • CNVs can lead to misinterpretation of heterozygotes as homozygotes and misjudgment of other data signals like split-mapped reads.

Purpose of the Study:

  • To develop a computational method for accurate genotyping of indels within CNV regions.
  • To address the challenge of coexisting structural variations and CNVs in cancer genomes.
  • To improve the accuracy and reliability of structural variation genotyping in complex genomic regions.

Main Methods:

  • Proposed a machine learning framework to incorporate multiple data features and their interactions for genotyping.
  • Extracted fifteen types of classification features as input, considering five variant types: normal homozygote, homozygous variant, heterozygous variant without CNV, heterozygous variant with CNV on mutated haplotype, and heterozygous variant with CNV on wild haplotype.
  • Utilized a Multiclass Relevance Vector Machine (M-RVM) combined with feature distribution characteristics.

Main Results:

  • The proposed method was applied to simulated and real data, outperforming existing software (Gindel, Facets, GATK) and other machine learning algorithms (SVM, Naïve Bayes, BP Neural Network).
  • Demonstrated superior accuracy, stability, and efficiency compared to current popular genotyping tools.
  • The method effectively handles the complexity introduced by coexisting CNVs.

Conclusions:

  • Genotyping structural variations in CNV regions requires a multi-feature approach beyond traditional methods.
  • The proposed method is a practical algorithm for correcting genotype structural variations with CNVs in next-generation sequencing data.
  • The developed computational method offers improved accuracy for complex genomic analyses.