RCANE: a deep learning algorithm for whole-genome pan-cancer somatic copy number aberration prediction using RNA-seq data

  • 0Graduate Group of Applied Mathematics and Computational Science, University of Pennsylvania, Philadelphia, PA, USA.

|

|

Summary

This summary is machine-generated.

RCANE, a deep-learning tool, predicts genome-wide somatic copy-number aberrations (SCNAs) from RNA sequencing data. This approach offers a cost-effective method for cancer research and diagnostics.

Area Of Science

  • Genomics
  • Bioinformatics
  • Cancer Research

Background

  • Transcriptome sequencing (RNA-seq) is crucial for cancer research, analyzing gene expression.
  • Somatic copy-number aberrations (SCNAs) are key drivers of cancer development.
  • Inferring SCNAs from RNA-seq offers a cost-effective alternative to DNA assays.

Purpose Of The Study

  • To introduce RCANE, a deep-learning framework for predicting genome-wide SCNAs using only RNA-seq data.
  • To demonstrate RCANE's effectiveness across diverse cancer types.
  • To provide a scalable and robust solution for SCNA profiling.

Main Methods

  • Developed a deep-learning framework named RCANE.
  • Trained RCANE on The Cancer Genome Atlas (TCGA) and DepMap cell-line cohorts.
  • Evaluated RCANE's performance against existing SCNA inference methods.

Main Results

  • RCANE accurately predicts genome-wide SCNAs from RNA-seq data.
  • The framework demonstrates superior performance compared to current approaches.
  • RCANE provides a scalable and robust solution for SCNA profiling.

Conclusions

  • RCANE enhances SCNA profiling using only RNA-seq data.
  • This method improves cancer diagnostics and therapeutic decision-making.
  • RCANE offers a valuable tool for cancer research and clinical applications.

Related Concept Videos

Comparing Copy Number Variations and SNPs 02:26

18.6K

Sequencing of the human genome has opened up several best-kept secrets of the genome. Scientists have identified thousands of genome variations that exist within a population. These variations can be a single nucleotide or a larger chromosomal variation.
Copy number variations or CNVs are the structural variations that cover more than 1kb of DNA sequence. The single nucleotide polymorphism (SNP), on the other hand, is a single nucleotide change or a point mutation that is found in more than 1%...

RNA-seq 03:21

11.8K

RNA sequencing, or RNA-Seq, is a high-throughput sequencing technology used to study the transcriptome of a cell. Transcriptomics helps to interpret the functional elements of a genome and identify the molecular constituents of an organism. Additionally, it also helps in understanding the development of an organism and the occurrence of diseases. 
Before the discovery of RNA-seq, microarray-based methods and Sanger sequencing were used for transcriptome analysis. However, while...

Genome Copying Errors 02:46

5.0K

DNA replication is a well-evolved process that copies millions of base pairs with high fidelity during each cell division. Occasionally a wrong base or a long stretch of wrong bases may get added to the daughter strands. If the errors are left unchecked, cells might accumulate several mutations that might endanger their  survival. Therefore, the copying errors are checked and repaired at three levels.

Proofreading - DNA polymerase enzymes, for example, introduce 1 incorrect nucleotide...