Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

RNA-seq03:21

RNA-seq

10.6K
RNA sequencing, or RNA-Seq, is a high-throughput sequencing technology used to study the transcriptome of a cell. Transcriptomics helps to interpret the functional elements of a genome and identify the molecular constituents of an organism. Additionally, it also helps in understanding the development of an organism and the occurrence of diseases. 
Before the discovery of RNA-seq, microarray-based methods and Sanger sequencing were used for transcriptome analysis. However, while...
10.6K
RACE - Rapid Amplification of cDNA Ends02:35

RACE - Rapid Amplification of cDNA Ends

6.7K
Rapid Amplification of cDNA Ends, or RACE, is one of the most effective methods to obtain a full-length cDNA from an mRNA sequence between a known internal region to the unknown sequence at the 5’ or 3’ end. The unknown region is cloned in the cDNA by a gene-specific primer that binds the known end, and a hybrid primer that attaches a predefined anchor sequence to the unknown end of the cDNA. The sequence in between is amplified by PCR with an anchor primer and a gene-specific...
6.7K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Potentially functional variants of PLCE1 identified by GWASs contribute to gastric adenocarcinoma susceptibility in an eastern Chinese population.

PloS one·2012
Same author

Pennogenin tetraglycoside stimulates secretion-dependent activation of rat platelets: evidence for critical roles of adenosine diphosphate receptor signal pathways.

Thrombosis research·2012
Same author

Polymorphisms in the XPG gene and risk of gastric cancer in Chinese populations.

Human genetics·2012
Same author

[Influences of D-galactosamine and lipopolysaccharide on liver tissue regeneration and repair in mice with partial hepatectomy].

Nan fang yi ke da xue xue bao = Journal of Southern Medical University·2012
Same author

Large and giant ventral paraclinoid carotid aneurysms: surgical techniques, complications and outcomes.

Clinical neurology and neurosurgery·2012
Same author

Association of mitochondrial DNA variations with lung cancer risk in a Han Chinese population from southwestern China.

PloS one·2012
Same journal

Transcriptomic analysis reveals FcγR-mediated phagocytosis as a key pathway for the anti-inflammatory action of <i>Polygonatum sibiricum</i> polysaccharides in loach.

Frontiers in genetics·2026
Same journal

A novel <i>ABO</i> splice site variant underlying the A<sub>3</sub> phenotype: immunogenetic basis and functional dissection.

Frontiers in genetics·2026
Same journal

Case Report: Identification of two novel <i>ALMS1</i> variants in a patient with a ciliopathy resembling Alström syndrome.

Frontiers in genetics·2026
Same journal

Integrative analysis identifies Hspa5 as a key regulator of the ERS/UPR-immune axis in spinal cord injury.

Frontiers in genetics·2026
Same journal

Evaluation of genomic selection to improve survival of eastern oysters infected with <i>Perkinsus marinus</i>.

Frontiers in genetics·2026
Same journal

A rescue assay for genetic diagnosis of oculocutaneous albinism using melanocytic MNT1 knock-out cells.

Frontiers in genetics·2026
See all related articles

Related Experiment Video

Updated: Oct 19, 2025

Author Spotlight: Advancing Alzheimer's Research &#8211; Exploring Early Detection and Multi-Omics Approaches
09:47

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Published on: December 15, 2023

1.4K

AdImpute: An Imputation Method for Single-Cell RNA-Seq Data Based on Semi-Supervised Autoencoders.

Li Xu1,2, Yin Xu3, Tong Xue1

  • 1College of Computer Science and Technology, Harbin Engineering University, Harbin, China.

Frontiers in Genetics
|September 27, 2021
PubMed
Summary
This summary is machine-generated.

Single-cell RNA sequencing allows scientists to measure gene activity in individual cells, but these datasets often contain many missing values that hinder analysis. This article introduces a new computational tool called AdImpute that uses machine learning to fill in these gaps more accurately. By combining existing techniques with a specialized neural network, the method improves data quality while preserving important biological signals. Tests show that this approach outperforms several other common methods in clustering accuracy. AdImpute provides a reliable way to clean complex genetic data for better biological insights.

Keywords:
autoencoderimputation methodmissing value fillingscRNA-seqsemi-supervised learningmachine learninggene expressionneural networksdata imputation

Frequently Asked Questions

More Related Videos

Droplet Barcoding-Based Single Cell Transcriptomics of Adult Mammalian Tissues
10:12

Droplet Barcoding-Based Single Cell Transcriptomics of Adult Mammalian Tissues

Published on: January 10, 2019

18.7K
RNA-Seq Analysis of Differential Gene Expression in Electroporated Chick Embryonic Spinal Cord
11:13

RNA-Seq Analysis of Differential Gene Expression in Electroporated Chick Embryonic Spinal Cord

Published on: November 1, 2014

14.8K

Related Experiment Videos

Last Updated: Oct 19, 2025

Author Spotlight: Advancing Alzheimer's Research &#8211; Exploring Early Detection and Multi-Omics Approaches
09:47

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Published on: December 15, 2023

1.4K
Droplet Barcoding-Based Single Cell Transcriptomics of Adult Mammalian Tissues
10:12

Droplet Barcoding-Based Single Cell Transcriptomics of Adult Mammalian Tissues

Published on: January 10, 2019

18.7K
RNA-Seq Analysis of Differential Gene Expression in Electroporated Chick Embryonic Spinal Cord
11:13

RNA-Seq Analysis of Differential Gene Expression in Electroporated Chick Embryonic Spinal Cord

Published on: November 1, 2014

14.8K

Area of Science:

  • Computational biology and AdImpute bioinformatics research
  • Genomics and transcriptomics data analysis

Background:

No prior work had fully resolved the challenge of missing values in single-cell transcriptomic datasets. These gaps frequently arise due to technical limitations during the sequencing process. Such incomplete information complicates downstream tasks like cell type identification. Prior research has shown that existing imputation techniques often struggle to balance noise reduction with signal preservation. That uncertainty drove the development of more sophisticated computational frameworks. Researchers have attempted to address these issues using various statistical and machine learning approaches. This gap motivated the creation of tools that leverage deep learning architectures for better performance. The current landscape requires methods that maintain biological integrity while improving data completeness.

Purpose Of The Study:

The aim of this study is to introduce AdImpute, a novel imputation method designed for single-cell RNA sequencing data. This research addresses the persistent challenge of missing values that frequently occur during the sequencing process. These gaps often obscure biological insights and complicate downstream statistical analyses. The authors seek to improve data quality by leveraging the power of semi-supervised autoencoders. By incorporating imputation weights from existing tools, the method strives to learn more accurate latent representations of the data. The motivation stems from the need for more robust computational solutions in the field of transcriptomics. This work focuses on achieving higher precision while ensuring that important biological signals remain preserved. The researchers intend to provide a reliable framework that enhances the utility of high-resolution genetic information.

Main Methods:

The authors developed a semi-supervised learning approach to address data sparsity in single-cell studies. Their review approach involved testing the model against four established imputation algorithms. They utilized simulated datasets to provide a controlled environment for performance validation. Real-world sequencing data were also processed to ensure the method functions under complex experimental conditions. The design incorporates an autoencoder architecture that learns latent features from the input matrix. Researchers integrated preliminary imputation results from DrImpute to generate specific weights for the cost function. This configuration allows the neural network to prioritize accurate reconstruction of missing values. The entire pipeline focuses on optimizing the balance between noise removal and signal retention.

Main Results:

Key findings from the literature indicate that AdImpute consistently outperforms four other publicly available imputation methods in clustering tasks. The model demonstrates high accuracy across both simulated and real-world datasets. The researchers observed that the method effectively fills missing values while minimally modifying genes that are biologically silent. This indicates a high level of preservation for essential biological signals during the correction process. The results confirm that the integration of weighted cost functions leads to more precise latent information learning. Overall, the performance metrics highlight the robustness of this autoencoder-based strategy. The comparative analysis shows that the tool maintains superior data quality compared to existing alternatives. These findings suggest that the approach is a reliable choice for processing complex transcriptomic information.

Conclusions:

The authors propose that their model offers superior accuracy compared to four other publicly available imputation tools. Synthesis and implications suggest that this approach effectively handles missing data while minimizing alterations to biologically silent genes. The researchers claim that their method provides a robust solution for processing single-cell transcriptomic information. Evidence indicates that the integration of weighted cost functions enhances the learning of latent data structures. This study demonstrates that combining existing imputation techniques with autoencoders improves overall performance. The authors conclude that their framework maintains high fidelity to original biological signals during the correction process. The findings imply that this strategy is suitable for diverse datasets, including both simulated and real-world examples. Future applications could benefit from the increased precision provided by this semi-supervised learning architecture.

The researchers propose a semi-supervised autoencoder framework that utilizes weights derived from an initial imputation step, such as DrImpute. This mechanism allows the model to learn latent data representations more effectively, leading to higher accuracy than conventional approaches.

The authors utilize autoencoders, which are neural networks designed to learn efficient data codings. These tools are specifically configured to incorporate imputation weights, ensuring that the model focuses on relevant information while filling in missing values.

A weighted cost function is necessary to guide the learning process. This component ensures the model prioritizes accurate imputation based on the initial estimates, which helps in refining the final output while preserving biological signals.

Imputation weights act as a guide for the autoencoder during the training phase. These values, derived from preliminary processing, inform the network about the reliability of different data points, thereby improving the overall reconstruction of gene expression levels.

The researchers measured performance through clustering experiments using both simulated and real-world datasets. They compared their results against four other publicly available methods to evaluate the accuracy and robustness of the proposed approach.

The authors claim that their method is more accurate than four other publicly available imputation techniques. They also note that their approach minimizes modifications to biologically silent genes, ensuring that the original biological information remains intact.