Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Genetic Lingo01:11

Genetic Lingo

Overview
Genomics02:02

Genomics

Genomics is the science of genomes: it is the study of all the genetic material of an organism. In humans, the genome consists of information carried in 23 pairs of chromosomes in the nucleus, as well as mitochondrial DNA. In genomics, both coding and non-coding DNA is sequenced and analyzed. Genomics allows a better understanding of all living things, their evolution, and their diversity. It has a myriad of uses: for example, to build phylogenetic trees, to improve productivity and...
Improving Translational Accuracy02:07

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...
Improving Translational Accuracy02:07

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...
Genome-wide Association Studies-GWAS01:11

Genome-wide Association Studies-GWAS

Genome-wide association studies or GWAS are used to identify whether common SNPs are associated with certain diseases. Suppose specific SNPs are more frequently observed in individuals with a particular disease than those without the disease. In that case, those SNPs are said to be associated with the disease. Chi-square analysis is performed to check the probability of the allele likely to be associated with the disease.
GWAS does not require the identification of the target gene involved in...
Human Genetics01:28

Human Genetics

Human genetics provides a profound framework for understanding the interplay between genetic predispositions and human psychology. At the heart of this discipline lies the study of how genes influence physical traits, behaviors, and susceptibility to diseases. Each person carries a unique genetic code that subtly or significantly shapes their psychological and behavioral landscape.
The complex relationship between genetics and psychology is observable through common biological components such...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

ResSAT: enhancing spatial transcriptomics prediction from H&E-stained histology images with an interactive spot transformer.

Genome biology·2026
Same author

Characteristics of pulmonary perfusion and ventilation in healthy adults: a prospective observational study with phase-resolved functional lung magnetic resonance imaging.

BMC medical imaging·2026
Same author

Pseudo-obstruction on single-phase CTPA: unilateral pulmonary vein atresia as the cause of 50 years of hemoptysis.

BMC pulmonary medicine·2026
Same author

Enhanced functional mitochondrial donation through glucose-responsive antioxidant microcarrier-engineered native-like MSCs accelerates diabetic wound healing.

Materials today. Bio·2026
Same author

Ursodeoxycholic acid mitigates cerebral ischemia/reperfusion injury by inhibiting thrombin-induced lipid peroxidation through activation of ALDH3A1.

Redox biology·2026
Same author

Phase-resolved functional lung magnetic resonance imaging for evaluation of lung perfusion and ventilation in fibrosing interstitial lung diseases.

Insights into imaging·2026
Same journal

Poisoning the Genome: Targeted Backdoor Attacks on DNA Foundation Models.

ArXiv·2026
Same journal

Mechanistic mathematical model of the in vitro infection dynamics of Bunyamwera and Batai viruses including MOI-dependent shortening of the eclipse phase.

ArXiv·2026
Same journal

AI-Driven Lumped-Element Modeling of Human Respiratory System for Studying Voice Mechanics.

ArXiv·2026
Same journal

Beyond Algorithms: Conceptual Innovation in Medical Imaging AI.

ArXiv·2026
Same journal

Feynman Kac Reweighted Schrödinger Bridge Matching for Surface-Based Tau PET Harmonization.

ArXiv·2026
Same journal

Agentic Discovery of Non-Canonical Antimicrobial Peptides with AMPGAN v3.

ArXiv·2026
See all related articles

Related Experiment Video

Updated: May 9, 2026

In Vivo Modeling of the Morbid Human Genome using Danio rerio
12:31

In Vivo Modeling of the Morbid Human Genome using Danio rerio

Published on: August 24, 2013

GenoBERT: A Language Model for Accurate Genotype Imputation.

Lei Huang1, Chuan Qiu2, Kuan-Jui Su2

  • 1School of Computing Sciences and Computer Engineering, University of Southern Mississippi, Hattiesburg, MS, USA.

Arxiv
|May 8, 2026
PubMed
Summary
This summary is machine-generated.

Genotype imputation accuracy is improved by GenoBERT, a novel transformer-based framework. This reference-free method enhances genomic studies by overcoming ancestry bias and improving rare-variant detection.

Keywords:
attentiongenotype imputationtransformer

More Related Videos

Infinium Assay for Large-scale SNP Genotyping Applications
13:33

Infinium Assay for Large-scale SNP Genotyping Applications

Published on: November 19, 2013

Related Experiment Videos

Last Updated: May 9, 2026

In Vivo Modeling of the Morbid Human Genome using Danio rerio
12:31

In Vivo Modeling of the Morbid Human Genome using Danio rerio

Published on: August 24, 2013

Infinium Assay for Large-scale SNP Genotyping Applications
13:33

Infinium Assay for Large-scale SNP Genotyping Applications

Published on: November 19, 2013

Area of Science:

  • Genomics
  • Bioinformatics
  • Computational Biology

Background:

  • Conventional genotype imputation methods are limited by ancestry bias and reduced accuracy for rare variants.
  • Reference-panel based imputation struggles with diverse populations and rare variants.

Purpose of the Study:

  • To introduce GenoBERT, a novel transformer-based, reference-free framework for genotype imputation.
  • To evaluate GenoBERT's performance against existing methods across diverse datasets and missingness levels.

Main Methods:

  • GenoBERT tokenizes phased genotypes and employs a self-attention mechanism to model linkage disequilibrium (LD).
  • The framework was benchmarked on the Louisiana Osteoporosis Study (LOS) and 1000 Genomes Project (1KGP) datasets.
  • Performance was assessed across various ancestry groups and genotype missingness levels (5-50%).

Main Results:

  • GenoBERT achieved the highest overall imputation accuracy compared to four baseline methods.
  • High imputation accuracy ($r^2 \approx 0.98$) was maintained at up to 25% missingness and robust performance ($r^2 > 0.90$) at 50% missingness.
  • Consistent accuracy gains were observed across different ancestries, with resilience to small sample sizes and weak LD.

Conclusions:

  • GenoBERT offers a scalable and robust solution for genotype imputation, eliminating reference-panel dependence.
  • The framework provides a strong foundation for downstream genomic modeling, including genome-wide association and risk-prediction studies.
  • GenoBERT demonstrates superior performance and broad applicability in genomic data analysis.