Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Transformers with Off-Nominal Turns Ratios

Transformers with Off-Nominal Turns Ratios

In scenarios involving parallel transformers with disparate ratings, developing per-unit models requires accommodating off-nominal turns ratios. This situation arises when the selected base voltages are not proportional to the transformer’s voltage ratings. Consider a transformer where the rated voltages are related by the term a. If the chosen voltage bases satisfy a relationship involving term b, term c is defined as the ratio of these bases. This ratio is then substituted into the...

Types Of Transformers

Types Of Transformers

Transformers can provide desired voltages to a circuit by modifying the number of turns in the secondary windings.
If the ratio of the number of turns in the secondary winding to that of the primary winding is greater than one, then the transformer is said to be a step-up transformer. In a step-up transformer, the voltage at the secondary winding is greater than the voltage applied at the primary winding.
However, if this ratio is less than one, the transformer is said to be a step-down...

Survival Tree

Survival Tree

Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
Building a Survival Tree
Constructing a...

Improving Translational Accuracy

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...

The Ideal Transformer

The Ideal Transformer

In single-phase two-winding transformers, two windings are coiled around a magnetic core characterized by cross-sectional area A and magnetic permeability μ. A phasor current i1 enters the left winding while i2 exits the right winding, establishing the fundamental working of the transformer through electromagnetic principles.
Ampere's Law forms the basis of understanding the magnetic field within the transformer. It states that the integral of the magnetic field intensity's...

Per-Unit Sequence Models

Per-Unit Sequence Models

An ideal Y-Y transformer, grounded through neutral impedances, displays per-unit sequence networks akin to those of a single-phase ideal transformer when subjected to balanced positive- or negative-sequence currents. These currents do not produce neutral currents, and their associated voltage drops.
Zero-sequence currents, which are identical in magnitude and phase, generate a neutral current, resulting in voltage drops across the neutral impedance and the low-voltage winding. If the...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Impact of age and clinical factors on the feasibility of mobile digital monitoring in people at risk of suicide.

PloS one·2026

Same author

Modeling recurrent suicide attempts using probabilistic Hawkes processes.

Spanish journal of psychiatry and mental health·2026

Same author

Digital relapse prevention plan for substance use disorders: study protocol for a multicentre randomised controlled trial.

BMJ health & care informatics·2025

Same author

A Closer Look at Benchmarking Self-supervised Pre-training with Image Classification.

International journal of computer vision·2025

Same author

Automated web-based typing of Clostridioides difficile ribotypes via MALDI-TOF MS.

BMC bioinformatics·2025

Same author

Scalable Random Feature Latent Variable Models.

IEEE transactions on pattern analysis and machine intelligence·2025

Same journal

Exploiting audio-visual modalities in videos: Object detection via multi-stage bilateral coupling network.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Reliability-aware modality completion with cross-modal distillation for federated learning with missing modalities.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

IGFD-Net: Illumination-guided frequency decoupling for polarization image fusion.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Multiple-Strategies dung beetle optimizer and its applications in engineering optimization and bankruptcy prediction.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Aggregating global-scale pixel-wise forgery cues within a graph.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Finite-Time intermittent control for secure synchronization of Neutral-Type stochastic delayed neural networks under aperiodic DoS attacks.

Neural networks : the official journal of the International Neural Network Society·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Aug 9, 2025

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Published on: December 15, 2023

Regularizing transformers with deep probabilistic layers.

Aurora Cobo Aguilera¹, Pablo M Olmos¹, Antonio Artés-Rodríguez¹

¹Department of Signal Theory and Communications, Universidad Carlos III de Madrid, Avda. de la Universidad 30, 28911, Leganés, Madrid, Spain.

Neural Networks : the Official Journal of the International Neural Network Society

|February 22, 2023

Summary

This summary is machine-generated.

This study introduces a Gaussian Mixture Variational Autoencoder (GMVAE) as a novel regularizer for Transformer language models (LM). Integrating GMVAE enhances model versatility, generalization, and text imputation capabilities.

Keywords:

Deep learning Missing data Natural language processing Regularization Transformers Variational auto-encoder

More Related Videos

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

Published on: April 21, 2023

Related Experiment Videos

Last Updated: Aug 9, 2025

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Published on: December 15, 2023

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

Published on: April 21, 2023

Area of Science:

Artificial Intelligence
Natural Language Processing
Deep Learning

Background:

Transformer architectures have dominated recent advancements in language models (LM).
Regularization techniques are crucial for improving LM performance but remain underexplored in Transformer structures.
Deep generative models offer potential for enhancing LM regularization.

Purpose of the Study:

To investigate the effectiveness of a Gaussian Mixture Variational Autoencoder (GMVAE) as a regularization layer within Transformer-based language models.
To analyze the impact of GMVAE placement depth on model performance.
To demonstrate the versatility and improved generalization of Transformer models augmented with GMVAE.

Main Methods:

Implementation of a Gaussian Mixture Variational Autoencoder (GMVAE) as a regularization layer.
Integration of the GMVAE into Transformer architectures like BERT, RoBERTa, and XLM-R.
Evaluation of model performance on tasks including SST-2 and TREC, focusing on generalization and imputation scores.

Main Results:

The inclusion of GMVAE as a regularizer significantly enhances the versatility of Transformer-based LMs.
Models incorporating GMVAE demonstrate improved generalization capabilities across various tasks.
GMVAE integration leads to superior performance in text imputation, including handling missing or noisy words with richer output.

Conclusions:

Deep generative models, specifically GMVAE, can be effectively employed as regularizers in Transformer architectures.
GMVAE regularization offers a promising approach to developing more robust and capable language models.
This method provides a pathway to richer text generation and improved performance on challenging NLP tasks.