Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies
  1. Home
  2. Dual-plane Wavefront Sensing Using A Vision Transformer.
  1. Home
  2. Dual-plane Wavefront Sensing Using A Vision Transformer.

Related Concept Videos

Transformers01:26

Transformers

2.2K
A device that transforms voltages from one value to another using induction is called a transformer. A transformer consists of two separate coils, or windings, wrapped around the same soft iron core. However, they are electrically insulated from each other.
The iron core has a substantial relative permeability. Therefore, the magnetic field lines generated due to the current in one winding are almost entirely confined within the core, such that the same magnetic flux permeates each turn of both...
2.2K
Vision01:24

Vision

61.2K
Vision is the result of light being detected and transduced into neural signals by the retina of the eye. This information is then further analyzed and interpreted by the brain. First, light enters the front of the eye and is focused by the cornea and lens onto the retina—a thin sheet of neural tissue lining the back of the eye. Because of refraction through the convex lens of the eye, images are projected onto the retina upside-down and reversed.
61.2K
Depth Perception and Spatial Vision01:15

Depth Perception and Spatial Vision

2.5K
Depth perception is the ability to perceive objects three-dimensionally. It relies on two types of cues: binocular and monocular. Binocular cues depend on the combination of images from both eyes and how the eyes work together. Since the eyes are in slightly different positions, each eye captures a slightly different image. This disparity between images, known as binocular disparity, helps the brain interpret depth. When the brain compares these images, it determines the distance to an object.
2.5K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Emergent dynamics in heterogeneous pulsatile swarmalators.

Chaos (Woodbury, N.Y.)·2026
Same author

Dynamics of pulsating swarmalators on a ring.

Physical review. E·2025
Same author

Stability of the 1D swarmalator model in the continuum limit.

Chaos (Woodbury, N.Y.)·2025
Same author

Effects of coupling range on the dynamics of swarmalators.

Physical review. E·2025
Same author

Global synchronization theorem for coupled swarmalators.

Chaos (Woodbury, N.Y.)·2025
Same author

Forced one-dimensional swarmalator model.

Physical review. E·2024
Same journal

Long-term stabilization of intensity-difference squeezing from four-wave mixing in rubidium vapor.

Optics express·2026
Same journal

Robust 3D topography measurement of large-range high-aspect-ratio structures based on dual-domain statistical filtering in SD-OCT.

Optics express·2026
Same journal

Broadband transmissive terahertz metasurface for simultaneous quad-mode OAM multiplexing.

Optics express·2026
Same journal

Leveraging two-dimensional materials for high-sensitivity optical sensors: quasi-bound states in the continuum within hybrid metasurfaces.

Optics express·2026
Same journal

Resolution investigation for dual-spherical-wave optical scanning holographic microscopy: methods and performance.

Optics express·2026
Same journal

Robustness of parallel subnetwork-filtered diffractive deep neural networks.

Optics express·2026
See all related articles

Related Experiment Video

A Multimodal Wide-Field Fourier-Transform Raman Microscope
06:48

A Multimodal Wide-Field Fourier-Transform Raman Microscope

Published on: December 30, 2025

679

Dual-plane wavefront sensing using a vision transformer.

Evan O'Rourke, Kevin O'Keeffe

    Optics Express
    |March 18, 2026

    View abstract on PubMed

    Summary
    This summary is machine-generated.

    Deep learning for wavefront sensing shows promise. A vision transformer model outperforms convolutional neural networks in estimating Zernike coefficients from downsampled images.

    More Related Videos

    Time Multiplexing Super Resolving Technique for Imaging from a Moving Platform
    06:25

    Time Multiplexing Super Resolving Technique for Imaging from a Moving Platform

    Published on: February 12, 2014

    8.9K
    Transmission of Multiple Signals through an Optical Fiber Using Wavefront Shaping
    09:43

    Transmission of Multiple Signals through an Optical Fiber Using Wavefront Shaping

    Published on: March 20, 2017

    10.4K

    Related Experiment Videos

    A Multimodal Wide-Field Fourier-Transform Raman Microscope
    06:48

    A Multimodal Wide-Field Fourier-Transform Raman Microscope

    Published on: December 30, 2025

    679
    Time Multiplexing Super Resolving Technique for Imaging from a Moving Platform
    06:25

    Time Multiplexing Super Resolving Technique for Imaging from a Moving Platform

    Published on: February 12, 2014

    8.9K
    Transmission of Multiple Signals through an Optical Fiber Using Wavefront Shaping
    09:43

    Transmission of Multiple Signals through an Optical Fiber Using Wavefront Shaping

    Published on: March 20, 2017

    10.4K

    Area of Science:

    • Optics and Photonics
    • Machine Learning
    • Image Processing

    Background:

    • Deep learning enables direct estimation of Zernike coefficients from intensity measurements in wavefront sensing.
    • Convolutional Neural Networks (CNNs) are the predominant deep learning models used for this task.
    • Limitations exist in CNN performance, particularly with downsampled image data.

    Purpose of the Study:

    • To introduce and evaluate a dual-plane wavefront sensor utilizing a vision transformer (ViT) model.
    • To compare the performance of the ViT-based sensor against a CNN-based approach.
    • To assess the efficacy of ViT in handling downsampled image data for Zernike coefficient estimation.

    Main Methods:

    • Development of a dual-plane wavefront sensor architecture.
    • Training a vision transformer model for wavefront sensing.
    • Comparative analysis of ViT and CNN performance using experimental and simulation data.
    • Evaluation of prediction accuracy for high-order Zernike coefficients.

    Main Results:

    • The vision transformer model demonstrated superior performance compared to the CNN.
    • Outperformance was particularly evident when dealing with significantly downsampled image data.
    • The ViT model showed enhanced accuracy in predicting high-order Zernike coefficients.

    Conclusions:

    • Vision transformers offer a powerful alternative to CNNs for image-based wavefront sensing.
    • ViT-based wavefront sensing is particularly advantageous for applications with limited image resolution.
    • This approach advances the field of optical metrology and adaptive optics.